Skip to content

Add suplementary rake tasks for distributed hll

Mikołaj Wawrzyniak requested to merge mwaw/296169-distributed-hll-ci-job into master

What does this MR do?

It aim to provide supplementary rake tasks to conduct robust HyperLogLog implementation accuracy analysis

Proposed rake tasks uses statistical methods that assess accuracy of algorithm implementation over big set of algorithm estimation, instead of single run that is pron to be skewed by probabilistic nature of algorithm.

In original paper http://algo.inria.fr/flajolet/Publications/FlFuGaMe07.pdf mathematical method used to asses accuracy was relative standard error

Screenshot_2021-01-08_at_11.03.18

Implements: #299184 (closed)

Screenshots (strongly suggested)

Does this MR meet the acceptance criteria?

Conformity

Availability and Testing

Security

If this MR contains changes to processing or storing of credentials or tokens, authorization and authentication methods and other items described in the security review guidelines:

  • Label as security and @ mention @gitlab-com/gl-security/appsec
  • The MR includes necessary changes to maintain consistency between UI, API, email, or other methods
  • Security reports checked/validated by a reviewer from the AppSec team
Edited by Mikołaj Wawrzyniak

Merge request reports