Improve performance of get_tuids_containing (!536) · Merge requests · Quantify / quantify-core

Explanation of changes

The method get_tuids_containing can be slow because it has to parse all stored experiments. A relatively expensive part of the method is the TUID.is_valid(x[:26]) check in the filter method. We can avoid that check in many cases by performing the (contains in x) check first. The latter check is much faster and (assuming a user stores many the quantify datasets in the quantify folder), the TUID.is_valid(x[:26]) check will almost always be true, while the (contains in x) check can be very selective.

In our system (> 100.000 experiments) performance of get_tuids_containing with the contains argument set goes from 4.1 seconds to 0.6 seconds.

Improve performance of get_tuids_containing

Explanation of changes

Motivation of changes

This improvement has no impact on the public interface. There are other methods, but they require more work.

Merge checklist

Improve performance of get_tuids_containing

Explanation of changes

Motivation of changes

This improvement has no impact on the public interface. There are other methods, but they require more work.

Merge checklist

Merge request reports