Reduce the frequency of random-variable-stream-test failures
This is a continuation of Bugzilla 1927
The remaining work from there was to refactor the Chi-Squared test to reduce code duplication, and make it easier to see where the tests are using different metrics or strategies.