Change scoring metric implementations to take in whole dataframes

Resolve issue #304 (closed)

TODO:

  1. Need test for every metric
  2. Need to change primitive, compute_score caller
Edited by linyang

Merge request reports

Loading