Two different names, but the same ID
When uploading dataset documents, we've seen two different names, but with the same ID.
2019-07-02 22:59:32,445:run-log:DEBUG - Uploading documents from /home/eval-dmc-11/output/dataset/normalizedScoreDatasetDoc.json
2019-07-02 22:59:32,971:urllib3.connectionpool:DEBUG - Starting new HTTPS connection (1): metalearning.datadrivendiscovery.org
2019-07-02 22:59:33,007:urllib3.connectionpool:DEBUG - https://metalearning.datadrivendiscovery.org:443 "POST /1.0/dataset/?submitter=dmc HTTP/1.1" 202 182
2019-07-02 22:59:33,008:run-log:DEBUG - {
"message": "Document with ID: 1491_one_hundred_plants_margin_clust_dataset_SCORE and digest: 89448cf507e57167ed723ff45c3899ca9d9a5c38dc9255c21389a047260c7a7a already exists"
}
2019-07-02 22:59:33,009:run-log:DEBUG - Uploading documents from /home/eval-dmc-11/output/dataset/normalizedTestDatasetDoc.json
2019-07-02 22:59:33,527:urllib3.connectionpool:DEBUG - Starting new HTTPS connection (1): metalearning.datadrivendiscovery.org
2019-07-02 22:59:33,580:urllib3.connectionpool:DEBUG - https://metalearning.datadrivendiscovery.org:443 "POST /1.0/dataset/?submitter=dmc HTTP/1.1" 202 181
2019-07-02 22:59:33,582:run-log:DEBUG - {
"message": "Document with ID: 1491_one_hundred_plants_margin_clust_dataset_TEST and digest: 89448cf507e57167ed723ff45c3899ca9d9a5c38dc9255c21389a047260c7a7a already exists"
}
And then, when we try to upload the pipeline_run
document, we get an error:
2019-07-02 22:59:55,456:urllib3.connectionpool:DEBUG - Starting new HTTPS connection (1): metalearning.datadrivendiscovery.org
2019-07-02 22:59:55,730:urllib3.connectionpool:DEBUG - https://metalearning.datadrivendiscovery.org:443 "POST /1.0/pipeline-run/?submitter=dmc HTTP/1.1" 400 326
2019-07-02 22:59:55,733:run-log:ERROR - Failed to upload /home/eval-dmc-11/output/pipeline_runs/8d5f190c-c4be-47d3-9061-936b1258974b.yaml to https://metalearning.datadrivendiscovery.org/1.0/pipeline-run/:
{'context': 'EVALUATION', 'datasets': [{'digest': '89448cf507e57167ed723ff45c3899ca9d9a5c38dc9255c21389a047260c7a7a', 'id': '1491_one_hundred_plants_margin_clust_dataset_TEST'}],
...
2019-07-02 22:59:55,736:run-log:ERROR - 400: {
"message": "Error ingesting document: Referenced document {\"id\": \"1491_one_hundred_plants_margin_clust_dataset_TEST\", \"digest\": \"89448cf507e57167ed723ff45c3899ca9d9a5c38dc9255c21389a047260c7a7a\", \"_id\": \"89448cf507e57167ed723ff45c3899ca9d9a5c38dc9255c21389a047260c7a7a\"} not found in collection: datasets"
}