Skip to content
Snippets Groups Projects
D

Dataset Transformation

Project ID: 19882269
Select Git revision
  • master default protected
  • 342-adarchive-review-label-generation-and-entity-typing-in-x3ml
  • 276-ind-affairs-data-cleaning
  • 263-rs-metadata-ttl
  • 254-adarchive-validation-error
  • 237-anthpath-august-2023-data-dump-and-postprocess-review
  • 222-anthologia-palatina-update-passage-and-text-in-xml
  • 156-E42-consistency-updates
  • 104-generic-post-processing-script
  • 63-orlando-entity-extraction
10 results
  • Clone with SSH
  • Clone with HTTPS
  • Natalie Hervieux's avatar
    Natalie Hervieux authored
    Resolve "hist-cdns: update web annotation model"
    
    Closes #358
    
    See merge request !266
    270ff0d6
    History

    This repository contains scripts used to convert structured datasets (e.g., spreadsheets, relational data, XML) from source data to LINCS compliant RDF. While datasets are still in their conversion phase, we continue to update and re-run the conversion steps here and then replace the version of the data in the LINCS triplestore. When a given dataset leaves draft form, we stop editing the data here and begin editing it directly in ResearchSpace. For details about our conversion workflows, see our conversion workflow documentation.

    When new source data becomes available, as long as it follows the same structure as the original source data, then it can be converted following the steps stored here.

    Final converted data is stored in Datasets/, while conversion scripts and in-progress data is stored in each project's respective folder.

    Ontologies/ contains ontology files for us in 3M, ResearchSpace, and the user-facing triplestore. Note that the LINCS 3M instance comes pre-loaded with the ontology files needed for most LINCS datasets.

    The X3ML/ folder contains scripts to apply X3ML mappings on source XML files in bulk, rather than needing to use the user interface. For details on using X3ML to convert your data and how to use existing mapping files, see our X3ML documentation.