README
This repository contains metadata dictionaries, class models and their instances of
computations, datasets, and software described in the Multi-Scale Integrated Cell
(MuSIC) pipeline which integrates protein fluorescent images and protein
biophysical association data to create a unified hierarchical map of human cell
architecture. A detailed provenance map here shows each component required to run the pipeline. In our current release, only the components within the rectangle titled Ideker Lab/MuSIC
have been addressed by the data dictionary, metadata model and instances.
More information on the MuSIC pipeline:
The models and instances are expressed using the Linked Data Modeling Language framework called LinkML.
For more information on LinkML:
- linkml.io main website
- specification
Data dictionary
Directory data-dictionary
contains three data dictionary documents which are still at their infancy.
The current release contains metadata about objects using a select set of variables, their data types, preferred mapping using URIs/CURIEs, and allowed values based on the current state of the MuSIC pipeline. More detailed version of the data dictionary is planned for release in future. For example, the dictionary for Dataset (dataset.md
) introduces a slot (aka attribute/property) called schema
which refers to the schema of the content. As different formats of the content demand different schema representation, more work in this direction is required.
Models
Directory linkml-models
contains three LinkML models for Dataset, Software, and Computation. The models are currently represented as flat structure. In future release, nested structure will be used with support from the advanced features in the LinkML ecosystem.
- LinkMLMuSICComputationModel
- LinkMLMuSICDatasetModel
- LinkMLMuSICSoftwareModel
Instances
There are many subsets of profiles of the metamodel, for different purposes: The instances of Computation, Dataset and Software described in the following directories can be validated against the metadata model:
- linkml-computation-instances
- linkml-dataset-instances
- linkml-software-instances