/idr-notebooks

Jupyter Notebooks for the Image Data Resource

Primary LanguageJupyter Notebook

Actions Status

idr-notebooks

Binder

A set of Python Notebooks to demonstrate how to access the images and metadata from the Image Data Resource (IDR), including features and all descriptive tags.

Full access to IDR metadata and images is provided through the standard OMERO API, documentation for which can be found here, with the Python bindings found specifically here. The notebooks in this repository are meant to exemplify the use of that API in the context of the IDR, and the sort of queries that can be done. In particular, they show how to reproduce Figure 1 and Figure 2 of the paper.1 They also make use of the scipy ecosystem, including pandas.

To run the notebooks, you can either run on mybinder.org or build locally with repo2docker.

To build locally:

  • Install Docker if required
  • Create a virtual environment and install repo2docker from PyPI.
  • Clone this repository
  • Run repo2docker.
  • Depending on the permissions, you might have to run the command as an admin
pip install jupyter-repo2docker
git clone https://github.com/IDR/idr-notebooks.git
cd idr-notebooks
repo2docker .
Notebook Lang Level Description
Using Jupyter Markdown Intro Getting a login account
IDR API example script Py Intro Shows an example of using the web API to extract metadata from the IDR. Can also be launched locally.
Getting Started Py Intro How to connect, some simple data access
Figure 1 Sample of Phenotypes Py IDR Paper Reproduces Fig. 1 of the paper: downloads annotations from all screens and computes and plots some statistics
Gene Network Py IDR Paper Reproduces Fig. 2 of the paper: downloads annotations from 3 screens,with a phenotype in common, queries StringDB for interactions and plots the resulting network. Uses a conversion table for orthologues and gene identifiers which was built offline using biomart (see article for more details). It uses Bokeh and NetworkX to display networks.
PCA analysis of Charm features Py Study details Shows how to access the computed CHARM features using OMERO.tables, and performs some analysis on them, showing that single cell information can be accessed from generic tile-based features, without segmentation.
Rohn Phenotype Clustering Py Study details Downloads annotations from idr0008-rohn-actinome, and performs some simple phenotypic clustering, building a figure, similar to Fig. 1 of the corresponding paper. Builds a gallery of thumbnails from images of several phenotypes.
Sysgro ROI Length Py Study details Loads polygons which are linked to the images of idr0001-graml-sysgro and compares the length of cells labelled with a particular gene such as ASH2 versus the wild type.
Calculate Sharpness Py Example Calculates sharpness of images and generates heatmaps.

1: Available on bioRxiv under https://doi.org/10.1101/089359

2: If you don't yet have an account, notebooks can be viewed under https://github.com/IDR/idr-metadata or more completely under https://nbviewer.jupyter.org/github/idr/idr-notebooks/tree/master/