/HistCNN

A set of codes to classify H&E slides using deep learning

Primary LanguagePythonOtherNOASSERTION

This package is an implementation of Inception-v3 CNN to classify TCGA H&E whole slide images (WSI) according to tumor/normal status and cancer subtype. It implements a pipeline in Kubernetes (k8s) under Google Cloud Platform (GCP) for labeling, tiling, and transfer learning on the images.

schematic of the analysis pipeline

For details, please see the following paper:

Javad Noorbakhsh, Saman Farahmand, Sandeep Namburi, Dennis Caruana, David Rimm, Mohammad Soltanieh-ha, Kourosh Zarringhalam, Jeffrey H Chuang, Deep learning-based cross-classifications reveal conserved spatial behaviors within tumor histological images, Nature communications 2020

Installation

To properly use the pipelines, you will need to install the package. After changing current directory to the root of the project run (for now this only works in the development mode):

pip install -e .

Documentation

The Kubernetes pipelines are separated into individual apps which run the corresponding problem of interest on GCP. For details on each app refer to the README in its corresponding folder. The following apps are implemented:

To begin using any of these apps you will need to set up a k8s cluster.

Command line tool

Few functionalities have been implemented through a command line tool. To access its help run:

histcnn --help

or one of the more detailed alternatives:

histcnn gcs --help
histcnn annotate --help
histcnn run-subtype --help