/vaf-experiments

A step-by-step guide to estimate tumor clonality/purity from variant allele frequency data

Primary LanguageJupyter NotebookApache License 2.0Apache-2.0

About

This repository contains a series of Python notebooks (structured as blog posts) that are meant to walk the reader through understanding how to use variant calls to make educated guesses about the tumor biology. The ultimate goal is to build a utility from scratch that will take a Variant Call Format (VCF) file as input and output an annotated version of it with metrics of clonality, purity and heterogeneity.

Table of Contents

Setup

For exploration in read-only mode, there is no need to install additional software or libraries. All notebooks can be browsed within the GitHub repository through its integration with nbviewer.

The suggested way to interact with and modify these notebooks is to first create an isolated Python environment using miniconda utility and then installing all the requirements as follows:

$ conda create --name vafexperiments python
$ source activate vafexperiments
$ pip install -r requirements.txt
$ jupyter notebook  # and browse individual notebooks