/clausen_ribonucleotides

Analyze ribonucleotide incorporation data from Clausen et al. 2015 data using script `plot_5prime_end_from_bedgraph.py`.

Primary LanguageJupyter NotebookMIT LicenseMIT

clausen_ribonucleotides

Binder

tl;dr:
Click any launch binder badge on this page to analyze ribonucleotide incorporation data from Clausen et al. 2015 data using script plot_5prime_end_from_bedgraph.py.


Demonstrating the use of my script plot_5prime_end_from_bedgraph.py by analyzing ribonucleotide incorporation data


Usage

This repository is set up to analyze ribonucleotide incorporation data from Clausen et al. 2015 data (see below), demonstrating the use of my script plot_5prime_end_from_bedgraph.py. The script will automatically be obtained when the notebook is run in the active Jupyter session launched from this repository. Click on any launch binder badge on this page to begin.

In the notebook that is launched I illustrate how to use the script and some of the various options it has. Alternatively, the notebook can be viewed statically here.

The necessary data is already available when in the active Jupyter session launched from this repository.

Data Source

The data used here comes from Clausen et al., 2015 (PMID:25622295) (full citation below) and was retrieved here

Users of the data should cite:

  • Tracking replication enzymology in vivo by genome-wide mapping of ribonucleotide incorporation. Clausen AR, Lujan SA, Burkholder AB, Orebaugh CD, Williams JS, Clausen MF, Malc EP, Mieczkowski PA, Fargo DC, Smith DJ, Kunkel TA. Nat Struct Mol Biol. 2015 Mar;22(3):185-91. doi: 10.1038/nsmb.2957. Epub 2015 Jan 26. PMID:25622295

The sequences and analyzed data from that publication have been deposited in the NCBI Gene Expression Omnibus under accession number GSE62181.

Clarifying Data Attribution:
I, Wayne, had nothing to do with that manuscript or data. I simply generated after-the-fact a utility script for analyzing data for specific regions from there. That script is called plot_5prime_end_from_bedgraph.py and is available here. It is demonstrated in the notebook launchable from this repository

Technical Details

This repository is set up to make use of the Binder service offered by MyBinder.org. See their site for more information about Binder.

Python 3 only is available in the Binder sessions launched from this repository. However, the script plot_5prime_end_from_bedgraph.py also works in Python 2. This can be demonstrated by copying the notebook from this repo and uploading it into a Binder session launched from here, which includes Python 2 as well. Run it there with the kernel switched to use Python 2.

Because the Clausen et al., 2015 data are served via FTP, some of the data has been already included in any active Jupyter session launched from the launch binder button in the repository. MyBinder.org cannot access data served from FTP for security purposes.

Developed from my blast-binder repo that was mainly made from adding the ability to run BLAST to much of the binderized repo from here with the ability to also run PatMatch, see here for information about PatMatch and launchable Jupyter notebooks demonstrating its use.

I borrrowed the 'warning' highlight/introductory text about notebooks at the top of the included notebook from Tim Sherratt's notebook here.

Click this button below to begin:

Binder