/metaTOR

Metagenomic binning based on Hi-C data

Primary LanguagePythonGNU General Public License v3.0GPL-3.0

metaTOR

PyPI version PyPI - Python Version Build Status Read the docs License: Artistic-2.0 Code style: black

Metagenomic Tridimensional Organisation-based Reassembly - A set of scripts that streamline the processing and binning of metagenomic 3C datasets.

Installation

   pip3 install metator

or, to use the latest version:

   pip3 install -e git+https://github.com/koszullab/metator.git@master#egg=metator

Python 3.4 or later is required. A standalone version (no installation, just download/unzip/run) is also available, as well as a Python 2 version, but keep in mind that development will focus on the current branch.

Usage

metator {align|partition|annotation|binning} [parameters]

A metaTOR command takes the form metator action --param1 arg1 --param2 arg2 #etc.

There are four actions/steps in the metaTOR pipeline, which must be run in the following order:

  • align : map paired-end reads on a preliminary assembly, then generate a network from detected contacts between DNA chunks.
  • partition : perform the Louvain community detection algorithm many times to isolate chunks that consistently cluster together for binning purposes.
  • annotation : run standard annotation software on the assembly (namely gene prediction and database comparison) to match with the bins.
  • binning : match annotations to bins, extract bin genomes and subnetworks, build bin-local and global contact maps.

After the last step is completed there should be a set of bins, their relative enrichments in various gene categories, and the contact map of each bin.

There are a number of other, optional, miscellaneous actions:

  • pipeline : check the environment is right, then run all four of the above actioins sequentially. This can take a while.
  • dependencies : download third party dependencies that are not available in most package managers.
  • deploy : set up the environment and all dependencies for Ubuntu 14.04 and higher (run as root).
  • version : display current version number.
  • help : display this help message.

Please refer to the tutorial in order to quickly get the pipeline up and running on example data (or your own), and the metaTOR manual for detailed explanations on the parameters.

Requirements

Requirements can usually be installed with an OS's package manager. The requirements that can not (namely prodigal, louvain and HMM databases) can be fetched with the following (The package may need to be run as a root depending on where the package is installed):

    metator dependencies

A dockerfile is also available if that is of interest.

References

Contact

Authors

Research lab

Spatial Regulation of Genomes (Institut Pasteur, Paris)