/grifo

GRIFO is an analysis pipeline for nanopore metabarcoding sequence data

Primary LanguageJupyter NotebookOtherNOASSERTION

GRIFO

GRIFO is a versatile pipeline for environmental metabarcoding using nanopore sequences with density based clustering for error correction and other utilities. The name is derived of the mythical creature grifo (Esperanto) which is partly eagle, lion, and deer. The name describes how the pipeline integrates existing tools into a new workflow.

Usage

GRIFO is under development. There is currently no user guide and source code is unstable and the latest version may not even be functional at times. Currently, GRIFO runs in a jupyter notebook in Python using wrapper functions for Linux tools.

Dependencies

GRIFO uses the clustering module of ashure, which itself relies on the following tools:

pip install pandas          # for organizing underlying data
pip install scikit-learn    # for clustering
pip install hdbscan         # for clustering
pip install spoa            # for clustering
pip install parasail

Other depencies are: qcat, nanofilt, and vsearch

Citation

GRIFO makes use of several tools and datasets from third parties. Please cite them too when using GRIFO.