Serratus
Serratus is a collaborative Open Science project for ultra-rapid discovery of all viruses.
Petabase-scale sequence alignment catalyses viral discovery
Paper:_Background
While there is an intense research effort to sequence and analyze SARS-CoV-2 isolates for tracking real-time virus evolution, our understanding of the virus's evolutionary origins is limited by incomplete genomic characterization of other members of the Coronaviridae (CoV) and related families.
We are re-analyzing all RNA-seq, meta-genomics, meta-transcriptomics and environmental sequencing data in the NCBI Short Read Archive to discover new coronaviruses. That is >5 million biological samples or >10 petabases of sequencing data.
Contribute to Serratus
The Serratus
team is actively looking to collaborate with all scientists and developers.
- See: CONTRIBUTING.md
- Email (ababaian AT bccrc DOT ca)
- Join us on Slack (
/join #serratus
)
Serratus Usage
Learn more on the Serratus Wiki
Serratus Data Explorer
How To: Finding Novel Viruses
Accessing Raw Data
Cloud Architecture
Serratus CLI Usage
Data Release Policy
Our primary goal is to generate the viral data to accelerate the global research efforts in fighting emerging pathogens. To achieve this:
-
All software development is open-source and freely available (GPLv3)
-
All sequencing data generated, raw and processed, will be freely and immediatly available in the public domain in accordance with the Bermuda Principles set out by the Human Genome Project.