/plate-qc

A pipeline for assessing cross-contamination in high throughput metagenomic sequencing, and guidelines for upstream preventative measures.

Primary LanguagePython

plate-qc

A pipeline for assessing cross-contamination in high throughput metagenomic sequencing, and guidelines for upstream preventative measures.

Metrics/Tasks

All are calculated at kmer-level for greatest applicability and computational efficiency

  1. Count kmers
  2. Row-wise and column-wise cross-correlation
  3. Negative control (blank) similarity to other samples
  4. Positive control (natural sample) similarity to previous independent runs
  5. Positive control (defined sample) similarity to previous independent runs
  6. What else?