This algorithm is designed to simulate the generation of sequencing reads from a given reference genome, mimicking the error rate characteristics typical of Oxford Nanopore sequencing technology.
- Input/Output: algorithm takes a string of nucleic acid (i.e. reference genome) and returns a list of sequence reads according to the hypergeometric probability distribution (i.e. sampling without replacement).
- Functionality: algorithm breaks the string (input) into smaller fragments (output) that are within the desired input size (min, max). It will then add errors (substitutions or deletions) to each read to simulate Oxford Nanopore sequencing error rates per read.