Welcome to sORF.

The following covers basic I/O, stats, data visualization, and 3rd party library installation.

Your tasks, should you choose to accept them:

  • Clone this repo.
  • Complete the tasks in Python and/or R.
  1. Using heavy_light_ag_aaseq.csv as input, output a summary file in comma-separated value (CSV) format.
    • Use Python or R.
    • Use column h_species.
    • Content headers: Species, # sequences, and % sequences.
    • See example_output.csv.
  2. Visualize the distribution of paratope lengths.
  3. Complete all tasks in 30 minutes.

Have fun coding!
Rahmad & Farhan.