nextstrain/nextclade_data

ENH: Add key RBD mutations as labelled mutations

Closed this issue · 5 comments

ENH: Add key RBD mutations as labelled mutations

This would help a lot for downstream analytics! I'm calculating key RBD mutations after running nextclade from this yaml. But my implementation in python is super slow (and not multi-threaded).

@ktmeaton Hmm, I don't think this would do what you want since only private mutations that are labelled would get highlighted.

If you want to calculate key RBD mutations fast, the best way may be to:
a) select only the aaSubstitution column from the nextclade.tsv using tsv-select (from eBay's tsv-utils)
b) Then parse that - if you need it to be super fast a little Rust script could help.

Ah, that makes sense, thanks for clarifying! I've been meaning to get more into rust, so I'll take a look.

This looks like a perfect task to get started in Rust
a) read in a config file, read in the mutation file
b) process mutations
c) output count

Probably obsolete