Submission for IndabaX 2019 Hackathon - IBM Virus Species Team Members: - Craig James Bester - Christopher Dunderdale We calculate aggregate features for each genome sequence based on the percentage occurences of nucleotides (A,G,T,C) and protein strings (TAG, etc.). These features are then fed into an XGBoost classifier.