/uniparc_xml_parser

UniParc dataset describing ~300 million protein sequences converted into relational tables accessible through Google BigQuery (and as Parquet files).

Primary LanguageRustApache License 2.0Apache-2.0

Watchers