This repo contains source codes for a EC prediction tool namely ECRECer, which is an implementation of our paper: 「Enzyme Commission Number Prediction and Benchmarking with Hierarchical Dual-core Multitask Learning Framework」
Detailed information about the framework can be found in our paper
1. Zhenkun Shi, Qianqian Yuan, Ruoyu Wang, Hoaran Li, Xiaoping Liao*, Hongwu Ma* (2022). ECRECer: Enzyme Commission Number Recommendation and Benchmarking based on Multiagent Dual-core Learning. arXiv preprint arXiv:2202.03632.
2. Zhenkun Shi, Rui Deng, Qianqian Yuan, Zhitao Mao, Ruoyu Wang, Haoran Li, Xiaoping Liao*, Hongwu Ma* (2023). Enzyme Commission Number Prediction and Benchmarking with Hierarchical Dual-core Multitask Learning Framework. Research.
For simply use our tools to predict EC numbers, pls visit our web service at https://ecrecer.biodesign.ac.cn
To re-implement our experiments or offline use, pls use read the details below:
- Python >= 3.6
- Sklearn
- Xgboost
- conda
- jupyter lab
- ...
Create conda env use env.yaml
git clone git@github.com:kingstdio/ECRECer.git
conda env create -f env.yaml
Download and prepare the data set use the.
Or directly download the preprocessed data from aws public dataset and put it in the rootfolder/data/datasets/
python benchmark_train.py
python benchmark_test.py
python benchmark_evaluation.py
python production.py -i input_fasta_file -o output_tsv_file -mode [p|r] -topk 5
If you find these methods valuable for your research, we kindly request that you reference the pertinent paper:
@article{shi2023enzyme,
title={Enzyme Commission Number Prediction and Benchmarking with Hierarchical Dual-core Multitask Learning Framework},
author={Shi, Zhenkun and Deng, Rui and Yuan, Qianqian and Mao, Zhitao and Wang, Ruoyu and Li, Haoran and Liao, Xiaoping and Ma, Hongwu},
journal={Research},
year={2023},
publisher={AAAS}
}