GOPHER: GenOmic Profile-model compreHensive EvaluatoR
$ pip install bio-gopher
This repository contains scripts for data preprocessing, training deep learning models for DNA sequence to epigenetic function prediction and evaluation of models.
The repo contains a set of tutorial jupyter notebooks that illustrate these steps on a toy dataset. The two notebooks below are required prerequisites for the rest of tutorials:
- preprocessing/preprocessing/quant_dataset_tutorial.ipynb
- tutorials/train_model.ipynb
To replicate the results of the manuscript run the scripts in the analyzis directory. As a prerequisite download and unzip dataset.zip, trained_models.zip from zenodo https://doi.org/10.5281/zenodo.6464031 within the git repo. These contain test sets and pre-trained models. The analysis scripts can be ran in any order as long as paper_run_evaluate.py is ran first, in order to produce model evaluations which is required for further steps.