/regLM

Toolkit for training hyenaDNA-based autoregressive language models on DNA sequences.

Primary LanguageJupyter NotebookMIT LicenseMIT

regLM

regLM is a toolkit for training hyenaDNA-based autoregressive language models on DNA sequences and generating novel regulatory elements.

regLM schematic

Documentation

Documentation

Tutorials

Tutorials

Installation

1. Install HyenaDNA

To use regLM, first install HyenaDNA from GitHub following the instructions: https://github.com/HazyResearch/hyena-dna

2. Install regLM

git clone https://github.com/Genentech/regLM.git
cd regLM
pip install .

Publication

https://genome.cshlp.org/content/early/2024/09/24/gr.279142.124.abstract

Lal, A., Garfield, D., Biancalani, T., & Eraslan, G. (2024). Designing realistic regulatory DNA with autoregressive language models. Genome Research.