Pinned Repositories
beefs
biomedical
Tools for curating biomedical training data for large-scale language modeling
eu-lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
lightning-hydra-template
PyTorch Lightning + Hydra. A very user-friendly template for ML experimentation. ⚡🔥⚡
tanl-bee
Structured Prediction as Translation between Augmented Natural Languages
biomedical
Tools for curating biomedical training data for large-scale language modeling
pedl
Search the biomedical literature for protein interactions and protein associations
curated-web-data
A collection of "good and bad" Web domains for LLM pretraining data
euro-lm-evaluation-harness
barthfab's Repositories
barthfab/beefs
barthfab/biomedical
Tools for curating biomedical training data for large-scale language modeling
barthfab/eu-lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
barthfab/lightning-hydra-template
PyTorch Lightning + Hydra. A very user-friendly template for ML experimentation. ⚡🔥⚡
barthfab/tanl-bee
Structured Prediction as Translation between Augmented Natural Languages