bigscience-workshop/lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
PythonMIT
Stargazers
- antferdom@datacrunch-research
- chen-yuxuanFreie Universität Berlin
- ctlllll@Princeton
- dadelani
- FateScript@Megvii-BaseDetection
- fly51flyPRIS
- GeraldCSCToronto, Canada
- giadilli@huggingface
- GitHub30Osaka, Japan
- haileyschoelkopf@EleutherAI
- hakunanatashaA-Alpha Bio
- hammer
- imr555Neovotech
- jdposada
- jiezhangGtMaryland
- jon-chunKenyon College
- jon-towNew York, New York
- leslyarun
- liigoQiFudan University
- LUDOVIC-PERAN
- marcderbauer@beyondwords-io
- nateraw@huggingface
- oskarvanderwalUniversity of Amsterdam
- pdurham2
- QingruZhangGeorgia Institute of Technology
- Real-bojack
- skychwang@ColumbiaNLP
- StellaAthenaBooz Allen Hamilton, EleutherAI
- SunkyoungKAIST
- taesiriPlanet Mars
- tttyuntian
- wilsonyhleeThe Trevor Project
- wjinfengbytedance
- Yuz998FZU
- zhenv5ByteDance Inc.
- zhouyizhuang-megviiMegvii.Inc