stanford-crfm/helm
Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110). This framework is also used to evaluate text-to-image models in HEIM (https://arxiv.org/abs/2311.04287) and vision-language models in VHELM (https://arxiv.org/abs/2410.07112).
PythonApache-2.0
Watchers
- ameliahardyStanford, CA
- AshwinParanjapeStanford University
- davesjoewang
- denissa4NLSQL
- eadeliStanford University
- eemailme
- faneshionict
- GoldenZeroNetherlands
- huaxiuyao
- ImKeTTUC Santa Cruz
- J38
- kappa0xStrasbourg, France
- katezhou
- mahaddadNew York, New York
- manning
- mbofb
- melodyee
- minaek
- ozfSoftware Square, Byte Town, Logicstate, Computronia
- paulpaul91
- percyliangStanford University
- pramitchoudhary@oidlabs.com
- rhudockKing & Spalding, LLP
- RohithKuditipudi
- ryokawajp
- SabrinaLameirasSão Paulo - SP
- SeshatCZCzech republic
- shantanusharmaSharma Labs
- sheshuguang
- smashclay
- teetoneStanford University
- tonywu95Google
- VijayAsokkumar
- yifanmai
- zzyunzhiStanford, CA