Pinned Repositories
air-bench-2024
AIR-Bench 2024 is a safety benchmark that aligns with emerging government regulations and company policies
BioMedLM
ecosystem-graphs
EUAIActJune15
Stanford CRFM's initiative to assess potential compliance with the draft EU AI Act
fmti
The Foundation Model Transparency Index
halie
helm
Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models (CRFM) at Stanford for holistic, reproducible and transparent evaluation of foundation models, including large language models (LLMs) and multimodal models.
helm-efficiency
image2struct
A Benchmark for Evaluating Vision-Language Models in extracting Structured Information from Images
mistral
Mistral: A strong, northwesterly wind: Framework for transparent and accessible large-scale language model training, built with Hugging Face 🤗 Transformers.
Stanford Center for Research on Foundation Models's Repositories
stanford-crfm/helm
Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models (CRFM) at Stanford for holistic, reproducible and transparent evaluation of foundation models, including large language models (LLMs) and multimodal models.
stanford-crfm/BioMedLM
stanford-crfm/mistral
Mistral: A strong, northwesterly wind: Framework for transparent and accessible large-scale language model training, built with Hugging Face 🤗 Transformers.
stanford-crfm/ecosystem-graphs
stanford-crfm/EUAIActJune15
Stanford CRFM's initiative to assess potential compliance with the draft EU AI Act
stanford-crfm/fmti
The Foundation Model Transparency Index
stanford-crfm/air-bench-2024
AIR-Bench 2024 is a safety benchmark that aligns with emerging government regulations and company policies
stanford-crfm/halie
stanford-crfm/helm-efficiency
stanford-crfm/image2struct
A Benchmark for Evaluating Vision-Language Models in extracting Structured Information from Images
stanford-crfm/data-overlap
stanford-crfm/transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
stanford-crfm/lm-evaluation-harness
Fork of lm-evaluation-harness
stanford-crfm/sprucfluo
Data streaming for LMs. WIP
stanford-crfm/composer
Composing methods for ML training efficiency
stanford-crfm/janus
A Streamlit interface that's a doorway into GPT-X.
stanford-crfm/cc-index-server
Common Crawl Index Server
stanford-crfm/chatnoir-resiliparse
A robust web archive analytics toolkit
stanford-crfm/transformers_fsdp_checkpoint_fix
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
stanford-crfm/mosaicml-benchmarks
Fast and flexible reference benchmarks