MLCommons

Better ML for everyone

Pinned Repositories

ailuminate
The AILuminate v1.1 benchmark suite is an AI risk assessment benchmark developed with broad involvement from leading AI companies, academia, and civil society.
26 6 189
algorithmic-efficiency
MLCommons Algorithmic Efficiency is a benchmark and competition measuring neural network training speedups due to algorithmic improvements in both training algorithms and models.
Language:Python400 22 23875
ck
Collective Knowledge (CK), Collective Mind (CM/CMX) and MLPerf automations: community-driven projects to facilitate collaborative and reproducible research and to learn how to run AI, ML, and other emerging workloads more efficiently and cost-effectively across diverse models, datasets, software, and hardware using MLPerf methodology and benchmarks
Language:Python633 45 505121
croissant
Croissant is a high-level format for machine learning datasets that brings together four rich layers.
Language:Jupyter Notebook746 21 30590
inference
Reference implementations of MLPerf® inference benchmarks
Language:Python1.5k 53 966588
inference_results_v5.1
This repository contains the results and code for the MLPerf™ Inference v5.1 benchmark.
Language:HTML04
modelbench
Run safety benchmarks against AI models and view detailed reports showing how well they performed.
Language:Python108 16 63926
tiny
MLPerf® Tiny is an ML benchmark suite for extremely low-power systems such as microcontrollers
Language:C432 36 30108
training
Reference implementations of MLPerf® training benchmarks
Language:Python1.7k 113 349584
training_results_v5.0
This repository contains the results and code for the MLPerf® Training v5.0 benchmark.
Language:Python67

MLCommons's Repositories

mlcommons/training
Reference implementations of MLPerf® training benchmarks
Language:Python1.7k 113 349584
mlcommons/inference
Reference implementations of MLPerf® inference benchmarks
Language:Python1.5k 53 966588
mlcommons/croissant
Croissant is a high-level format for machine learning datasets that brings together four rich layers.
Language:Jupyter Notebook746 21 30590
mlcommons/ck
Collective Knowledge (CK), Collective Mind (CM/CMX) and MLPerf automations: community-driven projects to facilitate collaborative and reproducible research and to learn how to run AI, ML, and other emerging workloads more efficiently and cost-effectively across diverse models, datasets, software, and hardware using MLPerf methodology and benchmarks
Language:Python633 45 505121
mlcommons/algorithmic-efficiency
MLCommons Algorithmic Efficiency is a benchmark and competition measuring neural network training speedups due to algorithmic improvements in both training algorithms and models.
Language:Python400 22 23875
mlcommons/storage
MLPerf® Storage Benchmark Suite
Language:Python167 19 10951
mlcommons/medperf
An open benchmarking platform for medical artificial intelligence using Federated Evaluation.
Language:Python163 13 29436
mlcommons/chakra
Repository for MLCommons Chakra schema and tools
Language:Python137 6 6660
mlcommons/modelbench
Run safety benchmarks against AI models and view detailed reports showing how well they performed.
Language:Python108 16 63926
mlcommons/training_policies
Issues related to MLPerf™ training policies, including rules and suggested changes
Language:Python95 38 36669
mlcommons/inference_policies
Issues related to MLPerf™ Inference policies, including rules and suggested changes
64 25 14155
mlcommons/mobile_app_open
Mobile App Open
Language:C++63 9 46729
mlcommons/mlperf_client
MLPerf Client is a benchmark for Windows and macOS, focusing on client form factors in ML inference scenarios.
Language:C++564
mlcommons/logging
MLPerf™ logging library
Language:Python37 19 12752
mlcommons/policies
General policies for MLPerf™ including submission rules, coding standards, etc.
Language:Python31 19 5760
mlcommons/mobile_models
MLPerf™ Mobile models
26 16 2010
mlcommons/dynabench
Language:Python25 5 6518
mlcommons/power-dev
Dev repo for power measurement for the MLPerf™ benchmarks
Language:Python25 19 15927
mlcommons/mlcflow
MLCFlow: Simplifying MLPerf Automations
Language:Python10 6 4313
mlcommons/inference_results_v5.0
This repository contains the results and code for the MLPerf™ Inference v5.0 benchmark.
Language:HTML910
mlcommons/mlperf-automations
This repository contains automation scripts designed to run MLPerf Inference benchmarks. Originally developed for the Collective Mind (CM) automation framework, these scripts have been adapted to leverage the MLC automation framework, maintained by the MLCommons Benchmark Infrastructure Working Group.
Language:Python8 5 21622
mlcommons/mlperf_automotive
Language:Python7 1 295
mlcommons/submissions_algorithms
Language:Python7 2 49
mlcommons/r2-downloader
Cloudflare Access + R2 Object Storage Dataset Download Script
Language:Shell22
mlcommons/modelplane
Language:Python10
mlcommons/r2-infra
MLC Cloudflare R2 Dataset Distribution Infrastructure Management
Language:HTML1 2 0
mlcommons/inference_results_v5.1
This repository contains the results and code for the MLPerf™ Inference v5.1 benchmark.
Language:HTML04
mlcommons/common-crawl-dmlr
Language:Rust
mlcommons/mlperf_inference_test_submissions_v5.0
Language:Mermaid4 05
mlcommons/tiny_results_v1.3
This repository contains the results and code for the MLPerf™ Tiny Inference v1.3 benchmark.
Language:C++