BigCode Project
BigCode Project is an open scientific collaboration run by Hugging Face and ServiceNow Research, focused on open and responsible development of LLMs for code.
Pinned Repositories
bigcode-dataset
bigcode-evaluation-harness
A framework for the evaluation of autoregressive code generation language models.
bigcodebench
BigCodeBench: Benchmarking Code Generation Towards AGI
Megatron-LM
Ongoing research training transformer models at scale
octopack
🐙 OctoPack: Instruction Tuning Code Large Language Models
starcoder
Home of StarCoder: fine-tuning & inference!
starcoder.cpp
C++ implementation for 💫StarCoder
starcoder2
Home of StarCoder2!
starcoder2-self-align
[NeurIPS'24] Fully Transparent Self-Alignment for Code Generation
the-stack-v2
Code for the curation of The Stack v2 and StarCoder2 training data
BigCode Project's Repositories
bigcode-project/starcoder
Home of StarCoder: fine-tuning & inference!
bigcode-project/starcoder2
Home of StarCoder2!
bigcode-project/bigcode-evaluation-harness
A framework for the evaluation of autoregressive code generation language models.
bigcode-project/starcoder.cpp
C++ implementation for 💫StarCoder
bigcode-project/octopack
🐙 OctoPack: Instruction Tuning Code Large Language Models
bigcode-project/Megatron-LM
Ongoing research training transformer models at scale
bigcode-project/bigcode-dataset
bigcode-project/selfcodealign
[NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generation
bigcode-project/bigcodebench
BigCodeBench: Benchmarking Code Generation Towards AGI
bigcode-project/jupytercoder
bigcode-project/bigcode-analysis
Repository for analysis and experiments in the BigCode project.
bigcode-project/the-stack-v2
Code for the curation of The Stack v2 and StarCoder2 training data
bigcode-project/astraios
Astraios: Parameter-Efficient Instruction Tuning Code Language Models
bigcode-project/bigcode-encoder
bigcode-project/transformers
bigcode-project/bigcode-website
Source of the website of the BigCode project.
bigcode-project/bigcodebench-annotation
BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions
bigcode-project/bigcode-inference-benchmark
bigcode-project/bigcode-tokenizer
bigcode-project/pii-lib
Code for PII detection and redaction in code datasets
bigcode-project/admin
A place for generic issues and administrative things.
bigcode-project/opt-out-v2
Repository for opt-out requests.
bigcode-project/Megatron-LM-deprecated
bigcode-project/bigcode-notebooks
bigcode-project/text-generation-inference
Large Language Model Text Generation Inference
bigcode-project/bigcode-data-mix
bigcode-project/bigcode-demo
A place to build and share model demos
bigcode-project/search