BigScience Workshop

Research workshop on large language models - The Summer of Language Models 21

Pinned Repositories

architecture-objective
Language:Python88 4 118
bigscience
Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.
Language:Shell951 36 1999
biomedical
Tools for curating biomedical training data for large-scale language modeling
Language:Python427 27 373111
data-preparation
Code used for sourcing and cleaning the BigScience ROOTS corpus
Language:Jupyter Notebook286 24 1240
lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
Language:Python97 4 2631
Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Language:Python1.3k 24 143209
petals
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
Language:Python8.8k 88 186474
promptsource
Toolkit for creating, sharing and using natural language prompts.
Language:Python2.5k 31 162340
t-zero
Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)
Language:Python451 24 2152
xmtf
Crosslingual Generalization through Multitask Finetuning
Language:Jupyter Notebook500 5 2237

bigscience-workshop/data_sourcing
This directory gathers the tools developed by the Data Sourcing Working Group
Language:Python31 17 86
bigscience-workshop/bigscience-workshop.github.io
Alternative to https://github.com/Dynalon/mdwiki-seed
Language:HTML11 2 07
bigscience-workshop/amazon-sagemaker-mlflow-fargate
Managing your machine learning lifecycle with MLflow and Amazon SageMaker
Language:Jupyter Notebook3 1 0
bigscience-workshop/scaling-laws-tokenization
scaling-laws-tokenization
2 15 0
bigscience-workshop/codecarbon
Track emissions from Compute and recommend ways to reduce their impact on the environment.
Language:Python1 0