BigScience Workshop

Research workshop on large language models - The Summer of Language Models 21

Pinned Repositories

architecture-objective
Language:Python98 3 116
bigscience
Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.
Language:Shell1k 36 19102
biomedical
Tools for curating biomedical training data for large-scale language modeling
Language:Python485 30 397118
data-preparation
Code used for sourcing and cleaning the BigScience ROOTS corpus
Language:Jupyter Notebook316 24 1242
lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
Language:Python105 4 2629
Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Language:Python1.4k 22 146228
petals
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
Language:Python9.8k 101 210583
promptsource
Toolkit for creating, sharing and using natural language prompts.
Language:Python3k 34 162377
t-zero
Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)
Language:Python462 23 2253
xmtf
Crosslingual Generalization through Multitask Finetuning
Language:Jupyter Notebook537 6 2242

BigScience Workshop's Repositories

bigscience-workshop/petals
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
Language:Python9.8k 101 210583
bigscience-workshop/promptsource
Toolkit for creating, sharing and using natural language prompts.
Language:Python3k 34 162377
bigscience-workshop/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Language:Python1.4k 22 146228
bigscience-workshop/bigscience
Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.
Language:Shell1k 36 19102
bigscience-workshop/xmtf
Crosslingual Generalization through Multitask Finetuning
Language:Jupyter Notebook537 6 2242
bigscience-workshop/biomedical
Tools for curating biomedical training data for large-scale language modeling
Language:Python485 30 397118
bigscience-workshop/t-zero
Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)
Language:Python462 23 2253
bigscience-workshop/data-preparation
Code used for sourcing and cleaning the BigScience ROOTS corpus
Language:Jupyter Notebook316 24 1242
bigscience-workshop/lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
Language:Python105 4 2629
bigscience-workshop/architecture-objective
Language:Python98 3 116
bigscience-workshop/lam
Libraries, Archives and Museums (LAM)
87 28 707
bigscience-workshop/data_tooling
Tools for managing datasets for governance and training.
Language:HTML86 15 26147
bigscience-workshop/multilingual-modeling
BLOOM+1: Adapting BLOOM model to support a new unseen language
Language:Python74 16 2416
bigscience-workshop/evaluation
Code and Data for Evaluation WG
Language:Python42 22 5124
bigscience-workshop/metadata
Experiments on including metadata such as URLs, timestamps, website descriptions and HTML tags during pretraining.
Language:Python31 16 5711
bigscience-workshop/model_card
25 2 04
bigscience-workshop/carbon-footprint
A repository for `codecarbon` logs.
Language:Jupyter Notebook12 14 25
bigscience-workshop/tokenization
Language:Python11 16 12
bigscience-workshop/bloom-dechonk
A repo for running model shrinking experiments
Language:Python10 6 04
bigscience-workshop/catalogue_data
Scripts to prepare catalogue data
Language:Jupyter Notebook8 21 51
bigscience-workshop/historical_texts
BigScience working group on language models for historical texts
Language:Jupyter Notebook8 24 07
bigscience-workshop/massive-probing-framework
Framework for BLOOM probing
Language:Python8 2 02
bigscience-workshop/pii_processing
PII Processing code to detect and remediate PII in BigScience datasets. Reference implementation for the PII Hackathon
Language:Python8 15 76
bigscience-workshop/ShadesofBias
Evaluation for Shades of Bias in Text
Language:HTML7 6 10
bigscience-workshop/training_dynamics
5 18 114
bigscience-workshop/bibliography
A list of BigScience publications
Language:TeX3 1 21
bigscience-workshop/datasets_stats
Generate statistics over datasets used in the context of BS
Language:Makefile2 23 01
bigscience-workshop/evaluation-robustness-consistency
Tools for evaluating model robustness and consistency
Language:Python2 18 02
bigscience-workshop/multilingual-modeling-1
Language:Python2 1 0
bigscience-workshop/interpretability-ideas
1 24 101