intfloat

Peking UniversityBeijing, China

intfloat's Stars

meta-llama/llama
Inference code for Llama models
Language:Python57.1k 525 1.1k9.6k
THUDM/ChatGLM-6B
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
Language:Python40.9k 394 1.3k5.2k
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
Language:Python39k 386 1.7k4.3k
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
Language:Python29.7k 344 2694.1k
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Language:Python16.9k 112 1.1k1.7k
meta-llama/codellama
Inference code for CodeLlama models
Language:Python16.1k 188 2081.9k
Stability-AI/StableLM
StableLM: Stability AI Language Models
Language:Jupyter Notebook15.8k 201 761k
meta-llama/llama-recipes
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.
Language:Jupyter Notebook15.8k 203 4002.3k
openai/evals
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Language:Python15.3k 263 2152.6k
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Language:Python15.2k 112 1.1k1.2k
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
Language:Python8.9k 77 581634
facebookresearch/ImageBind
ImageBind One Embedding Space to Bind Them All
Language:Python8.5k 99 93781
openlm-research/open_llama
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
7.4k 122 91382
EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of language models.
Language:Python7.4k 39 1.2k2k
CarperAI/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Language:Python4.6k 51 292472
mosaicml/llm-foundry
LLM training code for Databricks foundation models
Language:Python4.1k 48 385536
mlfoundations/open_flamingo
An open-source framework for training large multimodal models.
Language:Python3.8k 48 176289
THUDM/AgentBench
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
Language:Python2.3k 28 146171
openai/prm800k
800,000 step-level correctness labels on LLM solutions to MATH problems
Language:Python1.8k 115 17104
THUDM/WebGLM
WebGLM: An Efficient Web-enhanced Question Answering System (KDD 2023)
Language:Python1.6k 25 70137
guyulongcs/Awesome-Deep-Learning-Papers-for-Search-Recommendation-Advertising
Awesome Deep Learning papers for industrial Search, Recommendation and Advertisement. They focus on Embedding, Matching, Ranking (CTR/CVR prediction), Post Ranking, Large Model (Generative Recommendation, LLM), Transfer learning, Reinforcement Learning and so on.
Language:Python1.5k 53 1230
CStanKonrad/long_llama
LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transformer (FoT) method.
Language:Python1.5k 27 2486
allenai/mmc4
MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.
Language:Python913 9 1735
Victorwz/LongMem
Official implementation of our NeurIPS 2023 paper "Augmenting Language Models with Long-Term Memory".
Language:Python773 24 2070
jzbjyb/FLARE
Forward-Looking Active REtrieval-augmented generation (FLARE)
Language:Python598 7 2255
facebookresearch/Sphere
Web-scale retrieval for knowledge-intensive NLP
Language:Python553 14 527
princeton-nlp/ALCE
[EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627
Language:Python468 9 2142
LAION-AI/Open-Instruction-Generalist
Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks
Language:Python207 13 919
huggingface/OBELICS
Code used for the creation of OBELICS, an open, massive and curated collection of interleaved image-text web documents, containing 141M documents, 115B text tokens and 353M images.
Language:Python193 7 1310
thu-coai/PICL
Code for ACL2023 paper: Pre-Training to Learn in Context
Language:Python109 1 94

intfloat

intfloat's Stars

meta-llama/llama

THUDM/ChatGLM-6B

hpcaitech/ColossalAI

tatsu-lab/stanford_alpaca

huggingface/peft

meta-llama/codellama

Stability-AI/StableLM

meta-llama/llama-recipes

openai/evals

QwenLM/Qwen

facebookresearch/xformers

facebookresearch/ImageBind

openlm-research/open_llama

EleutherAI/lm-evaluation-harness

CarperAI/trlx

mosaicml/llm-foundry

mlfoundations/open_flamingo

THUDM/AgentBench

openai/prm800k

THUDM/WebGLM

guyulongcs/Awesome-Deep-Learning-Papers-for-Search-Recommendation-Advertising

CStanKonrad/long_llama

allenai/mmc4

Victorwz/LongMem

jzbjyb/FLARE

facebookresearch/Sphere

princeton-nlp/ALCE

LAION-AI/Open-Instruction-Generalist

huggingface/OBELICS

thu-coai/PICL