intfloat

Peking UniversityBeijing, China

intfloat's Stars

xai-org/grok-1
Grok open release
Language:Python49.8k 592 2148.3k
microsoft/autogen
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
Language:Jupyter Notebook36.7k 419 2.2k5.3k
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python33k 271 5.8k5k
unslothai/unsloth
Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory
Language:Python19.9k 135 1.2k1.4k
ItzCrazyKns/Perplexica
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI
Language:TypeScript17.9k 126 3711.7k
openai/swarm
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
Language:Python17.3k 286 111.8k
microsoft/BitNet
Official inference framework for 1-bit LLMs
Language:C++12.5k 132 95879
LargeWorldModel/LWM
Large World Model -- Modeling Text and Video with Millions Context
Language:Python7.2k 66 71554
hijkzzz/Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
6k 93 11330
google/gemma_pytorch
The official PyTorch implementation of Google's Gemma models
Language:Python5.3k 39 41515
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
Language:Python4.9k 111 137419
mit-han-lab/llm-awq
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
Language:Python2.6k 24 194218
facebookresearch/chameleon
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
Language:Python1.9k 26 51113
gkamradt/LLMTest_NeedleInAHaystack
Doing simple retrieval from LLM models at various context lengths to measure accuracy
Language:Jupyter Notebook1.6k 17 26178
RUC-NLPIR/FlashRAG
⚡FlashRAG: A Python Toolkit for Efficient RAG Research
Language:Python1.6k 14 97131
facebookresearch/MetaCLIP
ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering
Language:Python1.3k 12 3456
Xnhyacinth/Awesome-LLM-Long-Context-Modeling
📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥
1.1k 48 841
jxzhangjhu/Awesome-LLM-RAG
Awesome-LLM-RAG: a curated list of advanced retrieval augmented generation (RAG) in Large Language Models
1k 10 064
srush/awesome-o1
A bibliography and survey of the papers surrounding o1
Language:TeX1k 24 040
mistralai/megablocks-public
Language:Python862 10 055
NVIDIA/RULER
This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?
Language:Python810 17 6254
jzhang38/EasyContext
Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
Language:Python679 8 4648
haoliuhl/ringattention
Large Context Attention
Language:Python661 7 1753
huggingface/huggingface-llama-recipes
Language:Jupyter Notebook585 29 4365
ContextualAI/gritlm
Generative Representational Instruction Tuning
Language:Jupyter Notebook578 9 5442
nomic-ai/contrastors
Train Models Contrastively in Pytorch
Language:Python559 12 4143
google-deepmind/loft
LOFT: A 1 Million+ Token Long-Context Benchmark
Language:Python156 12 413
princeton-nlp/ProLong
Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"
Language:Python144 13 65
microsoft/SmartPlay
SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. SmartPlay is designed to be easy to use, and to support future development of LLMs.
Language:Python126 5 716
dwzhu-pku/LongEmbed
LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)
Language:Python122 4 56

intfloat

intfloat's Stars

xai-org/grok-1

microsoft/autogen

vllm-project/vllm

unslothai/unsloth

ItzCrazyKns/Perplexica

openai/swarm

microsoft/BitNet

LargeWorldModel/LWM

hijkzzz/Awesome-LLM-Strawberry

google/gemma_pytorch

huggingface/alignment-handbook

mit-han-lab/llm-awq

facebookresearch/chameleon

gkamradt/LLMTest_NeedleInAHaystack

RUC-NLPIR/FlashRAG

facebookresearch/MetaCLIP

Xnhyacinth/Awesome-LLM-Long-Context-Modeling

jxzhangjhu/Awesome-LLM-RAG

srush/awesome-o1

mistralai/megablocks-public

NVIDIA/RULER

jzhang38/EasyContext

haoliuhl/ringattention

huggingface/huggingface-llama-recipes

ContextualAI/gritlm

nomic-ai/contrastors

google-deepmind/loft

princeton-nlp/ProLong

microsoft/SmartPlay

dwzhu-pku/LongEmbed