guanzhchen

Sun Yat-sen UniversityGuangzhou, China

Pinned Repositories

CLEX
[ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models
Language:Python71 4 811
flash-attention
Fast and memory-efficient exact attention
Language:Python13.4k 115 1k1.2k
guanzhchen.github.io
Language:Less0 1 01
PETuning
Language:Python32 1 03
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python00
ChunkLlama
[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"
Language:Python333 7 2118
RULER
This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?
Language:Python546 9 5133
EasyContext
Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
Language:Python603 9 4141
InfiniteBench
Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718
Language:Python244 9 1919
LongAlign
LongAlign: A Recipe for Long Context Alignment Encompassing Data, Training, and Evaluation
Language:Python194 8 1013

guanzhchen's Repositories

guanzhchen/PETuning
Language:Python32 1 03
guanzhchen/guanzhchen.github.io
Language:Less0 1 01
guanzhchen/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python00