zeyuliu1037

zeyuliu1037's Stars

meta-llama/llama
Inference code for Llama models
Language:Python55.8k 521 9629.5k
rasbt/LLMs-from-scratch
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
Language:Jupyter Notebook28.1k 303 913.2k
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Language:Python19.6k 302 1.4k2.5k
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Language:Python16k 108 1k1.6k
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Language:Python13.6k 101 1k1.1k
state-spaces/mamba
Mamba SSM architecture
Language:Python12.7k 101 5131.1k
alshedivat/al-folio
A beautiful, simple, clean, and responsive Jekyll theme for academics
Language:HTML10.7k 24 55111k
mistralai/mistral-inference
Official inference library for Mistral models
Language:Jupyter Notebook9.6k 125 142846
mit-han-lab/streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
Language:Python6.6k 65 80362
paperswithcode/ai-deadlines
:alarm_clock: AI conference deadline countdowns
Language:JavaScript5.6k 100 92962
ali-vilab/VGen
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
Language:Python2.9k 33 133258
dvlab-research/LongLoRA
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
Language:Python2.6k 12 173268
codefuse-ai/Awesome-Code-LLM
[TMLR] A curated list of language modeling researches for code and related datasets.
1.4k 35 7103
jiaweizzhao/GaLore
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
Language:Python1.4k 18 52143
sustcsonglin/flash-linear-attention
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
Language:Python1.2k 27 4466
Jamie-Stirling/RetNet
An implementation of "Retentive Network: A Successor to Transformer for Large Language Models"
Language:Python1.2k 13 26100
AGI-Edgerunners/LLM-Adapters
Code for our EMNLP 2023 Paper: "LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models"
Language:Python1.1k 12 5899
ali-vilab/UniAnimate
Code for Paper "UniAnimate: Taming Unified Video Diﬀusion Models for Consistent Human Image Animation".
Language:Python984 22 7052
ydyjya/Awesome-LLM-Safety
A curated list of safety-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide researchers, practitioners, and enthusiasts with insights into the safety implications, challenges, and advancements surrounding these powerful models.
880 15 749
mobiusml/hqq
Official implementation of Half-Quadratic Quantization (HQQ)
Language:Python670 16 9665
lava-nc/lava
A Software Framework for Neuromorphic Computing
Language:Jupyter Notebook555 28 314143
Beomi/InfiniTransformer
Unofficial PyTorch/🤗Transformers(Gemma/Llama3) implementation of Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention
Language:Python336 8 2429
allenai/Holodeck
CVPR 2024: Language Guided Generation of 3D Embodied AI Environments.
Language:Python324 13 5429
yxli2123/LoftQ
Language:Python193 4 3718
loongson-community/areweloongyet
咱龙了吗？一站式了解 LoongArch 的上游生态建设。 Are we Loong yet? Your one-stop portal for following LoongArch upstream work.
Language:TypeScript169 14 3317
microsoft/LongRoPE
LongRoPE is a novel method that can extends the context window of pre-trained LLMs to an impressive 2048k tokens.
Language:Python87 2 710
wy1iu/butterfly-oft
Official implementation of "Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization"
72 10 50
goombalab/phi-mamba
Official implementation of Phi-Mamba. A MOHAWK-distilled model (Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic Models)
Language:Python693
zju3dv/text_scene_motion
[CVPR 2024] Generating Human Motion in 3D Scenes from Text Descriptions
Language:Python40 9 21
ee538/AutoGradingScript
AutoGradingScript for USC EE-538
Language:Python5 2 02