zeyuliu1037's Stars
meta-llama/llama
Inference code for Llama models
rasbt/LLMs-from-scratch
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
state-spaces/mamba
Mamba SSM architecture
alshedivat/al-folio
A beautiful, simple, clean, and responsive Jekyll theme for academics
mistralai/mistral-inference
Official inference library for Mistral models
mit-han-lab/streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
paperswithcode/ai-deadlines
:alarm_clock: AI conference deadline countdowns
ali-vilab/VGen
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
dvlab-research/LongLoRA
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
codefuse-ai/Awesome-Code-LLM
[TMLR] A curated list of language modeling researches for code and related datasets.
jiaweizzhao/GaLore
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
sustcsonglin/flash-linear-attention
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
Jamie-Stirling/RetNet
An implementation of "Retentive Network: A Successor to Transformer for Large Language Models"
AGI-Edgerunners/LLM-Adapters
Code for our EMNLP 2023 Paper: "LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models"
ali-vilab/UniAnimate
Code for Paper "UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation".
ydyjya/Awesome-LLM-Safety
A curated list of safety-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide researchers, practitioners, and enthusiasts with insights into the safety implications, challenges, and advancements surrounding these powerful models.
mobiusml/hqq
Official implementation of Half-Quadratic Quantization (HQQ)
lava-nc/lava
A Software Framework for Neuromorphic Computing
Beomi/InfiniTransformer
Unofficial PyTorch/🤗Transformers(Gemma/Llama3) implementation of Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention
allenai/Holodeck
CVPR 2024: Language Guided Generation of 3D Embodied AI Environments.
yxli2123/LoftQ
loongson-community/areweloongyet
咱龙了吗?一站式了解 LoongArch 的上游生态建设。 Are we Loong yet? Your one-stop portal for following LoongArch upstream work.
microsoft/LongRoPE
LongRoPE is a novel method that can extends the context window of pre-trained LLMs to an impressive 2048k tokens.
wy1iu/butterfly-oft
Official implementation of "Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization"
goombalab/phi-mamba
Official implementation of Phi-Mamba. A MOHAWK-distilled model (Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic Models)
zju3dv/text_scene_motion
[CVPR 2024] Generating Human Motion in 3D Scenes from Text Descriptions
ee538/AutoGradingScript
AutoGradingScript for USC EE-538