xinlong-yang

Master student@PKU

Peking UniversityBeijing

xinlong-yang's Stars

QwenLM/Qwen2-VL
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Language:Python2.8k165
QwenLM/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
Language:Python5k375
UbiquitousLearning/Efficient_Foundation_Model_Survey
Survey Paper List - Efficient LLM and Foundation Models
21512
mit-han-lab/smoothquant
[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
Language:Python1.2k144
mit-han-lab/duo-attention
DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads
Language:Python30714
facebookresearch/lingua
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
Language:Python4k193
alibaba/EasyRec
A framework for large scale recommendation algorithms.
Language:Python1.8k321
Infini-AI-Lab/TriForce
[COLM 2024] TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding
Language:Python22212
zhentingqi/rStar
Language:Python46250
THUDM/ReST-MCTS
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)
Language:Python25215
feifeibear/LLMSpeculativeSampling
Fast inference from large lauguage models via speculative decoding
Language:Python55156
hijkzzz/Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
4.9k273
haroldsultan/MCTS
Python Implementations of Monte Carlo Tree Search
Language:Python24185
YunjiaXi/DARE_code
4
EurekaLabsAI/micrograd
The Autograd Engine
Language:HTML52050
magpie-align/magpie
Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality synthetic data generation pipeline!
Language:Python46453
JackHCC/PKU-Lessons-Summary
北京大学软件与微电子学院硕士生课程知识点、作业等汇总【Summary of Knowledge Points and Assignments of Peking University Integrated Circuit Major Courses】
13017
AMD-AIG-AIMA/AMD-LLM
Language:Python17711
QwenLM/Qwen2.5
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
Language:Shell9.2k571
GATECH-EIC/Linearized-LLM
[ICML 2024] When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models
Language:Python251
EurekaLabsAI/tensor
The Tensor (or Array)
Language:Python40641
Equationliu/Kangaroo
Implementation of Kangaroo: Lossless Self-Speculative Decoding via Double Early Exiting
Language:Python405
ShiArthur03/ShiArthur03
Language:MATLAB10.4k1.9k
astramind-ai/Mixture-of-depths
Unofficial implementation for the paper "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"
Language:Python1297
karpathy/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
Language:Python9.2k854
GAIR-NLP/anole
Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation
Language:Python66636
xinlong-yang/Noise_Dense_Retrieval
[ICCV2023] Prototypical Mixing and Retrieval-based Refinement for Label Noise-resistant Image Retrieval
Language:Python21
karpathy/LLM101n
LLM101n: Let's build a Storyteller
29.5k1.6k
facebookresearch/chameleon
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
Language:Python1.8k110
WillDreamer/LOG
Language:Python2

xinlong-yang

xinlong-yang's Stars

QwenLM/Qwen2-VL

QwenLM/Qwen-VL

UbiquitousLearning/Efficient_Foundation_Model_Survey

mit-han-lab/smoothquant

mit-han-lab/duo-attention

facebookresearch/lingua

alibaba/EasyRec

Infini-AI-Lab/TriForce

zhentingqi/rStar

THUDM/ReST-MCTS

feifeibear/LLMSpeculativeSampling

hijkzzz/Awesome-LLM-Strawberry

haroldsultan/MCTS

YunjiaXi/DARE_code

EurekaLabsAI/micrograd

magpie-align/magpie

JackHCC/PKU-Lessons-Summary

AMD-AIG-AIMA/AMD-LLM

QwenLM/Qwen2.5

GATECH-EIC/Linearized-LLM

EurekaLabsAI/tensor

Equationliu/Kangaroo

ShiArthur03/ShiArthur03

astramind-ai/Mixture-of-depths

karpathy/minbpe

GAIR-NLP/anole

xinlong-yang/Noise_Dense_Retrieval

karpathy/LLM101n

facebookresearch/chameleon

WillDreamer/LOG