dwzhu-pku

Ph.D Student from PKU ICL.

dwzhu-pku's Stars

mem0ai/mem0
The Memory layer for your AI apps
Language:Python22.1k 126 6542k
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python13.6k 115 1k1.2k
intel-analytics/ipex-llm
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Baichuan, Mixtral, Gemma, Phi, MiniCPM, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, GraphRAG, DeepSpeed, vLLM, FastChat, Axolotl, etc.
Language:Python6.5k 250 2.6k1.3k
bitsandbytes-foundation/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
Language:Python6.1k 50 1k613
hijkzzz/Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
4k 65 6214
jbhuang0604/awesome-tips
3.4k 99 4190
deepseek-ai/DeepSeek-Coder-V2
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
2k 22 50100
multimodal-art-projection/MAP-NEO
Language:Python845 10 3480
zhijing-jin/nlp-phd-global-equality
A repo for open resources & information for people to succeed in PhD in CS & career in AI / NLP
845 19 072
microsoft/MInference
To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an A100 while maintaining accuracy.
Language:Python717 6 5328
okhat/blog
195 8 14
TIGER-AI-Lab/LongRAG
Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".
Language:Python172 3 415
HaozheZhao/UltraEdit
Language:Python154 3 188
metame-ai/awesome-llm-plaza
awesome llm plaza: daily tracking all sorts of awesome topics of llm, e.g. llm for coding, robotics, reasoning, multimod etc.
133 8 011
google-deepmind/loft
LOFT: A 1 Million+ Token Long-Context Benchmark
Language:Python132 11 48
KbsdJames/Awesome-LLM-Preference-Learning
The official repository of our survey paper: "Towards a Unified View of Preference Learning for Large Language Models: A Survey"
124 1 01
hkust-nlp/llm-compression-intelligence
Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]
Language:Python123 3 76
AI21Labs/Parallel-Context-Windows
Language:Python99 2 612
llyx97/TempCompass
[ACL 2024 Findings] "TempCompass: Do Video LLMs Really Understand Videos?", Yuanxin Liu, Shicheng Li, Yi Liu, Yuxiang Wang, Shuhuai Ren, Lei Li, Sishuo Chen, Xu Sun, Lu Hou
Language:Python76 4 33
MozerWang/Loong
[EMNLP 2024 Main]Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA
Language:Python70 1 43
zhiyuanhubj/LongRecipe
LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models
Language:Python60 1 34
alonj/Same-Task-More-Tokens
The code for the paper: "Same Task, More Tokens: the Impact of Input Length on the Reasoning Performance of Large Language Models"
Language:Jupyter Notebook46 1 02
Zefan-Cai/Awesome-LLM-KV-Cache
Awesome-LLM-KV-Cache: A curated list of 📙Awesome LLM KV Cache Papers with Codes.
33 1 01
mutonix/pyramidinfer
Language:Python270
Yifan-Song793/GoodBadGreedy
The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism
Language:Python24 2 11
WeiminXiong/IPR
Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement
Language:Python23 3 30
KbsdJames/Omni-MATH
The official repository of the Omni-MATH benchmark.
Language:Python22 2 2
chenllliang/MMEvalPro
Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMs
Language:Python21 1 22
FranxYao/Retrieval-Head-with-Flash-Attention
Efficient retrieval head analysis with triton flash attention that supports topK probability
Language:Jupyter Notebook12 2 0
geronimi73/accelerate_tricks
Language:Jupyter Notebook8 1 01

dwzhu-pku

dwzhu-pku's Stars

mem0ai/mem0

Dao-AILab/flash-attention

intel-analytics/ipex-llm

bitsandbytes-foundation/bitsandbytes

hijkzzz/Awesome-LLM-Strawberry

jbhuang0604/awesome-tips

deepseek-ai/DeepSeek-Coder-V2

multimodal-art-projection/MAP-NEO

zhijing-jin/nlp-phd-global-equality

microsoft/MInference

okhat/blog

TIGER-AI-Lab/LongRAG

HaozheZhao/UltraEdit

metame-ai/awesome-llm-plaza

google-deepmind/loft

KbsdJames/Awesome-LLM-Preference-Learning

hkust-nlp/llm-compression-intelligence

AI21Labs/Parallel-Context-Windows

llyx97/TempCompass

MozerWang/Loong

zhiyuanhubj/LongRecipe

alonj/Same-Task-More-Tokens

Zefan-Cai/Awesome-LLM-KV-Cache

mutonix/pyramidinfer

Yifan-Song793/GoodBadGreedy

WeiminXiong/IPR

KbsdJames/Omni-MATH

chenllliang/MMEvalPro

FranxYao/Retrieval-Head-with-Flash-Attention

geronimi73/accelerate_tricks