hanlinxuy

hanlinxuy's Stars

QSCTech/zju-icicles
浙江大学课程攻略共享计划
Language:HTML37.2k 1.1k 779.4k
HigherOrderCO/Bend
A massively parallel, high-level programming language
Language:Rust17.4k 92 257429
lltcggie/waifu2x-caffe
waifu2xのCaffe版
Language:C++8.1k 265 202841
Buuntu/fastapi-react
🚀 Cookiecutter Template for FastAPI + React Projects. Using PostgreSQL, SQLAlchemy, and Docker
Language:Python2.2k 44 65349
sustcsonglin/flash-linear-attention
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
Language:Python1.3k 27 4867
Event-AHU/Mamba_State_Space_Model_Paper_List
[Mamba-Survey-2024] Paper list for State-Space-Model/Mamba and it's Applications
602 13 533
princeton-nlp/LLM-Shearing
[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
Language:Python552 24 7244
wolfparticle/machineLearningDeepLearning
李宏毅2021机器学习深度学习笔记PPT作业
Language:Jupyter Notebook532 6 0116
Ai00-X/ai00_server
The all-in-one RWKV runtime box with embed, RAG, AI agents, and more.
Language:Rust475 15 6458
Leeroo-AI/mergoo
A library for easily merging multiple LLM experts, and efficiently train the merged LLM.
Language:Python400 4 1325
Cornell-RelaxML/QuIP
Code for paper: "QuIP: 2-Bit Quantization of Large Language Models With Guarantees"
Language:Python344 9 1232
FlagOpen/FlagGems
FlagGems is an operator library for large language models implemented in Triton Language.
Language:Python315 18 2733
cryscan/web-rwkv
Implementation of the RWKV language model in pure WebGPU/Rust.
Language:Rust236 4 1316
FasterDecoding/SnapKV
Language:Python181 5 197
RWKV/RWKV-infctx-trainer
RWKV infctx trainer, for training arbitary context sizes, to 10k and beyond!
Language:Jupyter Notebook133 5 2328
jshuadvd/LongRoPE
Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper
Language:Python124 6 313
neromous/RWKV-Ouroboros
This project is established for real-time training of the RWKV model.
Language:Python50 2 44
LuJunru/MemoChat
MemoChat: Tuning LLMs to Use Memos for Consistent Long-Range Open-Domain Conversation
Language:Python17 2 12
radarFudan/mamba
Language:Python15 0 00
Dan-wanna-M/bnf_sampler
Language:Rust10 1 01
StarRing2022/RingRWKV
修复Transformer官方库中RWKV的适配问题，支持RWKV所有系列模型在转换后，通过RingRWKV库，与其他transfomer模型一样简单方便地部署和微调。
Language:Python9 3 03
fsndzomga/rag-fastapi
A simple implementation of RAG (Retrieval Augmented Generation) using fastapi and postgreSQL
Language:Python8 1 01
hanlinxuy/RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
Language:Python5 0 01