LanDisen's Stars
deepseek-ai/DeepSeek-V3
triton-lang/triton
Development repository for the Triton language and compiler
facebookresearch/lingua
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
NVlabs/Sana
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
pytorch/torchtitan
A PyTorch native library for large model training
FoundationVision/LlamaGen
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
srush/Triton-Puzzles
Puzzles for learning Triton
PRIME-RL/PRIME
Scalable RL solution for advanced reasoning of language models
LTH14/mar
PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838
hao-ai-lab/FastVideo
FastVideo is a lightweight framework for accelerating large video diffusion models.
lucidrains/titans-pytorch
Unofficial implementation of Titans, SOTA memory for transformers, in Pytorch
lucidrains/transfusion-pytorch
Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI
WECENG/ticket-purchase
大麦自动抢票,支持人员、城市、日期场次、价格选择
mengchaoheng/SCUT_thesis
华南理工大学硕博士学位论文模板(LaTeX)。Latex templates for the thesis of South China University of Technology
tensorgi/T6
The official implementation of Tensor ProducT ATTenTion Transformer (T6)
rkinas/triton-resources
A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.
facebookresearch/memory
Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, sparsely activated memory layers complement compute-heavy dense feed-forward layers, providing dedicated capacity to store and retrieve information cheaply.
SakanaAI/evo-memory
Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.
fla-org/flame
🔥 A minimal training framework for scaling FLA models
ShevonKuan/SCUT-thesis
(更新于2024年) 华南理工大学 LaTeX 论文模板项目,star一下嘛~(☆▽☆),应该是最完善也是最容易使用的华工本科生论文模板了
ssmisya/PRMBench
The official code repository for PRMBench.
hkust-nlp/mstar
M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning
OpenNLPLab/HGRN2
HGRN2: Gated Linear RNNs with State Expansion
OChicken/SCUT-Bachelor-Thesis-Template
Latex template for the bachelor graduation thesis of South China University of Technology (SCUT) 华南理工大学 本科毕业论文LaTeX模板
hychaochao/EMMA
The official repository for the paper "Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchmark"
Jellyfish042/RWKV_Othello
A specialized RWKV-7 model for Othello(a.k.a. Reversi) that predicts legal moves, evaluates positions, and performs in-context search. Its performance scales with the number of test-time tokens.
kazuki-irie/kv-memory-brain
Official Code Repository for the paper "Key-value memory in the brain"
abdelfattah-lab/attamba
h-hg/latex-scut-bachelor-thesis
华南理工大学本科毕业论文模板
tile-lang/tile
tile compiler frontend with antlr4