LanDisen

Fudan UniversityShanghai, China

LanDisen's Stars

deepseek-ai/DeepSeek-V3
Language:Python95.2k 742 49815.4k
triton-lang/triton
Development repository for the Triton language and compiler
Language:MLIR15.1k 195 1.7k1.9k
facebookresearch/lingua
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
Language:Python4.5k 29 52244
NVlabs/Sana
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
Language:Python3.9k 77 191243
pytorch/torchtitan
A PyTorch native library for large model training
Language:Python3.6k 51 290330
FoundationVision/LlamaGen
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
Language:Python1.7k 21 7975
srush/Triton-Puzzles
Puzzles for learning Triton
Language:Jupyter Notebook1.6k 11 17124
PRIME-RL/PRIME
Scalable RL solution for advanced reasoning of language models
Language:Python1.5k 10 3989
LTH14/mar
PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838
Language:Python1.5k 18 8282
hao-ai-lab/FastVideo
FastVideo is a lightweight framework for accelerating large video diffusion models.
Language:Python1.3k 16 15877
lucidrains/titans-pytorch
Unofficial implementation of Titans, SOTA memory for transformers, in Pytorch
Language:Python1.3k 29 30112
lucidrains/transfusion-pytorch
Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI
Language:Python1k 33 3246
WECENG/ticket-purchase
大麦自动抢票，支持人员、城市、日期场次、价格选择
Language:Python926 4 64132
mengchaoheng/SCUT_thesis
华南理工大学硕博士学位论文模板(LaTeX)。Latex templates for the thesis of South China University of Technology
Language:TeX371 6 5765
tensorgi/T6
The official implementation of Tensor ProducT ATTenTion Transformer (T6)
Language:Python354 8 532
rkinas/triton-resources
A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.
Language:Python330 5 021
facebookresearch/memory
Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, sparsely activated memory layers complement compute-heavy dense feed-forward layers, providing dedicated capacity to store and retrieve information cheaply.
Language:Python312 8 118
SakanaAI/evo-memory
Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.
Language:Python302 10 331
fla-org/flame
🔥 A minimal training framework for scaling FLA models
Language:Python94 7 1015
ShevonKuan/SCUT-thesis
(更新于2024年) 华南理工大学 LaTeX 论文模板项目，star一下嘛~(☆▽☆)，应该是最完善也是最容易使用的华工本科生论文模板了
Language:TeX91 1 57
ssmisya/PRMBench
The official code repository for PRMBench.
Language:Jupyter Notebook68 1 55
hkust-nlp/mstar
M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning
56 2 01
OpenNLPLab/HGRN2
HGRN2: Gated Linear RNNs with State Expansion
Language:Python54 2 12
OChicken/SCUT-Bachelor-Thesis-Template
Latex template for the bachelor graduation thesis of South China University of Technology (SCUT) 华南理工大学本科毕业论文LaTeX模板
Language:TeX50 3 312
hychaochao/EMMA
The official repository for the paper "Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchmark"
Language:Python47 1 11
Jellyfish042/RWKV_Othello
A specialized RWKV-7 model for Othello(a.k.a. Reversi) that predicts legal moves, evaluates positions, and performs in-context search. Its performance scales with the number of test-time tokens.
Language:Python39 1 03
kazuki-irie/kv-memory-brain
Official Code Repository for the paper "Key-value memory in the brain"
Language:Jupyter Notebook24 4 02
abdelfattah-lab/attamba
Language:Python14 2 11
h-hg/latex-scut-bachelor-thesis
华南理工大学本科毕业论文模板
Language:TeX14 1 24
tile-lang/tile
tile compiler frontend with antlr4
Language:Java4 0 40

LanDisen

LanDisen's Stars

deepseek-ai/DeepSeek-V3

triton-lang/triton

facebookresearch/lingua

NVlabs/Sana

pytorch/torchtitan

FoundationVision/LlamaGen

srush/Triton-Puzzles

PRIME-RL/PRIME

LTH14/mar

hao-ai-lab/FastVideo

lucidrains/titans-pytorch

lucidrains/transfusion-pytorch

WECENG/ticket-purchase

mengchaoheng/SCUT_thesis

tensorgi/T6

rkinas/triton-resources

facebookresearch/memory

SakanaAI/evo-memory

fla-org/flame

ShevonKuan/SCUT-thesis

ssmisya/PRMBench

hkust-nlp/mstar

OpenNLPLab/HGRN2

OChicken/SCUT-Bachelor-Thesis-Template

hychaochao/EMMA

Jellyfish042/RWKV_Othello

kazuki-irie/kv-memory-brain

abdelfattah-lab/attamba

h-hg/latex-scut-bachelor-thesis

tile-lang/tile