lambda7xx

Build Systems 千里之行始于足下

Shanghai Jiao Tong University

lambda7xx's Stars

comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Language:Python67.4k 474 4.5k7.2k
bytedance/monolith
A Lightweight Recommendation System
Language:Python8.6k 77 22665
NVIDIA/Cosmos
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. Cosmos is purpose built for physical AI. The Cosmos repository will enable end users to run the Cosmos models, run inference scripts and generate videos.
Language:Python7.5k 84 87473
MingchaoZhu/DeepLearning
Python for《Deep Learning》，该书为《深度学习》(花书) 数学推导、原理剖析与源码级别代码实现
Language:Python6.7k 193 71.4k
CarperAI/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Language:Python4.6k 50 293478
dreamgaussian/dreamgaussian
[ICLR 2024 Oral] Generative Gaussian Splatting for Efficient 3D Content Creation
Language:Python4.1k 44 159362
opendilab/LightZero
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)
Language:Python1.3k 11 116135
PRIME-RL/PRIME
Scalable RL solution for advanced reasoning of language models
Language:Python1.3k 8 3478
huggingface/search-and-learn
Recipes to scale inference-time compute of open models
Language:Python1k 12 1595
FoundationVision/Infinity
Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
Language:Python956 24 6140
huggingface/picotron
Minimalistic 4D-parallelism distributed training framework for education purpose
Language:Python723 10 855
stanford-crfm/levanter
Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax
Language:Python540 14 26786
MarioSieg/magnetron
(WIP) A small but powerful, homemade PyTorch from scratch.
Language:C++522 4 025
WooooDyy/AgentGym
Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.
Language:Python391 5 1550
alibaba/ChatLearn
A flexible and efficient training framework for large-scale alignment tasks
Language:Python300 16 2723
PKU-DAIR/Hetu
A high-performance distributed deep learning system targeting large-scale and automated distributed training.
Language:Python283 8 132
rkinas/triton-resources
A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.
Language:Python267 5 018
hustvl/LightningDiT
[arXiv'25] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models
Language:Python246 8 215
pytorch/torchft
PyTorch per step fault tolerance (actively under development)
Language:Python243 18 1220
yuandong-tian/arXiv_recbot
A Telegram bot to recommend arXiv papers
Language:Python243 3 120
ServiceNow/AgentLab
AgentLab: An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and reproducibility.
Language:Python232 5 5356
modelscope/dash-infer
DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including CUDA, x86 and ARMv9.
Language:C230 6 3324
ezelikman/STaR
Code for STaR: Bootstrapping Reasoning With Reasoning (NeurIPS 2022)
Language:Python197 4 122
MLSys-Learner-Resources/Awesome-MLSys-Blogger
The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)
Language:HTML177 4 04
DefTruth/cuffpa-py
📚[WIP] FFPA: Yet antother Faster Flash Prefill Attention with O(1)🎉GPU SRAM complexity for headdim > 256, 1.5x~2x🎉faster vs SDPA EA.
Language:Cuda391
LemonTwoL/ReNeg
ReNeg: Learning Negative Embedding with Reward Guidance
Language:Python27 2 10
thuwzy/ZhuSuan-PyTorch
An Elegant Library for Bayesian Deep Learning in PyTorch
Language:Python24 4 04
feifeibear/DPSKV3MFU
Estimate MFU for DeepSeekV3
Language:Python16 1 0
thomaschlt/mla.c
Implementation from scratch in C of the Multi-head latent attention used in the Deepseek-v3 technical paper.
Language:C140
cylinbao/cylinbao.github.io
Language:HTML1 1 00

lambda7xx

lambda7xx's Stars

comfyanonymous/ComfyUI

bytedance/monolith

NVIDIA/Cosmos

MingchaoZhu/DeepLearning

CarperAI/trlx

dreamgaussian/dreamgaussian

opendilab/LightZero

PRIME-RL/PRIME

huggingface/search-and-learn

FoundationVision/Infinity

huggingface/picotron

stanford-crfm/levanter

MarioSieg/magnetron

WooooDyy/AgentGym

alibaba/ChatLearn

PKU-DAIR/Hetu

rkinas/triton-resources

hustvl/LightningDiT

pytorch/torchft

yuandong-tian/arXiv_recbot

ServiceNow/AgentLab

modelscope/dash-infer

ezelikman/STaR

MLSys-Learner-Resources/Awesome-MLSys-Blogger

DefTruth/cuffpa-py

LemonTwoL/ReNeg

thuwzy/ZhuSuan-PyTorch

feifeibear/DPSKV3MFU

thomaschlt/mla.c

cylinbao/cylinbao.github.io