junxu

cmccsuzhou

junxu's Stars

PipeFusion/PipeFusion
A Suite of Parallel Approaches for Inference of Diffusion Transformer Models on GPU Clusters
Language:Python945
Lyken17/pytorch-OpCounter
Count the MACs / FLOPs of your PyTorch model.
Language:Python4.8k518
bentoml/OpenLLM
Run any open-source LLMs, such as Llama 2, Mistral, as OpenAI compatible API endpoint in the cloud.
Language:Python9.3k590
feifeibear/Odysseus-Transformer
Odysseus: Playground of LLM Sequence Parallelism
Language:Python37
google/vizier
Python-based research interface for blackbox and hyperparameter optimization, based on the internal Google Vizier Service.
Language:Python1.2k71
tencent-ailab/IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
Language:Jupyter Notebook4.5k296
bayesoptbook/bayesoptbook.github.io
Companion webpage for the book "Bayesian Optimization" by Roman Garnett
Language:HTML86142
KindXiaoming/pykan
Kolmogorov Arnold Networks
Language:Jupyter Notebook13.6k1.2k
byungsoo-oh/ml-systems-papers
Curated collection of papers in machine learning systems
745
microsoft/triton-shared
Shared Middle-Layer for Triton Compilation
Language:MLIR13226
eric-mitchell/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
Language:Python1.8k143
openobserve/openobserve
🚀 10x easier, 🚀 140x lower storage cost, 🚀 high performance, 🚀 petabyte scale - Elasticsearch/Splunk/Datadog alternative for 🚀 (logs, metrics, traces, RUM, Error tracking, Session replay).
Language:Rust10.2k364
OpenDevin/OpenDevin
🐚 OpenDevin: Code Less, Make More
Language:Python28.4k3.3k
princeton-nlp/SWE-agent
SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.
Language:Python11.9k1.2k
astra-sim/astra-sim
ASTRA-sim2.0: Modeling Hierarchical Networks and Disaggregated Systems for Large-model Training at Scale
Language:C++21287
opencomputeproject/OCP-NET-Falcon
348
SymbioticLab/frdma_benchmark
Language:C62
tkn-tub/ns3-gym
ns3-gym - The Playground for Reinforcement Learning in Networking Research
Language:C++511196
hust-diangroup/ns3-ai
Enable the interaction between ns-3 and popular frameworks using Python, which mean you can train and test your AI algorithms in ns-3 without changing any frameworks you are using now!
Language:C++21076
Terabit-Ethernet/hostCC
hostCC is a congestion control architecture which handles host congestion, along with in-network congestion
Language:Shell345
volcengine/veScale
A PyTorch Native LLM Training Framework
Language:Python48119
layerdiffusion/LayerDiffuse
Transparent Image Layer Diffusion using Latent Transparency
1.9k21
dongzhuoyao/awesome-flow-matching
A summary of related works about flow matching, stochastic interpolants
1876
willisma/SiT
Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"
Language:Python54522
karpathy/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
Language:Python8.7k790
LargeWorldModel/LWM
Language:Python7k540
ludwig-ai/ludwig
Low-code framework for building custom LLMs, neural networks, and other AI models
Language:Python11k1.2k
FasterDecoding/Medusa
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
Language:Jupyter Notebook2k130
ModelTC/lightllm
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
Language:Python2k177
TencentARC/PhotoMaker
PhotoMaker
Language:Jupyter Notebook8.6k675

junxu

junxu's Stars

PipeFusion/PipeFusion

Lyken17/pytorch-OpCounter

bentoml/OpenLLM

feifeibear/Odysseus-Transformer

google/vizier

tencent-ailab/IP-Adapter

bayesoptbook/bayesoptbook.github.io

KindXiaoming/pykan

byungsoo-oh/ml-systems-papers

microsoft/triton-shared

eric-mitchell/direct-preference-optimization

openobserve/openobserve

OpenDevin/OpenDevin

princeton-nlp/SWE-agent

astra-sim/astra-sim

opencomputeproject/OCP-NET-Falcon

SymbioticLab/frdma_benchmark

tkn-tub/ns3-gym

hust-diangroup/ns3-ai

Terabit-Ethernet/hostCC

volcengine/veScale

layerdiffusion/LayerDiffuse

dongzhuoyao/awesome-flow-matching

willisma/SiT

karpathy/minbpe

LargeWorldModel/LWM

ludwig-ai/ludwig

FasterDecoding/Medusa

ModelTC/lightllm

TencentARC/PhotoMaker