junxu's Stars
PipeFusion/PipeFusion
A Suite of Parallel Approaches for Inference of Diffusion Transformer Models on GPU Clusters
Lyken17/pytorch-OpCounter
Count the MACs / FLOPs of your PyTorch model.
bentoml/OpenLLM
Run any open-source LLMs, such as Llama 2, Mistral, as OpenAI compatible API endpoint in the cloud.
feifeibear/Odysseus-Transformer
Odysseus: Playground of LLM Sequence Parallelism
google/vizier
Python-based research interface for blackbox and hyperparameter optimization, based on the internal Google Vizier Service.
tencent-ailab/IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
bayesoptbook/bayesoptbook.github.io
Companion webpage for the book "Bayesian Optimization" by Roman Garnett
KindXiaoming/pykan
Kolmogorov Arnold Networks
byungsoo-oh/ml-systems-papers
Curated collection of papers in machine learning systems
microsoft/triton-shared
Shared Middle-Layer for Triton Compilation
eric-mitchell/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
openobserve/openobserve
🚀 10x easier, 🚀 140x lower storage cost, 🚀 high performance, 🚀 petabyte scale - Elasticsearch/Splunk/Datadog alternative for 🚀 (logs, metrics, traces, RUM, Error tracking, Session replay).
OpenDevin/OpenDevin
🐚 OpenDevin: Code Less, Make More
princeton-nlp/SWE-agent
SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.
astra-sim/astra-sim
ASTRA-sim2.0: Modeling Hierarchical Networks and Disaggregated Systems for Large-model Training at Scale
opencomputeproject/OCP-NET-Falcon
SymbioticLab/frdma_benchmark
tkn-tub/ns3-gym
ns3-gym - The Playground for Reinforcement Learning in Networking Research
hust-diangroup/ns3-ai
Enable the interaction between ns-3 and popular frameworks using Python, which mean you can train and test your AI algorithms in ns-3 without changing any frameworks you are using now!
Terabit-Ethernet/hostCC
hostCC is a congestion control architecture which handles host congestion, along with in-network congestion
volcengine/veScale
A PyTorch Native LLM Training Framework
layerdiffusion/LayerDiffuse
Transparent Image Layer Diffusion using Latent Transparency
dongzhuoyao/awesome-flow-matching
A summary of related works about flow matching, stochastic interpolants
willisma/SiT
Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"
karpathy/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
LargeWorldModel/LWM
ludwig-ai/ludwig
Low-code framework for building custom LLMs, neural networks, and other AI models
FasterDecoding/Medusa
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
ModelTC/lightllm
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
TencentARC/PhotoMaker
PhotoMaker