apeforest's Stars
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
amazon-science/chronos-forecasting
Chronos: Pretrained Models for Probabilistic Time Series Forecasting
NVIDIA/cutlass
CUDA Templates for Linear Algebra Subroutines
dottxt-ai/outlines
Structured Text Generation
predibase/lorax
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
FasterDecoding/Medusa
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
microsoft/LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
jax-ml/jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
DeepRec-AI/DeepRec
DeepRec is a high-performance recommendation deep learning framework based on TensorFlow. It is hosted in incubation in LF AI & Data Foundation.
alibaba/EasyParallelLibrary
Easy Parallel Library (EPL) is a general and efficient deep learning framework for distributed model training.
alibaba/BladeDISC
BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.
BBuf/tvm_mlir_learn
compiler learning resources collect.
apeforest/incubator-mxnet
Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
mosaicml/composer
Supercharge Your Model Training
https-deeplearning-ai/machine-learning-engineering-for-production-public
Public repo for DeepLearning.AI MLEP Specialization
couler-proj/couler
Unified Interface for Constructing and Managing Workflows on different workflow engines, such as Argo Workflows, Tekton Pipelines, and Apache Airflow.
deepjavalibrary/djl
An Engine-Agnostic Deep Learning Framework in Java
apeforest/horovod
Distributed training framework for TensorFlow, Keras, and PyTorch.
apeforest/pow3
A Low Power Finite State Machine Encoding Package for Sequential Logic Synthesis
TalkAI/apache-mxnet-odsc-2018
Introduction to deep learning with Apache MXNet GLUON. MLP, CNN, RNN and Model Server
dmlc/gluon-nlp
NLP made easy
dmlc/gluon-cv
Gluon CV Toolkit
horovod/horovod
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
apache/mxnet
Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
szha/KDD18-Gluon
KDD18 Tutorial: Deep Learning and Natural Language Processing with Apache MXNet (Incubating) Gluon
d2l-ai/d2l-zh
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。