Pinned Repositories
android-CardView
deeplearning-cfn
Distributed Deep Learning on AWS Using CloudFormation (CFN), MXNet and TensorFlow
horovod
Distributed training framework for TensorFlow, Keras, and PyTorch.
incubator-mxnet
Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
mshadow
Matrix Shadow:Lightweight CPU/GPU Matrix and Tensor Template Library in C++/CUDA for (Deep) Machine Learning
mxnet-benchmark
MXNet distributed training using Horovod
mxnet-build-script
Script to build mxnet pip wheel
pow3
A Low Power Finite State Machine Encoding Package for Sequential Logic Synthesis
tensorflow
An Open Source Machine Learning Framework for Everyone
tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
apeforest's Repositories
apeforest/mxnet-build-script
Script to build mxnet pip wheel
apeforest/mxnet-benchmark
MXNet distributed training using Horovod
apeforest/horovod
Distributed training framework for TensorFlow, Keras, and PyTorch.
apeforest/incubator-mxnet
Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
apeforest/tensorflow
An Open Source Machine Learning Framework for Everyone
apeforest/mshadow
Matrix Shadow:Lightweight CPU/GPU Matrix and Tensor Template Library in C++/CUDA for (Deep) Machine Learning
apeforest/assignment1
Assignment 1 for the CMU 15418 Course
apeforest/byteps
A high performance and general PS framework for distributed training
apeforest/d2l-en
Dive into Deep Learning: an interactive deep learning book with code, math, and discussions
apeforest/deeplearning-benchmark
apeforest/dlaicourse
Notebooks for learning deep learning
apeforest/dlpack
RFC for common in-memory tensor structure and operator interface for deep learning system
apeforest/dmlc-core
A common bricks library for building scalable and portable distributed machine learning.
apeforest/doraemon
This is my toolbox for debugging and testing.
apeforest/elasticdl
Kubernetes-native Deep Learning Framework
apeforest/FAR-HO
Gradient based hyperparameter optimization & meta-learning package for TensorFlow
apeforest/gluon-cv
Gluon CV Toolkit
apeforest/gluon-nlp
NLP made easy
apeforest/jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
apeforest/machine-learning-systems-design
A booklet on machine learning systems design with exercises
apeforest/Mnasnet.MXNet
A Gluon implementation of Mnasnet
apeforest/mpitutorial
MPI programming lessons in C and executable code examples
apeforest/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
apeforest/NanoGpt-JAX
Jax implementation of the nanoGpt by Andrej Karpathy
apeforest/new-docs
https://beta.mxnet.io/
apeforest/parallax
A Tool for Automatic Parallelization of Deep Learning Training in Distributed Multi-GPU Environments.
apeforest/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
apeforest/system-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
apeforest/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
apeforest/workshop
AI and Machine Learning with Kubeflow, Amazon EKS, and SageMaker