reminisce's Stars
feifeibear/long-context-attention
USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference
AmadeusChan/Awesome-LLM-System-Papers
NVIDIA/FasterTransformer
Transformer related optimization, including BERT, GPT
thenifty/neon-guide
Makes ARM NEON documentation accessible (with examples)
wangshusen/DeepLearning
matterport/Mask_RCNN
Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow
Tencent/TNN
TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is distinguished by several outstanding features, including its cross-platform capability, high performance, model compression and code pruning. Based on ncnn and Rapidnet, TNN further strengthens the support and performance optimization for mobile devices, and also draws on the advantages of good extensibility and high performance from existed open source efforts. TNN has been deployed in multiple Apps from Tencent, such as Mobile QQ, Weishi, Pitu, etc. Contributions are welcome to work in collaborative with us and make TNN a better framework.
XiaoMi/mace
MACE is a deep learning inference framework optimized for mobile heterogeneous computing platforms.
alibaba/MNN
MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba
Tencent/ncnn
ncnn is a high-performance neural network inference framework optimized for the mobile platform
amdegroot/ssd.pytorch
A PyTorch Implementation of Single Shot MultiBox Detector
zhanghang1989/ResNeSt
ResNeSt: Split-Attention Networks
HuaizhengZhang/AI-System-School
🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSys, etc. 🗃️ Llama3, Mistral, etc. 🧑💻 Video Tutorials.
scikit-learn/scikit-learn
scikit-learn: machine learning in Python
hgt312/NumpyXBench
Benchmarks for NumPy compatible frameworks.
mli/d2l-1day-notebooks-zh
Notebooks for a single-day DL crash course in Chinese
d2l-ai/d2l-tvm
Dive into Deep Learning Compiler
toutiaoio/awesome-architecture
架构师技术图谱,助你早日成为架构师
dgasmith/opt_einsum
⚡️Optimizing einsum functions in NumPy, Tensorflow, Dask, and more with contraction order optimization.
bytedance/byteps
A high performance and generic framework for distributed DNN training
d2l-ai/d2l-book
Books with Jupyter notebooks
mli/deepnumpy-doc
Documents for MXNet's deepnumpy API
rougier/from-python-to-numpy
An open-access book on numpy vectorization techniques, Nicolas P. Rougier, 2017
sebastianstarke/AI4Animation
Bringing Characters to Life with Computer Brains in Unity
deepinsight/insightface
State-of-the-art 2D and 3D Face Analysis Project
numpy/numpy
The fundamental package for scientific computing with Python.
uwsampl/relay-aot
An experimental ahead of time compiler for Relay.
dmlc/decord
An efficient video loader for deep learning with smart shuffling that's super easy to digest
Unity-Technologies/ml-agents
The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.
996icu/996.ICU
Repo for counting stars and contributing. Press F to pay respect to glorious developers.