Pinned Repositories
Algorithm_Interview_Notes-Chinese
2018/2019/校招/春招/秋招/算法/机器学习(Machine Learning)/深度学习(Deep Learning)/自然语言处理(NLP)/C/C++/Python/面试笔记
algorithms_and_data_structures
160+ Algorithm & Data Structure Problems using C++
apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
mlperf
NCF and Transformer model basing on mlperf benchmark
op_bench
Resnext3d-for-video-classification
Using https://github.com/facebookresearch/ClassyVision to implement Resnext3d
XiaobingSuper's Repositories
XiaobingSuper/Resnext3d-for-video-classification
Using https://github.com/facebookresearch/ClassyVision to implement Resnext3d
XiaobingSuper/op_bench
XiaobingSuper/apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
XiaobingSuper/examples
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
XiaobingSuper/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
XiaobingSuper/FBGEMM
FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
XiaobingSuper/fused_linear_triton
XiaobingSuper/HBONet
[ICCV 2019] Harmonious Bottleneck on Two Orthogonal Dimensions, surpassing MobileNetV2
XiaobingSuper/ideep
Intel® Optimization for Chainer*, a Chainer module providing numpy like API and DNN acceleration using MKL-DNN.
XiaobingSuper/incubator-mxnet
Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
XiaobingSuper/inference
Reference implementations of inference benchmarks
XiaobingSuper/intel-extension-for-pytorch
XiaobingSuper/jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
XiaobingSuper/llama.cpp
LLM inference in C/C++
XiaobingSuper/llm_dynamo
XiaobingSuper/mmclassification
OpenMMLab Image Classification Toolbox and Benchmark
XiaobingSuper/oneDNN
oneAPI Deep Neural Network Library (oneDNN)
XiaobingSuper/opacus
Training PyTorch models with differential privacy
XiaobingSuper/openNMT
XiaobingSuper/optimized-models
XiaobingSuper/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
XiaobingSuper/resnet50_lars_training
XiaobingSuper/RunningScript
XiaobingSuper/T5-NLP
XiaobingSuper/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
XiaobingSuper/tensorrtllm_backend
The Triton TensorRT-LLM Backend
XiaobingSuper/torchdynamo
A Python-level JIT compiler designed to make unmodified PyTorch programs faster.
XiaobingSuper/tutorials
PyTorch tutorials.
XiaobingSuper/vision
Datasets, Transforms and Models specific to Computer Vision
XiaobingSuper/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs