wildkid1024

MLSys | ML Acceleration |Inference engine

Capital Normal UniversityHaidian Beijing

Pinned Repositories

awesome-model-compression-and-acceleration
a list of awesome papers on deep model ompression and acceleration
0 1 00
DFI
Deep Fault Injection(DFI) is a framework which can help you train,verify,quantize and fault-inject for your deep neural network(DNN) models fastly and concisely!
Language:Python1 2 00
EmotiVoice-TensorRT
The faster EmotiVoice infer engine with ~8x speedup
Language:Python5 1 10
fastllm
纯c++实现，无第三方依赖的大模型库，支持CUDA加速，目前支持国产大模型ChatGLM-6B，MOSS; 可以在安卓设备上流畅运行ChatGLM-6B
Language:C++1 0 00
Neural-Networks-on-Chip
This is a collection of works on neural networks and neural accelerators.
0 1 00
NVpower
NVpower is a tool to measure the energy consumption of NVIDIA GPUs.
Language:Python0 2 00
Paddle2ONNX
ONNX Model Exporter for PaddlePaddle
Language:Python0 0 00
pymnn-llm
Language:Python2 1 02
weekly-papers
3 2 01
wildkid1024
0 2 00

wildkid1024/EmotiVoice-TensorRT
The faster EmotiVoice infer engine with ~8x speedup
Language:Python5 1 10
wildkid1024/weekly-papers
3 2 01
wildkid1024/pymnn-llm
Language:Python2 1 02
wildkid1024/DFI
Deep Fault Injection(DFI) is a framework which can help you train,verify,quantize and fault-inject for your deep neural network(DNN) models fastly and concisely!
Language:Python1 2 00
wildkid1024/fastllm
纯c++实现，无第三方依赖的大模型库，支持CUDA加速，目前支持国产大模型ChatGLM-6B，MOSS; 可以在安卓设备上流畅运行ChatGLM-6B
Language:C++1 0 00
wildkid1024/awesome-model-compression-and-acceleration
a list of awesome papers on deep model ompression and acceleration
0 1 00
wildkid1024/Neural-Networks-on-Chip
This is a collection of works on neural networks and neural accelerators.
0 1 00
wildkid1024/NVpower
NVpower is a tool to measure the energy consumption of NVIDIA GPUs.
Language:Python0 2 00
wildkid1024/Paddle2ONNX
ONNX Model Exporter for PaddlePaddle
Language:Python0 0 00
wildkid1024/wildkid1024
0 2 00