China-LiuXiaopeng's Stars
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
AlexeyAB/darknet
YOLOv4 / Scaled-YOLOv4 / YOLO - Neural Networks for Object Detection (Windows and Linux version of Darknet )
microsoft/onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
nl8590687/ASRT_SpeechRecognition
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
mozillazg/python-pinyin
汉字转拼音(pypinyin)
wenet-e2e/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
chenyuntc/simple-faster-rcnn-pytorch
A simplified implemention of Faster R-CNN that replicate performance from origin paper
Tencent/FaceDetection-DSFD
腾讯优图高精度双分支人脸检测器
liweiwei1419/LeetCode-Solutions-in-Good-Style
首页已经更新,希望能对大家有帮助。
jameslyons/python_speech_features
This library provides common speech features for ASR including MFCCs and filterbank energies.
666DZY666/micronet
micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference)、Low-Bit(≤2b)/Ternary and Binary(TWN/BNN/XNOR-Net); post-training-quantization(PTQ), 8-bit(tensorrt); 2、 pruning: normal、regular and group convolutional channel pruning; 3、 group convolution structure; 4、batch-normalization fuse for quantization. deploy: tensorrt, fp32/fp16/int8(ptq-calibration)、op-adapt(upsample)、dynamic_shape
asteroid-team/asteroid
The PyTorch-based audio source separation toolkit for researchers
SeanNaren/deepspeech.pytorch
Speech Recognition using DeepSpeech2.
arkingc/note
学习笔记整理📚
ruotianluo/pytorch-faster-rcnn
pytorch1.0 updated. Support cpu test and demo. (Use detectron2, it's a masterpiece)
google/gemmlowp
Low-precision matrix multiplication
facebookresearch/svoice
We provide a PyTorch implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers In which, we present a new method for separating a mixed audio sequence, in which multiple voices speak simultaneously. The new method employs gated neural networks that are trained to separate the voices at multiple processing steps, while maintaining the speaker in each output channel fixed. A different model is trained for every number of possible speakers, and the model with the largest number of speakers is employed to select the actual number of speakers in a given sample. Our method greatly outperforms the current state of the art, which, as we show, is not competitive for more than two speakers.
facebookresearch/VMZ
VMZ: Model Zoo for Video Modeling
athena-team/athena
an open-source implementation of sequence-to-sequence based speech processing engine
githubharald/CTCDecoder
Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon search, prefix search, and token passing. Implemented in Python.
hirofumi0810/neural_sp
End-to-end ASR/LM implementation with PyTorch
Syencil/tensorRT
TensorRT-7 Network Lib 包括常用目标检测、关键点检测、人脸检测、OCR等 可训练自己数据
facebookresearch/gtn
Automatic differentiation with weighted finite-state transducers.
tencent-ailab/pika
a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi
cywang97/StreamingTransformer
dukebw/lintel
A Python module to decode video frames directly, using the FFmpeg C API.
RizhaoCai/PyTorch_ONNX_TensorRT
A tutorial about how to build a TensorRT Engine from a PyTorch Model with the help of ONNX
yaysummeriscoming/DALI_pytorch_demo
Example code showing how to use Nvidia DALI in pytorch, with fallback to torchvision. Contains a few differences to the official Nvidia example, namely a completely CPU pipeline & improved memory usage
China-LiuXiaopeng/BraTS-DMFNet
ferreirafabio/video2tfrecord
Easily convert RGB video data (e.g. .avi) to the TensorFlow tfrecords file format for training e.g. a NN in TensorFlow. This implementation allows to limit the number of frames per video to be stored in the tfrecords.