Pinned Repositories
asv-subtools
An Open Source Tools for Speaker Recognition
AudioGPT
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
cube-studio
云原生一站式机器学习平台,多租户,数据资产,notebook在线开发,拖拉拽任务流编排,多机多卡分布式训练,超参搜索,推理服务,多集群调度,多项目组资源组,边缘计算,大模型实时训练, ai应用商店
fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
kaldi_org
This is now the official location of the Kaldi project.
LeetcodeTop
汇总各大互联网公司容易考察的高频leetcode题🔥 推荐刷题网站:https://www.lintcode.com/?utm_source=tf-github-codetop
wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
wetts
Production First and Production Ready End-to-End Text-to-Speech Toolkit
whisper-jax
whisper faster inference
zh-google-styleguide
Google 开源项目风格指南 (中文版)
donstang's Repositories
donstang/zh-google-styleguide
Google 开源项目风格指南 (中文版)
donstang/3m-asr
donstang/apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
donstang/asr-server
Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.
donstang/BeamformIt
BeamformIt acoustic beamforming software
donstang/CAT
A CRF-based ASR Toolkit
donstang/Conv-TasNet
Speech Separation
donstang/covost
CoVoST: A Large-Scale Multilingual Speech-To-Text Translation Corpus (CC0 Licensed)
donstang/DeepLearningExamples
Deep Learning Examples
donstang/dockerswarm.rocks
Docker Swarm mode rocks! Ideas, tools and recipes. Get a production-ready, distributed, HTTPS served, cluster in minutes, not weeks.
donstang/e2e_lfmmi
This is the implementation of paper CONSISTENT TRAINING AND DECODING FOR END-TO-END SPEECH RECOGNITIONUSING LATTICE-FREE MMI submitted to ICASSP2022
donstang/espresso
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
donstang/flashlight
A C++ standalone library for machine learning
donstang/jtubespeech
爬取youtube视频
donstang/kubefed
Kubernetes Cluster Federation
donstang/Lichee
一个多模态内容理解算法框架,其中包含数据处理、预训练模型、常见模型以及模型加速等模块。
donstang/lingvo
Lingvo
donstang/NeuralNLP-NeuralClassifier
An Open-source Neural Hierarchical Multi-label Text Classification Toolkit
donstang/py-webrtcvad
Python interface to the WebRTC Voice Activity Detector
donstang/pychain
PyTorch implementation of LF-MMI for End-to-end ASR
donstang/R-FCN
R-FCN: Object Detection via Region-based Fully Convolutional Networks
donstang/shellcheck
ShellCheck, a static analysis tool for shell scripts
donstang/silk-v3-decoder
[Skype Silk Codec SDK]Decode silk v3 audio files (like wechat amr, aud files, qq slk files) and convert to other format (like mp3). Batch conversion support.
donstang/TNN
TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is distinguished by several outstanding features, including its cross-platform capability, high performance, model compression and code pruning. Based on ncnn and Rapidnet, TNN further strengthens the support and performance optimization for mobile devices, and also draws on the advantages of good extensibility and high performance from existed open source efforts. TNN has been deployed in multiple Apps from Tencent, such as Mobile QQ, Weishi, Pitu, etc. Contributions are welcome to work in collaborative with us and make TNN a better framework.
donstang/tts-survey
A Survey on Neural Speech Synthesis https://arxiv.org/pdf/2106.15561.pdf
donstang/TurboTransformers
a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.
donstang/UER-py
Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo
donstang/UniPunc
The case study and multilingfual performance of ICASSP submission
donstang/UniSpeech
UniSpeech - Large Scale Self-Supervised Learning for Speech
donstang/wenet-onnx