yaozengwei

Xiaomi CorporationBeijing

yaozengwei's Stars

ossrs/srs
SRS is a simple, high-efficiency, real-time media server supporting RTMP, WebRTC, HLS, HTTP-FLV, HTTP-TS, SRT, MPEG-DASH, and GB28181.
Language:C++25.5k 844 1.4k5.4k
k2-fsa/sherpa-onnx
Speech-to-text, text-to-speech, speaker recognition, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter, Object Pascal, Lazarus, Rust
Language:C++3.2k 51 488380
asteroid-team/asteroid
The PyTorch-based audio source separation toolkit for researchers
Language:Python2.2k 52 218420
lifeiteng/vall-e
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
Language:Python2k 49 126319
jeonsworld/ViT-pytorch
Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)
Language:Jupyter Notebook1.9k 13 55364
k2-fsa/k2
FSA/FST algorithms, differentiable, with PyTorch compatibility.
Language:Cuda1.1k 77 379213
k2-fsa/sherpa-ncnn
Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Linux, macOS, Windows, Raspberry Pi, VisionFive2, LicheePi4A etc.
Language:C++996 36 143154
lhotse-speech/lhotse
Tools for handling speech data in machine learning projects.
Language:Python936 44 410214
k2-fsa/icefall
Language:Python902 48 650287
Snowdar/asv-subtools
An Open Source Tools for Speaker Recognition
Language:Python592 21 52135
k2-fsa/sherpa
Speech-to-text server framework with next-gen Kaldi
Language:C++534 33 196107
alibaba-damo-academy/FunCodec
FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.
Language:Python292 16 4223
Amshaker/SwiftFormer
[ICCV'23] Official repository of paper SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applications
Language:Python246 6 1425
csukuangfj/kaldifeat
Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API
Language:C++186 7 3735
k2-fsa/libriheavy
Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context
Language:Python172 7 610
CorentinJ/librispeech-alignments
Word alignments generated by the Montreal Forced Aligner for the Librispeech dataset
Language:Python149 4 823
csukuangfj/transducer-loss-benchmarking
Language:Python64 5 810
k2-fsa/text_search
Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup
Language:Python57 12 1314
danpovey/quantization
Torch-based tool for quantizing high-dimensional vectors using additive codebooks
Language:Python50 7 03
k2-fsa/multi_quantization
Language:Python41 8 09
csukuangfj/kaldilm
Python wrapper for kaldi's arpa2fst
Language:C++38 4 126
xunguangwang/ProS-GAN
[CVPR 2021] Official repository for "Prototype-supervised Adversarial Network for Targeted Attack of Deep Hashing"
Language:Python35 1 515
csukuangfj/kaldi-hmm-gmm
Language:C++25 5 1
winlinvip/srs-k2
Apply https://github.com/k2-fsa/sherpa-ncnn in live streaming and WebRTC
Language:Go20 3 13
k2-fsa/divide_lm
Language:Python4 6 03
tuyanglin/Fingerprint-Restoration
Fingerprint Restoration using Cubic Bezier Curve
Language:Python4
csukuangfj/kfj-vim
My vim settings.
Language:Shell2 2 01
csukuangfj/piper-phonemize
C++ library for converting text to phonemes for Piper
Language:C++1 0 0
Jarellin/HITszDailyHealth
哈工大深圳每日健康上报
Language:Python1 0 00
yilinyl/hybrid_nmm
A hybrid framework (neural mass model + ML) for SC-to-FC prediction
Language:Python1 1 00

yaozengwei

yaozengwei's Stars

ossrs/srs

k2-fsa/sherpa-onnx

asteroid-team/asteroid

lifeiteng/vall-e

jeonsworld/ViT-pytorch

k2-fsa/k2

k2-fsa/sherpa-ncnn

lhotse-speech/lhotse

k2-fsa/icefall

Snowdar/asv-subtools

k2-fsa/sherpa

alibaba-damo-academy/FunCodec

Amshaker/SwiftFormer

csukuangfj/kaldifeat

k2-fsa/libriheavy

CorentinJ/librispeech-alignments

csukuangfj/transducer-loss-benchmarking

k2-fsa/text_search

danpovey/quantization

k2-fsa/multi_quantization

csukuangfj/kaldilm

xunguangwang/ProS-GAN

csukuangfj/kaldi-hmm-gmm

winlinvip/srs-k2

k2-fsa/divide_lm

tuyanglin/Fingerprint-Restoration

csukuangfj/kfj-vim

csukuangfj/piper-phonemize

Jarellin/HITszDailyHealth

yilinyl/hybrid_nmm