jiaoza

jiaoza's Stars

openai/openai-realtime-embedded-sdk
A SDK to using the Realtime API with Microcontrollers like the ESP32
Language:C++1.2k109
google-ai-edge/ai-edge-torch
Supporting PyTorch models with the Google AI Edge TFLite runtime.
Language:Jupyter Notebook40052
openvinotoolkit/openvino
OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference
Language:C++7.5k2.4k
alibaba/TinyNeuralNetwork
TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.
Language:Python771117
PINTO0309/onnx2tf
Self-Created Tools to convert ONNX files (NCHW) to TensorFlow/TFLite/Keras format (NHWC). The purpose of this tool is to solve the massive Transpose extrapolation problem in onnx-tensorflow (onnx-tf). I don't need a Star, but give me a pull request.
Language:Python73374
onnx/onnx-tensorflow
Tensorflow Backend for ONNX
Language:Python1.3k297
microsoft/P.808
This is an open-source implementation of the ITU P.808 standard for "Subjective evaluation of speech quality with a crowdsourcing approach" (see https://www.itu.int/rec/T-REC-P.808/en). It uses Amazon Mechanical Turk as the crowdsourcing platform. It includes implementations for Absolute Category Rating (ACR), Degradation Category Rating (DCR), and Comparison Category Rating (CCR).
Language:HTML21158
gitwukeyi/FSPEN
Language:Python4111
santi-pdp/segan
Speech Enhancement Generative Adversarial Network in TensorFlow
Language:Python822282
kan-bayashi/ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
Language:Jupyter Notebook1.6k343
ruizhecao96/CMGAN
Conformer-based Metric GAN for speech enhancement
Language:Python32760
pytorch/executorch
On-device AI across mobile, embedded and edge for PyTorch
Language:C++2.3k407
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
Language:Jupyter Notebook36.5k4.3k
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Python36.4k4.5k
daanzu/kaldi-active-grammar
Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
Language:Python34050
eliberis/tflite-tools
TFLite model analyzer & memory optimizer
Language:Python12020
k2-fsa/icefall
Language:Python969308
csukuangfj/optimized_transducer
Memory efficient transducer loss computation
Language:CMake6812
markostam/active-noise-cancellation
Active noise cancellation using various algorithms (FxLMS, FuLMS, NLMS) in Matlab, VST and C
Language:Matlab348102
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Language:Python12.6k2.6k
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Language:Python73.4k8.8k
baichuan-inc/Baichuan-13B
A 13B large language model developed by Baichuan Intelligent Technology
Language:Python3k235
mozillazg/phrase-pinyin-data
词语拼音数据
Language:Python46099
coqui-ai/open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
1.3k141
double22a/speech_dataset
The dataset of Speech Recognition
39376
sooftware/conformer
[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)
Language:Python976179
chenweiphd/LargeLanguageModel-and-GPT-4-ResourceMap
37659
openai/following-instructions-human-feedback
1.2k143
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Language:Python7.5k798
chester256/Model-Compression-Papers
Papers for deep neural network compression and acceleration
39680