jiaoza's Stars
openai/openai-realtime-embedded-sdk
A SDK to using the Realtime API with Microcontrollers like the ESP32
google-ai-edge/ai-edge-torch
Supporting PyTorch models with the Google AI Edge TFLite runtime.
openvinotoolkit/openvino
OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference
alibaba/TinyNeuralNetwork
TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.
PINTO0309/onnx2tf
Self-Created Tools to convert ONNX files (NCHW) to TensorFlow/TFLite/Keras format (NHWC). The purpose of this tool is to solve the massive Transpose extrapolation problem in onnx-tensorflow (onnx-tf). I don't need a Star, but give me a pull request.
onnx/onnx-tensorflow
Tensorflow Backend for ONNX
microsoft/P.808
This is an open-source implementation of the ITU P.808 standard for "Subjective evaluation of speech quality with a crowdsourcing approach" (see https://www.itu.int/rec/T-REC-P.808/en). It uses Amazon Mechanical Turk as the crowdsourcing platform. It includes implementations for Absolute Category Rating (ACR), Degradation Category Rating (DCR), and Comparison Category Rating (CCR).
gitwukeyi/FSPEN
santi-pdp/segan
Speech Enhancement Generative Adversarial Network in TensorFlow
kan-bayashi/ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
ruizhecao96/CMGAN
Conformer-based Metric GAN for speech enhancement
pytorch/executorch
On-device AI across mobile, embedded and edge for PyTorch
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
daanzu/kaldi-active-grammar
Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
eliberis/tflite-tools
TFLite model analyzer & memory optimizer
k2-fsa/icefall
csukuangfj/optimized_transducer
Memory efficient transducer loss computation
markostam/active-noise-cancellation
Active noise cancellation using various algorithms (FxLMS, FuLMS, NLMS) in Matlab, VST and C
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
baichuan-inc/Baichuan-13B
A 13B large language model developed by Baichuan Intelligent Technology
mozillazg/phrase-pinyin-data
词语拼音数据
coqui-ai/open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
double22a/speech_dataset
The dataset of Speech Recognition
sooftware/conformer
[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)
chenweiphd/LargeLanguageModel-and-GPT-4-ResourceMap
openai/following-instructions-human-feedback
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
chester256/Model-Compression-Papers
Papers for deep neural network compression and acceleration