ignite720

minimalist

ignite720's Stars

openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Language:Python74.7k 606 08.9k
CorentinJ/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Language:Python53.3k 942 1.1k8.9k
gradio-app/gradio
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Language:Python35.3k 180 5.2k2.7k
TabbyML/tabby
Self-hosted AI coding assistant
Language:Rust28.4k 126 7961.3k
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Language:Python15.4k 113 1.1k1.2k
microsoft/onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
Language:C++15.3k 253 6.9k3k
kaldi-asr/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
Language:Shell14.5k 694 1.7k5.3k
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Language:Python13.4k 141 7561.5k
huggingface/transformers.js
State-of-the-art Machine Learning for the web. Run 🤗 Transformers directly in your browser, with no need for a server!
Language:JavaScript12.6k 81 717814
QwenLM/Qwen2.5
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
Language:Shell11.8k 72 905708
SubtitleEdit/subtitleedit
the subtitle editor :)
Language:C#9.2k 164 5k929
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
Language:Python9.2k 134 1.1k1.4k
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Language:Python7.8k 71 1.3k810
Morizeyao/GPT2-Chinese
Chinese version of GPT2 training code, using BERT tokenizer.
Language:Python7.5k 161 2511.7k
librosa/librosa
Python library for audio and music analysis
Language:Python7.3k 138 1.2k972
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Language:Jupyter Notebook6.7k 74 1k810
tyiannak/pyAudioAnalysis
Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
Language:Python6k 209 3101.2k
luau-lang/luau
A fast, small, safe, gradually typed embeddable scripting language derived from Lua
Language:C++4.2k 76 640394
FunAudioLLM/SenseVoice
Multilingual Voice Understanding Model
Language:Python4.1k 48 167363
LlamaEdge/LlamaEdge
The easiest & fastest way to run customized and fine-tuned LLMs locally or on the edge
Language:Rust1.2k 22 109103
segment-any-text/wtpsplit
Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.
Language:Python823 13 8146
KoljaB/LocalAIVoiceChat
Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with Coqui XTTS for synthesis.
Language:Python560 12 1461
Xirider/finetune-gpt2xl
Guide: Finetune GPT2-XL (1.5 Billion Parameters) and finetune GPT-NEO (2.7 B) on a single GPU with Huggingface Transformers using DeepSpeed
Language:Python436 5 2274
oliverguhr/wav2vec2-live
A live speech recognition using Facebooks wav2vec 2.0 model.
Language:Python336 7 1656
Perlmint/glew-cmake
GLEW(https://github.com/nigels-com/glew, source updated nightly) with Cmake and pre-generated sources
Language:C238 12 3895
eastonYi/wav2vec
a simplified version of wav2vec(1.0, vq, 2.0) in fairseq
Language:Python138 6 221
bmx-ng/bmx-ng
The Open Source BlitzMax Compiler Project
Language:BlitzMax105 14 15915
MatijaNovosel/montage
🎬 A clip editor made with Tauri.
Language:Vue97 3 09
Recordscript/recordscript
Cross-platform screen recorder, transcript, subtitle. Built with Tauri & Whisper-rs (rust port of whisper.cpp)
Language:Rust21 1 1
drakang4/jamak
A subtitle editor built with Electron, React and Redux.
Language:TypeScript19 2 04

ignite720

ignite720's Stars

openai/whisper

CorentinJ/Real-Time-Voice-Cloning

gradio-app/gradio

TabbyML/tabby

QwenLM/Qwen

microsoft/onnxruntime

kaldi-asr/kaldi

m-bain/whisperX

huggingface/transformers.js

QwenLM/Qwen2.5

SubtitleEdit/subtitleedit

speechbrain/speechbrain

modelscope/FunASR

Morizeyao/GPT2-Chinese

librosa/librosa

pyannote/pyannote-audio

tyiannak/pyAudioAnalysis

luau-lang/luau

FunAudioLLM/SenseVoice

LlamaEdge/LlamaEdge

segment-any-text/wtpsplit

KoljaB/LocalAIVoiceChat

Xirider/finetune-gpt2xl

oliverguhr/wav2vec2-live

Perlmint/glew-cmake

eastonYi/wav2vec

bmx-ng/bmx-ng

MatijaNovosel/montage

Recordscript/recordscript

drakang4/jamak