0nutation

Fudan UniversityShanghai, China

0nutation's Stars

mendableai/firecrawl
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Language:TypeScript20.7k 114 4161.6k
Anjok07/ultimatevocalremovergui
GUI for a Vocal Remover that uses Deep Neural Networks.
Language:Python18.8k 160 1.4k1.4k
ml-explore/mlx
MLX: An array framework for Apple silicon
Language:C++18k 148 5891k
SWivid/F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Language:Python8.3k 79 4441.1k
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Language:Jupyter Notebook6.6k 73 1k800
facebookresearch/lingua
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
Language:Python4.3k 27 40224
linkedin/Liger-Kernel
Efficient Triton Kernels for LLM Training
Language:Python4k 40 139232
jy0205/Pyramid-Flow
Code of Pyramidal Flow Matching for Efficient Video Generative Modeling
Language:Python2.6k 45 172259
usefulsensors/moonshine
Fast and accurate automatic speech recognition (ASR) for edge devices
Language:Python2.4k 32 26124
openai/openai-realtime-console
React app for inspecting, building and debugging with the Realtime API
Language:JavaScript2.4k 38 67901
alaskasquirrel/Chinese-Podcasts
播客 🎧 编程、设计、Vlog、音乐、访谈、博客...
2k 50 4109
homebrewltd/ichigo
Local realtime voice AI
Language:Python1.9k 19 6991
baaivision/Emu3
Next-Token Prediction is All You Need
Language:Python1.9k 33 4777
qiuqiangkong/audioset_tagging_cnn
Language:Python1.4k 14 69258
hendrycks/test
Measuring Massive Multitask Language Understanding | ICLR 2021
Language:Python1.3k 18 2096
haoheliu/voicefixer
General Speech Restoration
Language:Python1.1k 17 59132
SmartFlowAI/EmoLLM
心理健康大模型、LLM、The Big Model of Mental Health、Finetune、InternLM2、InternLM2.5、Qwen、ChatGLM、Baichuan、DeepSeek、Mixtral、LLama3、GLM4、Qwen2、LLama3.1
Language:Python911 7 43128
facebookresearch/spiritlm
Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".
Language:Python852 18 1856
lifeiteng/OmniSenseVoice
Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯
Language:Python769 9 1030
JusperLee/Speech-Separation-Paper-Tutorial
A must-read paper for speech separation based on neural networks
763 27 2137
lhl/voicechat2
Local SRT/LLM/TTS Voicechat
Language:Python573 7 1661
facebookresearch/voxpopuli
A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation
Language:Python518 18 2256
FireRedTeam/FireRedTTS
An Open-Sourced LLM-empowered Foundation TTS System
Language:Python505 31 2335
yeyupiaoling/AudioClassification-Pytorch
The Pytorch implementation of sound classification supports EcapaTdnn, PANNS, TDNN, Res2Net, ResNetSE and other models, as well as a variety of preprocessing methods.
Language:Python436 7 3485
xinchen-ai/Westlake-Omni
Language:Python187 6 1016
wenet-e2e/wesep
Target Speaker Extraction Toolkit
Language:Python136 6 914
alibabasglab/MossFormer2
This is the audio sample repository for speech separation model "MossFormer2".
Language:Python115 4 79
mlcommons/peoples-speech
The People’s Speech Dataset
Language:Jupyter Notebook99 16 2612
thuhcsi/SpeechCraft
The official repository of SpeechCraft dataset, a large-scale expressive bilingual speech dataset with natural language descriptions.
64 3 11
LAION-AI/emotional-speech-annotations
This repository contains prompts & best practices to annotate audio clips with a very high degree of details using Audio-Language-Models
31 4 01

0nutation

0nutation's Stars

mendableai/firecrawl

Anjok07/ultimatevocalremovergui

ml-explore/mlx

SWivid/F5-TTS

pyannote/pyannote-audio

facebookresearch/lingua

linkedin/Liger-Kernel

jy0205/Pyramid-Flow

usefulsensors/moonshine

openai/openai-realtime-console

alaskasquirrel/Chinese-Podcasts

homebrewltd/ichigo

baaivision/Emu3

qiuqiangkong/audioset_tagging_cnn

hendrycks/test

haoheliu/voicefixer

SmartFlowAI/EmoLLM

facebookresearch/spiritlm

lifeiteng/OmniSenseVoice

JusperLee/Speech-Separation-Paper-Tutorial

lhl/voicechat2

facebookresearch/voxpopuli

FireRedTeam/FireRedTTS

yeyupiaoling/AudioClassification-Pytorch

xinchen-ai/Westlake-Omni

wenet-e2e/wesep

alibabasglab/MossFormer2

mlcommons/peoples-speech

thuhcsi/SpeechCraft

LAION-AI/emotional-speech-annotations