zqlsnr's Stars
localsend/localsend
An open-source cross-platform alternative to AirDrop
laurent22/joplin
Joplin - the privacy-focused note taking app with sync capabilities for Windows, macOS, Linux, Android and iOS.
myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell.
koodo-reader/koodo-reader
A modern ebook manager and reader with sync and backup capacities for Windows, macOS, Linux and Web
NanmiCoder/MediaCrawler
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫
TranslucentTB/TranslucentTB
A lightweight utility that makes the Windows taskbar translucent/transparent.
obsidianmd/obsidian-releases
Community plugins list, theme list, and releases of Obsidian.
fishaudio/Bert-VITS2
vits2 backbone with multilingual-bert
netease-youdao/EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
aimhubio/aim
Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.
yl4579/StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
KevinWang676/Bark-Voice-Cloning
Bark Voice Cloning and Voice Cloning for Chinese Speech
zetaloop/ExplorerPatcher
ExplorerPatcher Chinese L10n - 在 Windows 11 上恢复高效的工作环境
0nutation/SpeechGPT
SpeechGPT Series: Speech Large Language Models
descriptinc/descript-audio-codec
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
alibaba-damo-academy/FunCodec
FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.
JusperLee/TDANet
An efficient speech separation method
descriptinc/audiotools
Object-oriented handling of audio data, with GPU-powered augmentations, and more.
Edresson/VoiceSplit
VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram
mct10/RepCodec
Models and code for RepCodec: A Speech Representation Codec for Speech Tokenization
CODEJIN/NaturalSpeech2
0nutation/USLM
Unified Speech Language Model for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"(ICLR 2024)
PlayVoice/Grad-SVC
Diffusion Singing Voice Conversion based on Grad-TTS from HuaWei
ZhangXInFD/soundstorm-speechtokenizer
Implementation of SoundStorm built upon SpeechTokenizer.
DarrenZZhang/Survey_IMC
This is a comprehensive survey of incompate multi-view clustering algorithms.
Qqi-HE/DeepChorus
An end-to-end chorus detection model DeepChorus.
krystalan/MMCR
:musical_note: ICANN‘2021: Multi-Modal Chorus Recognition for Improving Song Search
Xiaobin-Rong/lite-rtse
An unofficial implementation of Lite-RTSE, a cost-effective lite model for real-time speech enhancement
zeynepgulhanuslu/denoiser-onnx
Denoiser onnx model usage