FChin39

勤学多问

NTUSingapore

FChin39's Stars

X-LANCE/AniTalker
[ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding"
Language:Jupyter Notebook1.4k133
Plachtaa/seed-vc
State-of-the-Art zero-shot voice conversion & singing voice conversion with in context learning
Language:Python57164
divan/txqr
Transfer data via animated QR codes
Language:Go3k172
sz3/libcimbar
Optimized implementation for color-icon-matrix barcodes
Language:C++4.2k297
Stirling-Tools/Stirling-PDF
#1 Locally hosted web application that allows you to perform various operations on PDF files
Language:Java45.7k3.7k
microsoft/graphrag
A modular graph-based Retrieval-Augmented Generation (RAG) system
Language:Python18.9k1.8k
jianchang512/pyvideotrans
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言，同时支持语音识别转录、语音合成、字幕翻译。
Language:Python10.6k1.2k
facebookresearch/voxpopuli
A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation
Language:Python51051
CosmosShadow/gptpdf
Using GPT to parse PDF
Language:Python3k226
2noise/ChatTTS
A generative speech model for daily dialogue.
Language:Python32.2k3.5k
spatialaudio/jackclient-python
🂻 JACK Audio Connection Kit (JACK) Client for Python :snake:
Language:Python13727
niedev/RTranslator
Open source real-time translation app for Android that runs locally
Language:C++6.8k510
andrewyng/translation-agent
Language:Python4.8k545
CrazyBoyM/llama3-Chinese-chat
Llama3、Llama3.1 中文仓库（随书籍撰写中... 各种网友及厂商微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档）
Language:Python4k332
ben0oil1/GPT-SoVITS-Server
【脱离复杂的环境配置和整合包，极简配置推理服务】从GPT-SoVITS项目里面提取出来的，纯粹的推理服务方案。
Language:Python19930
PantsuDango/Dango-Translator
团子翻译器 —— 个人兴趣制作的一款基于OCR技术的翻译器
Language:Python7.1k525
myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell.
Language:Python29.7k2.9k
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Language:Python30.5k6.4k
koodo-reader/koodo-reader
A modern ebook manager and reader with sync and backup capacities for Windows, macOS, Linux and Web
Language:JavaScript18.6k1.4k
open-mmlab/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Language:Jupyter Notebook7.4k550
yerfor/GeneFacePlusPlus
GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code
Language:Python1.6k224
KevinWang676/Bark-Voice-Cloning
Bark Voice Cloning and Voice Cloning for Chinese Speech
Language:Jupyter Notebook2.8k399
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Language:Python35.3k4k
facebookresearch/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
Language:Python20.9k2.1k
facebookresearch/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
Language:Jupyter Notebook10.9k1.1k
facebookresearch/audioseal
Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector
Language:Python44555
wavmark/wavmark
AI-based Audio Watermarking Tool
Language:Python22529
Far-Se/win32audio
Flutter package to handle windows audio devices. Also extracts native icon to bytes in dart
Language:C++165
jackaudio/jack2
jack2 codebase
Language:C++2.2k376
yxlllc/DDSP-SVC
Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)
Language:Python1.9k250

FChin39

FChin39's Stars

X-LANCE/AniTalker

Plachtaa/seed-vc

divan/txqr

sz3/libcimbar

Stirling-Tools/Stirling-PDF

microsoft/graphrag

jianchang512/pyvideotrans

facebookresearch/voxpopuli

CosmosShadow/gptpdf

2noise/ChatTTS

spatialaudio/jackclient-python

niedev/RTranslator

andrewyng/translation-agent

CrazyBoyM/llama3-Chinese-chat

ben0oil1/GPT-SoVITS-Server

PantsuDango/Dango-Translator

myshell-ai/OpenVoice

facebookresearch/fairseq

koodo-reader/koodo-reader

open-mmlab/Amphion

yerfor/GeneFacePlusPlus

KevinWang676/Bark-Voice-Cloning

RVC-Boss/GPT-SoVITS

facebookresearch/audiocraft

facebookresearch/seamless_communication

facebookresearch/audioseal

wavmark/wavmark

Far-Se/win32audio

jackaudio/jack2

yxlllc/DDSP-SVC