WhiteFu

speech synthesis & voice conversion & speech enhancement

Pinned Repositories

ai-audio-datasets-list
This is a list of datasets consisting of speech, music, and sound effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications. It is mainly used for speech recognition, speech synthesis, singing voice synthesis, music information retrieval, music generation, etc.
1 0 00
bigcode-dataset
Language:Jupyter Notebook1 0 00
llm-paper-daily
Daily updated LLM papers. 每日更新 LLM 相关的论文，欢迎订阅 👏 喜欢的话动动你的小手 🌟 一个
1 0 00
MidiTok
MIDI / symbolic music tokenizers for Deep Learning models 🎶
Language:Python1 0 00
nmt_data_tools
machine translation data process tools
Language:Perl1 0 00
pycorrector
pycorrector is a toolkit for text error correction. 文本纠错，Kenlm，Seq2Seq_Attention，BERT，MacBERT，ELECTRA，ERNIE，Transformer等模型实现，开箱即用。
Language:Python1 0 00
speech_process
语音处理基本教程
Language:Jupyter Notebook2 0 00
TikTokDownloader
完全免费开源，基于 Requests 模块实现：TikTok 主页/视频/图集/原声；抖音主页/视频/图集/收藏/直播/原声/合集/评论/账号/搜索/热榜数据采集工具
Language:Python1 0 00
voice_datasets
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
1 1 00
Wav2Lip-GFPGAN
Language:Python1 0 00

WhiteFu's Repositories

WhiteFu/ai-audio-startups
Community list of startups working with AI in audio and music technology
0 0
WhiteFu/audio-pipeline
Language:Python0 0
WhiteFu/AudioEditingCode
Language:Python0 0
WhiteFu/awesome-audio-plaza
Daily tracking of awesome audio papers, including music generation, zero-shot tts, asr, audio generation
0 0
WhiteFu/Awesome-LLMs-Datasets
Summarize existing representative LLMs text datasets.
0 0
WhiteFu/Awesome-LLMs-meet-Multimodal-Generation
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
Language:HTML0 0
WhiteFu/Bunny
A family of lightweight multimodal models.
Language:Python0 0
WhiteFu/codec-bpe
Implementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs
Language:Python0 0
WhiteFu/ConsistI2V
ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation
Language:Python0 0
WhiteFu/diarizers
Language:Python0 0
WhiteFu/EVA
EVA Series: Visual Representation Fantasies from BAAI
Language:Python0 0
WhiteFu/FRESCO
[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation
Language:Jupyter Notebook0 0
WhiteFu/i-Code
Language:Jupyter Notebook0 0
WhiteFu/lina-speech
lina-speech : linear attention based text-to-speech
Language:Jupyter Notebook0 0
WhiteFu/llava-phi
Language:Python0 0
WhiteFu/llm-datasets
High-quality datasets, tools, and concepts for LLM fine-tuning.
0 0
WhiteFu/M2UGen
This is the official repository for M2UGen
Language:Jupyter Notebook0 0
WhiteFu/Mantis
Official code for Paper "Mantis: Multi-Image Instruction Tuning"
Language:Python0 0
WhiteFu/metavoice-src
Foundational model for human-like, expressive TTS
Language:Python0 0
WhiteFu/MoneyPrinterTurbo
利用AI大模型，一键生成高清短视频 Generate short videos with one click using AI LLM.
Language:Python0 0
WhiteFu/Open-Sora
Building your own video generation model like OpenAI's Sora
Language:Python0 0
WhiteFu/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
Language:Python0 0
WhiteFu/overseas-website-note
「海外工具网站」已经是我人生主要事业了，很庆幸还来得及，感谢这个伟大的 AI 时代。
0 0
WhiteFu/pyannote-whisper
Language:Python0 0
WhiteFu/pytorch-speech-features
Language:Python0 0
WhiteFu/pyvideotrans
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言，并添加配音
Language:Python0 0
WhiteFu/snac
Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate
Language:Python0 0
WhiteFu/SoraReview
The official GitHub page for the review paper "Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models".
0 0
WhiteFu/tts-qa
Language:Python0 0
WhiteFu/VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
Language:Python0 0