yaleimeng

Improve everyday！

ItibiaSuZhou

yaleimeng's Stars

2dust/v2rayNG
A V2Ray client for Android, support Xray core and v2fly core
Language:Kotlin37.3k5.7k
ZiqiaoPeng/SyncTalk
[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"
Language:Python1.4k162
MetaCubeX/ClashMetaForAndroid
A rule-based tunnel for Android.
Language:Kotlin17.5k1.3k
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Language:Python12.6k2.6k
yerfor/GeneFacePlusPlus
GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code
Language:Python1.6k234
yerfor/Real3DPortrait
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code
Language:Python983113
lyswhut/lx-music-mobile
一个基于 React native 开发的音乐软件
Language:TypeScript11.6k1.5k
IntelLabs/fastRAG
Efficient Retrieval Augmentation and Generation Framework
Language:Python1.4k127
Zz-ww/SadTalker-Video-Lip-Sync
本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形，设置面部区域可配置的增强方式进行合成唇形（人脸）区域画面增强，提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧，补充帧间合成唇形的动作过渡，使合成的唇形更为流畅、真实以及自然。
Language:Python1.9k329
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Language:Python61.4k6.6k
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Language:Python37.4k4.2k
TencentARC/GFPGAN
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
Language:Python36.1k6k
datawhalechina/self-llm
《开源大模型食用指南》针对**宝宝量身打造的基于Linux环境快速微调（全参数/Lora）、部署国内外开源大模型（LLM）/多模态大模型（MLLM）教程
Language:Jupyter Notebook10.5k1.2k
zvxme/wav2lip_chinese
Language:Python7010
eosphoros-ai/DB-GPT
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
Language:Python14.1k1.9k
Picovoice/cobra
On-device voice activity detection (VAD) powered by deep learning
Language:Python18312
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Language:Jupyter Notebook6.6k799
guoqincode/Open-AnimateAnyone
Unofficial Implementation of Animate Anyone
Language:Python3k240
fishaudio/fish-speech
SOTA Open Source TTS
Language:Python17.8k1.3k
rany2/edge-tts
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
Language:Python6.6k653
Plachtaa/VITS-fast-fine-tuning
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
Language:Python4.8k719
PlayVoice/vits_chinese
Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!
Language:Python1.2k167
Plachtaa/VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
Language:Python7.7k770
hiroi-sora/Umi-OCR
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片，PDF文档识别，排除水印/页眉页脚，扫描/生成二维码。内置多国语言库。
Language:Python28.2k2.8k
LC044/WeChatMsg
提取微信聊天记录，将其导出成HTML、Word、Excel文档永久保存，对聊天记录进行分析生成年度聊天报告，用聊天数据训练专属于个人的AI聊天助手
Language:Python35.7k3.7k
Raudaschl/rag-fusion
Language:Python809101
fishaudio/Bert-VITS2
vits2 backbone with multilingual-bert
Language:Python8.1k1.2k
Stability-AI/generative-models
Generative Models by Stability AI
Language:Python24.9k2.8k
SagerNet/sing-box
The universal proxy platform
Language:Go21.1k2.5k
leoFitz1024/wallhaven
基于wallhaven.cc的一款壁纸管理工具
Language:CSS1.5k99

yaleimeng

yaleimeng's Stars

2dust/v2rayNG

ZiqiaoPeng/SyncTalk

MetaCubeX/ClashMetaForAndroid

NVIDIA/NeMo

yerfor/GeneFacePlusPlus

yerfor/Real3DPortrait

lyswhut/lx-music-mobile

IntelLabs/fastRAG

Zz-ww/SadTalker-Video-Lip-Sync

comfyanonymous/ComfyUI

RVC-Boss/GPT-SoVITS

TencentARC/GFPGAN

datawhalechina/self-llm

zvxme/wav2lip_chinese

eosphoros-ai/DB-GPT

Picovoice/cobra

pyannote/pyannote-audio

guoqincode/Open-AnimateAnyone

fishaudio/fish-speech

rany2/edge-tts

Plachtaa/VITS-fast-fine-tuning

PlayVoice/vits_chinese

Plachtaa/VALL-E-X

hiroi-sora/Umi-OCR

LC044/WeChatMsg

Raudaschl/rag-fusion

fishaudio/Bert-VITS2

Stability-AI/generative-models

SagerNet/sing-box

leoFitz1024/wallhaven