Pinned Repositories
ActivityNet_task_1
data_preparation+TSN+BSN
Agently-Daily-News-Collector
An open-source LLM based automatically daily news collecting workflow showcase powered by Agently AI application development framework.
AI-Vtuber
AI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain(本地/llm)/chatglm/text-generation-webui/闻达/千问/kimi】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/快手/微信视频号/斗鱼/YouTube/twitch/TikTok】 直播中与观众实时互动 或 直接在本地进行聊天。它使用TTS技术【edge-tts/VITS/elevenlabs/bark/bert-vits2/睿声】生成回答并可以选择【so-vits-svc/DDSP-SVC】变声;指令协同SD画图。
AI-with-code
AI学习过程中的实操代码
AniPortrait
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
AnyDoor
Official implementations for paper: Anydoor: zero-shot object-level image customization
auto-video-generateor
自动视频生成器,给定主题,自动生成解说视频。用户输入主题文字,系统调用大语言模型生成故事或解说的文字,然后进一步调用语音合成接口生成解说的语音,调用文生图接口生成契合文字内容的配图,最后融合语音和配图生成解说视频。
awesome-active-learning
Hope you can find everything you need about active learning in this repository.
Awesome-PyTorch-Chinese
【干货】史上最全的PyTorch学习资源汇总
Awesome-pytorch-list
A comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.
OpenSorceYCW's Repositories
OpenSorceYCW/Agently-Daily-News-Collector
An open-source LLM based automatically daily news collecting workflow showcase powered by Agently AI application development framework.
OpenSorceYCW/AI-Vtuber
AI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain(本地/llm)/chatglm/text-generation-webui/闻达/千问/kimi】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/快手/微信视频号/斗鱼/YouTube/twitch/TikTok】 直播中与观众实时互动 或 直接在本地进行聊天。它使用TTS技术【edge-tts/VITS/elevenlabs/bark/bert-vits2/睿声】生成回答并可以选择【so-vits-svc/DDSP-SVC】变声;指令协同SD画图。
OpenSorceYCW/AniPortrait
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
OpenSorceYCW/auto-video-generateor
自动视频生成器,给定主题,自动生成解说视频。用户输入主题文字,系统调用大语言模型生成故事或解说的文字,然后进一步调用语音合成接口生成解说的语音,调用文生图接口生成契合文字内容的配图,最后融合语音和配图生成解说视频。
OpenSorceYCW/champ
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
OpenSorceYCW/CatVTON
CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) and 3) Simplified Inference (< 8G VRAM for 1024X768 resolution).
OpenSorceYCW/DouyinLiveRecorder
可循环值守和多人录制的直播录制软件,支持抖音、TikTok、快手、虎牙、斗鱼、B站、小红书、pandatv、afreecatv、flextv、popkontv、twitcasting、winktv、百度、微博、酷狗、花椒、流星、Twitch等平台直播录制
OpenSorceYCW/duix.ai
OpenSorceYCW/facefusion
Next generation face swapper and enhancer
OpenSorceYCW/FRESCO
[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation
OpenSorceYCW/IDM-VTON
[ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild
OpenSorceYCW/Linly-Talker
Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction method. 🤝🤖 It integrates various technologies like Whisper, Linly, Microsoft Speech Services, and SadTalker talking head generation system. 🌟🔬
OpenSorceYCW/LLaMA-Omni
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
OpenSorceYCW/Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
OpenSorceYCW/metahuman-stream
Real time streaming digital human based on nerf
OpenSorceYCW/MimicMotion
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
OpenSorceYCW/MotionClone
Official implementation of MotionClone: Training-Free Motion Cloning for Controllable Video Generation
OpenSorceYCW/Omost
Your image is almost there!
OpenSorceYCW/OOTDiffusion
Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
OpenSorceYCW/Open-LLM-VTuber
Talk to any LLM with fast hands-free voice interaction, Live2D taking face, and long-term memory running locally across platforms
OpenSorceYCW/reactor
Reactor Bill Of Materials (tracking reactor-core, reactor-netty and more)
OpenSorceYCW/ReHiFace-S
Real Time High-Fidelity Faceswap
OpenSorceYCW/sd-webui-regional-prompter
set prompt to divided region
OpenSorceYCW/seed-tts-eval
OpenSorceYCW/Streamer-Sales
Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️
OpenSorceYCW/tango
A family of diffusion models for text-to-audio generation.
OpenSorceYCW/ToonCrafter
a research paper for generative cartoon interpolation
OpenSorceYCW/V-Express
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
OpenSorceYCW/VideoLingo
Netflix级字幕切割翻译、精确对齐和个性化配音,一键全自动视频搬运
OpenSorceYCW/YesPlayMusic
高颜值的第三方网易云播放器,支持 Windows / macOS / Linux :electron: