Pinned Repositories
AI-Vtuber-chatglm
本地部署chatglm生成并以语音回复你的bilibili直播弹幕 Use chatglm to generate and reply your bilibili live danmu with voice
animatediff-cli-prompt-travel
animatediff prompt travel
AnyDoor
Official implementations for paper: Anydoor: zero-shot object-level image customization
AsyncDiff-
Official implementation of "AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising"
Audio-driven-TalkingFace-HeadPose
Code for "Audio-driven Talking Face Video Generation with Learning-based Personalized Head Pose"
autogen
Enable Next-Gen Large Language Model Applications. Join our Discord: https://discord.gg/pAbnFJrkgZ
AutoStudio
AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation
Awesome-Video-Diffusion-Models
[Arxiv] A Survey on Video Diffusion Models视频生成模型收集
QAnything
支持多种文件格式的RAG
xuniren-NeRF-
虚拟人说话头生成(NeRF虚拟人实时驱动) 含API
yaospacetim's Repositories
yaospacetim/QAnything
支持多种文件格式的RAG
yaospacetim/AsyncDiff-
Official implementation of "AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising"
yaospacetim/AutoStudio
AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation
yaospacetim/ChatTTS
ChatTTS is a generative speech model for daily dialogue.
yaospacetim/cobalt-
save what you love
yaospacetim/CogVideo-
Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
yaospacetim/ComicCrawler
An image crawler written in Python.
yaospacetim/CosyVoice-TTS
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
yaospacetim/DCT-Net_Webui
基于DCT-Net的图片/视频转绘gradio界面webui
yaospacetim/EasySpider-
A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。
yaospacetim/EchoMimic-
Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
yaospacetim/EvTexture-
[ICML 2024] EvTexture: Event-driven Texture Enhancement for Video Super-Resolution
yaospacetim/firecrawl-
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
yaospacetim/GeneFacePlusPlus
GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code
yaospacetim/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
yaospacetim/groqbook-
Groqbook: Generate entire books in seconds using Groq and Llama3
yaospacetim/IMAGDressing-
👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing
yaospacetim/InstantID
InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥
yaospacetim/IOPaint-
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
yaospacetim/Long-Novel-GPT
Long-Novel-GPT是一个基于GPT等大语言模型的长篇小说生成器。它采用层次化的大纲/章节/正文结构,以把握长篇小说的连贯剧情,通过上下文管理优化API调用成本,并根据自身或用户反馈不断进行优化,直至达到预定目标。
yaospacetim/MimicBrush-P-
Official implementations for paper: Zero-shot Image Editing with Reference Imitation
yaospacetim/MinerU-PDF
A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
yaospacetim/MotionClone-
Official implementation of MotionClone: Training-Free Motion Cloning for Controllable Video Generation
yaospacetim/Real3DPortrait
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code
yaospacetim/ReplaceAnything-hf
替换任何图像
yaospacetim/SenseVoice
Multilingual Voice Understanding Model
yaospacetim/StoryDiffusion
Create Magic Story!创造连续性的漫画和视频
yaospacetim/StoryMaker
StoryMaker: Towards consistent characters in text-to-image generation
yaospacetim/taipy
Turns Data and AI algorithms into production-ready web applications in no time.
yaospacetim/TokenFlow
Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow" (ICLR 2024)