ytzhangscr

ytzhangscr's Stars

xinsir6/ControlNetPlus
ControlNet++: All-in-one ControlNet for image generations and editing!
Language:Python1.7k36
AIVFI/Monocular-Depth-Estimation-Rankings-and-2D-to-3D-Video-Conversion-Rankings
Rankings include: BetterDepth Depth Anything DPT FutureDepth GBDMF GenPercept GeoWizard LeReS LightedDepth LFVRT Marigold Metric3D MiDaS NeWCRFs PatchFusion UniDepth ZoeDepth
38
datawhalechina/intro-mathmodel
《数学建模导论》教程，全网最全数学建模模型与算法教程系列，带你走进数学建模的大门！
46163
YangLing0818/VideoTetris
[NeurIPS 2024] VideoTetris: Towards Compositional Text-To-Video Generation
Language:Python1996
X-LANCE/AniTalker
[ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding"
Language:Jupyter Notebook1.4k128
modelscope/FunClip
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
Language:Python3.4k358
replicate/cog
Containers for machine learning
Language:Python7.9k550
Lxiangyue/GenN2N
[CVPR'24 - Rebuttal Score 554] GenN2N: Generative NeRF2NeRF Translation
766
mshumer/gpt-prompt-engineer
Language:Jupyter Notebook9.3k645
myshell-ai/MeloTTS
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
Language:Python4.5k561
aishwaryanr/awesome-generative-ai-guide
A one stop repository for generative AI research updates, interview resources, notebooks and much more!
8.4k1.8k
HumanAIGC/AnimateAnyone
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
14.4k968
jasonppy/VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
Language:Jupyter Notebook7.5k739
All-Hands-AI/OpenHands
🙌 OpenHands: Code Less, Make More
Language:Python32.3k3.7k
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Language:Python11.3k1k
marcelscruz/public-apis
A collaborative list of public APIs for developers
Language:JavaScript3.8k371
HeyPuter/puter
🌐 The Internet OS! Free, Open-Source, and Self-Hostable.
Language:JavaScript25.1k1.6k
ollama/ollama
Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.
Language:Go91.5k7.2k
Hillobar/Rope
GUI-focused roop
Language:Python4.4k691
microsoft/UFO
A UI-Focused Agent for Windows OS Interaction.
Language:Python7.6k1k
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Language:Python33.3k3.8k
mckaywrigley/chatbot-ui
Come join the best place on the internet to learn AI skills. Use code "chatbotui" for an extra 20% off.
Language:TypeScript28.4k7.9k
myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell.
Language:Python28.7k2.8k
dvmazur/mixtral-offloading
Run Mixtral-8x7B models in Colab or consumer desktops
Language:Python2.3k225
fishaudio/fish-speech
Brand new TTS solution
Language:Python12.7k955
ansible/ansible
Ansible is a radically simple IT automation platform that makes your applications and systems easier to deploy and maintain. Automate everything from code deployment to network configuration to cloud management, in a language that approaches plain English, using SSH, with no agents to install on remote systems. https://docs.ansible.com.
Language:Python62.5k23.8k
Ucas-HaoranWei/Vary
[ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.
Language:Python1.8k155
AILab-CVC/UniRepLKNet
[CVPR'24] UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition
Language:Python90553
open-mmlab/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Language:Python4.5k384
geekan/MetaGPT
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
Language:Python44k5.2k

ytzhangscr

ytzhangscr's Stars

xinsir6/ControlNetPlus

AIVFI/Monocular-Depth-Estimation-Rankings-and-2D-to-3D-Video-Conversion-Rankings

datawhalechina/intro-mathmodel

YangLing0818/VideoTetris

X-LANCE/AniTalker

modelscope/FunClip

replicate/cog

Lxiangyue/GenN2N

mshumer/gpt-prompt-engineer

myshell-ai/MeloTTS

aishwaryanr/awesome-generative-ai-guide

HumanAIGC/AnimateAnyone

jasonppy/VoiceCraft

All-Hands-AI/OpenHands

PKU-YuanGroup/Open-Sora-Plan

marcelscruz/public-apis

HeyPuter/puter

ollama/ollama

Hillobar/Rope

microsoft/UFO

RVC-Boss/GPT-SoVITS

mckaywrigley/chatbot-ui

myshell-ai/OpenVoice

dvmazur/mixtral-offloading

fishaudio/fish-speech

ansible/ansible

Ucas-HaoranWei/Vary

AILab-CVC/UniRepLKNet

open-mmlab/Amphion

geekan/MetaGPT