gluttony-10

gluttony-10's Stars

xai-org/grok-1
Grok open release
Language:Python49.8k 591 2148.3k
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Language:Python37.4k 218 1.4k4.2k
chatchat-space/Langchain-Chatchat
Langchain-Chatchat（原Langchain-ChatGLM）基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain
Language:TypeScript32.7k 288 4k5.6k
HumanAIGC/AnimateAnyone
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
14.5k 673 94978
instantX-research/InstantID
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
Language:Python11.3k 128 233823
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Language:Python10k 126 483940
TencentARC/PhotoMaker
PhotoMaker [CVPR 2024]
Language:Jupyter Notebook9.7k 101 163771
lepoco/wpfui
WPF UI provides the Fluent experience in your known and loved WPF framework. Intuitive design, themes, navigation and new immersive controls. All natively and effortlessly.
Language:C#7.8k 82 694773
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Language:Python7.5k 69 1.3k798
Stability-AI/StableCascade
Official Code for Stable Cascade
Language:Jupyter Notebook6.6k 60 124533
levihsu/OOTDiffusion
Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
Language:Python5.9k 76 213846
THUDM/GLM-4
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
Language:Python5.6k 36 590471
lllyasviel/sd-forge-layerdiffuse
[WIP] Layer Diffusion for WebUI (via Forge)
Language:Python3.9k 42 117335
williamyang1991/Rerender_A_Video
[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
Language:Jupyter Notebook3k 26 114201
FujiwaraChoki/MoneyPrinterV2
Automate the process of making money online.
Language:Python2.6k 27 62335
tencent-ailab/V-Express
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
Language:Python2.3k 38 54286
THUDM/CogVLM2
GPT4V-level open-source multi-modal model based on Llama3-8B
Language:Python2.2k 29 174148
lllyasviel/LayerDiffuse
Transparent Image Layer Diffusion using Latent Transparency
2k 114 3628
Nerogar/OneTrainer
OneTrainer is a one-stop solution for all your stable diffusion training needs.
Language:Python1.9k 24 356159
dreamoving/dreamoving-project
Official implementation of DreaMoving
1.8k 130 1097
kijai/ComfyUI-CogVideoXWrapper
Language:Python1.2k 21 31276
Flode-Labs/vid2densepose
Convert your videos to densepose and use it on MagicAnimate
Language:Python1.1k 12 16129
jiayev/GPT4V-Image-Captioner
Language:Python799 13 5358
THUDM/AutoWebGLM
An LLM-based Web Navigating Agent (KDD'24)
Language:Python772 27 1363
THUDM/LongCite
LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA
Language:Python441 11 1533
thu-coai/CharacterGLM-6B
[EMNLP'24] CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language Models
Language:Python428 13 1933
padeoe/hf-mirror-site
a huggingface mirror site.
242 4 3529
THUDM/CogCoM
Language:Jupyter Notebook154 9 2810
SAIS-FUXI/VidGen
Language:Python57 2 64
zRzRzRzRzRzRzR/chatgpt-on-wechat
基于大模型搭建的微信聊天机器人，同时支持微信、企业微信、公众号、飞书、钉钉接入，可选择GPT3.5/GPT4.0/Claude/文心一言/讯飞星火/通义千问/Gemini/LinkAI，能处理文本、语音和图片，访问操作系统和互联网，支持基于自有知识库进行定制企业智能客服。
Language:Python3 0 0