gluttony-10's Stars
xai-org/grok-1
Grok open release
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
chatchat-space/Langchain-Chatchat
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain
HumanAIGC/AnimateAnyone
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
instantX-research/InstantID
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
TencentARC/PhotoMaker
PhotoMaker [CVPR 2024]
lepoco/wpfui
WPF UI provides the Fluent experience in your known and loved WPF framework. Intuitive design, themes, navigation and new immersive controls. All natively and effortlessly.
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Stability-AI/StableCascade
Official Code for Stable Cascade
levihsu/OOTDiffusion
Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
THUDM/GLM-4
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
lllyasviel/sd-forge-layerdiffuse
[WIP] Layer Diffusion for WebUI (via Forge)
williamyang1991/Rerender_A_Video
[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
FujiwaraChoki/MoneyPrinterV2
Automate the process of making money online.
tencent-ailab/V-Express
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
THUDM/CogVLM2
GPT4V-level open-source multi-modal model based on Llama3-8B
lllyasviel/LayerDiffuse
Transparent Image Layer Diffusion using Latent Transparency
Nerogar/OneTrainer
OneTrainer is a one-stop solution for all your stable diffusion training needs.
dreamoving/dreamoving-project
Official implementation of DreaMoving
kijai/ComfyUI-CogVideoXWrapper
Flode-Labs/vid2densepose
Convert your videos to densepose and use it on MagicAnimate
jiayev/GPT4V-Image-Captioner
THUDM/AutoWebGLM
An LLM-based Web Navigating Agent (KDD'24)
THUDM/LongCite
LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA
thu-coai/CharacterGLM-6B
[EMNLP'24] CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language Models
padeoe/hf-mirror-site
a huggingface mirror site.
THUDM/CogCoM
SAIS-FUXI/VidGen
zRzRzRzRzRzRzR/chatgpt-on-wechat
基于大模型搭建的微信聊天机器人,同时支持微信、企业微信、公众号、飞书、钉钉接入,可选择GPT3.5/GPT4.0/Claude/文心一言/讯飞星火/通义千问/Gemini/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。