sflowing's Stars
baaivision/Emu3
Next-Token Prediction is All You Need
Henry-23/VideoChat
基于开源项目的数字人,可自定义形象与音色,支持语音克隆,对话首包延迟3-5s。A digital human dialogue demo based on open-source projects with an first package latency of 3-5 seconds.
TMElyralab/MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
lipku/LiveTalking
Real time interactive streaming digital human
Kedreamix/Linly-Talker
Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction method. 🤝🤖 It integrates various technologies like Whisper, Linly, Microsoft Speech Services, and SadTalker talking head generation system. 🌟🔬
SWivid/F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
myshell-ai/MeloTTS
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
Stability-AI/stable-fast-3d
SF3D: Stable Fast 3D Mesh Reconstruction with UV-unwrapping and Illumination Disentanglement
AiuniAI/Unique3D
Official implementation of Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
KwaiKEG/KwaiAgents
A generalized information-seeking agent system with Large Language Models (LLMs).
Tencent/APIJSON
🏆 实时 零代码、全功能、强安全 ORM 库 🚀 后端接口和文档零代码,前端(客户端) 定制返回 JSON 的数据和结构 🏆 Real-Time coding-free, powerful and secure ORM 🚀 providing APIs and Docs without coding by Backend, and the returned JSON of API can be customized by Frontend(Client) users
AlibabaPAI/llumnix
Efficient and easy multi-instance LLM serving
alibaba/animate-anything
Fine-Grained Open Domain Image Animation with Motion Guidance
botpress/botpress
The open-source hub to build & deploy GPT/LLM Agents ⚡️
TencentARC/PhotoMaker
PhotoMaker [CVPR 2024]
NetEase-Media/grps
【深度学习模型部署框架】支持tf/torch/trt/trtllm/vllm以及更多nn框架,支持dynamic batching、streaming模式,支持python/c++双语言,可限制,可拓展,高性能。帮助用户快速地将模型部署到线上,并通过http/rpc接口方式提供服务。
vesoft-inc/nebula
A distributed, fast open-source graph database featuring horizontal scalability and high availability
karpathy/LLM101n
LLM101n: Let's build a Storyteller
li-plus/chatglm.cpp
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
modelscope/dash-infer
DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including x86 and ARMv9.
lenML/Speech-AI-Forge
🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.
jianchang512/ChatTTS-ui
一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces.
6drf21e/ChatTTS_colab
🚀 一键部署(含离线整合包)!基于 ChatTTS ,支持流式输出、音色抽卡、长音频生成和分角色朗读。简单易用,无需复杂安装。
Human3DAIGC/Make-A-Character
Official repo for Make-A-Character: High Quality Text-to-3D Character Generation within Minutes
Doubiiu/ToonCrafter
[SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation
2noise/ChatTTS
A generative speech model for daily dialogue.
VAST-AI-Research/TripoSR
TencentARC/InstantMesh
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
openai/shap-e
Generate 3D objects conditioned on text or images
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.