hly990's Stars
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
s0md3v/roop
one-click face swap
chatwoot/chatwoot
Open-source live-chat, email support, omni-channel desk. An alternative to Intercom, Zendesk, Salesforce Service Cloud etc. 🔥💬
songquanpeng/one-api
OpenAI 接口管理 & 分发系统,支持 Azure、Anthropic Claude、Google PaLM 2 & Gemini、智谱 ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360 智脑以及腾讯混元,可用于二次分发管理 key,仅单可执行文件,已打包好 Docker 镜像,一键部署,开箱即用. OpenAI key management & redistribution system, using a single API for all LLMs, and features an English UI.
jianchang512/pyvideotrans
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,同时支持语音识别转录、语音合成、字幕翻译。
fishaudio/Bert-VITS2
vits2 backbone with multilingual-bert
modelscope/modelscope
ModelScope: bring the notion of Model-as-a-Service to life.
HVision-NKU/StoryDiffusion
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
sczhou/ProPainter
[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting
QwenLM/Qwen2-VL
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
dongrixinyu/JioNLP
中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com
lm-sys/RouteLLM
A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!
TMElyralab/MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
gluestack/gluestack-ui
React & React Native Components & Patterns (copy-paste components & patterns crafted with Tailwind CSS (NativeWind))
axinc-ai/ailia-models
The collection of pre-trained, state-of-the-art AI models for ailia SDK
omerbt/TokenFlow
Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow" (ICLR 2024)
fofr/cog-face-to-many
Turn any face into a video game character, pixel art, claymation, 3D or toy
DAMO-NLP-SG/VideoLLaMA2
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
X-T-E-R/Uni-TTS
本项目意图在于让使用各类语音合成引擎的方式变得统一,支持多种语音合成引擎适配器,允许直接作为模组使用或启动后端服务
sentient-engineering/sentient
the framework/ sdk that lets you build browser controlling agents in 3 lines of code. join chat @ https://discord.gg/umgnyQU2K8
ai365vip/chat-api
基于One API与New API的基础上进行二次开发
Farzad-R/Advanced-QA-and-RAG-Series
This repository contains advanced LLM-based chatbots for Q&A using LLM agents, and Retrieval Augmented Generation (RAG) and with different databases. (VectorDB, GraphDB, SQLite, CSV, XLSX, etc.)
sdbds/hallo-for-windows
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
JianYing-Automation/JianYingApi
Third Party JianYing Api. 第三方剪映Api
GiilDe/turbo-edit
owent-utils/font
OWenT's Utils -- Font branch
JianYing-Automation/JianYingSrt
模拟剪映转换字幕
hedizekri/bark-rvc-pipeline
TTS pipeline that uses RVC to enhance Bark audio quality and cloning
aniket-work/Lets_Build_Market_Analysis_Team_w_AI_Agents
Let's Build Market Analysis Team w/ AI Agents
axinc-ai/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)