hly990

hly990's Stars

RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Language:Python36.9k 218 1.4k4.2k
s0md3v/roop
one-click face swap
Language:Python28.8k 263 07.1k
chatwoot/chatwoot
Open-source live-chat, email support, omni-channel desk. An alternative to Intercom, Zendesk, Salesforce Service Cloud etc. 🔥💬
Language:Ruby21.6k 236 4.4k3.7k
songquanpeng/one-api
OpenAI 接口管理 & 分发系统，支持 Azure、Anthropic Claude、Google PaLM 2 & Gemini、智谱 ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360 智脑以及腾讯混元，可用于二次分发管理 key，仅单可执行文件，已打包好 Docker 镜像，一键部署，开箱即用. OpenAI key management & redistribution system, using a single API for all LLMs, and features an English UI.
Language:JavaScript20k 111 1.5k4.4k
jianchang512/pyvideotrans
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言，同时支持语音识别转录、语音合成、字幕翻译。
Language:Python11k 66 6251.2k
fishaudio/Bert-VITS2
vits2 backbone with multilingual-bert
Language:Python8.1k 49 01.1k
modelscope/modelscope
ModelScope: bring the notion of Model-as-a-Service to life.
Language:Python7.1k 77 634736
HVision-NKU/StoryDiffusion
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
Language:Jupyter Notebook6k 85 147602
sczhou/ProPainter
[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting
Language:Python5.7k 56 90670
QwenLM/Qwen2-VL
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Language:Python3.6k 30 437224
dongrixinyu/JioNLP
中文 NLP 预处理、解析工具包，准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com
Language:Python3.4k 34 217411
lm-sys/RouteLLM
A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!
Language:Python3.3k 25 52251
TMElyralab/MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
Language:Python3.1k 55 218382
gluestack/gluestack-ui
React & React Native Components & Patterns (copy-paste components & patterns crafted with Tailwind CSS (NativeWind))
Language:TypeScript2.9k 20 485120
axinc-ai/ailia-models
The collection of pre-trained, state-of-the-art AI models for ailia SDK
Language:Python2.1k 53 750332
omerbt/TokenFlow
Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow" (ICLR 2024)
Language:Python1.6k 77 44138
fofr/cog-face-to-many
Turn any face into a video game character, pixel art, claymation, 3D or toy
Language:Python1.3k 9 48201
DAMO-NLP-SG/VideoLLaMA2
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
Language:Python953 11 11262
X-T-E-R/Uni-TTS
本项目意图在于让使用各类语音合成引擎的方式变得统一，支持多种语音合成引擎适配器，允许直接作为模组使用或启动后端服务
Language:Python665 8 3258
sentient-engineering/sentient
the framework/ sdk that lets you build browser controlling agents in 3 lines of code. join chat @ https://discord.gg/umgnyQU2K8
Language:Python469 8 2154
ai365vip/chat-api
基于One API与New API的基础上进行二次开发
Language:JavaScript439 6 95101
Farzad-R/Advanced-QA-and-RAG-Series
This repository contains advanced LLM-based chatbots for Q&A using LLM agents, and Retrieval Augmented Generation (RAG) and with different databases. (VectorDB, GraphDB, SQLite, CSV, XLSX, etc.)
Language:Jupyter Notebook205 6 11128
sdbds/hallo-for-windows
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Language:Python193 6 630
JianYing-Automation/JianYingApi
Third Party JianYing Api. 第三方剪映Api
Language:Python163 5 928
GiilDe/turbo-edit
Language:Python86 1 24
owent-utils/font
OWenT's Utils -- Font branch
Language:Python40 2 011
JianYing-Automation/JianYingSrt
模拟剪映转换字幕
Language:Python38 1 1322
hedizekri/bark-rvc-pipeline
TTS pipeline that uses RVC to enhance Bark audio quality and cloning
Language:Python6
aniket-work/Lets_Build_Market_Analysis_Team_w_AI_Agents
Let's Build Market Analysis Team w/ AI Agents
Language:Python31
axinc-ai/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Language:Python2 1 0