jdola's Stars
ground-creative/openvoice-docker
Docker container with api for OpenVoice
jdola/fastapi-openvoice-tts
norbertkross/fastapi-openvoice-tts
whitebabyblackmarket/Ventriloquist-v2
Ventriloquist v2 is an AI-powered voice assistant that combines speech recognition, natural language processing, and text-to-speech capabilities using OpenVoice technology.
codename0og/rvc-realtime-voice-changer
RVC realtime voice changer - standalone/lightweight
codename0og/codename-rvc-fork
Retrieval-based-Voice-Conversion ( RVC ) modified and enhanced by codename;0
PGRjoystick/rvc-fastapi
a simple Fast API server that act as an proxy to inference voice on RVC project to convert a voice with voice2voice
fumiama/Retrieval-based-Voice-Conversion-WebUI
Easily train a good VC model with voice data <= 10 mins!
michaelkamprath/multi-service-rtmp-broadcaster
A dockerized livestream rebroadcaster
datarhei/restreamer
The Restreamer is a complete streaming server solution for self-hosting. It has a visually appealing user interface and no ongoing license costs. Upload your live stream to YouTube, Twitch, Facebook, Vimeo, or other streaming solutions like Wowza. Receive video data from OBS and publish it with the RTMP and SRT server.
alibaba/nacos
an easy-to-use dynamic service discovery, configuration and service management platform for building cloud native applications.
TraceMachina/nativelink
NativeLink is an open source high-performance build cache and remote execution server, compatible with Bazel, Buck2, Reclient, and other RBE-compatible build systems. It offers drastically faster builds, reduced test flakiness, and specialized hardware.
HKoon/ChatTTS-OpenVoice
Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.
yuvraj108c/stable-audio-1-docker
Docker image for Stable Audio Open 1
bunkerity/bunkerweb
🛡️ Open-source and next-generation Web Application Firewall (WAF)
soumik-kanad/diff2lip
timhagel/MeloTTS-Docker-API-Server
A docker image to access MeloTTS through API calls
YoungSeng/DiffuseStyleGesture
DiffuseStyleGesture: Stylized Audio-Driven Co-Speech Gesture Generation with Diffusion Models (IJCAI 2023) | The DiffuseStyleGesture+ entry to the GENEA Challenge 2023 (ICMI 2023, Reproducibility Award)
ictnlp/StreamSpeech
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
zmwv823/ComfyUI-AnyText
Unofficial implementation of AnyText. Generate or edit image with text (Mainly English & Chinese) in ComfyUI
tyxsspa/AnyText
Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>
Fictionarry/TalkingGaussian
[ECCV'24] TalkingGaussian: Structure-Persistent 3D Talking Head Synthesis via Gaussian Splatting
xg-chu/lightning_track
[ICLR 2024] Generalizable and Precise Head Avatar from Image(s)
lipku/metahuman-stream
Real time interactive streaming digital human
tanshuai0219/EDTalk
[ECCV 2024] EDTalk - Official PyTorch Implementation
Camb-ai/MARS5-TTS
MARS5 speech model (TTS) from CAMB.AI
microsoft/UFO
A UI-Focused Agent for Windows OS Interaction.
ltdrdata/ComfyUI-Manager
ComfyUI-Manager is an extension designed to enhance the usability of ComfyUI. It offers management functions to install, remove, disable, and enable various custom nodes of ComfyUI. Furthermore, this extension provides a hub feature and convenience functions to access a wide range of information within ComfyUI.
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
landing-ai/vision-agent
Vision agent