jdola

CT TNHH VPAGOHanoi

jdola's Stars

ground-creative/openvoice-docker
Docker container with api for OpenVoice
Language:Shell11
jdola/fastapi-openvoice-tts
1
norbertkross/fastapi-openvoice-tts
Language:Python21
whitebabyblackmarket/Ventriloquist-v2
Ventriloquist v2 is an AI-powered voice assistant that combines speech recognition, natural language processing, and text-to-speech capabilities using OpenVoice technology.
Language:Python1
codename0og/rvc-realtime-voice-changer
RVC realtime voice changer - standalone/lightweight
Language:Python112
codename0og/codename-rvc-fork
Retrieval-based-Voice-Conversion ( RVC ) modified and enhanced by codename;0
Language:Python72
PGRjoystick/rvc-fastapi
a simple Fast API server that act as an proxy to inference voice on RVC project to convert a voice with voice2voice
Language:Python1
fumiama/Retrieval-based-Voice-Conversion-WebUI
Easily train a good VC model with voice data <= 10 mins!
Language:Python8412
michaelkamprath/multi-service-rtmp-broadcaster
A dockerized livestream rebroadcaster
Language:Python13325
datarhei/restreamer
The Restreamer is a complete streaming server solution for self-hosting. It has a visually appealing user interface and no ongoing license costs. Upload your live stream to YouTube, Twitch, Facebook, Vimeo, or other streaming solutions like Wowza. Receive video data from OBS and publish it with the RTMP and SRT server.
Language:HTML3.7k433
alibaba/nacos
an easy-to-use dynamic service discovery, configuration and service management platform for building cloud native applications.
Language:Java29.6k12.7k
TraceMachina/nativelink
NativeLink is an open source high-performance build cache and remote execution server, compatible with Bazel, Buck2, Reclient, and other RBE-compatible build systems. It offers drastically faster builds, reduced test flakiness, and specialized hardware.
Language:Rust80090
HKoon/ChatTTS-OpenVoice
Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.
Language:Python7410
yuvraj108c/stable-audio-1-docker
Docker image for Stable Audio Open 1
Language:Python1
bunkerity/bunkerweb
🛡️ Open-source and next-generation Web Application Firewall (WAF)
Language:Python5.1k288
soumik-kanad/diff2lip
Language:Python26329
timhagel/MeloTTS-Docker-API-Server
A docker image to access MeloTTS through API calls
Language:Python85
YoungSeng/DiffuseStyleGesture
DiffuseStyleGesture: Stylized Audio-Driven Co-Speech Gesture Generation with Diffusion Models (IJCAI 2023) | The DiffuseStyleGesture+ entry to the GENEA Challenge 2023 (ICMI 2023, Reproducibility Award)
Language:Python14119
ictnlp/StreamSpeech
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
Language:Python76858
zmwv823/ComfyUI-AnyText
Unofficial implementation of AnyText. Generate or edit image with text (Mainly English & Chinese) in ComfyUI
Language:Python685
tyxsspa/AnyText
Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>
Language:Python4.1k271
Fictionarry/TalkingGaussian
[ECCV'24] TalkingGaussian: Structure-Persistent 3D Talking Head Synthesis via Gaussian Splatting
Language:Python12516
xg-chu/lightning_track
[ICLR 2024] Generalizable and Precise Head Avatar from Image(s)
Language:Python575
lipku/metahuman-stream
Real time interactive streaming digital human
Language:Python1k237
tanshuai0219/EDTalk
[ECCV 2024] EDTalk - Official PyTorch Implementation
Language:Python1595
Camb-ai/MARS5-TTS
MARS5 speech model (TTS) from CAMB.AI
Language:Jupyter Notebook2.3k181
microsoft/UFO
A UI-Focused Agent for Windows OS Interaction.
Language:Python7k850
ltdrdata/ComfyUI-Manager
ComfyUI-Manager is an extension designed to enhance the usability of ComfyUI. It offers management functions to install, remove, disable, and enable various custom nodes of ComfyUI. Furthermore, this extension provides a hub feature and convenience functions to access a wide range of information within ComfyUI.
Language:JavaScript4.9k609
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Language:Python20.9k2k
landing-ai/vision-agent
Vision agent
Language:Python1k99

jdola

jdola's Stars

ground-creative/openvoice-docker

jdola/fastapi-openvoice-tts

norbertkross/fastapi-openvoice-tts

whitebabyblackmarket/Ventriloquist-v2

codename0og/rvc-realtime-voice-changer

codename0og/codename-rvc-fork

PGRjoystick/rvc-fastapi

fumiama/Retrieval-based-Voice-Conversion-WebUI

michaelkamprath/multi-service-rtmp-broadcaster

datarhei/restreamer

alibaba/nacos

TraceMachina/nativelink

HKoon/ChatTTS-OpenVoice

yuvraj108c/stable-audio-1-docker

bunkerity/bunkerweb

soumik-kanad/diff2lip

timhagel/MeloTTS-Docker-API-Server

YoungSeng/DiffuseStyleGesture

ictnlp/StreamSpeech

zmwv823/ComfyUI-AnyText

tyxsspa/AnyText

Fictionarry/TalkingGaussian

xg-chu/lightning_track

lipku/metahuman-stream

tanshuai0219/EDTalk

Camb-ai/MARS5-TTS

microsoft/UFO

ltdrdata/ComfyUI-Manager

hpcaitech/Open-Sora

landing-ai/vision-agent