gauravk95

👨‍💻 Android and Backend Dev, Experienced in AR, 3D Graphics, ML, Camera, Audio, Video, OTT, Social Media apps. 📱 Building apps for billions

Bangalore, India

gauravk95's Stars

supabase/supabase
The open source Firebase alternative. Supabase gives you a dedicated Postgres database to build your web, mobile, and AI applications.
Language:TypeScript76.2k 529 4.1k7.5k
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Python36.9k 296 1.1k4.6k
myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell. Audio foundation model.
Language:Python30.5k 224 2653k
graphdeco-inria/gaussian-splatting
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
Language:Python15.4k 120 1.1k2.1k
neonbjb/tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
Language:Jupyter Notebook13.5k 175 5241.9k
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
Language:Python11.4k 184 1.9k1.9k
InstantID/InstantID
InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥
Language:Python10.7k 125 217785
PeterL1n/RobustVideoMatting
Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!
Language:Python8.7k 139 2501.1k
PaddlePaddle/PaddleGAN
PaddlePaddle GAN library, including lots of interesting applications like First-Order motion transfer, Wav2Lip, picture repair, image editing, photo2cartoon, image style transfer, GPEN, and so on.
Language:Python8k 107 3661.2k
HumanAIGC/EMO
Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
7.6k 335 268928
yl4579/StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Language:Python5.3k 77 205453
collabora/WhisperSpeech
An Open Source text-to-speech system built by inverting Whisper.
Language:Jupyter Notebook4k 76 112221
facebookresearch/audio2photoreal
Code and dataset for photorealistic Codec Avatars driven from audio
Language:Python2.7k 31 62263
android/play-billing-samples
Samples for Google Play In-app Billing
Language:Kotlin2.4k 189 5431.3k
Zz-ww/SadTalker-Video-Lip-Sync
本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形，设置面部区域可配置的增强方式进行合成唇形（人脸）区域画面增强，提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧，补充帧间合成唇形的动作过渡，使合成的唇形更为流畅、真实以及自然。
Language:Python1.9k 36 96333
FACEGOOD/FACEGOOD-Audio2Face
http://www.facegood.cc
Language:Python1.8k 30 90361
Nutlope/notesGPT
Record voice notes & transcribe, summarize, and get tasks
Language:TypeScript1.8k 24 22296
xiaobai1217/Awesome-Video-Datasets
Video datasets
1.3k 29 1297
YuanxunLu/LiveSpeechPortraits
Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation (SIGGRAPH Asia 2021)
Language:Python1.2k 22 92214
sh-lee-prml/HierSpeechpp
The official implementation of HierSpeech++
Language:Python1.2k 56 54136
SociallyIneptWeeb/AICoverGen
A WebUI to create song covers with any RVC v2 trained AI voice from YouTube videos or audio files.
Language:Python1.2k 26 130281
wladradchenko/wunjo.wladradchenko.ru
Wunjo CE: Face Swap, Lip Sync, Control Remove Objects & Text & Background, Restyling, Audio Separator, Clone Voice, Video Generation. Open Source, Local & Free.
Language:Python928 19 5896
Doubiiu/CodeTalker
[CVPR 2023] CodeTalker: Speech-Driven 3D Facial Animation with Discrete Motion Prior
Language:Jupyter Notebook550 24 8159
RenYurui/PIRender
The source code of the ICCV2021 paper "PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering"
Language:Python528 20 3867
choyingw/SynergyNet
3DV 2021: Synergy between 3DMM and 3D Landmarks for Accurate 3D Facial Geometry
Language:Jupyter Notebook388 18 3457
amirbar/speech2gesture
code for training the models from the paper "Learning Individual Styles of Conversational Gestures"
Language:Python380 27 2544
yfeng95/DELTA
Learning Disentangled Avatars with Hybrid 3D Representations. (Face, Body, Hair and Clothing)
Language:Python251 14 1115
KangweiiLiu/Awesome_Audio-driven_Talking-Face-Generation
A curated list of resources of audio-driven talking face generation
136 2 311
zhongshaoyy/Audio2Face
Language:Python94 6 725
gauravk95/SadTalker-Video
This project is based on SadTalker to implement video lip synthesis.
Language:Python12 2 05

gauravk95

gauravk95's Stars

supabase/supabase

coqui-ai/TTS

myshell-ai/OpenVoice

graphdeco-inria/gaussian-splatting

neonbjb/tortoise-tts

PaddlePaddle/PaddleSpeech

InstantID/InstantID

PeterL1n/RobustVideoMatting

PaddlePaddle/PaddleGAN

HumanAIGC/EMO

yl4579/StyleTTS2

collabora/WhisperSpeech

facebookresearch/audio2photoreal

android/play-billing-samples

Zz-ww/SadTalker-Video-Lip-Sync

FACEGOOD/FACEGOOD-Audio2Face

Nutlope/notesGPT

xiaobai1217/Awesome-Video-Datasets

YuanxunLu/LiveSpeechPortraits

sh-lee-prml/HierSpeechpp

SociallyIneptWeeb/AICoverGen

wladradchenko/wunjo.wladradchenko.ru

Doubiiu/CodeTalker

RenYurui/PIRender

choyingw/SynergyNet

amirbar/speech2gesture

yfeng95/DELTA

KangweiiLiu/Awesome_Audio-driven_Talking-Face-Generation

zhongshaoyy/Audio2Face

gauravk95/SadTalker-Video