junhwanjang's Stars
Rudrabha/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
cupy/cupy
NumPy & SciPy for GPU
Megvii-BaseDetection/YOLOX
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/
yeemachine/kalidokit
Blendshape and kinematics calculator for Mediapipe/Tensorflow.js Face, Eyes, Pose, and Finger tracking models.
zzw922cn/awesome-speech-recognition-speech-synthesis-papers
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
tomgoldstein/loss-landscape
Code for visualizing the loss landscape of neural nets
boost-devs/ai-tech-interview
π©βπ»π¨βπ» AI μμ§λμ΄ κΈ°μ λ©΄μ μ€ν°λ (βοΈ 1k+)
draftbit/avatar-generator
Personas, an avatar generator by Draftbit
TimoBolkart/voca
This codebase demonstrates how to synthesize realistic 3D character animations given an arbitrary speech signal and a static character mesh.
declare-lab/MELD
MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in Conversation
microsoft/FaceSynthetics
EvelynFan/FaceFormer
[CVPR 2022] FaceFormer: Speech-Driven 3D Facial Animation with Transformers
x4nth055/emotion-recognition-using-speech
Building and training Speech Emotion Recognizer that predicts human emotions using Python, Sci-kit learn and Keras
SuperKogito/spafe
:sound: spafe: Simplified Python Audio Features Extraction
NumesSanguis/FACSvatar
An Open Source Modular Framework From Face to FACS Based Avatar Animation (Unity3D / Blender)
Emotional-Text-to-Speech/dl-for-emo-tts
:computer: :robot: A summary on our attempts at using Deep Learning approaches for Emotional Text to Speech :speaker:
ethanhe42/epipolar-transformers
Epipolar Transformers (best paper award, CVPR 2020 workshop)
Demfier/multimodal-speech-emotion-recognition
Lightweight and Interpretable ML Model for Speech Emotion Recognition and Ambiguity Resolution (trained on IEMOCAP dataset)
zhangchenxu528/FACIAL
FACIAL: Synthesizing Dynamic Talking Face With Implicit Attribute Learning. ICCV, 2021.
numediart/EmoV-DB
The Emotional Voices Database: Towards Controlling the Emotional Expressiveness in Voice Generation Systems
Sxela/face2comics
face2comics datasets
habla-liaa/ser-with-w2v2
Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'
srcnalt/3d-profile-avatars
Add your website a 3D Profile Avatar using Ready Player Me with a single line of code!
emotiontts/emotiontts_open_db
λ‘λ΄μ κ°μ λ° κ°μ±μ ννν μ μλ λνν μμ±ν©μ± μ€νμμ€ νλ«νΌ
lelechen63/talking-head-generation-survey
Official github repo for paper "What comprises a good talking-head video generation?: A Survey and Benchmark"
junhwanjang/visemenet-inference
3D Avatar Lip Synchronization from speech (JALI based face-rigging)
gau-nernst/centernet-lightning
Implementation of CenterNet and FairMOT with PyTorch Lightning
subinium/web3-onboarding
λ€μν Web3 μλ£λ₯Ό ν΅ν μ¨λ³΄λ©
varunnr/camera_and_pose
Reconstructing 3D Human Pose from 2D Image Landmarks
UttaranB127/speech2affective_gestures
This is the official implementation of the paper "Speech2AffectiveGestures: Synthesizing Co-Speech Gestures with Generative Adversarial Affective Expression Learning".