Metokarski's Stars
facebookresearch/AnimalAvatar
Code from the ECCV 2024 paper "Animal Avatar Reconstructing Animatable 3D Animals from Casual Videos".
facebookresearch/UHM
Official PyTorch implementation of "Authentic Hand Avatar from a Phone Scan via Universal Hand Model", CVPR 2024.
facebookresearch/dva
Drivable Volumetric Avatars
Unity-Technologies/ml-agents
The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.
BAAI-Agents/Cradle
The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curation, in a standardized general environment with minimal requirements.
meta-llama/llama3
The official Meta Llama 3 GitHub site
zengqunzhao/DFER-CLIP
[BMVC'23] Prompting Visual-Language Models for Dynamic Facial Expression Recognition
ddlBoJack/emotion2vec
[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
x4nth055/emotion-recognition-using-speech
Building and training Speech Emotion Recognizer that predicts human emotions using Python, Sci-kit learn and Keras
SuyashMore/MevonAI-Speech-Emotion-Recognition
Identify the emotion of multiple speakers in an Audio Segment
MiteshPuthran/Speech-Emotion-Analyzer
The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
iperov/DeepFaceLive
Real-time face swap for PC streaming or video calls
ihp-lab/OpenSense
OpenSense: A Platform for Multimodal Data Acquisition and Behavior Perception
ihp-lab/LibreFace
[WACV 2024] LibreFace: An Open-Source Toolkit for Deep Facial Expression Analysis
hiyouga/LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
yoheinakajima/autofinetune
auto fine tune of models with synthetic data
aras-p/UnityGaussianSplatting
Toy Gaussian Splatting visualization in Unity
zju3dv/street_gaussians
[ECCV 2024] Street Gaussians: Modeling Dynamic Urban Scenes with Gaussian Splatting
zubair-irshad/NeO-360
Pytorch code for ICCV'23 paper. NEO 360: Neural Fields for Sparse View Synthesis of Outdoor Scenes
pierotofy/OpenSplat
Production-grade 3D gaussian splatting with CPU/GPU support for Windows, Mac and Linux 🚀
fraunhoferhhi/Self-Organizing-Gaussians
[ECCV '24] Compressing 3D Gaussian Splats by placing their parameters into a 2D grid with local smoothness
snuvclab/gtu
[CVPR 2024] Official Repo of Guess The Unseen (GTU)
Florian-Barthel/splatviz
Full python interactive 3D Gaussian Splatting viewer for real-time editing and analyzing.
initialneil/SplattingAvatar
[CVPR2024] Official implementation of SplattingAvatar.
zslrmhb/Omniverse-Virtual-Assisstant
Audio2Face Avatar with Riva SDK functionality
metaiintw/build-an-avatar-with-ASR-TTS-Transformer-Omniverse-Audio2Face
openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
deepgram-devs/deepgram-ai-agent-demo
Deepgram Conversational AI demo
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
NVIDIA/flowtron
Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style transfer