Pinned Repositories
ACTOR
Official Pytorch implementation of the paper "Action-Conditioned 3D Human Motion Synthesis with Transformer VAE", ICCV 2021
Audio2Face
bvh-python
Python module for parsing BVH (Biovision hierarchical data) mocap files
Emote-hack
using chatgpt (now Claude 3) to reverse engineer code from Emote white paper. WIP
GenMotion
Deep motion generator collections
ignite-1
High-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently.
MachineLearning-AI
This repository contains all the work that I regularly did and studied from Medium blogs, several research papers, and other Repos (related/unrelated to the research papers).
sd_webui
secutron
TesTime
collection of sample colab files
secutron's Repositories
secutron/TesTime
collection of sample colab files
secutron/ACTOR
Official Pytorch implementation of the paper "Action-Conditioned 3D Human Motion Synthesis with Transformer VAE", ICCV 2021
secutron/Emote-hack
using chatgpt (now Claude 3) to reverse engineer code from Emote white paper. WIP
secutron/MachineLearning-AI
This repository contains all the work that I regularly did and studied from Medium blogs, several research papers, and other Repos (related/unrelated to the research papers).
secutron/sd_webui
secutron/secutron
secutron/stable-diffusion-webui
Stable Diffusion web UI
secutron/DefenseHackathonSatImage
secutron/DiffFace
DiffFace: Diffusion-based Face Swapping with Facial Guidance
secutron/dna.node
secutron/Easy-Wav2Lip
Colab for making Wav2Lip high quality and easy to use
secutron/football_analysis
This repository contains a comprehensive computer vision/machine learning football project that uses YOLO for object detection, Kmeans for pixel segmentation, optical flow for motion tracking, and perspective transformation to analyze player movements in football videos
secutron/GPT-4o-Assistant
secutron/hallo-for-windows
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
secutron/ImageBind
ImageBind One Embedding Space to Bind Them All
secutron/jepa
PyTorch code and models for V-JEPA self-supervised learning from video.
secutron/KoProgressiveTransformersSLP
Source code for "Progressive Transformers for End-to-End Sign Language Production" (ECCV 2020)
secutron/ListenDenoiseAction
Code to reproduce the results for our SIGGRAPH 2023 paper "Listen Denoise Action"
secutron/LivePortrait-Advanced-Portrait-Animation-System
LivePortrait is an advanced deep learning-based system for animating portrait images. It uses a two-stage training process to create realistic and controllable animations from static portrait images.
secutron/LSLM-Listening-while-Speaking-Language-Model
LSLM implements full duplex modeling in interactive speech language models, based on research by Ma et al. (2024). This project advances human-computer interaction through real-time spoken dialogue capabilities.
secutron/MARLIN
[CVPR] MARLIN: Masked Autoencoder for facial video Representation LearnINg
secutron/mediapipe_pose_compare
Joint angle comparison of mediapipe prediction results bvh conversion with ground truth bvh
secutron/Meteor
Official PyTorch implementation code for realizing the technical part of Mamba-based traversal of rationale (Meteor) to improve performance of numerous vision language performances for diverse capabilities. (Under Review)
secutron/minimal-diffusion
A minimal yet resourceful implementation of diffusion models (along with pretrained models + synthetic images for nine datasets)
secutron/MultiTalk
[INTERSPEECH'24] Official repository for "MultiTalk: Enhancing 3D Talking Head Generation Across Languages with Multilingual Video Dataset"
secutron/nitec
NITEC: Versatile Hand-Annotated Eye Contact Dataset for Ego-Vision Interaction (Accepted at WACV24)
secutron/realistic_talking_facegen
secutron/SHOW
This is the codebase for SHOW.
secutron/Speech-driven-expressions
Speech-Driven Expression Blendshape Based on Single-Layer Self-attention Network (AIWIN 2022)
secutron/videollm-online
VideoLLM-online: Online Video Large Language Model for Streaming Video (CVPR 2024)