sky24h
Research engineer, Ph.D. in CS. Focusing on visual content editing and synthesis.
@CyberAgentAILab
Pinned Repositories
AnimateDiff_Serverless_Runpod
A serverless application that uses AnimateDiff to run a Text-to-Video task on RunPod.
ChatGPT_Telegram_Bot
Simple implementation using ChatGPT (and GPT-4) API, deployed as a Telegram Bot.
Face_Animation_Real_Time
One-shot face animation using webcam, capable of running in real time.
Free-View_Expressive_Talking_Head_Video_Editing
Code for the paper "Free-View Expressive Talking Head Video Editing" (ICASSP 2023)
image2ink
image-to-image translation from image to ink wash painting
PLACE
PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis (CVPR2024 Highlight)
SDXL_Serverless_Runpod
A serverless application that uses Stable Diffusion XL to run a Text-to-Image task on RunPod.
SIS_from_Sparse_Layouts
Code for the paper "Diffusion-based Semantic Image Synthesis from Sparse Layouts" (CGI 2023)
sketch2ink_pix2pix
input sketch, output ink wash painting
User_study_tool_flask
sky24h's Repositories
sky24h/Face_Animation_Real_Time
One-shot face animation using webcam, capable of running in real time.
sky24h/AnimateDiff_Serverless_Runpod
A serverless application that uses AnimateDiff to run a Text-to-Video task on RunPod.
sky24h/image2ink
image-to-image translation from image to ink wash painting
sky24h/SDXL_Serverless_Runpod
A serverless application that uses Stable Diffusion XL to run a Text-to-Image task on RunPod.
sky24h/sketch2ink_pix2pix
input sketch, output ink wash painting
sky24h/Free-View_Expressive_Talking_Head_Video_Editing
Code for the paper "Free-View Expressive Talking Head Video Editing" (ICASSP 2023)
sky24h/PLACE
PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis (CVPR2024 Highlight)
sky24h/SIS_from_Sparse_Layouts
Code for the paper "Diffusion-based Semantic Image Synthesis from Sparse Layouts" (CGI 2023)
sky24h/ChatGPT_Telegram_Bot
Simple implementation using ChatGPT (and GPT-4) API, deployed as a Telegram Bot.
sky24h/User_study_tool_flask
sky24h/3DDFA_V2
The official PyTorch implementation of Towards Fast, Accurate and Stable 3D Dense Face Alignment, ECCV 2020.
sky24h/deeplab-pytorch
PyTorch implementation of DeepLab v2 on COCO-Stuff / PASCAL VOC
sky24h/DocTr_Geometric_Unwarping
The official code for “DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction”, ACM MM, Oral Paper, 2021.
sky24h/Face_Recognition_Comparisons
sky24h/Glance_Detetction
Real time detection for whether a person is looking at the screen or not.
sky24h/Human-Pose-Estimation-with-Deep-Learning_backup
MATLAB example of deep learning based human pose estimation.
sky24h/Training-Free_Zero-Shot_Semantic_Segmentation_with_LLM_Refinement
This repository contains official implementation of the paper "Training-Free Zero-Shot Semantic Segmentation with LLM Refinement" (BMVC 2024).
sky24h/accelerate_utils
Custom functions and use cases of [accelerate](https://github.com/huggingface/accelerate).
sky24h/flatten
Pytorch Implementation of FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing (ICLR 2024)
sky24h/One-Shot_Free-View_Neural_Talking_Head_Synthesis
sky24h/Paint-by-Example
Paint by Example: Exemplar-based Image Editing with Diffusion Models
sky24h/sky24h
sky24h/sky24h.github.io
sky24h/Stable-Makeup
Pytorch Implementation of "Stable-Makeup: When Real-World Makeup Transfer Meets Diffusion Model"
sky24h/StoryDiffusion-ControlNet
Create Magic Story with ControlNet!
sky24h/TF-ICON
[ICCV 2023] "TF-ICON: Diffusion-Based Training-Free Cross-Domain Image Composition" (Official Implementation)
sky24h/utils_cv
sky24h/validation_for_image_generation
sky24h/video-preprocessing
sky24h/websites