Pinned Repositories
SMPLer-X
Official Code for "SMPLer-X: Scaling Up Expressive Human Pose and Shape Estimation"
shimomurakei's Repositories
shimomurakei/4dfy
4D-fy: Text-to-4D Generation Using Hybrid Score Distillation Sampling
shimomurakei/4DGaussians
[CVPR 2024] 4D Gaussian Splatting for Real-Time Dynamic Scene Rendering
shimomurakei/analog-virtualbox-vm-sky130a
Virtual Machine for analog with the open source Sky130A PDK
shimomurakei/APISR
APISR: Anime Production Inspired Real-World Anime Super-Resolution (CVPR 2024)
shimomurakei/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
shimomurakei/ChatMusician
shimomurakei/colorize-line-art-replicate
shimomurakei/comfyui-ipadapter-latentupscale-replicate
shimomurakei/daclip-uir
[ICLR 2024] Controlling Vision-Language Models for Universal Image Restoration. 5th place in the NTIRE 2024 Restore Any Image Model in the Wild Challenge.
shimomurakei/DiffMorpher
Official Code for DiffMorpher: Unleashing the Capability of Diffusion Models for Image Morphing (CVPR 2024)
shimomurakei/DiffSketcher
[NIPS 2023] Official implementation for "DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models" https://arxiv.org/abs/2306.14685
shimomurakei/DSINE
[CVPR 2024 Oral] Rethinking Inductive Biases for Surface Normal Estimation
shimomurakei/IDM-VTON
IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild
shimomurakei/IDM-VTON-jupyter
shimomurakei/img2img-turbo
One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more
shimomurakei/IPAdapter-jupyter
shimomurakei/Kandinsky-2
Kandinsky 2 — multilingual text2image latent diffusion model
shimomurakei/OpenVoice
Instant voice cloning by MyShell.
shimomurakei/PARE
Code for ICCV2021 paper PARE: Part Attention Regressor for 3D Human Body Estimation
shimomurakei/Perturbed-Attention-Guidance
Official implementation of "Perturbed-Attention Guidance"
shimomurakei/PIA
[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA,你的个性化图像动画生成器,利用文本提示将图像变为奇妙的动画
shimomurakei/PuLID-jupyter
shimomurakei/ReNoise-Inversion
Officail Implementation for "ReNoise: Real Image Inversion Through Iterative Noising"
shimomurakei/StableStudio
Community interface for generative AI
shimomurakei/StoryDiffusion-jupyter
shimomurakei/styletts-colab
shimomurakei/T2I-Adapter
T2I-Adapter
shimomurakei/TCD
Official Repository of the paper "Trajectory Consistency Distillation"
shimomurakei/uform
Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️
shimomurakei/ZoeDepth
Metric depth estimation from a single image