Pinned Repositories
SMPLer-X
Official Code for "SMPLer-X: Scaling Up Expressive Human Pose and Shape Estimation"
shimomurakei's Repositories
shimomurakei/4D-Humans
4DHumans: Reconstructing and Tracking Humans with Transformers
shimomurakei/4DGen-colab
shimomurakei/agency-swarm
An opensource agent orchestration framework built on top of the latest OpenAI Assistants API.
shimomurakei/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
shimomurakei/Bark-Voice-Cloning
Bark Voice Cloning and Voice Cloning for Chinese Speech
shimomurakei/co-tracker
CoTracker is a model for tracking any point (pixel) on a video.
shimomurakei/Depth-Anything
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
shimomurakei/discord
shimomurakei/elevenlabs-python
The official Python API for ElevenLabs Text to Speech.
shimomurakei/EMAGE-jupyter
shimomurakei/ExplorerBlurMica
Add background Blur effect or Acrylic (Mica for win11) effect to explorer for win10 and win11
shimomurakei/GaussianDreamer
GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion Models (CVPR 2024)
shimomurakei/GeoWizard
[arXiv'24] GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image
shimomurakei/ges-splatting-jupyter
shimomurakei/GRM-jupyter
shimomurakei/IC-Light
More relighting!
shimomurakei/IC-Light-jupyter
shimomurakei/InstantID
InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥
shimomurakei/MagicTime
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
shimomurakei/momask-codes
Official implementation of "MoMask: Generative Masked Modeling of 3D Human Motions (CVPR2024)"
shimomurakei/moondream
tiny vision language model
shimomurakei/MVDream-threestudio
3D generation code for MVDream
shimomurakei/OOTDiffusion
Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
shimomurakei/OSINT-Framework
OSINT Framework
shimomurakei/PatchFusion
[CVPR 2024] An End-to-End Tile-Based Framework for High-Resolution Monocular Metric Depth Estimation
shimomurakei/ScoreHMR-jupyter
shimomurakei/sdxl-colab
shimomurakei/stable-fast
Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.
shimomurakei/StreamMultiDiffusion
Official code for the paper "StreamMultiDiffusion: Real-Time Interactive Generation with Region-Based Semantic Control."
shimomurakei/Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding