shimomurakei

Pinned Repositories

SMPLer-X
Official Code for "SMPLer-X: Scaling Up Expressive Human Pose and Shape Estimation"
Language:Python1 1 00

shimomurakei's Repositories

shimomurakei/4D-Humans
4DHumans: Reconstructing and Tracking Humans with Transformers
Language:Python0 0
shimomurakei/4DGen-colab
shimomurakei/agency-swarm
An opensource agent orchestration framework built on top of the latest OpenAI Assistants API.
Language:Python0 0
shimomurakei/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Language:Python0 0
shimomurakei/Bark-Voice-Cloning
Bark Voice Cloning and Voice Cloning for Chinese Speech
Language:Jupyter Notebook0 0
shimomurakei/co-tracker
CoTracker is a model for tracking any point (pixel) on a video.
Language:Jupyter Notebook0 0
shimomurakei/Depth-Anything
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
Language:Python0 0
shimomurakei/discord
shimomurakei/elevenlabs-python
The official Python API for ElevenLabs Text to Speech.
Language:Python0 0
shimomurakei/EMAGE-jupyter
Language:Jupyter Notebook0 0
shimomurakei/ExplorerBlurMica
Add background Blur effect or Acrylic (Mica for win11) effect to explorer for win10 and win11
Language:C++0 0
shimomurakei/GaussianDreamer
GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion Models (CVPR 2024)
shimomurakei/GeoWizard
[arXiv'24] GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image
Language:Python0 0
shimomurakei/ges-splatting-jupyter
shimomurakei/GRM-jupyter
Language:Jupyter Notebook0 0
shimomurakei/IC-Light
More relighting!
Language:Python0 0
shimomurakei/IC-Light-jupyter
Language:Jupyter Notebook0 0
shimomurakei/InstantID
InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥
shimomurakei/MagicTime
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
Language:Python0 0
shimomurakei/momask-codes
Official implementation of "MoMask: Generative Masked Modeling of 3D Human Motions (CVPR2024)"
Language:Python0 0
shimomurakei/moondream
tiny vision language model
Language:Jupyter Notebook0 0
shimomurakei/MVDream-threestudio
3D generation code for MVDream
shimomurakei/OOTDiffusion
Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
Language:Python0 0
shimomurakei/OSINT-Framework
OSINT Framework
Language:JavaScript0 0
shimomurakei/PatchFusion
[CVPR 2024] An End-to-End Tile-Based Framework for High-Resolution Monocular Metric Depth Estimation
Language:Python0 0
shimomurakei/ScoreHMR-jupyter
Language:Jupyter Notebook0 0
shimomurakei/sdxl-colab
Language:Jupyter Notebook0 0
shimomurakei/stable-fast
Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.
Language:Python0 0
shimomurakei/StreamMultiDiffusion
Official code for the paper "StreamMultiDiffusion: Real-Time Interactive Generation with Region-Based Semantic Control."
Language:Jupyter Notebook0 0
shimomurakei/Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
Language:Python0 0