zhixinshu's Stars
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
google-ai-edge/mediapipe
Cross-platform, customizable ML solutions for live and streaming media.
HarisIqbal88/PlotNeuralNet
Latex code for making neural networks diagrams
huggingface/datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
danielgatis/rembg
Rembg is a tool to remove images background
apple/ml-stable-diffusion
Stable Diffusion with Core ML on Apple Silicon
microsoft/Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
extreme-assistant/CVPR2024-Paper-Code-Interpretation
cvpr2024/cvpr2023/cvpr2022/cvpr2021/cvpr2020/cvpr2019/cvpr2018/cvpr2017 论文/代码/解读/直播合集,极市团队整理
mlfoundations/open_clip
An open source implementation of CLIP.
kornia/kornia
🐍 Geometric Computer Vision Library for Spatial AI
ashawkey/stable-dreamfusion
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.
carson-katri/dream-textures
Stable Diffusion built-in to Blender
OpenGVLab/LLaMA-Adapter
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
dreamworksanimation/openmoonray
MoonRay is DreamWorks’ open-source, award-winning, state-of-the-art production MCRT renderer.
richzhang/PerceptualSimilarity
LPIPS metric. pip install lpips
unsplash/datasets
🎁 5,400,000+ Unsplash images made available for research and machine learning
cambrian-mllm/cambrian
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
sicxu/Deep3DFaceRecon_pytorch
Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019). A PyTorch implementation.
OpenMotionLab/MotionGPT
[NeurIPS 2023] MotionGPT: Human Motion as a Foreign Language, a unified motion-language generation model using LLMs
facebookresearch/MaskFormer
Per-Pixel Classification is Not All You Need for Semantic Segmentation (NeurIPS 2021, spotlight)
TEXTurePaper/TEXTurePaper
Official Implementation for "TEXTure: Text-Guided Texturing of 3D Shapes"
buaacyw/MeshAnythingV2
From anything to mesh like human artists. Official impl. of "MeshAnything V2: Artist-Created Mesh Generation With Adjacent Mesh Tokenization"
microsoft/MeshTransformer
Research code for CVPR 2021 paper "End-to-End Human Pose and Mesh Reconstruction with Transformers"
open-mmlab/FoleyCrafter
FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝
saic-violet/bilayer-model
yasumat/RobustPhotometricStereo
Robust Photometric Stereo
roy-hachnochi/cross-domain-compositing
zqbai-jeremy/INORig
Source code for CVPR 2021 paper "Riggable 3D Face Reconstruction via In-Network Optimization"
microsoft/ConfigNet
Official implementation for ECCV 2020 paper CONFIG: Controllable Neural Face Image Generation