Huang9495
Research include computer vision, pattern recognition, and deep learning, focusing on fine-grained recognition, retail product recognition, object tracking.
China
Huang9495's Stars
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
apple/ml-stable-diffusion
Stable Diffusion with Core ML on Apple Silicon
karpathy/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
lllyasviel/stable-diffusion-webui-forge
naver/dust3r
DUSt3R: Geometric 3D Vision Made Easy
ZHO-ZHO-ZHO/ComfyUI-Workflows-ZHO
我的 ComfyUI 工作流合集 | My ComfyUI workflows collection
shibing624/MedicalGPT
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。
facebookresearch/jepa
PyTorch code and models for V-JEPA self-supervised learning from video.
jamriska/ebsynth
Fast Example-based Image Synthesis and Style Transfer
ltdrdata/ComfyUI-Impact-Pack
XPandora/PhysGaussian
[CVPR 2024 Highlight] PhysGaussian: Physics-Integrated 3D Gaussians for Generative Dynamics
VAST-AI-Research/TriplaneGaussian
TriplaneGaussian: A new hybrid representation for single-view 3D reconstruction.
lucidrains/meshgpt-pytorch
Implementation of MeshGPT, SOTA Mesh generation using Attention, in Pytorch
OpenTexture/Paint3D
Paint3D: Paint Anything 3D with Lighting-Less Texture Diffusion Models, a no lighting baked texture generative model
storyicon/comfyui_segment_anything
Based on GroundingDino and SAM, use semantic strings to segment any element in an image. The comfyui version of sd-webui-segment-anything.
G-U-N/AnimateLCM
AnimateLCM: Let's Accelerate the Video Generation within 4 Steps!
shibing624/ChatPDF
RAG for Local LLM, chat with PDF/doc/txt files, ChatPDF
thu-ml/CRM
Single Image to 3D Textured Mesh in 10 seconds with Convolutional Reconstruction Model.
Tangshitao/MVDiffusion
MVDiffusion: Enabling Holistic Multi-view Image Generation with Correspondence-Aware Diffusion, NeurIPS 2023 (spotlight)
TianxingWu/FreeInit
FreeInit: Bridging Initialization Gap in Video Diffusion Models
limuloo/MIGC
[CVPR 2024 Highlight] "MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis" (Official Implementation)
benhenryL/Deblurring-3D-Gaussian-Splatting
Tangshitao/MVDiffusion_plusplus
MVDiffusion++: A Dense High-resolution Multi-view Diffusion Model for Single or Sparse-view 3D Object Reconstruction
Trentonom0r3/Ezsynth
An Implementation of Ebsynth for video stylization, and the original ebsynth for image stylization as an importable python library!
snap-research/AToM
Official implementation of `AToM: Amortized Text-to-Mesh using 2D Diffusion`
shibing624/chatgpt-webui
ChatGPT WebUI using gradio. 给 LLM 对话和检索知识问答RAG提供一个简单好用的Web UI界面
ming1993li/Instant3DCodes
ctrotz/stylizing-video
Stylizing Video by Example (Jamriska et al., 2019)
t-Authenting/AnimateLCM
AnimateLCM: Accelerating the Animation of Personalized Diffusion Models and Adapters with Decoupled Consistency Learning