Huang9495

Research include computer vision, pattern recognition, and deep learning, focusing on fine-grained recognition, retail product recognition, object tracking.

China

Huang9495's Stars

karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Language:Python32.4k 346 2914.9k
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Language:Python17k 154 2571.6k
apple/ml-stable-diffusion
Stable Diffusion with Core ML on Apple Silicon
Language:Python16.3k 140 231869
karpathy/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
Language:Python8.1k 79 31719
lllyasviel/stable-diffusion-webui-forge
Language:Python4.7k 52 419422
naver/dust3r
DUSt3R: Geometric 3D Vision Made Easy
Language:Python4.4k 50 85479
ZHO-ZHO-ZHO/ComfyUI-Workflows-ZHO
我的 ComfyUI 工作流合集 | My ComfyUI workflows collection
3.3k 22 7329
shibing624/MedicalGPT
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型，实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。
Language:Python2.7k 31 352411
facebookresearch/jepa
PyTorch code and models for V-JEPA self-supervised learning from video.
Language:Python2.4k 31 40224
jamriska/ebsynth
Fast Example-based Image Synthesis and Style Transfer
Language:C1.5k 46 38193
ltdrdata/ComfyUI-Impact-Pack
Language:Python1.3k 20 551130
XPandora/PhysGaussian
[CVPR 2024 Highlight] PhysGaussian: Physics-Integrated 3D Gaussians for Generative Dynamics
Language:Python882 58 2032
VAST-AI-Research/TriplaneGaussian
TriplaneGaussian: A new hybrid representation for single-view 3D reconstruction.
Language:Python632 19 2044
lucidrains/meshgpt-pytorch
Implementation of MeshGPT, SOTA Mesh generation using Attention, in Pytorch
Language:Python554 18 6747
OpenTexture/Paint3D
Paint3D: Paint Anything 3D with Lighting-Less Texture Diffusion Models, a no lighting baked texture generative model
Language:Python540 63 714
storyicon/comfyui_segment_anything
Based on GroundingDino and SAM, use semantic strings to segment any element in an image. The comfyui version of sd-webui-segment-anything.
Language:Python457 6 5059
G-U-N/AnimateLCM
AnimateLCM: Let's Accelerate the Video Generation within 4 Steps!
Language:Python454 27 2538
shibing624/ChatPDF
RAG for Local LLM, chat with PDF/doc/txt files, ChatPDF
Language:Python451 4 2582
thu-ml/CRM
Single Image to 3D Textured Mesh in 10 seconds with Convolutional Reconstruction Model.
Language:Python439 17 1934
Tangshitao/MVDiffusion
MVDiffusion: Enabling Holistic Multi-view Image Generation with Correspondence-Aware Diffusion, NeurIPS 2023 (spotlight)
Language:Python435 24 4619
TianxingWu/FreeInit
FreeInit: Bridging Initialization Gap in Video Diffusion Models
Language:Python434 6 1925
limuloo/MIGC
[CVPR 2024 Highlight] "MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis" (Official Implementation)
Language:Python386 14 519
benhenryL/Deblurring-3D-Gaussian-Splatting
187 39 55
Tangshitao/MVDiffusion_plusplus
MVDiffusion++: A Dense High-resolution Multi-view Diffusion Model for Single or Sparse-view 3D Object Reconstruction
118 18 12
Trentonom0r3/Ezsynth
An Implementation of Ebsynth for video stylization, and the original ebsynth for image stylization as an importable python library!
Language:Python81 1 2210
snap-research/AToM
Official implementation of `AToM: Amortized Text-to-Mesh using 2D Diffusion`
80 20 11
shibing624/chatgpt-webui
ChatGPT WebUI using gradio. 给 LLM 对话和检索知识问答RAG提供一个简单好用的Web UI界面
Language:Python63 3 010
ming1993li/Instant3DCodes
56 14 10
ctrotz/stylizing-video
Stylizing Video by Example (Jamriska et al., 2019)
Language:C++429
t-Authenting/AnimateLCM
AnimateLCM: Accelerating the Animation of Personalized Diffusion Models and Adapters with Decoupled Consistency Learning
10

Huang9495

Huang9495's Stars

karpathy/nanoGPT

hpcaitech/Open-Sora

apple/ml-stable-diffusion

karpathy/minbpe

lllyasviel/stable-diffusion-webui-forge

naver/dust3r

ZHO-ZHO-ZHO/ComfyUI-Workflows-ZHO

shibing624/MedicalGPT

facebookresearch/jepa

jamriska/ebsynth

ltdrdata/ComfyUI-Impact-Pack

XPandora/PhysGaussian

VAST-AI-Research/TriplaneGaussian

lucidrains/meshgpt-pytorch

OpenTexture/Paint3D

storyicon/comfyui_segment_anything

G-U-N/AnimateLCM

shibing624/ChatPDF

thu-ml/CRM

Tangshitao/MVDiffusion

TianxingWu/FreeInit

limuloo/MIGC

benhenryL/Deblurring-3D-Gaussian-Splatting

Tangshitao/MVDiffusion_plusplus

Trentonom0r3/Ezsynth

snap-research/AToM

shibing624/chatgpt-webui

ming1993li/Instant3DCodes

ctrotz/stylizing-video

t-Authenting/AnimateLCM