albert100121
A Computer Vision and Machine Learning Lover, especially interested in 3D vision and its application (Autonomous driving, robotics, and VR/AR Application)
VSLab@NTHU; MediaTek; AILabsTaipei, Taiwan
albert100121's Stars
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
ytdl-org/youtube-dl
Command-line program to download videos from YouTube.com and other video sites
meta-llama/llama
Inference code for Llama models
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
google-research/google-research
Google Research
lllyasviel/ControlNet
Let us control diffusion models!
pytorch/vision
Datasets, Transforms and Models specific to Computer Vision
NVlabs/instant-ngp
Instant neural graphics primitives: lightning fast NeRF and more
CompVis/latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
Lightning-AI/litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
bmaltais/kohya_ss
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
LiheYoung/Depth-Anything
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
cmhungsteve/Awesome-Transformer-Attention
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
DepthAnything/Depth-Anything-V2
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
rasbt/machine-learning-book
Code Repository for Machine Learning with PyTorch and Scikit-Learn
princeton-vl/RAFT
MooreThreads/Moore-AnimateAnyone
Character Animation (AnimateAnyone, Face Reenactment)
prs-eth/Marigold
[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation
fudan-zvg/Semantic-Segment-Anything
Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).
lukasHoel/text2room
Text2Room generates textured 3D meshes from a given text prompt using 2D text-to-image models (ICCV2023).
johannakarras/DreamPose
Official implementation of "DreamPose: Fashion Image-to-Video Synthesis via Stable Diffusion"
fuxiao0719/GeoWizard
[ECCV'24] GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image
tomrunia/OpticalFlow_Visualization
Python optical flow visualization following Baker et al. (ICCV 2007) as used by the MPI-Sintel challenge
lzqsd/InverseRenderingOfIndoorScene
NVlabs/I2SB
cfernandezlab/CFL
Tensorflow implementation of our end-to-end model to recover 3D layouts. Also with equirectangular convolutions!
howardyclo/CLCC-CVPR21
An official TensorFlow implementation of “CLCC: Contrastive Learning for Color Constancy” accepted at CVPR 2021.
sunset1995/PanoPlane360
Indoor Panorama Planar 3D Reconstruction via Divide and Conquer
mvlchallenge/mvl_toolkit
Official toolkit for Multi-View Layout Estimation Challenge in OmniCV workshop at CVPR'23.