albert100121

A Computer Vision and Machine Learning Lover, especially interested in 3D vision and its application (Autonomous driving, robotics, and VR/AR Application)

VSLab@NTHU; MediaTek; AILabsTaipei, Taiwan

albert100121's Stars

AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
Language:Python146k 1.1k 7.7k27.3k
ytdl-org/youtube-dl
Command-line program to download videos from YouTube.com and other video sites
Language:Python134k 2.2k 26.7k10.2k
meta-llama/llama
Inference code for Llama models
Language:Python57.2k 527 1.1k9.7k
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook48.5k 313 6825.7k
google-research/google-research
Google Research
Language:Jupyter Notebook34.7k 757 1.3k8k
lllyasviel/ControlNet
Let us control diffusion models!
Language:Python31.2k 219 5592.8k
pytorch/vision
Datasets, Transforms and Models specific to Computer Vision
Language:Python16.5k 448 3.4k7k
NVlabs/instant-ngp
Instant neural graphics primitives: lightning fast NeRF and more
Language:Cuda16.2k 203 1k1.9k
CompVis/latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
Language:Jupyter Notebook12.2k 98 3481.6k
Lightning-AI/litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
Language:Python11.1k 100 8211.1k
bmaltais/kohya_ss
Language:Python9.9k 95 2.1k1.3k
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
Language:Python8.9k 78 584636
LiheYoung/Depth-Anything
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
Language:Python7.2k 50 219550
cmhungsteve/Awesome-Transformer-Attention
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
4.7k 130 30490
DepthAnything/Depth-Anything-V2
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
Language:Python4.3k 36 216374
rasbt/machine-learning-book
Code Repository for Machine Learning with PyTorch and Scikit-Learn
Language:Jupyter Notebook3.8k 57 981.4k
princeton-vl/RAFT
Language:Python3.4k 38 170641
MooreThreads/Moore-AnimateAnyone
Character Animation (AnimateAnyone, Face Reenactment)
Language:Python3.3k 36 156256
prs-eth/Marigold
[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation
Language:Python2.5k 42 110145
fudan-zvg/Semantic-Segment-Anything
Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).
Language:Python2.2k 19 58139
lukasHoel/text2room
Text2Room generates textured 3D meshes from a given text prompt using 2D text-to-image models (ICCV2023).
Language:Python1k 10 3374
johannakarras/DreamPose
Official implementation of "DreamPose: Fashion Image-to-Video Synthesis via Stable Diffusion"
Language:Python986 25 6975
fuxiao0719/GeoWizard
[ECCV'24] GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image
Language:Python807 24 4234
tomrunia/OpticalFlow_Visualization
Python optical flow visualization following Baker et al. (ICCV 2007) as used by the MPI-Sintel challenge
Language:Python428 10 657
lzqsd/InverseRenderingOfIndoorScene
Language:Python304 13 1835
NVlabs/I2SB
Language:Python265 5 1926
cfernandezlab/CFL
Tensorflow implementation of our end-to-end model to recover 3D layouts. Also with equirectangular convolutions!
Language:Jupyter Notebook106 6 1518
howardyclo/CLCC-CVPR21
An official TensorFlow implementation of “CLCC: Contrastive Learning for Color Constancy” accepted at CVPR 2021.
Language:Python66 2 510
sunset1995/PanoPlane360
Indoor Panorama Planar 3D Reconstruction via Divide and Conquer
Language:Python52 5 710
mvlchallenge/mvl_toolkit
Official toolkit for Multi-View Layout Estimation Challenge in OmniCV workshop at CVPR'23.
Language:Python16 1 23

albert100121

albert100121's Stars

AUTOMATIC1111/stable-diffusion-webui

ytdl-org/youtube-dl

meta-llama/llama

facebookresearch/segment-anything

google-research/google-research

lllyasviel/ControlNet

pytorch/vision

NVlabs/instant-ngp

CompVis/latent-diffusion

Lightning-AI/litgpt

bmaltais/kohya_ss

facebookresearch/xformers

LiheYoung/Depth-Anything

cmhungsteve/Awesome-Transformer-Attention

DepthAnything/Depth-Anything-V2

rasbt/machine-learning-book

princeton-vl/RAFT

MooreThreads/Moore-AnimateAnyone

prs-eth/Marigold

fudan-zvg/Semantic-Segment-Anything

lukasHoel/text2room

johannakarras/DreamPose

fuxiao0719/GeoWizard

tomrunia/OpticalFlow_Visualization

lzqsd/InverseRenderingOfIndoorScene

NVlabs/I2SB

cfernandezlab/CFL

howardyclo/CLCC-CVPR21

sunset1995/PanoPlane360

mvlchallenge/mvl_toolkit