zhouyao4321's Stars
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
meta-llama/llama
Inference code for Llama models
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
huggingface/pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
facebookresearch/detectron2
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
google-deepmind/deepmind-research
This repository contains implementations and illustrative code to accompany DeepMind publications
Megvii-BaseDetection/YOLOX
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/
facebookresearch/mae
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
rom1504/img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
fundamentalvision/BEVFormer
[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
traveller59/spconv
Spatial Sparse Convolution Library
SHI-Labs/Neighborhood-Attention-Transformer
Neighborhood Attention Transformer, arxiv 2022 / CVPR 2023. Dilated Neighborhood Attention Transformer, arxiv 2022
gnobitab/RectifiedFlow
Official Implementation of Rectified Flow (ICLR2023 Spotlight)
chenhsuanlin/bundle-adjusting-NeRF
BARF: Bundle-Adjusting Neural Radiance Fields š¤® (ICCV 2021 oral)
IDEA-Research/DN-DETR
[CVPR 2022 Oral] Official implementation of DN-DETR
ranandalon/mtl
Unofficial implementation of: Multi-task learning using uncertainty to weigh losses for scene geometry and semantics
bradyz/cross_view_transformers
Cross-view Transformers for real-time Map-view Semantic Segmentation (CVPR 2022 Oral)
Megvii-BaseDetection/DeFCN
End-to-End Object Detection with Fully Convolutional Network
TRI-ML/dd3d
Official PyTorch implementation of DD3D: Is Pseudo-Lidar needed for Monocular 3D Object detection? (ICCV 2021), Dennis Park*, Rares Ambrus*, Vitor Guizilini, Jie Li, and Adrien Gaidon.
Owen-Liuyuxuan/visualDet3D
Official Repo for Ground-aware Monocular 3D Object Detection for Autonomous Driving / YOLOStereo3D: A Step Back to 2D for Efficient Stereo 3D Detection
DrSleep/multi-task-refinenet
Multi-Task (Joint Segmentation / Depth / Surface Normas) Real-Time Light-Weight RefineNet
TRI-ML/PF-Track
Implementation of PF-Track
kakaobrain/sparse-detr
PyTorch Implementation of Sparse DETR
kienduynguyen/BoxeR
Code release for "BoxeR: Box-Attention for 2D and 3D Transformers"
SuperMHP/GUPNet
Gorilla-Lab-SCUT/VISTA
This repo presents you the official code of "VISTA: Boosting 3D Object Detection via Dual Cross-VIew SpaTial Attention"
lucidrains/uniformer-pytorch
Implementation of Uniformer, a simple attention and 3d convolutional net that achieved SOTA in a number of video classification tasks, debuted in ICLR 2022
SJSU-AD/FusionAD
An open source autonomous driving stack by San Jose State University Autonomous Driving Team