monstre0731's Stars
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
academicpages/academicpages.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
xingyizhou/CenterNet
Object detection, 3D detection, and pose estimation using center point detection:
open-mmlab/mmcv
OpenMMLab Computer Vision Foundation
OpenDriveLab/UniAD
[CVPR 2023 Best Paper Award] Planning-oriented Autonomous Driving
google-research-datasets/Objectron
Objectron is a dataset of short, object-centric video clips. In addition, the videos also contain AR session metadata including camera poses, sparse point-clouds and planes. In each video, the camera moves around and above the object and captures it from different views. Each object is annotated with a 3D bounding box. The 3D bounding box describes the object’s position, orientation, and dimensions. The dataset contains about 15K annotated video clips and 4M annotated images in the following categories: bikes, books, bottles, cameras, cereal boxes, chairs, cups, laptops, and shoes
Jamie-Stirling/RetNet
An implementation of "Retentive Network: A Successor to Transformer for Large Language Models"
NVlabs/VoxFormer
Official PyTorch implementation of VoxFormer [CVPR 2023 Highlight]
noahcao/OC_SORT
[CVPR2023] The official repo for OC-SORT: Observation-Centric SORT on video Multi-Object Tracking. OC-SORT is simple, online and robust to occlusion/non-linear motion.
dyhBUPT/StrongSORT
[TMM 2023] StrongSORT: Make DeepSORT Great Again
DataXujing/YOLOv8
:fire: Official YOLOv8模型训练和部署
kimyoon-young/centerNet-deep-sort
realtime multiple people tracking (centerNet based person detector + deep sort algorithm with pytorch)
TRI-ML/dd3d
Official PyTorch implementation of DD3D: Is Pseudo-Lidar needed for Monocular 3D Object detection? (ICCV 2021), Dennis Park*, Rares Ambrus*, Vitor Guizilini, Jie Li, and Adrien Gaidon.
AIR-THU/DAIR-V2X
Owen-Liuyuxuan/visualDet3D
Official Repo for Ground-aware Monocular 3D Object Detection for Autonomous Driving / YOLOStereo3D: A Step Back to 2D for Efficient Stereo 3D Detection
youngwanLEE/MPViT
[CVPR 2022] MPViT:Multi-Path Vision Transformer for Dense Prediction
SysCV/idisc
iDisc: Internal Discretization for Monocular Depth Estimation [CVPR 2023]
syncdoth/RetNet
Huggingface compatible implementation of RetNet (Retentive Networks, https://arxiv.org/pdf/2307.08621.pdf) including parallel, recurrent, and chunkwise forward.
zxcqlf/MonoViT
Self-supervised monocular depth estimation with a vision transformer
dvlab-research/ECCV22-P3AFormer-Tracking-Objects-as-Pixel-wise-Distributions
The official code for our ECCV22 oral paper: tracking objects as pixel-wise distributions.
aminshabani/house_diffusion
The implementation of "HouseDiffusion: Vector Floorplan Generation via a Diffusion Model with Discrete and Continuous Denoising", https://arxiv.org/abs/2211.13287
AIR-THU/DAIR-V2X-Seq
TRI-ML/permatrack
Implementation for Learning to Track with Object Permanence
cnexah/VA-DepthNet
VA-DepthNet: A Variational Approach to Single Image Depth Prediction
liyingying0113/rope3d-dataset-tools
zhiruiluo/my_cv
Academic CV Latex template publishing PDF on github pages
neuralint/neuralint
NeuraLint: Automatic Fault Detection for Deep Learning Programs Using Graph Transformations
Cram3r95/BEV-MOT-DeepSORT-LiDAR-clustering
lobanov-m/dcn-v2-pytorch
Deformable Convolution v2 PyTorch Layer for old networks like CenterNet v1
PratikMishra/AnomalyThresholds
Code for the paper "Empirical Thresholding on Spatio-temporal Autoencoders Trained on Surveillance Videos in a Dementia Care Unit".