daitranskku's Stars
facebookresearch/sapiens
High-resolution models for human tasks.
nota-github/AIC2024_Track1_Nota
ZhenyuX1E/PoseTrack
Tencent/DepthCrafter
DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
daitranskku/forest-fire-damage-mapping
[Remote Sensing] Damage-Map Estimation Using UAV Images and Deep Learning Algorithms for Disaster Management System
daitranskku/CRDDC_2022_Code
[CRDDC2022] Crowdsensing-based Road Damage Detection Challenge
daitranskku/Image2Hazard
[ISARC 2024] Image-to-Hazard: GPT-based Logic Reasoning for Hazard Identification in Construction Site using CCTV Data
daitranskku/VizWiz2024-VQA-AnswerTherapy
[2024VizWiz] Vision-Language Model-based PolyFormer for Recognizing Visual Questions with Multiple Answer Groundings
daitranskku/AIC2024-TRACK4-TEAM15
[CVPRW 2024] Low-Light Image Enhancement Framework for Improved Object Detection in Fisheye Lens Datasets
colorfulfuture/Awesome-Trajectory-Motion-Prediction-Papers
aras62/PIE
Annotations for Pedestrian Intention Estimation (PIE) dataset
facebookresearch/segment-anything-2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
NationalGAILab/HoT
[CVPR 2024 🔥] Official implementation of the paper "⏳ Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose Estimation"
TaatiTeam/MotionAGFormer
Official implementation of the paper "MotionAGFormer: Enhancing 3D Pose Estimation with a Transformer-GCNFormer Network" (WACV 2024).
xifen523/COD
Towards Consistent Object Detection via LiDAR-Camera Synergy (official code SMC2024)
karpathy/LLM101n
LLM101n: Let's build a Storyteller
callsys/DynRefer
DynRefer: Delving into Region-level Multi-modality Tasks via Dynamic Resolution
jacobmarks/fiftyone_florence2_plugin
Run SOTA Vision-Language Model Florence-2 on your data!
callsys/ControlCap
[ECCV 2024] ControlCap: Controllable Region-level Captioning
spacewalk01/depth-anything-tensorrt
TensorRT implementation of Depth-Anything V1, V2
DepthAnything/Depth-Anything-V2
Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
johnowusuduah/DivNEDS
DivNEDS: Diverse Naturalistic Edge Driving Scene Dataset for Autonomous Vehicle Scene Understanding
salmank255/ROAD_Waymo_Baseline
lez-s/StereoDiffusion
Implementation of StereoDiffusion
siyuanliii/masa
Official Implementation of CVPR24 highligt paper: Matching Anything by Segmenting Anything
ipl-uw/AIC23_Track1_UWIPL_ETRI
hailanyi/VirConv
Virtual Sparse Convolution for Multimodal 3D Object Detection
UCF-SST-Lab/AICity-2024-Track2-CVPRW
This is open source code for AI City Challenge Track 2 Traffic Safety Description and Analysis.
NVIDIAAICITYCHALLENGE/2024AICITY_Code_From_Top_Teams
zhengchen1999/DAT
PyTorch code for our ICCV 2023 paper "Dual Aggregation Transformer for Image Super-Resolution"