xiilei99's Stars
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
labmlai/annotated_deep_learning_paper_implementations
🧑🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Stability-AI/generative-models
Generative Models by Stability AI
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
LiheYoung/Depth-Anything
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
wgwang/awesome-LLMs-In-China
**大模型
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的可商用开源多模态对话模型
mapillary/OpenSfM
Open source Structure-from-Motion pipeline
huawei-noah/Pretrained-Language-Model
Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.
isl-org/ZoeDepth
Metric depth estimation from a single image
ytongbai/LVM
OpenDriveLab/Birds-eye-view-Perception
[IEEE T-PAMI] Awesome BEV perception research and cookbook for all level audience in autonomous diriving
YvanYin/Metric3D
The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."
Sense-X/Co-DETR
[ICCV 2023] DETRs with Collaborative Hybrid Assignments Training
liliu-avril/Awesome-Segment-Anything
This repository is for the first comprehensive survey on Meta AI's Segment Anything Model (SAM).
4DVLab/Vision-Centric-BEV-Perception
Vision-Centric BEV Perception: A Survey
RaymondWang987/NVDS
The official repository of the ICCV2023 paper "Neural Video Depth Stabilizer" (NVDS).
awaisrauf/Awesome-CV-Foundational-Models
facebookresearch/OrienterNet
Source Code for Paper "OrienterNet Visual Localization in 2D Public Maps with Neural Matching"
amir32002/3D_Street_View
The repo of Street View Image, Pose, and 3D Cities Dataset. Used in "Generic 3D Representation via Pose Estimation and Matching", ECCV16
avishkarsaha/translating-images-into-maps
Official PyTorch code for 'Translating Images Into Maps' ICRA 2022 (Outstanding Paper Award)
yzd-v/cls_KD
'NKD and USKD' (ICCV 2023) and 'ViTKD' (CVPRW 2024)
hsouri/Battle-of-the-Backbones
google-research/snap
SNAP: Self-supervised Neural Maps for Visual Positioning and Semantic Understanding (NeurIPS 2023)
ggjy/DeLVM
fudan-zvg/Ego3RT
[ECCV 2022] Learning Ego 3D Representation as Ray Tracing
ShenZheng2000/TPSeNCE
TPSeNCE for image rain generation, deraining, and object detection.
YujiaoShi/Boosting3DoFAccuracy
ICCV2023: Boosting 3-DoF Ground-to-Satellite Camera Localization Accuracy via Geometry-Guided Cross-View Transformer
woxihuanjiangguo/BEVNeXt