xiilei99

xiilei99's Stars

huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python130k 1.1k 15.3k25.8k
labmlai/annotated_deep_learning_paper_implementations
🧑‍🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Language:Python52.3k 435 1305.4k
Stability-AI/generative-models
Generative Models by Stability AI
Language:Python23.5k 252 2872.6k
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Language:Python19.3k 297 1.3k2.5k
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
10.9k 251 105720
LiheYoung/Depth-Anything
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
Language:Python6.5k 48 200502
wgwang/awesome-LLMs-In-China
**大模型
5k 102 24423
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的可商用开源多模态对话模型
Language:Python4.4k 43 376341
mapillary/OpenSfM
Open source Structure-from-Motion pipeline
Language:Python3.3k 145 638850
huawei-noah/Pretrained-Language-Model
Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.
Language:Python3k 57 201624
isl-org/ZoeDepth
Metric depth estimation from a single image
Language:Jupyter Notebook2.1k 31 106199
ytongbai/LVM
Language:Python1.7k 123 2053
OpenDriveLab/Birds-eye-view-Perception
[IEEE T-PAMI] Awesome BEV perception research and cookbook for all level audience in autonomous diriving
Language:Python1.1k 34 1896
YvanYin/Metric3D
The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."
Language:Python1.1k 25 11875
Sense-X/Co-DETR
[ICCV 2023] DETRs with Collaborative Hybrid Assignments Training
Language:Python910 11 14595
liliu-avril/Awesome-Segment-Anything
This repository is for the first comprehensive survey on Meta AI's Segment Anything Model (SAM).
700 19 746
4DVLab/Vision-Centric-BEV-Perception
Vision-Centric BEV Perception: A Survey
654 30 171
RaymondWang987/NVDS
The official repository of the ICCV2023 paper "Neural Video Depth Stabilizer" (NVDS).
Language:Python452 22 3124
awaisrauf/Awesome-CV-Foundational-Models
431 19 625
facebookresearch/OrienterNet
Source Code for Paper "OrienterNet Visual Localization in 2D Public Maps with Neural Matching"
Language:Python430 12 4241
amir32002/3D_Street_View
The repo of Street View Image, Pose, and 3D Cities Dataset. Used in "Generic 3D Representation via Pose Estimation and Matching", ECCV16
428 25 1264
avishkarsaha/translating-images-into-maps
Official PyTorch code for 'Translating Images Into Maps' ICRA 2022 (Outstanding Paper Award)
Language:Python395 23 4049
yzd-v/cls_KD
'NKD and USKD' (ICCV 2023) and 'ViTKD' (CVPRW 2024)
Language:Python199 9 1916
hsouri/Battle-of-the-Backbones
187 5 25
google-research/snap
SNAP: Self-supervised Neural Maps for Visual Positioning and Semantic Understanding (NeurIPS 2023)
Language:Python160 8 212
ggjy/DeLVM
Language:Python106 2 96
fudan-zvg/Ego3RT
[ECCV 2022] Learning Ego 3D Representation as Ray Tracing
Language:Python105 12 77
ShenZheng2000/TPSeNCE
TPSeNCE for image rain generation, deraining, and object detection.
Language:Python72 1 80
YujiaoShi/Boosting3DoFAccuracy
ICCV2023: Boosting 3-DoF Ground-to-Satellite Camera Localization Accuracy via Geometry-Guided Cross-View Transformer
Language:Python32 1 34
woxihuanjiangguo/BEVNeXt
160