speedinghzl's Stars
openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
julycoding/The-Art-Of-Programming-By-July-2nd
本项目曾冲到全球第一,干货集锦见本页面最底部,另完整精致的纸质版《编程之法:面试和算法心得》已在京东/当当上销售
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
google-deepmind/alphafold
Open source code for AlphaFold 2.
mlfoundations/open_clip
An open source implementation of CLIP.
facebookresearch/dino
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
Tencent/TNN
TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is distinguished by several outstanding features, including its cross-platform capability, high performance, model compression and code pruning. Based on ncnn and Rapidnet, TNN further strengthens the support and performance optimization for mobile devices, and also draws on the advantages of good extensibility and high performance from existed open source efforts. TNN has been deployed in multiple Apps from Tencent, such as Mobile QQ, Weishi, Pitu, etc. Contributions are welcome to work in collaborative with us and make TNN a better framework.
boston-dynamics/spot-sdk
Spot SDK repo
IDEA-Research/DINO
[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"
shunsukesaito/PIFu
This repository contains the code for the paper "PIFu: Pixel-Aligned Implicit Function for High-Resolution Clothed Human Digitization"
megvii-model/ShuffleNet-Series
speedinghzl/CCNet
CCNet: Criss-Cross Attention for Semantic Segmentation (TPAMI 2020 & ICCV 2019).
facebookresearch/MaskFormer
Per-Pixel Classification is Not All You Need for Semantic Segmentation (NeurIPS 2021, spotlight)
SHI-Labs/Neighborhood-Attention-Transformer
Neighborhood Attention Transformer, arxiv 2022 / CVPR 2023. Dilated Neighborhood Attention Transformer, arxiv 2022
DingXiaoH/RepLKNet-pytorch
Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs (CVPR 2022)
NVlabs/GroupViT
Official PyTorch implementation of GroupViT: Semantic Segmentation Emerges from Text Supervision, CVPR 2022.
astra-vision/MonoScene
[CVPR 2022] "MonoScene: Monocular 3D Semantic Scene Completion": 3D Semantic Occupancy Prediction from a single image
IDEA-Research/DN-DETR
[CVPR 2022 Oral] Official implementation of DN-DETR
Jeff-sjtu/res-loglikelihood-regression
Code for "Human Pose Regression with Residual Log-likelihood Estimation", ICCV 2021 Oral
hustvl/TopFormer
TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation, CVPR2022
youngwanLEE/MPViT
[CVPR 2022] MPViT:Multi-Path Vision Transformer for Dense Prediction
hustvl/MIMDet
[ICCV 2023] You Only Look at One Partial Sequence
ZhangGongjie/SAM-DETR
[CVPR'2022] SAM-DETR & SAM-DETR++: Official PyTorch Implementation
Dw1010/IntagHand
Picsart-AI-Research/SeMask-Segmentation
[NIVT Workshop @ ICCV 2023] SeMask: Semantically Masked Transformers for Semantic Segmentation
hustvl/TeViT
Temporally Efficient Vision Transformer for Video Instance Segmentation, CVPR 2022, Oral
rom1504/laion-prepro
Get hundred of million of image+url from the crawling at home dataset and preprocess them
OpenGVLab/gv-benchmark
General Vision Benchmark, GV-B, a project from OpenGVLab
tsunghan-wu/RandLA-Net-pytorch
:four_leaf_clover: Pytorch Implementation of RandLA-Net (https://arxiv.org/abs/1911.11236)
openseg-group/RankSeg
[ECCV2022] This is an official implementation of paper "RankSeg: Adaptive Pixel Classification with Image Category Ranking for Segmentation".