CVPR 2020 |
Flow2Stereo: Effective Self-Supervised Learning of Optical Flow and Stereo Matching |
Optical Flow |
F1 : 7.63% (KITTI 2012) |
CVPR 2020 |
Self-Supervised Viewpoint Learning From Image Collections |
Viewpoint learning |
MAE : 4.0 (BIWI) |
CVPR 2020 |
Self-Supervised Scene De-occlusion |
Remove occlusion |
mAP : 29.3 % (KINS) |
CVPR 2020 |
Distilled Semantics for Comprehensive Scene Understanding from Videos |
Scene Understanding |
- |
CVPR 2020 |
Learning by Analogy : Reliable Supervision from Transformations for Unsupervised Optical Flow Estimation |
Optical Flow |
F1 : 11.79% (KITTI 2015) |
CVPR 2020 |
D3Feat: Joint Learning of Dense Detection and Description of 3D Local Features |
3D Local Features |
- |
CVPR 2020 |
SpeedNet: Learning the Speediness in Videos |
predict the "speediness" |
- |
CVPR 2020 |
Action Segmentation with Joint Self-Supervised Temporal Domain Adaptation |
Action Segmentation |
F1@10 : 83.0 (GTEA) |
CVPR 2020 |
MVP: Unified Motion and Visual Self-Supervised Learning for Large-Scale Robotic Navigation |
Robotic Navigation |
- |
arXiv:2003.06734 |
Active Perception and Representation for Robotic Manipulation |
Robot manipulation |
- |
arXiv:2005.01655 |
Words aren’t enough, their order matters: On the Robustness of Grounding Visual Referring Expressions |
Visual Referring Expressions |
- |
arXiv:2004.11362 |
Supervised Contrastive Learning |
Supervised Contrastive Learning |
ImageNet Acc: 80.8 (Top-1) |
arXiv:2007.14449 |
Learning from Scale-Invariant Examples for Domain Adaptation in Semantic Segmentation |
Domain Adaptation |
GTA5 to Cityscape : 47.5 (mIoU) |
arXiv:2007.12360 |
On the Effectiveness of Image Rotation for Open Set Domain Adaptation |
Domain Adaptation |
- |
arXiv:2003.12283 |
LIMP: Learning Latent Shape Representations with Metric Preservation Priors |
Geneartive models |
- |
arXiv:2004.04312 |
Learning to Scale Multilingual Representations for Vision-Language Tasks |
Vision-Language |
MSCOCO: 81.5 |
arXiv:2003.08934 |
NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis |
View Synthesis |
- |
arXiv:2001.01536 |
Learning From Multiple Experts: Self-paced Knowledge Distillation for Long-tailed Classification |
Knowledge Distillation, Long-tail classification |
- |
arXiv:2006.07114 |
Knowledge Distillation Meets Self-Supervision |
Knowledge Distillation |
Res50 --> MobileNetv2 Acc: 72.57 (Top-1) |
AAAI2020 |
Fast and Robust Face-to-Parameter Translation for Game Character Auto-Creation |
Game Character Auto-Creation |
- |
arXiv:2009.07719 |
Domain-invariant Similarity Activation Map Metric Learning for Retrieval-based Long-term Visual Localization |
Similarity Activation Map |
- |
arXiv:2008.10312 |
Self-Supervised Learning for Large-Scale Unsupervised Image Clustering |
Image Clustering |
ImageNet Acc: 38.60 (cluster assignment) |
ICLR2021 under review |
SSD: A UNIFIED FRAMEWORK FOR SELFSUPERVISED OUTLIER DETECTION |
Outlier Detection |
CIFAR10/CIFAR100 : 94.1% (in/out) |