Awesome Self-Supervised Learning

A curated list of awesome Self-Supervised Learning resources. Inspired by awesome-deep-vision, awesome-adversarial-machine-learning, awesome-deep-learning-papers, and awesome-architecture-search

Self-Supervised Learning has become an exciting direction in Computer Vision, Machine Learning, and Robotics community. These are some of the awesome resources!

Contributing

Please help contribute this list by contacting me or add pull request

Markdown format:

- Paper Name. 
  [[pdf]](link) 
  [[code]](link)
  - Author 1, Author 2, and Author 3. *Conference Year*

Change Log

Call for PRs for Robotic papers.
Dec.1: NIPS'18 papers added!
Jul.29: ECCV'18 papers updated!

Computer Vision
Machine Learning
- Reinforcement Learning
Robotics
Talks

Computer Vision

Image Representation Learning

2015

Unsupervised Visual Representation Learning by Context Prediction. [pdf] [code]
- Doersch, Carl and Gupta, Abhinav and Efros, Alexei A. ICCV 2015
Unsupervised Learning of Visual Representations using Videos. [pdf] [code]
- Wang, Xiaolong and Gupta, Abhinav. ICCV 2015
Learning to See by Moving. [pdf] [code]
- Agrawal, Pulkit and Carreira, Joao and Malik, Jitendra. ICCV 2015
Learning image representations tied to ego-motion. [pdf] [code]
- Jayaraman, Dinesh and Grauman, Kristen. ICCV 2015

2016

Joint Unsupervised Learning of Deep Representations and Image Clusters. [pdf] [code-torch] [code-caffe]
- Jianwei Yang, Devi Parikh, Dhruv Batra. CVPR 2016
Unsupervised Deep Embedding for Clustering Analysis. [pdf] [code]
- Junyuan Xie, Ross Girshick, and Ali Farhadi. ICML 2016
Slow and steady feature analysis: higher order temporal coherence in video. [pdf]
- Jayaraman, Dinesh and Grauman, Kristen. CVPR 2016
Context Encoders: Feature Learning by Inpainting. [pdf] [code]
- Pathak, Deepak and Krahenbuhl, Philipp and Donahue, Jeff and Darrell, Trevor and Efros, Alexei A. CVPR 2016
Colorful Image Colorization. [pdf] [code]
- Zhang, Richard and Isola, Phillip and Efros, Alexei A. ECCV 2016
Unsupervised Learning of Visual Representations by Solving Jigsaw Puzzles. [pdf] [code]
- Noroozi, Mehdi and Favaro, Paolo. ECCV 2016
Ambient Sound Provides Supervision for Visual Learning. [pdf] [code]
- Owens, Andrew and Wu, Jiajun and McDermott, Josh and Freeman, William and Torralba, Antonio. ECCV 2016
Learning Representations for Automatic Colorization. [pdf] [code]
- Larsson, Gustav and Maire, Michael and Shakhnarovich, Gregory. ECCV 2016
Unsupervised Visual Representation Learning by Graph-based Consistent Constraints. [pdf] [code]
- Li, Dong and Hung, Wei-Chih and Huang, Jia-Bin and Wang, Shengjin and Ahuja, Narendra and Yang, Ming-Hsuan. ECCV 2016

2017

Adversarial Feature Learning. [pdf] [code]
- Donahue, Jeff and Krahenbuhl, Philipp and Darrell, Trevor. ICLR 2017
Self-supervised learning of visual features through embedding images into text topic spaces. [pdf] [code]
- L. Gomez* and Y. Patel* and M. Rusiñol and D. Karatzas and C.V. Jawahar. CVPR 2017
Split-Brain Autoencoders: Unsupervised Learning by Cross-Channel Prediction. [pdf] [code]
- Zhang, Richard and Isola, Phillip and Efros, Alexei A. CVPR 2017
Learning Features by Watching Objects Move. [pdf] [code]
- Pathak, Deepak and Girshick, Ross and Dollar, Piotr and Darrell, Trevor and Hariharan, Bharath. CVPR 2017
Colorization as a Proxy Task for Visual Understanding. [pdf] [code]
- Larsson, Gustav and Maire, Michael and Shakhnarovich, Gregory. CVPR 2017
DeepPermNet: Visual Permutation Learning. [pdf] [code]
- Cruz, Rodrigo Santa and Fernando, Basura and Cherian, Anoop and Gould, Stephen. CVPR 2017
Unsupervised Learning by Predicting Noise. [pdf] [code]
- Bojanowski, Piotr and Joulin, Armand. ICML 2017
Multi-task Self-Supervised Visual Learning. [pdf]
- Doersch, Carl and Zisserman, Andrew. ICCV 2017
Representation Learning by Learning to Count. [pdf]
- Noroozi, Mehdi and Pirsiavash, Hamed and Favaro, Paolo. ICCV 2017
Transitive Invariance for Self-supervised Visual Representation Learning. [pdf]
- Wang, Xiaolong and He, Kaiming and Gupta, Abhinav. ICCV 2017
Look, Listen and Learn. [pdf]
- Relja, Arandjelovic and Zisserman, Andrew. ICCV 2017
Unsupervised Representation Learning by Sorting Sequences. [pdf] [code]
- Hsin-Ying Lee, Jia-Bin Huang, Maneesh Kumar Singh, and Ming-Hsuan Yang. ICCV 2017

2018

Learning Image Representations by Completing Damaged Jigsaw Puzzles. [pdf]
- Kim, Dahun and Cho, Donghyeon and Yoo, Donggeun and Kweon, In So. WACV 2018
Unsupervised Representation Learning by Predicting Image Rotations. [pdf] [code]
- Spyros Gidaris and Praveer Singh and Nikos Komodakis. ICLR 2018
Improvements to context based self-supervised learning. [pdf]
- Terrell Mundhenk and Daniel Ho and Barry Chen. CVPR 2018
Self-Supervised Feature Learning by Learning to Spot Artifacts.
- Simon Jenni and Universität Bern and Paolo Favaro. CVPR 2018
Boosting Self-Supervised Learning via Knowledge Transfer. [pdf]
- Mehdi Noroozi and Ananth Vinjimoor and Paolo Favaro and Hamed Pirsiavash. CVPR 2018
Cross-domain Self-supervised Multi-task Feature Learning Using Synthetic Imagery. [pdf] [code]
- Zhongzheng Ren and Yong Jae Lee. CVPR 2018
ShapeCodes: Self-Supervised Feature Learning by Lifting Views to Viewgrids. [pdf]
- Dinesh Jayaraman*, UC Berkeley; Ruohan Gao, University of Texas at Austin; Kristen Grauman. ECCV 2018
Deep Clustering for Unsupervised Learning of Visual Features [pdf]
- Mathilde Caron, Piotr Bojanowski, Armand Joulin, Matthijs Douze. ECCV 2018
Cross Pixel Optical-Flow Similarity for Self-Supervised Learning. [pdf]
- Aravindh Mahendran, James Thewlis, Andrea Vedaldi. ACCV 2018

Video Representation Learning

Unsupervised Learning of Video Representations using LSTMs. [pdf] [code]
- Srivastava, Nitish and Mansimov, Elman and Salakhudinov, Ruslan. ICML 2015
Shuffle and Learn: Unsupervised Learning using Temporal Order Verification. [pdf] [code]
- Ishan Misra, C. Lawrence Zitnick and Martial Hebert. ECCV 2016
LSTM Self-Supervision for Detailed Behavior Analysis [pdf]
- Biagio Brattoli*, Uta Büchler*, Anna-Sophia Wahl, Martin E. Schwab, and Björn Ommer. CVPR 2017
Self-Supervised Video Representation Learning With Odd-One-Out Networks. [pdf]
- Basura Fernando and Hakan Bilen and Efstratios Gavves and Stephen Gould. CVPR 2017
Unsupervised Learning of Long-Term Motion Dynamics for Videos. [pdf]
- Luo, Zelun and Peng, Boya and Huang, De-An and Alahi, Alexandre and Fei-Fei, Li. CVPR 2017
Geometry Guided Convolutional Neural Networks for Self-Supervised Video Representation Learning. [pdf]
- Chuang Gan and Boqing Gong and Kun Liu and Hao Su and Leonidas J. Guibas. CVPR 2018
Improving Spatiotemporal Self-Supervision by Deep Reinforcement Learning. [pdf]
- Biagio Brattoli*, Uta Büchler*, and Björn Ommer. ECCV 2018
Self-Supervised Video Representation Learning with Space-Time Cubic Puzzles. [pdf]
- Kim, Dahun and Cho, Donghyeon and Yoo, Donggeun and Kweon, In So. AAAI 2019

Geometry

Self-supervised Learning of Motion Capture. [pdf] [code] [web]
- Tung, Hsiao-Yu and Tung, Hsiao-Wei and Yumer, Ersin and Fragkiadaki, Katerina. NIPS 2017
Unsupervised Learning of Depth and Ego-Motion from Video. [pdf] [code] [web]
- Zhou, Tinghui and Brown, Matthew and Snavely, Noah and Lowe, David G. CVPR 2017
Active Stereo Net: End-to-End Self-Supervised Learning for Active Stereo Systems. [project]
- Yinda Zhang*, Sean Fanello, Sameh Khamis, Christoph Rhemann, Julien Valentin, Adarsh Kowdle, Vladimir Tankovich, Shahram Izadi, Thomas Funkhouser. ECCV 2018
Self-Supervised Relative Depth Learning for Urban Scene Understanding. [pdf] [project]
- Huaizu Jiang*, Erik Learned-Miller, Gustav Larsson, Michael Maire, Greg Shakhnarovich. ECCV 2018

Audio

Audio-Visual Scene Analysis with Self-Supervised Multisensory Features. [pdf] [code]
- Andrew Owens, Alexei A. Efros. ECCV 2018
Objects that Sound. [pdf]
- R. Arandjelović, A. Zisserman. ECCV 2018
Learning to Separate Object Sounds by Watching Unlabeled Video. [pdf] [project]
- Ruohan Gao, Rogerio Feris, Kristen Grauman. ECCV 2018
Cooperative Learning of Audio and Video Models from Self-Supervised Synchronization. [pdf]
- Bruno Korbar,Dartmouth College, Du Tran, Lorenzo Torresani. NIPS 2018
Self-Supervised Generation of Spatial Audio for 360° Video. [pdf]
- Pedro Morgado, Nuno Nvasconcelos, Timothy Langlois, Oliver Wang. NIPS 2018

Others

Self-learning Scene-specific Pedestrian Detectors using a Progressive Latent Model. [pdf]
- Qixiang Ye, Tianliang Zhang, Qiang Qiu, Baochang Zhang, Jie Chen, Guillermo Sapiro. CVPR 2017
Free Supervision from Video Games. [pdf] [project+code]
- Philipp Krähenbühl. CVPR 2018
Fighting Fake News: Image Splice Detection via Learned Self-Consistency [pdf] [code]
- Minyoung Huh*, Andrew Liu*, Andrew Owens, Alexei A. Efros. ECCV 2018
Self-supervised Tracking by Colorization (Tracking Emerges by Colorizing Videos). [pdf]
- Carl Vondrick*, Abhinav Shrivastava, Alireza Fathi, Sergio Guadarrama, Kevin Murphy. ECCV 2018

Machine Learning

Self-taught Learning: Transfer Learning from Unlabeled Data. [pdf]
- Raina, Rajat and Battle, Alexis and Lee, Honglak and Packer, Benjamin and Ng, Andrew Y. ICML 2007
Representation Learning: A Review and New Perspectives. [pdf]
- Bengio, Yoshua and Courville, Aaron and Vincent, Pascal. TPAMI 2013.

Reinforcement Learning

Curiosity-driven Exploration by Self-supervised Prediction. [pdf] [code]
- Deepak Pathak, Pulkit Agrawal, Alexei A. Efros, and Trevor Darrell. ICML 2017
Large-Scale Study of Curiosity-Driven Learning. [pdf]
- Yuri Burda*, Harri Edwards*, Deepak Pathak*, Amos Storkey, Trevor Darrell and Alexei A. Efros
Playing hard exploration games by watching YouTube. [pdf]
- Yusuf Aytar, Tobias Pfaff, David Budden, Tom Le Paine, Ziyu Wang, Nando de Freitas. NIPS 2018

Robotics

2015

Online self-supervised learning for dynamic object segmentation [pdf]
- Vitor Guizilini and Fabio Ramos, The International Journal of Robotics Research

2016

The Curious Robot: Learning Visual Representations via Physical Interactions. [pdf]
- Lerrel Pinto and Dhiraj Gandhi and Yuanfeng Han and Yong-Lae Park and Abhinav Gupta. ECCV 2016
Learning to Poke by Poking: Experiential Learning of Intuitive Physics. [pdf]
- Agrawal, Pulkit and Nair, Ashvin V and Abbeel, Pieter and Malik, Jitendra and Levine, Sergey. NIPS 2016
Supersizing Self-supervision: Learning to Grasp from 50K Tries and 700 Robot Hours. [pdf]
- Pinto, Lerrel and Gupta, Abhinav. ICRA 2016

2017

Supervision via Competition: Robot Adversaries for Learning Tasks. [pdf]
- Pinto, Lerrel and Davidson, James and Gupta, Abhinav. ICRA 2017
Multi-view Self-supervised Deep Learning for 6D Pose Estimation in the Amazon Picking Challenge. [pdf] [Project]
- Andy Zeng, Kuan-Ting Yu, Shuran Song, Daniel Suo, Ed Walker Jr., Alberto Rodriguez, Jianxiong Xiao. ICRA 2017

2018

Time-Contrastive Networks: Self-Supervised Learning from Video. [pdf] [Project]
- Pierre Sermanet and Corey Lynch and Yevgen Chebotar and Jasmine Hsu and Eric Jang and Stefan Schaal and Sergey Levine. ICRA 2018
Learning Synergies between Pushing and Grasping with Self-supervised Deep Reinforcement Learning. [pdf] [Project]
- Andy Zeng, Shuran Song, Stefan Welker, Johnny Lee, Alberto Rodriguez, Thomas Funkhouser. IROS 2018

2019

Learning Long-Range Perception Using Self-Supervision from Short-Range Sensors and Odometry [pdf]
- Mirko Nava, Jerome Guzzi, R. Omar Chavez-Garcia, Luca M. Gambardella, Alessandro Giusti

Talks

Supersizing Self-Supervision: Learning Perception and Action without Human Supervision. Abhinav Gupta (CMU) [link]
Self-supervision, Meta-supervision, Curiosity: Making Computers Study Harder. Alyosha Efros (UCB) [link]
Unsupervised Visual Learning Tutorial. CVPR 2018 [part 1] [part 2]

Thesis

Supervision Beyond Manual Annotations for Learning Visual Representations. Carl Doersch. [pdf].
Image Synthesis for Self-Supervised Visual Representation Learning. Richard Zhang. [pdf].
Visual Learning beyond Direct Supervision. Tinghui Zhou. [pdf].
Visual Learning with Minimal Human Supervision. Ishan Misra. [pdf].

License

To the extent possible under law, Zhongzheng Ren has waived all copyright and related or neighboring rights to this work.

ahavenoname/awesome-self-supervised-learning

Awesome Self-Supervised Learning

Contributing

Change Log

Table of Contents

Computer Vision

Image Representation Learning

2015

2016

2017

2018

Video Representation Learning

Geometry

Audio

Others

Machine Learning

Reinforcement Learning

Robotics

2015

2016

2017

2018

2019

Talks

Thesis

License