Awesome Self-Supervised LearningAwesome

A curated list of awesome Self-Supervised Learning resources. Inspired by awesome-deep-vision, awesome-adversarial-machine-learning, awesome-deep-learning-papers, and awesome-architecture-search

Self-Supervised Learning has become an exciting direction in Computer Vision, Machine Learning, and Robotics community. These are some of the awesome resources!

Contributing

We Need You!

Please help contribute this list by contacting me or add pull request

Markdown format:

- Paper Name. 
  [[pdf]](link) 
  [[code]](link)
  - Author 1, Author 2, and Author 3. *Conference Year*

Change Log

  • Call for PRs for Robotic papers.
  • Dec.1: NIPS'18 papers added!
  • Jul.29: ECCV'18 papers updated!

Table of Contents

Computer Vision

Image Representation Learning

2015

  • Unsupervised Visual Representation Learning by Context Prediction. [pdf] [code]

    • Doersch, Carl and Gupta, Abhinav and Efros, Alexei A. ICCV 2015
  • Unsupervised Learning of Visual Representations using Videos. [pdf] [code]

    • Wang, Xiaolong and Gupta, Abhinav. ICCV 2015
  • Learning to See by Moving. [pdf] [code]

    • Agrawal, Pulkit and Carreira, Joao and Malik, Jitendra. ICCV 2015
  • Learning image representations tied to ego-motion. [pdf] [code]

    • Jayaraman, Dinesh and Grauman, Kristen. ICCV 2015

2016

  • Joint Unsupervised Learning of Deep Representations and Image Clusters. [pdf] [code-torch] [code-caffe]

    • Jianwei Yang, Devi Parikh, Dhruv Batra. CVPR 2016
  • Unsupervised Deep Embedding for Clustering Analysis. [pdf] [code]

    • Junyuan Xie, Ross Girshick, and Ali Farhadi. ICML 2016
  • Slow and steady feature analysis: higher order temporal coherence in video. [pdf]

    • Jayaraman, Dinesh and Grauman, Kristen. CVPR 2016
  • Context Encoders: Feature Learning by Inpainting. [pdf] [code]

    • Pathak, Deepak and Krahenbuhl, Philipp and Donahue, Jeff and Darrell, Trevor and Efros, Alexei A. CVPR 2016
  • Colorful Image Colorization. [pdf] [code]

    • Zhang, Richard and Isola, Phillip and Efros, Alexei A. ECCV 2016
  • Unsupervised Learning of Visual Representations by Solving Jigsaw Puzzles. [pdf] [code]

    • Noroozi, Mehdi and Favaro, Paolo. ECCV 2016
  • Ambient Sound Provides Supervision for Visual Learning. [pdf] [code]

    • Owens, Andrew and Wu, Jiajun and McDermott, Josh and Freeman, William and Torralba, Antonio. ECCV 2016
  • Learning Representations for Automatic Colorization. [pdf] [code]

    • Larsson, Gustav and Maire, Michael and Shakhnarovich, Gregory. ECCV 2016
  • Unsupervised Visual Representation Learning by Graph-based Consistent Constraints. [pdf] [code]

    • Li, Dong and Hung, Wei-Chih and Huang, Jia-Bin and Wang, Shengjin and Ahuja, Narendra and Yang, Ming-Hsuan. ECCV 2016

2017

  • Adversarial Feature Learning. [pdf] [code]

    • Donahue, Jeff and Krahenbuhl, Philipp and Darrell, Trevor. ICLR 2017
  • Self-supervised learning of visual features through embedding images into text topic spaces. [pdf] [code]

    • L. Gomez* and Y. Patel* and M. Rusiñol and D. Karatzas and C.V. Jawahar. CVPR 2017
  • Split-Brain Autoencoders: Unsupervised Learning by Cross-Channel Prediction. [pdf] [code]

    • Zhang, Richard and Isola, Phillip and Efros, Alexei A. CVPR 2017
  • Learning Features by Watching Objects Move. [pdf] [code]

    • Pathak, Deepak and Girshick, Ross and Dollar, Piotr and Darrell, Trevor and Hariharan, Bharath. CVPR 2017
  • Colorization as a Proxy Task for Visual Understanding. [pdf] [code]

    • Larsson, Gustav and Maire, Michael and Shakhnarovich, Gregory. CVPR 2017
  • DeepPermNet: Visual Permutation Learning. [pdf] [code]

    • Cruz, Rodrigo Santa and Fernando, Basura and Cherian, Anoop and Gould, Stephen. CVPR 2017
  • Unsupervised Learning by Predicting Noise. [pdf] [code]

    • Bojanowski, Piotr and Joulin, Armand. ICML 2017
  • Multi-task Self-Supervised Visual Learning. [pdf]

    • Doersch, Carl and Zisserman, Andrew. ICCV 2017
  • Representation Learning by Learning to Count. [pdf]

    • Noroozi, Mehdi and Pirsiavash, Hamed and Favaro, Paolo. ICCV 2017
  • Transitive Invariance for Self-supervised Visual Representation Learning. [pdf]

    • Wang, Xiaolong and He, Kaiming and Gupta, Abhinav. ICCV 2017
  • Look, Listen and Learn. [pdf]

    • Relja, Arandjelovic and Zisserman, Andrew. ICCV 2017
  • Unsupervised Representation Learning by Sorting Sequences. [pdf] [code]

    • Hsin-Ying Lee, Jia-Bin Huang, Maneesh Kumar Singh, and Ming-Hsuan Yang. ICCV 2017

2018

  • Learning Image Representations by Completing Damaged Jigsaw Puzzles. [pdf]

    • Kim, Dahun and Cho, Donghyeon and Yoo, Donggeun and Kweon, In So. WACV 2018
  • Unsupervised Representation Learning by Predicting Image Rotations. [pdf] [code]

    • Spyros Gidaris and Praveer Singh and Nikos Komodakis. ICLR 2018
  • Improvements to context based self-supervised learning. [pdf]

    • Terrell Mundhenk and Daniel Ho and Barry Chen. CVPR 2018
  • Self-Supervised Feature Learning by Learning to Spot Artifacts.

    • Simon Jenni and Universität Bern and Paolo Favaro. CVPR 2018
  • Boosting Self-Supervised Learning via Knowledge Transfer. [pdf]

    • Mehdi Noroozi and Ananth Vinjimoor and Paolo Favaro and Hamed Pirsiavash. CVPR 2018
  • Cross-domain Self-supervised Multi-task Feature Learning Using Synthetic Imagery. [pdf] [code]

    • Zhongzheng Ren and Yong Jae Lee. CVPR 2018
  • ShapeCodes: Self-Supervised Feature Learning by Lifting Views to Viewgrids. [pdf]

    • Dinesh Jayaraman*, UC Berkeley; Ruohan Gao, University of Texas at Austin; Kristen Grauman. ECCV 2018
  • Deep Clustering for Unsupervised Learning of Visual Features [pdf]

    • Mathilde Caron, Piotr Bojanowski, Armand Joulin, Matthijs Douze. ECCV 2018
  • Cross Pixel Optical-Flow Similarity for Self-Supervised Learning. [pdf]

    • Aravindh Mahendran, James Thewlis, Andrea Vedaldi. ACCV 2018

Video Representation Learning

  • Unsupervised Learning of Video Representations using LSTMs. [pdf] [code]

    • Srivastava, Nitish and Mansimov, Elman and Salakhudinov, Ruslan. ICML 2015
  • Shuffle and Learn: Unsupervised Learning using Temporal Order Verification. [pdf] [code]

    • Ishan Misra, C. Lawrence Zitnick and Martial Hebert. ECCV 2016
  • LSTM Self-Supervision for Detailed Behavior Analysis [pdf]

    • Biagio Brattoli*, Uta Büchler*, Anna-Sophia Wahl, Martin E. Schwab, and Björn Ommer. CVPR 2017
  • Self-Supervised Video Representation Learning With Odd-One-Out Networks. [pdf]

    • Basura Fernando and Hakan Bilen and Efstratios Gavves and Stephen Gould. CVPR 2017
  • Unsupervised Learning of Long-Term Motion Dynamics for Videos. [pdf]

    • Luo, Zelun and Peng, Boya and Huang, De-An and Alahi, Alexandre and Fei-Fei, Li. CVPR 2017
  • Geometry Guided Convolutional Neural Networks for Self-Supervised Video Representation Learning. [pdf]

    • Chuang Gan and Boqing Gong and Kun Liu and Hao Su and Leonidas J. Guibas. CVPR 2018
  • Improving Spatiotemporal Self-Supervision by Deep Reinforcement Learning. [pdf]

    • Biagio Brattoli*, Uta Büchler*, and Björn Ommer. ECCV 2018
  • Self-Supervised Video Representation Learning with Space-Time Cubic Puzzles. [pdf]

    • Kim, Dahun and Cho, Donghyeon and Yoo, Donggeun and Kweon, In So. AAAI 2019

Geometry

  • Self-supervised Learning of Motion Capture. [pdf] [code] [web]

    • Tung, Hsiao-Yu and Tung, Hsiao-Wei and Yumer, Ersin and Fragkiadaki, Katerina. NIPS 2017
  • Unsupervised Learning of Depth and Ego-Motion from Video. [pdf] [code] [web]

    • Zhou, Tinghui and Brown, Matthew and Snavely, Noah and Lowe, David G. CVPR 2017
  • Active Stereo Net: End-to-End Self-Supervised Learning for Active Stereo Systems. [project]

    • Yinda Zhang*, Sean Fanello, Sameh Khamis, Christoph Rhemann, Julien Valentin, Adarsh Kowdle, Vladimir Tankovich, Shahram Izadi, Thomas Funkhouser. ECCV 2018
  • Self-Supervised Relative Depth Learning for Urban Scene Understanding. [pdf] [project]

    • Huaizu Jiang*, Erik Learned-Miller, Gustav Larsson, Michael Maire, Greg Shakhnarovich. ECCV 2018

Audio

  • Audio-Visual Scene Analysis with Self-Supervised Multisensory Features. [pdf] [code]

    • Andrew Owens, Alexei A. Efros. ECCV 2018
  • Objects that Sound. [pdf]

    • R. Arandjelović, A. Zisserman. ECCV 2018
  • Learning to Separate Object Sounds by Watching Unlabeled Video. [pdf] [project]

    • Ruohan Gao, Rogerio Feris, Kristen Grauman. ECCV 2018
  • Cooperative Learning of Audio and Video Models from Self-Supervised Synchronization. [pdf]

    • Bruno Korbar,Dartmouth College, Du Tran, Lorenzo Torresani. NIPS 2018
  • Self-Supervised Generation of Spatial Audio for 360° Video. [pdf]

    • Pedro Morgado, Nuno Nvasconcelos, Timothy Langlois, Oliver Wang. NIPS 2018

Others

  • Self-learning Scene-specific Pedestrian Detectors using a Progressive Latent Model. [pdf]

    • Qixiang Ye, Tianliang Zhang, Qiang Qiu, Baochang Zhang, Jie Chen, Guillermo Sapiro. CVPR 2017
  • Free Supervision from Video Games. [pdf] [project+code]

    • Philipp Krähenbühl. CVPR 2018
  • Fighting Fake News: Image Splice Detection via Learned Self-Consistency [pdf] [code]

    • Minyoung Huh*, Andrew Liu*, Andrew Owens, Alexei A. Efros. ECCV 2018
  • Self-supervised Tracking by Colorization (Tracking Emerges by Colorizing Videos). [pdf]

    • Carl Vondrick*, Abhinav Shrivastava, Alireza Fathi, Sergio Guadarrama, Kevin Murphy. ECCV 2018

Machine Learning

  • Self-taught Learning: Transfer Learning from Unlabeled Data. [pdf]

    • Raina, Rajat and Battle, Alexis and Lee, Honglak and Packer, Benjamin and Ng, Andrew Y. ICML 2007
  • Representation Learning: A Review and New Perspectives. [pdf]

    • Bengio, Yoshua and Courville, Aaron and Vincent, Pascal. TPAMI 2013.

Reinforcement Learning

  • Curiosity-driven Exploration by Self-supervised Prediction. [pdf] [code]

    • Deepak Pathak, Pulkit Agrawal, Alexei A. Efros, and Trevor Darrell. ICML 2017
  • Large-Scale Study of Curiosity-Driven Learning. [pdf]

    • Yuri Burda*, Harri Edwards*, Deepak Pathak*, Amos Storkey, Trevor Darrell and Alexei A. Efros
  • Playing hard exploration games by watching YouTube. [pdf]

    • Yusuf Aytar, Tobias Pfaff, David Budden, Tom Le Paine, Ziyu Wang, Nando de Freitas. NIPS 2018

Robotics

2015

  • Online self-supervised learning for dynamic object segmentation [pdf]
    • Vitor Guizilini and Fabio Ramos, The International Journal of Robotics Research

2016

  • The Curious Robot: Learning Visual Representations via Physical Interactions. [pdf]

    • Lerrel Pinto and Dhiraj Gandhi and Yuanfeng Han and Yong-Lae Park and Abhinav Gupta. ECCV 2016
  • Learning to Poke by Poking: Experiential Learning of Intuitive Physics. [pdf]

    • Agrawal, Pulkit and Nair, Ashvin V and Abbeel, Pieter and Malik, Jitendra and Levine, Sergey. NIPS 2016
  • Supersizing Self-supervision: Learning to Grasp from 50K Tries and 700 Robot Hours. [pdf]

    • Pinto, Lerrel and Gupta, Abhinav. ICRA 2016

2017

  • Supervision via Competition: Robot Adversaries for Learning Tasks. [pdf]

    • Pinto, Lerrel and Davidson, James and Gupta, Abhinav. ICRA 2017
  • Multi-view Self-supervised Deep Learning for 6D Pose Estimation in the Amazon Picking Challenge. [pdf] [Project]

    • Andy Zeng, Kuan-Ting Yu, Shuran Song, Daniel Suo, Ed Walker Jr., Alberto Rodriguez, Jianxiong Xiao. ICRA 2017

2018

  • Time-Contrastive Networks: Self-Supervised Learning from Video. [pdf] [Project]

    • Pierre Sermanet and Corey Lynch and Yevgen Chebotar and Jasmine Hsu and Eric Jang and Stefan Schaal and Sergey Levine. ICRA 2018
  • Learning Synergies between Pushing and Grasping with Self-supervised Deep Reinforcement Learning. [pdf] [Project]

    • Andy Zeng, Shuran Song, Stefan Welker, Johnny Lee, Alberto Rodriguez, Thomas Funkhouser. IROS 2018

2019

  • Learning Long-Range Perception Using Self-Supervision from Short-Range Sensors and Odometry [pdf]
    • Mirko Nava, Jerome Guzzi, R. Omar Chavez-Garcia, Luca M. Gambardella, Alessandro Giusti

Talks

  • Supersizing Self-Supervision: Learning Perception and Action without Human Supervision. Abhinav Gupta (CMU) [link]
  • Self-supervision, Meta-supervision, Curiosity: Making Computers Study Harder. Alyosha Efros (UCB) [link]
  • Unsupervised Visual Learning Tutorial. CVPR 2018 [part 1] [part 2]

Thesis

  • Supervision Beyond Manual Annotations for Learning Visual Representations. Carl Doersch. [pdf].
  • Image Synthesis for Self-Supervised Visual Representation Learning. Richard Zhang. [pdf].
  • Visual Learning beyond Direct Supervision. Tinghui Zhou. [pdf].
  • Visual Learning with Minimal Human Supervision. Ishan Misra. [pdf].

License

To the extent possible under law, Zhongzheng Ren has waived all copyright and related or neighboring rights to this work.