SOS4NLP

SOS4NLP: A survey list of surveys for natural language processing.

Mainly Contributed and Maintained by Yuan Zang.

Reading the surveys is an efficient way to learn about an academic field. This repository provides a paperlist of surveys for different areas of natural language processing.

Thanks for all great contributors! Everyone in Github is welcomed to make contribution to this repository.

0. Surveys of Natural Language Processing
1. Language Parsing
2. Natural Language Understanding and Generation
3. Information Extraction
4. Information Retrieval
5. Dialogue and Question Answering
- 5.1 Dialogue
- 5.2 Question Answering
6. Representation Learning
7. Machine Learning for Natural Language Processing
8. Interdisciplinary Natural language Processing Application
Acknowledgements

0. Surveys of Natural Language Processing

Advances in natural language processing. Julia Hirschberg, Christopher D Manning. Science 2015. [pdf]
Jumping NLP Curves: A Review of Natural Language Processing Research.

Erik Cambria, Bebo White. IEEE Computational Intelligence Magazine 2014. [pdf]
Natural Language Processing: An Introduction. Prakash M Nadkarni, Lucila Ohno-Machado, Wendy W Chapman. Journal of the American Medical Informatics Association 2011. [pdf]
Natural Language Processing. Gobinda G Chowdhury. Annual Review of Information Science and Technology 2003. [pdf]

1. Language Parsing

1.1 Semantic Parsing

A Survey on Semantic Parsing.

Aishwarya Kamath, Rajarshi Das. AKBC 2018. [pdf]

1.2 Text Segmentation

Text Segmentation Techniques: A Critical Review. Irina Pak, Phoey Lee Teh. Innovative Computing, Optimization and Its Applications 2018. [pdf]

1.3 Part of Speech Tagging

Part‐of‐speech Tagging.

Angel R Martinez. Wiley Interdisciplinary Reviews: Computational Statistics 2012. [pdf]

1.4 Coreference Resolution

Coreference Resolution: A Survey.

Pradheep Elango. University of Wisconsin, Madison, WI 2005. [pdf]

1.5 Word Sense Disambiguation

Word Sense Disambiguation: A Survey.

Alok Ranjan Pal. Arxiv 2015. [pdf]
Word Sense Disambiguation: A Survey.

Roberto Navigli. CSUR 2009. [pdf]

1.6 Named Entity Recognization

A Survey on Deep Learning for Named Entity Recognition.

Jing Li, Aixin Sun, Jianglei Han, Chenliang Li. TKDE 2020. [pdf]
A Survey of Named Entity Recognition and Classification.

David Nadeau, Satoshi Sekine. Lingvisticae Investigationes 2007. [pdf]

1.7 Dependency Parsing

Dependency Parsing.

Sandra Kubler, Ryan McDonald, Joakim Nivre. Synthesis Lectures on Human Language Technologies 2009. [pdf]

2. Natural Language Understanding and Generation

2.1 Text Classification

Text Classification Algorithms: A Survey. Kamran Kowsari, Kiana Jafari Meimandi, Mojtaba Heidarysafa, Sanjana Mendu, Laura Barnes, Donald Brown. Information 2019. [pdf]
Semantic Text Classification: A Survey of Past and Recent Advances.

Berna Altinel, Murat Can Ganiz. Information Processing & Management 2018. [pdf]

2.2 Sentiment Analysis

A Survey of Sentiment Analysis in Social Media. Lin Yue, Weitong Chen, Xue Li, Wanli Zuo, Minghao Yin. Knowledge and Information Systems 2019. [pdf]
Deep Learning for Sentiment Analysis: A Survey.

Lei Zhang, Shuai Wang, Bing Liu. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 2018. [pdf]

2.3 Natural Language Inference

Recent Advances in Natural Language Inference: A Survey of Benchmarks, Resources, and Approaches. Shane Storks, Qiaozi Gao, Joyce Y Chai. Arxiv 2019. [pdf]

2.4 Reading Comprehension

A Survey on Machine Reading Comprehension—Tasks, Evaluation Metrics and Benchmark Datasets. Changchang Zeng, Shaobo Li, Qin Li, Jie Hu, Jianjun Hu. Applied Sciences 2020. [pdf]
Neural Machine Reading Comprehension: Methods and Trends.

Shanshan Liu, Xin Zhang, Sheng Zhang, Hui Wang, Weiming Zhang. Applied Sciences 2019. [pdf]

2.5 Text Generation

Pretrained Language Models for Text Generation: A Survey.

Junyi Li, Tianyi Tang, Wayne Xin Zhao, Ji-Rong Wen. Arxiv 2021. [pdf]
Survey of the State of the Art in Natural Language Generation: Core Tasks, Applications and Evaluation. Albert Gatt, Emiel Krahmer. Journal of Artificial Intelligence Research 2018. [pdf]

2.6 Machine Translation

A Survey of Multilingual Neural Machine Translation. Raj Dabre, Chenhui Chu, Anoop Kunchukuttan. CSUR 2020. [pdf]
A Survey of Machine Translation: Its History, Current Status and Future Prospects.

Jonathan Slocum. Computational Linguistics 1985. [pdf]

2.7 Text Summarization

Recent Automatic Text Summarization Techniques: A Survey. Mahak Gambhir, Vishal Gupta. Artificial Intelligence Review 2017. [pdf]
A Survey on Dialogue Summarization: Recent Advances and New Frontiers.

Xiachong Feng, Xiaocheng Feng, Bing Qin. arxiv 2021. [pdf]
What Have We Achieved on Text Summarization?

Dandan Huang, Leyang Cui, Sen Yang, Guangsheng Bao, Kun Wang, Jun Xie, Yue Zhang. EMNLP 2020. [pdf]

3. Information Extraction

3.1 Relation Extraction

More Data, More Relations, More Context and More Openness: A Review and Outlook for Relation Extraction.

Xu Han, Tianyu Gao, Yankai Lin, Hao Peng, Yaoliang Yang, Chaojun Xiao, Zhiyuan Liu, Peng Li, Maosong Sun, Jie Zhou. AACL 2020. [pdf]
Relation extraction: a survey.

Sachin Pawar, Girish K. Palshikara, Pushpak Bhattacharyyab. Arxiv 2017. [pdf]

3.2 Event Extration

A Survey of Event Extraction from Text. Wei Xiang, Bang Wang. IEEE Access 2019. [pdf]

3.3 Open Information Extraction

A Survey on Open Information Extraction. Christina Niklaus, Matthias Cetto, Andre Freitas, Siegfried Handschuh. COLING 2018. [pdf]

4. Information Retrieval

Data Mining and Information Retrieval in the 21st Century: A Bibliographic Review.

Jiaying Liu, Xiangjie, Kong, Xinyu Zhou, Lei Wang, Da Zhang, Ivan Lee, Bo Xu, Feng Xia. Science Review 2019. [pdf]
Neural Models for Information Retrieval.

Bhaskar Mitra, Nick Craswell. Arxiv 2017. [pdf]

5. Dialogue and Question Answering

5.1 Dialogue

A Survey on Dialogue Systems: Recent Advances and New Frontiers.

Hongshen Chen, Xiaorui Liu, Dawei Yin, Jiliang Tang. Acm SIGKDD Explorations Newsletter 2017. [pdf]

5.2 Question Answering

Core Techniques of Question Answering Systems over Knowledge Bases: A Survey. Dennis Diefenbach, Vanessa Lopez, Kamal Singh, Pierre Maret. Knowledge and Information Systems 2018. [pdf]
Question Answering Systems: Survey and Trends.

Abdelghani Bouziane, Djelloul Bouchiha, Noureddine Doumi, Mimoun Malki. Procedia Computer Science 2015. [pdf]
A Survey on Question Answering Technology from an Information Retrieval Perspective.

Oleksandr Kolomiyets, Marie-Francine Moens. Information Science 2011. [pdf]
Retrieving and Reading: A Comprehensive Survey on Open-domain Question Answering.

Fengbin Zhu, Wenqiang Lei, Chao Wang, Jianming Zheng, Soujanya Poria, Tat-Seng Chua. arxiv 2021. [pdf]

6. Representation Learning

6.1 Representation Learning

Representation Learning: A Review and New Perspectives.

Yoshua Bengio, Aaron Courville, and Pascal Vincent. TPAMI 2013. [pdf]

6.2 Knowledge Representation Learning

Sememe Knowledge Computation: A Review of Recent Advances in Application and Expansion of Sememe Knowledge Bases.

Fanchao Qi, Ruobing Xie, Yuan Zang, Zhiyuan Liu, Maosong Sun. Frontiers of Computer Science 2021. [pdf]
Knowledge Graph Embedding: A Survey of Approaches and Applications.

Quan Wang, Zhendong Mao, Bin Wang, Li Guo. TKDE 2017. [pdf]
Knowledge Representation Learning: A Review.

Zhiyuan Liu, Maosong Sun, Yankai Lin, Ruobing Xie. Journal of Computer Research and Development 2016. [pdf]
A Review of Relational Machine Learning for Knowledge Graphs.

Maximilian Nickel, Kevin Murphy, Volker Tresp, Evgeniy Gabrilovich. Proceedings of the IEEE 2015. [pdf]

6.3 Word Representation Learning

From Word to Sense Embeddings: A Survey on Vector Representations of Meaning.

Jose Camacho-Collados, Mohammad Taher Pilehvar. JAIR 2018. [pdf]

6.4 Network Representation Learning

A Survey on Network Embedding.

Peng Cui, Xiao Wang, Jian Pei, Wenwu Zhu. TKDE 2018. [pdf]
Network Representation Learning: A Survey.

Daokun Zhang, Jie Yin, Xingquan Zhu, Chengqi Zhang. IEEE Transactions on Big Data 2018. [pdf]
Network Representation Learning: An Overview.

Cunchao Tu, Cheng Yang, Zhiyuan Liu, Maosong Sun. Scientia Sinica Informationis 2017. [pdf]

7. Machine Learning for Natural Language Processing

7.1 Deep Learning for Natural Language Processing

A Survey of the Usages of Deep Learning for Natural Language Processing.

Daniel W. Otter, Julian R. Medina, Jugal K. Kalita. IEEE Transactions on Neural Networks and Learning Systems 2021. [pdf]
Recent Trends in Deep Learning Based Natural Language Processing.

Tom Young, Devamanyu Hazarika, Soujanya Poria, Erik Cambria. IEEE Computational Intelligence Magazine 2018. [pdf]

7.2 Transformers and Pretrain Language Models

A Survey of Transformers.

Tianyang Lin, Yuxin Wang, Xiangyang Liu, Xipeng Qiu. Arxiv 2021. [pdf]
Pre-Trained Models: Past, Present and Future.

Xu Han, Zhengyan Zhang, Ning Ding, Yuxian Gu, Xiao Liu, Yuqi Huo, Jiezhong Qiu, Liang Zhang, Wentao Han, Minlie Huang, Qin Jin, Yanyan Lan, Yang Liu, Zhiyuan Liu, Zhiwu Lu, Xipeng Qiu, Ruihua Song, Jie Tang, Ji-Rong Wen, Jinhui Yuan, Wayne Xin Zhao, Jun Zhu. Arxiv 2021. [pdf]
Pre-trained models for natural language processing: A survey.

XiPeng Qiu, TianXiang Sun, YiGe Xu, YunFan Shao, Ning Dai, XuanJing Huang. Science China Technological Sciences 2020. [pdf]
Efficient Transformers: A Survey.

Yi Tay, Mostafa Dehghani, Dara Bahri, Donald Metzler. Arxiv 2020. [pdf]

7.3 Graph Neural Networks

Graph Neural Networks for Natural Language Processing: A Survey.

Lingfei Wu, Yu Chen, Kai Shen, Xiaojie Guo, Hanning Gao, Shucheng Li, Jian Pei, Bo Long. Arxiv 2021. [pdf]
Graph Neural Networks: A Review of Methods and Applications.

Jie Zhou, Ganqu Cui, Shengding Hu, Zhengyan Zhang, Cheng Yang, Zhiyuan Liu, Lifeng Wang, Changcheng Li, Maosong Sun. AI Open 2020. [pdf]
A Comprehensive Survey on Graph Neural Networks.

Zonghan Wu, Shirui Pan, Fengwen Chen, Guodong Long, Chengqi Zhang, Philip S. Yu. IEEE transactions on neural networks and learning systems 2020. [pdf]

7.4 Reinforcement Learning

A Survey of Reinforcement Learning Informed by Natural Language.

Jelena Luketina, Nantas Nardelli, Gregory Farquhar, Jakob Foerster, Jacob Andreas, Edward Grefenstette, Shimon Whiteson, Tim Rocktäschel. IJCAI 2019. [pdf]

7.5 Data Augmentation

A Survey of Text Data Augmentation.

Pei Liu, Xuemin Wang, Chao Xiang, Weiye Meng. CCNS 2020. [pdf]
A Survey of Data Augmentation Approaches for NLP.

Steven Y. Feng, Varun Gangal, Jason Wei, Sarath Chandar, Soroush Vosoughi, Teruko Mitamura, Eduard Hovy. ACL Finings 2021. [pdf]
An Empirical Survey of Data Augmentation for Limited Data Learning in NLP.

Jiaao Chen, Derek Tam, Colin Raffel, Mohit Bansal, Diyi Yang. arxiv 2021. [pdf]

7.6 Few/Zero Shot Learning

Generalizing from a Few Examples: A Survey on Few-Shot Learning.

Yaqing Wang, Quanming Yao, James Kwok, Lionel M. Ni. CSUR 2020. [pdf]
A Survey of Zero-Shot Learning: Settings, Methods, and Applications.

Wei Wang, Vincent W. Zheng, Han Yu, Chunyan Miao. ACM Transactions on Intelligent Systems and Technology 2019. [pdf]

7.7 Meta Learning

Meta-Learning in Neural Networks: A Survey.

Timothy Hospedales, Antreas Antoniou, Paul Micaelli, Amos Storkey. PAMI 2020. [pdf]
Meta-Learning: A Survey.

Joaquin Vanschoren. Arxiv 2018. [pdf]

7.8 Continual Learning

A Continual Learning Survey: Defying Forgetting in Classification Tasks.

Matthias Delange, Rahaf Aljundi, Marc Masana, Sarah Parisot, Xu Jia, Ales Leonardis, Greg Slabaugh, Tinne Tuytelaars. PAMI 2021. [pdf]

7.9 Contrastive Learning

A Survey on Contrastive Self-Supervised Learning.

Ashish Jaiswal, Ashwin Ramesh Babu, Mohammad Zaki Zadeh, Debapriya Banerjee, Fillia Makedon. Technologies 2021. [pdf]
Contrastive Representation Learning: A Framework and Review.

Phuc H. Le-Khac, Graham Healy, Alan F. Smeaton. IEEE Access 2020. [pdf]

7.10 Multi-Task Learning

Multi-task learning for natural language processing in the 2020s: Where are we going?

Joseph Worsham, Jugal Kalita. Pattern Recognition Letters 2020. [pdf]
An Overview of Multi-Task Learning in Deep Neural Networks.

Sebastian Ruder. Arxiv 2017. [pdf]

7.11 Intepretability and Analysis

On Interpretability of Artificial Neural Networks: A Survey.

Feng-Lei Fan, Jinjun Xiong, Mengzhou Li, Ge Wang. IEEE Transactions on Radiation and Plasma Medical Sciences 2021. [pdf]
A Survey of the State of Explainable AI for Natural Language Processing.

Marina Danilevsky, Kun Qian, Ranit Aharonov, Yannis Katsis, Ban Kawas, Prithviraj Sen. AACL 2020. [pdf].
Machine Learning Interpretability: A Survey on Methods and Metrics.

Diogo V. Carvalho, Eduardo M. Pereira, Jaime S. Cardoso. Electronics 2019. [pdf]
Analysis Methods in Neural Language Processing: A Survey.

Yonatan Belinkov, James Glass. TACL 2019. [pdf]
Teach Me to Explain: A Review of Datasets for Explainable NLP.

Sarah Wiegreffe, Ana Marasović. arxiv 2021. [pdf]

7.12 Security Threats and Defense

Adversarial Attacks on Deep Learning Models in Natural Language Processing: A Survey.

Wei Emma Zhang, Quan Z. Sheng, Ahoud Alhazmi, Chenliang Li. ACM TIST 2020. [pdf]
Backdoor Learning: A Survey.

Yiming Li, Baoyuan Wu, Yong Jiang, Zhifeng Li, Shu-Tao Xia. Arxiv 2020. [pdf].
A Survey of Privacy Attacks in Machine Learning.

Maria Rigaki, Sebastian Garcia. Arxiv 2020. [pdf]

8. Interdisciplinary Natural language Processing Application

8.1 Legal Intelligence

How Does NLP Benefit Legal System: A Summary of Legal Artificial Intelligence.

Haoxi Zhong, Chaojun Xiao, Cunchao Tu, Tianyang Zhang, Zhiyuan Liu, Maosong Sun. ACL 2020. [pdf]

8.2 Bioinformation

Survey of Natural Language Processing Techniques in Bioinformatics.

Zhiqiang Zeng, Hua Shi, Yun Wu, Zhiling Hong. Computational and Mathematical Methods in Medicine 2015. [pdf].

8.3 Financial Intelligence

Natural Language Based Financial Forecasting: A Survey.

Frank Z. Xing, Erik Cambria, Roy E. Welsch. Artificial Intelligence Review 2018. [pdf].

Acknowledgements

Great thanks to other contributors Shengding Hu and Chenglei Si! (names are not listed in particular order)

Please contact us if we miss your names in this list, we will add you back ASAP!

uoneway/SOS4NLP