A curated list of Open Information Extraction (OIE) resources: research papers, code, data, applications, etc. The list is not limited to Open Information Extraction systems exclusively. It also includes work highly related to the field of OIE, such as taxonomizing open relations and using OIE in downstream applications.
- Introduction to OIE
- Papers sorted in chronological order
- Papers grouped by category
- Slides
- Talks
- Code
- Data
- PhD theses
- Demos
Open Information Extraction (OIE) systems aim to extract unseen relations and their arguments from unstructured text in unsupervised manner. In its simplest form, given a natural language sentence, they extract information in the form of a triple, consisted of subject (S), relation (R) and object (O).
Suppose we have the following input sentence:
AMD, which is based in U.S., is a technology company.
An OIE system aims to make the following extractions:
("AMD"; "is based in"; "U.S.")
("AMD"; "is"; "technology company")
-
"Open Information Extraction from the Web" - IJCAI 2007
Michele Banko, Michael J. Cafarella, Stephen Soderland, Matthew Broadhead, Oren Etzioni
-
"Autonomously Semantifying Wikipedia" - CIKM 2007
Fei Wu, Daniel S. Weld
-
"Unsupervised Resolution of Objects and Relations on the Web" - NAACL 2007
Alexander Yates, Oren Etzioni
-
"TextRunner: Open Information Extraction on the Web" - HLT-NAACL 2007
Alexander Yates, Michele Banko, Matthew Broadhead, Michael J. Cafarella, Oren Etzioni, Stephen Soderland
-
"The Tradeoffs between Open and Traditional Relation Extraction" - ACL 2008
Michele Banko, Oren Etzioni
-
"Open Knowledge Extraction through Compositional Language Processing" - STEP 2008
Benjamin Van Durme, Lenhart K. Schubert
-
"Open Information Extraction from the Web" - Commun. ACM 2008
Oren Etzioni, Michele Banko, Stephen Soderland, Daniel S. Weld
-
"Using Wikipedia to Bootstrap Open Information Extraction" - SIGMOD 2009
Daniel S. Weld, Raphael Hoffmann, Fei Wu
-
"Open Information Extraction Using Wikipedia" - ACL 2010
Fei Wu, Daniel S. Weld
-
"Identifying Functional Relations in Web Text" - EMNLP 2010
Thomas Lin, Mausam, Oren Etzioni
-
"Adapting Open Information Extraction to Domain-Specific Relations" - AI Magazine (31), 2010
Stephen Soderland, Brendan Roof, Bo Qin, Shi Xu, Mausam, Oren Etzioni
-
"Open Information Extraction: The Second Generation" - IJCAI 2011 (slides)
Oren Etzioni, Anthony Fader, Janara Christensen, Stephen Soderland, Mausam
-
"Identifying Relations for Open Information Extraction" - EMNLP 2011 (resources (code, data))
Anthony Fader, Stephen Soderland, Oren Etzioni
-
"Filtering and Clustering Relations for Unsupervised Information Extraction in Open Domain" - CIKM 2011
Wei Wang, Romaric Besançon, Olivier Ferret, Brigitte Grau
-
"An Analysis of Open Information Extraction based on Semantic Role Labeling" - K-CAP 2011
Janara Christensen, Mausam, Stephen Soderland, Oren Etzioni
-
"Open Language Learning for Information Extraction" - EMNLP-CoNLL 2012 (resources (code, data, binaries))
Mausam, Michael Schmitz, Stephen Soderland, Robert Bart, Oren Etzioni
-
"PATTY: A Taxonomy of Relational Patterns with Semantic Types" - EMNLP-CoNLL 2012
Ndapandula Nakashole, Gerhard Weikum, Fabian M. Suchanek
-
"Ensemble Semantics for Large-scale Unsupervised Relation Extraction" - EMNLP-CoNLL 2012
Bonan Min, Shuming Shi, Ralph Grishman, Chin-Yew Lin
-
"WiSeNet: building a wikipedia-based semantic network with ontologized relations" - CIKM 2012 (resources)
Andrea Moro, Roberto Navigli
-
"Open Information Extraction for SOV Language Based on Entity-Predicate Pair Detection" - COLING 2012
Woong-Ki Lee, Yeon-Su Lee, Hyoung-Gyu Lee, Won-Ho Ryu, Hae-Chang Rim
-
"A Weighting Scheme for Open Information Extraction" - HLT-NAACL 2012
Yuval Merhav
-
"Dependency-based open information extraction" - Joint Workshop on Unsupervised and Semi-Supervised Learning in NLP @ ACL 2012
Pablo Gamallo, Marcos Garcia
-
"KrakeN: N-ary Facts in Open Information Extraction" - AKBC-WEKEX@NAACL-HLT 2012
Alan Akbik, Alexander Löser
-
"Improving Open Information Extraction for Informal Web Documents with Ripple-Down Rules" - PKAW 2012
Myung Hee Kim, Paul Compton
-
"ClausIE: clause-based open information extraction" - WWW 2013 (slides, code, all resources)
Luciano Del Corro, Rainer Gemulla
-
"Integrating Syntactic and Semantic Analysis into the Open Information Extraction Paradigm" - IJCAI 2013 (resources)
Andrea Moro, Roberto Navigli
-
"Effectiveness and Efficiency of Open Relation Extraction" - EMNLP 2013 (code)
Filipe de Sá Mesquita, Jordan Schmidek, Denilson Barbosa
-
"Open Information Extraction with Tree Kernels" - HLT-NAACL 2013
Ying Xu, Mi-Young Kim, Kevin Quinn, Randy Goebel, Denilson Barbosa
-
"Relation Extraction with Matrix Factorization and Universal Schemas" - HLT-NAACL 2013
Sebastian Riedel, Limin Yao, Andrew McCallum, Benjamin M. Marlin
-
"Open Information Extraction via Contextual Sentence Decomposition" - ICSC 2013
Hannah Bast, Elmar Haussmann
-
"Integrating Open and Closed Information Extraction: Challenges and First Steps" - NLP-DBPEDIA@ISWC 2013
Arnab Dutta, Christian Meilicke, Mathias Niepert, Simone Paolo Ponzetto
-
"Open Information Extraction to KBP Relations in 3 Hours" - TAC 2013
Stephen Soderland, John Gilmer, Robert Bart, Oren Etzioni, Daniel S. Weld
-
"ReNoun: Fact Extraction for Nominal Attributes" - EMNLP 2014
Mohamed Yahya, Steven Whang, Rahul Gupta, Alon Y. Halevy
-
"ZORE: A Syntax-based System for Chinese Open Relation Extraction" - EMNLP 2014
Likun Qiu, Yue Zhang
-
"Canonicalizing Open Knowledge Bases" - CIKM 2014
Luis Galárraga, Geremy Heitz, Kevin Murphy, Fabian M. Suchanek
-
"Focused Entailment Graphs for Open IE Propositions" - CoNLL 2014
Omer Levy, Ido Dagan, Jacob Goldberger
-
"Boosting Open Information Extraction with Noun-Based Relations" - LREC 2014
Clarissa Castellã Xavier, Vera Lúcia Strube de Lima
-
"Improving Open Relation Extraction via Sentence Re-Structuring" - LREC 2014
Jordan Schmidek, Denilson Barbosa
-
"More Informative Open Information Extraction via Simple Inference" - ECIR 2014
Hannah Bast, Elmar Haussmann
-
"Semantifying Triples from Open Information Extraction Systems" - STAIRS 2014
Arnab Dutta, Christian Meilicke, Heiner Stuckenschmidt
-
"Entity-Centric Coreference Resolution of Person Entities for Open Information Extraction" - Procesamiento del Lenguaje Natural (2014)
Marcos García, Pablo Gamallo
-
"Leveraging Linguistic Structure For Open Domain Information Extraction" - ACL 2015 (code (Java), code (Python))
Gabor Angeli, Melvin Jose Johnson Premkumar, Christopher D. Manning
-
"Open IE as an Intermediate Structure for Semantic Tasks" - ACL 2015
Gabriel Stanovsky, Ido Dagan, Mausam
-
"Large-Scale Information Extraction from Textual Definitions through Deep Syntactic and Semantic Analysis" - TACL 2015 (resources)
Claudio Delli Bovi, Luca Telesca, Roberto Navigli
-
"Inferring Binary Relation Schemas for Open Information Extraction" - EMNLP 2015
Kangqi Luo, Xusheng Luo, Kenny Qili Zhu
-
"Knowledge Base Unification via Sense Embeddings and Disambiguation" - EMNLP 2015 (resources)
Claudio Delli Bovi, Luis Espinosa Anke, Roberto Navigli
-
"CORE: Context-Aware Open Relation Extraction with Factorization Machines" - EMNLP 2015 (code)
Fabio Petroni, Luciano Del Corro, Rainer Gemulla
-
"Multilingual Open Relation Extraction Using Cross-lingual Projection" - HLT-NAACL 2015
Manaal Faruqui, Shankar Kumar
-
"Enriching Structured Knowledge with Open Information" - WWW 2015
Arnab Dutta, Christian Meilicke, Heiner Stuckenschmidt
-
"SRDF: Korean Open Information Extraction using Singleton Property" - ISWC 2015
Sangha Nam, YoungGyun Hahm, Sejin Nam, Key-Sun Choi
-
"Multilingual Open Information Extraction" - EPIA 2015
Pablo Gamallo, Marcos García
-
"Open information extraction based on lexical semantics" - J. Braz. Comp. Soc. 21 2015
Clarissa Castellã Xavier, Vera Lúcia Strube de Lima, Marlo Souza
-
"Nested Propositions in Open Information Extraction" - EMNLP 2016 (talk)
Nikita Bhutani, H. V. Jagadish, Dragomir R. Radev
-
"Creating a Large Benchmark for Open Information Extraction" - EMNLP 2016 (code, talk)
Gabriel Stanovsky, Ido Dagan
-
"Porting an Open Information Extraction System from English to German" - EMNLP 2016
Tobias Falke, Gabriel Stanovsky, Iryna Gurevych, Ido Dagan
-
"Relation Schema Induction using Tensor Factorization with Side Information" - EMNLP 2016
Madhav Nimishakavi, Uday Singh Saini, Partha P. Talukdar
-
"Open Information Extraction Systems and Downstream Applications" - IJCAI 2016
Mausam
-
"Demonyms and Compound Relational Nouns in Nominal Open IE" - AKBC@NAACL-HLT 2016
Harinder Pal, Mausam
-
"A Rule Based Open Information Extraction Method Using Cascaded Finite-State Transducer" - PAKDD 2016
Hailun Lin, Yuanzhuo Wang, Peng Zhang, Weiping Wang, Yinliang Yue, Zheng Lin
-
"Getting More Out Of Syntax with PropS" - CoRR (2016)
Gabriel Stanovsky, Jessica Ficler, Ido Dagan, Yoav Goldberg
-
"Improving Open Information Extraction for Semantic Web Tasks" - Trans. Computational Collective Intelligence 21, 2016
Cheikh Kacfah Emani, Catarina Ferreira Da Silva, Bruno Fiés, Parisa Ghodous
-
"An Informativeness Approach to Open IE Evaluation" - CICLing 2016 (slides, code + data)
William Léchelle, Philippe Langlais
-
"MinIE: Minimizing Facts in Open Information Extraction" - EMNLP 2017 (code, poster, all resources)
Kiril Gashteovski, Rainer Gemulla, Luciano Del Corro
-
"Answering Complex Questions Using Open Information Extraction" - ACL 2017
Tushar Khot, Ashish Sabharwal, Peter Clark
-
"Pocket Knowledge Base Population" - ACL 2017
Travis Wolfe, Mark Dredze, Benjamin Van Durme
-
"Bootstrapping for Numerical Open IE" - ACL 2017
Swarnadeep Saha, Harinder Pal, Mausam
-
"MT/IE: Cross-lingual Open Information Extraction with Neural Sequence-to-Sequence Models" - EACL 2017 (code)
Kevin Duh, Benjamin Van Durme, Sheng Zhang
-
"Syntactic Representation Learning for Open Information Extraction on Web" - WWW 2017
Chengsen Ru, Jintao Tang, Shasha Li, Ting Wang
-
"MetaPAD: Meta Pattern Discovery from Massive Text Corpora" (code)- KDD 2017
Meng Jiang, Jingbo Shang, Taylor Cassidy, Xiang Ren, Lance M. Kaplan, Timothy P. Hanratty, Jiawei Han
-
"RelVis: Benchmarking OpenIE Systems" - ISWC 2017
Rudolf Schneider, Tom Oberhauser, Tobias Klatt, Felix A. Gers, Alexander Löser
-
"Open Relation Extraction and Grounding" - IJCNLP 2017
Dian Yu, Lifu Huang, Heng Ji
-
"Selective Decoding for Cross-lingual Open Information Extraction" - IJCNLP(1) 2017
Sheng Zhang, Kevin Duh, Benjamin Van Durme
-
"An assessment of open relation extraction systems for the semantic web" - Inf. Syst. 71, 2017
Amal Zouaq, Michel Gagnon, Ludovic Jean-Louis
-
"An Evaluation of PredPatt and Open IE via Stage 1 Semantic Role Labeling" - IWCS 2017
Sheng Zhang, Rachel Rudinger, Benjamin Van Durme
-
"Discovering Relational Phrases for Qualia Roles Through Open Information Extraction" - KESW 2017
Giovanni Siragusa, Valentina Leone, Luigi Di Caro, Claudio Schifanella
-
"Open Relation Extraction Based on Core Dependency Phrase Clustering" - DSC 2017
Chengsen Ru, Shasha Li, Jintao Tang, Yi Gao, Ting Wang
-
"Analysing Errors of Open Information Extraction Systems" - Workshop on Building Linguistically Generalizable NLP Systems @ EMNLP 2017
Rudolf Schneider, Tom Oberhauser, Tobias Klatt, Felix A. Gers, Alexander Löser
-
"Logician: A Unified End-to-End Neural Approach for Open-Domain Information Extraction" - WSDM 2018
Mingming Sun, Xu Li, Xin Wang, Miao Fan, Yue Feng, Ping Li
-
"Assertion-Based QA With Question-Aware Open Information Extraction" - AAAI 2018
Zhao Yan, Duyu Tang, Nan Duan, Shujie Liu, Wendi Wang, Daxin Jiang, Ming Zhou, Zhoujun Li
-
"Neural Open Information Extraction" - ACL 2018
Lei Cui, Furu Wei, Ming Zhou
-
"Supervised Open Information Extraction" - NAACL-HLT 2018
Gabriel Stanovsky, Julian Michael, Luke Zettlemoyer, Ido Dagan
-
"Logician and Orator: Learning from the Duality between Language and Knowledge in Open Domain" - EMNLP 2018
Mingming Sun, Xu Li, Ping Li
-
"Open Information Extraction from Conjunctive Sentences" - COLING 2018
Swarnadeep Saha, Mausam
-
"Graphene: Semantically-Linked Propositions in Open Information Extraction" - COLING 2018 (code, documentation)
Matthias Cetto, Christina Niklaus, André Freitas, Siegfried Handschuh
-
"Open Information Extraction on Scientific Text: An Evaluation" - COLING 2018
Paul T. Groth, Michael Lauruhn, Antony Scerri, Ron Daniel
-
"A Survey on Open Information Extraction" - COLING 2018
Christina Niklaus, Matthias Cetto, André Freitas, Siegfried Handschuh
-
"StuffIE: Semantic Tagging of Unlabeled Facets Using Fine-Grained Information Extraction" - CIKM 2018
Radityo Eko Prasojo, Mouna Kacimi, Werner Nutt
-
"Towards Practical Open Knowledge Base Canonicalization" - CIKM 2018
Tien-Hsuan Wu, Zhiyong Wu, Ben Kao, Pengcheng Yin
-
"Open Information Extraction with Global Structure Constraints" - WWW 2018
Qi Zhu, Xiang Ren, Jingbo Shang, Yu Zhang, Frank F. Xu, Jiawei Han
-
"CESI: Canonicalizing Open Knowledge Bases using Embeddings and Side Information" - WWW 2018 (code)
Shikhar Vashishth, Prince Jain, Partha Talukdar
-
"Revisiting the Task of Scoring Open IE Relations" (poster) - LREC 2018
William Léchelle, Philippe Langlais
-
"Employing Semantic Context for Sparse Information Extraction Assessment" - TKDD 2018 (resources)
Pei-Pei Li, Haixun Wang, Hongsong Li, Xindong Wu
-
"Open Information Extraction with Meta-pattern Discovery in Biomedical Literature" - BCB 2018
Xuan Wang, Yu Zhang, Qi Li, Yinyin Chen, Jiawei Han
-
"Modeling and Summarizing News Events Using Semantic Triples" - ESWC 2018
Radityo Eko Prasojo, Mouna Kacimi, Werner Nutt
-
"Disambiguating Open IE: Identifying Semantic Similarity in Relation Extraction by Word Embeddings" - PROPOR 2018
Leandro M. P. Sanches, Victor S. Cardel, Larissa S. Machado, Marlo Souza, Laís do Nascimento Salvador
-
"Task-Oriented Evaluation of Dependency Parsing with Open Information Extraction" - PROPOR 2018
Pablo Gamallo, Marcos Garcia
-
"Challenges of an Annotation Task for Open Information Extraction in Portuguese" - PROPOR 2018
Rafael Glauber, Leandro Souza de Oliveira, Cleiton Fernando Lima Sena, Daniela Barreiro Claro, Marlo Souza
-
"A systematic mapping study on open information extraction" - Expert Syst. Appl. 2018
Rafael Glauber, Daniela Barreiro Claro
-
"Self-training on refined clause patterns for relation extraction" - Inf. Process. Manage. 54(4): 686-706 (2018)
Duc-Thuan Vo, Ebrahim Bagheri
-
"Supervised Neural Models Revitalize the Open Relation Extraction" - CoRR 2018
Shengbin Jia, Yang Xiang, Xiaojun Chen
-
"Chinese Open Relation Extraction and Knowledge Base Establishment" - ACM Trans. Asian & Low-Resource Lang. Inf. Process. 2018 (slides, code)
Shengbin Jia, Shijia E, Maozhen Li, Yang Xiang
-
"WiRe57 : A Fine-Grained Benchmark for Open Information Extraction" - CoRR 2018
William Léchelle, Fabrizio Gotti, Philippe Langlais
-
"OPIEC: An Open Information Extraction Corpus" - AKBC 2019 (data + resources, code (data reading), code (pipeline))
Kiril Gashteovski, Sebastian Wanner, Sven Hertling, Samuel Broscheit, Rainer Gemulla
-
"MinScIE: Citation-centered Open Information Extraction" - JCDL 2019 (code)
Anne Lauscher, Yide Song, Kiril Gashteovski
-
"EAL: A Toolkit and Dataset for Entity-Aspect Linking" - JCDL 2019 (data, code, demo)
Federico Nanni, Jingyi Zhang, Ferdinand Betz, Kiril Gashteovski
-
"Integrating Local Context and Global Cohesiveness for Open Information Extraction" - WSDM 2019 (code)
Qi Zhu, Xiang Ren, Jingbo Shang, Yu Zhang, Ahmed El-Kishky, Jiawei Han
-
"Open Information Extraction from Question-Answer Pairs" - NAACL 2019
Nikita Bhutani, Yoshihiko Suhara, Wang-Chiew Tan, Alon Halevy and H V Jagadish
-
"OpenKI: Integrating Open Information Extraction and Knowledge Bases with Relation Inference" - NAACL 2019
Dongxu Zhang, Subhabrata Mukherjee, Colin Lockard, Xin Luna Dong, Andrew McCallum
-
"OpenCeres: When Open Information Extraction Meets the Semi-Structured Web" - NAACL 2019
Colin Lockard, Prashant Shiralkar and Xin Luna Dong
-
"Improving Open Information Extraction via Iterative Rank-Aware Learning" - ACL 2019 (code)
Zhengbao Jiang, Pengcheng Yin and Graham Neubig
-
"Open Relation Extraction: Relational Knowledge Transfer from Supervised Data to Unsupervised Data" - EMNLP 2019
Ruidong Wu, Yuan Yao, Xu Han, Ruobing Xie, Zhiyuan Liu, Fen Lin, Leyu Lin and Maosong Sun
-
"Supervising Unsupervised Open Information Extraction Models" - EMNLP 2019
Arpita Roy, Youngja Park, Taesung Lee and Shimei Pan
-
"CaRB: A Crowdsourced Benchmark for Open IE" - EMNLP 2019
Sangnie Bhardwaj, Samarth Aggarwal and Mausam
-
"CaRe: Open Knowledge Graph Embeddings" - EMNLP 2019
Swapnil Gupta, Sreyash Kenkre, Partha Talukdar
-
"Collaborative Policy Learning for Open Knowledge Graph Reasoning" - EMNLP 2019
Cong Fu, Tong Chen, Meng Qu, Woojeong Jin, Xiang Ren
-
"Canonicalization of Open Knowledge Bases with Side Information from the Source Text" - ICDM 2019
Xueling Lin, Lei Chen
-
"Open Relation Extraction for Chinese Noun Phrases" - TKDE 2019
Chengyu Wang, Xiaofeng He, Aoying Zhou
-
"Lexicon-Grammar based Open Information Extraction from Natural Language Sentences in Italian" - Expert Systems and Applications 2019
Raffaele Guarasci, Emanuele Damiano, Aniello Minutolo, Massimo Esposito, Giuseppe De Pietro
-
"Weakly Supervised, Data-Driven Acquisition of Rules for Open Information Extraction" - CAIAC 2019
Fabrizio GottiEmail, Philippe Langlais
-
"Aligning Open IE Relations and KB Relations using a Siamese Network Based on Word Embedding" - ICCS 2019
Rifki Afina Putri, Giwon Hong, Sung-Hyon Myaeng
-
"Contextualized Word Embeddings in a Neural Open Information Extraction Model" - NLDB 2019
Injy Sarhan, Marco R. Spruit
-
"Multilingual Open Information Extraction: Challenges and Opportunities" - Information 10(7): 228, 2019
Daniela Barreiro Claro, Marlo Souza, Clarissa Castellã Xavier, Leandro Souza de Oliveira
-
"Learning Open Information Extraction of Implicit Relations from Reading Comprehension Datasets" - CoRR 2019
Jacob Beckerman, Theodore Christakis
-
"Span Based Open Information Extraction" - CoRR 2019
Junlang Zhan, Hai Zhao
-
"Open Information Extraction Systems and Downstream Applications" - IJCAI 2016
Mausam
-
"A Survey on Open Information Extraction" - COLING 2018
Christina Niklaus, Matthias Cetto, André Freitas, Siegfried Handschuh
-
"A systematic mapping study on open information extraction" - Expert Syst. Appl. 2018
Rafael Glauber, Daniela Barreiro Claro
-
"Multilingual Open Information Extraction: Challenges and Opportunities" - Information 10(7): 228, 2019
Daniela Barreiro Claro, Marlo Souza, Clarissa Castellã Xavier, Leandro Souza de Oliveira
-
"Creating a Large Benchmark for Open Information Extraction" - EMNLP 2016 (code, talk)
Gabriel Stanovsky, Ido Dagan
-
"An Informativeness Approach to Open IE Evaluation" - CICLing 2016 (slides, code + data)
William Léchelle, Philippe Langlais
-
"An Evaluation of PredPatt and Open IE via Stage 1 Semantic Role Labeling" - IWCS 2017
Sheng Zhang, Rachel Rudinger, Benjamin Van Durme
-
"An assessment of open relation extraction systems for the semantic web" - Inf. Syst. 71, 2017
Amal Zouaq, Michel Gagnon, Ludovic Jean-Louis
-
"RelVis: Benchmarking OpenIE Systems" - ISWC 2017
Rudolf Schneider, Tom Oberhauser, Tobias Klatt, Felix A. Gers, Alexander Löser
-
"Analysing Errors of Open Information Extraction Systems" - Workshop on Building Linguistically Generalizable NLP Systems @ EMNLP 2017
Rudolf Schneider, Tom Oberhauser, Tobias Klatt, Felix A. Gers, Alexander Löser
-
"Open Information Extraction on Scientific Text: An Evaluation" - COLING 2018
Paul T. Groth, Michael Lauruhn, Antony Scerri, Ron Daniel
-
"WiRe57 : A Fine-Grained Benchmark for Open Information Extraction" - CoRR 2018
William Léchelle, Fabrizio Gotti, Philippe Langlais
-
"CaRB: A Crowdsourced Benchmark for Open IE" - EMNLP 2019
Sangnie Bhardwaj, Samarth Aggarwal and Mausam
OIE's output has been shown to be a useful input for many downstream tasks. In this section, several downstream tasks that benefited from OIE output are listed.
-
"Assertion-based QA with Question-Aware Open Information Extraction" AAAI 2018
Zhao Yan, Duyu Tang, Nan Duan, Shujie Liu, Wendi Wang, Daxin Jiang, Ming Zhou, Zhoujun Li
-
"Answering Complex Questions Using Open Information Extraction" - ACL 2017
Tushar Khot, Ashish Sabharwal, Peter Clark
-
"Paraphrase-Driven Learning for Open Question Answering" ACL 2013
Anthony Fader, Luke S. Zettlemoyer, Oren Etzioni
-
"Open Information Extraction to KBP Relations in 3 Hours" - TAC 2013
Stephen Soderland, John Gilmer, Robert Bart, Oren Etzioni, Daniel S. Weld
-
"Generating Coherent Event Schemas at Scale" - EMNLP 2013
Niranjan Balasubramanian, Stephen Soderland, Mausam, Oren Etzioni
-
"Facts That Matter" - EMNLP 2018
Marco Ponza, Luciano Del Corro, Gerhard Weikum
-
"Pocket Knowledge Base Population" - ACL 2017
Travis Wolfe, Mark Dredze, Benjamin Van Durme
-
"Relating Legal Entities via Open Information Extraction" - MTSR 2018
Giovanni Siragusa, Rohan Nanda, Valeria De Paiva, Luigi Di Caro
Most of the OIE systems are focused on extractions made from text written on English. However, some OIE systems either are focused on a language other than English, or are multilingual. In this section, OIE systems on languages other than English or multilingual OIE systems are listed.
-
"Multilingual Open Information Extraction: Challenges and Opportunities" - Information 10(7): 228, 2019
Daniela Barreiro Claro, Marlo Souza, Clarissa Castellã Xavier, Leandro Souza de Oliveira
-
"Multilingual Open Relation Extraction Using Cross-lingual Projection" - HLT-NAACL 2015
Manaal Faruqui, Shankar Kumar
-
"Multilingual Open Information Extraction" - EPIA 2015
Pablo Gamallo, Marcos García
-
"GerIE - An Open Information Extraction System for the German Language" - J. UCS 2018
Akim Bassa, Mark Kröll, Roman Kern
-
"Porting an Open Information Extraction System from English to German" - EMNLP 2016
Tobias Falke, Gabriel Stanovsky, Iryna Gurevych, Ido Dagan
-
"Challenges of an Annotation Task for Open Information Extraction in Portuguese" - PROPOR 2018
Rafael Glauber, Leandro Souza de Oliveira, Cleiton Fernando Lima Sena, Daniela Barreiro Claro, Marlo Souza
-
"Inference Approach to Enhance a Portuguese Open Information Extraction" - ICEIS 2017
Cleiton Fernando Lima Sena, Rafael Glauber, Daniela Barreiro Claro
-
"DependentIE: An Open Information Extraction system on Portuguese by a Dependence Analysis" - ENIAC 2017
Leandro Souza de Oliveira, Rafael Glauber, Daniela Barreiro Claro
-
"ZORE: A Syntax-based System for Chinese Open Relation Extraction" - EMNLP 2014
Likun Qiu, Yue Zhang
-
"Chinese Open Relation Extraction and Knowledge Base Establishment" - ACM Trans. Asian & Low-Resource Lang. Inf. Process. 2018 (slides, code)
Shengbin Jia, Shijia E, Maozhen Li, Yang Xiang
-
"Open Relation Extraction for Chinese Noun Phrases" - TKDE 2019
Chengyu Wang, Xiaofeng He, Aoying Zhou
-
"A recursive algorithm for open information extraction from Persian texts" IJCAT 2018
Mahmoud Rahat, Alireza Talebpour, Seyedamin Monemian
-
"Open information extraction as an intermediate semantic structure for Persian text summarization" - Int. J. on Digital Libraries (2018)
Mahmoud Rahat, Alireza Talebpour
-
"Parsa: An open information extraction system for Persian" - DSH 2018
Mahmoud Rahat, Alireza Talebpour
-
"Lexicon-Grammar based Open Information Extraction from Natural Language Sentences in Italian" - Expert Systems and Applications 2019
Raffaele Guarasci, Emanuele Damiano, Aniello Minutolo, Massimo Esposito, Giuseppe De Pietro
-
"Supervised Open Information Extraction" - NAACL-HLT 2018
Gabriel Stanovsky, Julian Michael, Luke Zettlemoyer, Ido Dagan
-
"Neural Open Information Extraction" - ACL 2018
Lei Cui, Furu Wei, Ming Zhou
-
"Logician: A Unified End-to-End Neural Approach for Open-Domain Information Extraction" - WSDM 2018
Mingming Sun, Xu Li, Xin Wang, Miao Fan, Yue Feng, Ping Li
-
"Contextualized Word Embeddings in a Neural Open Information Extraction Model" - NLDB 2019
Injy Sarhan, Marco R. Spruit
-
"Weakly Supervised, Data-Driven Acquisition of Rules for Open Information Extraction" - CAIAC 2019
Fabrizio GottiEmail, Philippe Langlais
-
"Learning Open Information Extraction of Implicit Relations from Reading Comprehension Datasets" - CoRR 2019
Jacob Beckerman, Theodore Christakis
- [pdf] "Compact Open Information Extraction on Large Corpora". Talk by Kiril Gashteovski given at NEC Labs Europe GmbH, 2019.
- [pdf] "(Information Extraction) Lecture 10 – Ontological and Open IE": A lecture on Open IE, which is part of the course "Information Extraction", by Prof. Dr. Alexander Fraser, from LMU München
- Open IE Tutorial: Open Information Extraction for QA by André Freitas. Tutorial was presented on OKBQA 2018
- [pdf] "Chinese Open Relation Extraction and Knowledge Base Establishment", 2018
- [pdf] "Brief Introduction and Review of Open Information Extraction (Open-IE) Systems". Project Presentation by Sina Miran.
- [pdf] "Open Information Extraction Systems and Downstream Applications" by Prof. Mausam. The talk was presented at IJCAI 2016
- [pptx] "Open Information Extraction from the Web", presented by Prof. Oren Etzioni. The tutorial was presented at AKBC-WEKEX 2012
- [pdf] "ClausIE: Clause-Based Open Information Extraction" by Luciano del Corro.
- [pdf] "Open Information Extraction: the Second Generation"
- [pdf] "Open Information Extraction: Where Are We Going?" by Claudio Delli Bovi, 2016
- [pdf] "An Informativeness Approach to Open Information Extraction Evaluation" by William Léchelle, 2016
- [video] "Open Information Extraction from the Web", by Prof. Oren Etzioni, presented at AKBC-WEKEX 2012. Slides: [pptx]
- [video] "Open Information Extraction: Where Are We Going?", by Claudio Delli Bovi. The talk was given at AI2 in 2016. Slides [pdf]
- [video] "Nested Propositions in Open Information Extraction" by Nikita Bhutani at EMNLP 2016
- [video] "Creating a Large Benchmark for Open Information Extraction" by Gabriel Stanovsky at EMNLP 2016
- MinIE: Open Information Extraction System
- MinIE: originally written in Java
- Python wrapper for MinIE
- MinScIE - an Open Information Extraction system which provides structured knowledge enriched with semantic information about citations (based on MinIE).
- SalIE - Salient Open Information Extraction (based on MinIE)
- ClausIE: Clause-based OIE
- ClausIE: originally written in Java
- ClausIE (mavenized version)
- ClausIEpy: Python wrapper for ClausIE
- OpenIE 5.0: the latest version of OpenIE at A2I.
- OLLIE
- ReVerb
- Stanford's OpenIE:
- Stanford OpenIE: Stanford's OpenIE system.
- Stanford OpenIE Spider: Extract Information from WebCorpus using Stanford Open Information Extraction.
- Python wrapper for Stanford OpenIE: The unofficial cross-platform Python wrapper for the state-of-art information extraction library from Stanford University.
- Graphene: OpenIE system containing coreference resolution, simplification and open relation extraction pipeline
- EXEMPLAR
- DefIE: Open information extraction from textual definitions
- ReMine: Integrating Local and Global Cohesiveness for Open Information Extraction
- OIE systems for languages other than English or cross-lingual systems:
- Zhopenie - Chinese OIE: OIE system for Chinese language written in Python.
- Open Relation Extraction for Chinese: Knowledge triples extraction (entities and relations extraction) and knowledge base construction based on dependency syntax for open domain text (for Chinese)
- Baaz: Open information extraction from Persian web (Python)
- MT/IE: Cross-lingual Open IE. Attention-based sequence-to-sequence model for cross-lingual open IE. Written in Python
- Relation Extraction on German Websites: This repository holds a collection of three Open Information Extraction approaches for the German language
- DptOIE: A Portuguese Open Information Extraction system based on Dependency Analysis
- PragmaticOIE: a rule-based approach to extract facts in Portuguese in a first pragmatic level
- CORE: Context-Aware Open Relation Extraction with Factorization Machines
- CESI: Canonicalizing Open Knowledge Bases using Embeddings and Side Information
- IMPLIE: IMPLIE (IMPLicit relation Information Extraction) is a program that extracts binary relations from English sentences where the relationship between the two entities is not explicitly stated in the text.
- Ranking: Iterative Rank-Aware Open IE (confidence score).
OIE output is used as a useful input in many other downstream tasks, such as question answering, event schema induction or generating inference rules. Moreover, OIE output can be used as a "fuel" to derive further resources. Here, the data is organized into two major categories: 1) OIE corpora; 2) Resources derived from OIE output.
- OPIEC: An Open Information Extraction Corpus: the largest OIE corpus to date, containing more than 341M triples extracted from the entire English Wikipedia. Each triple from the corpus is composed of rich meta-data: each token from the subj / obj / rel along with NLP annotations (POS tag, NER tag, ...), provenance sentence along with the dependency parse, original (golden) left from Wikipedia, sentence order, space / time, etc.
- [.gz] ReVerb extractions: 15 million high-precision OIE extractions (826MB compressed) from the OIE system ReVerb. The extractions were made from the ClueWeb09 corpus. The data contains (subject, relation, object) triples, accompanied by a confidence score (estimating the likelihood of whether the triple was correctly extracted) and provenance information (the link of the web-page where the triple was extracted from).
- ReVerb extractions (linked): 3 million triples with linked argument (a subset of the 15 M high-precision ReVerb extractions). The links (to Freebase) are provided by an entity linker. The data fields are: argument 1, relation phrase, argument 2, freebase ID for argument 1 link, corresponding freebase entity name, link score, link ambiguity score
- PATTY: PATTY is a system that takes open relations between two arguments, structures them into relational synsets and then organizes the synsets into a taxonomy. This resource contains over 15M triples with disambiguated arguments (links to WikiPedia articles) and relation synset ID between them. Additionaly, the resource contains: 1) relation pattern synsets with type signatures; 2) relation pattern subsumptions; 3) relation paraphrases; 4) evaluation data;
- WiseNet (1.0 and 2.0): similarly as PATTY, WiseNet 1.0/2.0 is a source containing of OIE triples, where the arguments are disambiguated and the open relations are organized into relation synsets and then taxonomized. One of the main differences between PATTY and WiseNet is that WiseNet contains "golden links" for the arguments (annotated by humans) by keeping the original links from the WikiPedia articles.
- KB-Unify: KB-Unify takes as an input several OIE corpora and unifies them into a single disambiguated OIE repository. The open relations are organized into relational synsets and the arguments are disambiguated with BabelFy.
- Functional relations: 10K Functional relations. This resource comes from the paper "Identifying Functional Relations in Web Text", published on EMNLP 2010.
- Entailment rules: 10M predicative entailment rules learned using local and global algorithms. From the documentation: "This resource of predicative entailment rules contains three resources in two formats – shallow and syntactic. Resources are learned over the REVERB data set and using the local and algorithms described in Chapter 5 of Jonathan Berant’s thesis (which is part of the package)."
- Entailment rules: 36K high precision entailment rules (data and code). The resource is the result of the work of Prachi Jain and Mausam "Knowledge-Guided Linguistic Rewrites for Inference Rule Verification" published on NAACL-HLT, 2016.
-
"Constructing Lexicons of Relational Phrases" by Adam Grycner, University of Saarland, Germany, 2017
-
"Methods for open information extraction and sense disambiguation on natural language text" by Luciano Del Corro, University of Saarland, Germany, 2016
-
"Automated Knowledge Base Extension Using Open Information" by Arnab Kumar Dutta, University of Mannheim, Germany, 2015
-
"Exploiting Knowledge in Unsupervised Open Information Extraction" by Yuval Merhav, Illinois Institute of Technology, USA, 2012
-
"Open Information Extraction for the Web" by Michele Banko, University of Washington, USA, 2009
- ClausIE: Demo for ClausIE, an OIE system.
- Fact retrieval: Fact retrieval with OpenIE on large corpora.