/Dissertation_Supplementary_Materials

Datasets, tools and results from my doctoral dissertation

Primary LanguageShellGNU General Public License v3.0GPL-3.0

This is the main repository that contains the supplementary materials related to my doctoral dissertation.

Abstract

The unprecedented rate of scientific publications is a major threat to the productivity of knowledge workers, who rely on scrutinizing the latest scientific discoveries for their daily tasks. In this dissertation, we introduce a Personal Research Agent that can help scientists in discovering, reading and learning from scientific documents, primarily in the computer science domain. We demonstrate how a confluence of techniques from the Natural Language Processing and Semantic Web domains can construct a knowledge base of semantically-rich, inter-connected graph of scholarly artifacts, effectively transforming scientific literature from written content in isolation, into a queryable web of knowledge, suitable for machine interpretation.

Repository Structure

  • datasets contains the data used in the evaluation chapter of the dissertation
  • knowledgebase contains the populated knowledge bases that resulted from the experiments in the dissertation
  • pipelines contains the GATE text mining pipelines
  • recommendation contains the supplementary materials related to evaluation of our semantic vector-based recommendation of scientific articles
  • utilities contains tools and scripts frequently used in my research

License

All the files included in this repository are licensed under GNU Lesser General Public License version 3 (LGPL v3).