
Solutions for extracting information from massive free-text resources, such as Wikipedia and generating a database of facts and information.

Primary LanguagePython


The web is full of informative free-text resources, which might make some sense to a human being but not much to a computer, except might be as unstructured data or a bag of words. In this project, we are trying to implement Snowball system for the crucial task of extracting information/facts from massive free-text resources, such as Wikipedia and generating a database (semantic) of facts and information.

Semantic data allow machines to interact with worldly information without human interpretation. Semantic representation of data involves two entities and the relationship between them. For example, the sentence “Einstein was born in Germany.” would be represented as <Einstein, Germany> and the relationship would be "wasBornIn".


