wikidrugs

Extraction of medical drugs side effects from Wikipedia

Project description • Developed and automated the process to extract all possible side effects for medical drugs articles from Wikipedia by using Google NL API to retrieve all possible entities based on the sailence score. • Preprocessed and extracted all possible patterns of side effects from 7 million lines of text using regular expressions, xml, json parsers of python. (Python, Scrapy, Google NL API, HTML, Excel).