/SP500-Wikipedia-Company-Info-Extractor

A simple script that will scrape wikipedia and output a json file with basic company info, products, services, and key executives.

Primary LanguageJupyter Notebook

S&P 500 Wikipedia Company Info Extractor

A simple script that will scrape wikipedia and output a json file with basic company info, products, services, and key executives.

TO DO

  • Use regex to clean raw text in Key People, Products, and Services columns
  • Convert output to JSON
  • Convert notebook to script.py file