/GeoInvest

Primary LanguageOpenEdge ABL

Environment:

  • Python 2.x
  • Pathos (used for multiprocessing)
  • bs4

Minimal Viable Product

Download file from SEC

Parallelization

Generate Cities and Forms Classes

(Details are omitted.)

Process document

  1. identify sections 1, 2, 6 and 7
  2. import tables from city utilities
  3. make state count table

To do:
4. store states count table into csv file
we will need info as below:

  • company name
  • time (year) the document was published
  • url of 10-k
  • count of states (of course)