- Python 2.x
- Pathos (used for multiprocessing)
- bs4
Parallelization
(Details are omitted.)
- identify sections 1, 2, 6 and 7
- import tables from city utilities
- make state count table
To do:
4. store states count table into csv file
we will need info as below:
- company name
- time (year) the document was published
- url of 10-k
- count of states (of course)