Source code for web scraping football draft prospects from the DraftTek NFL Big Board.
The scraper is written in Golang and uses the Colly scraper. The binary file in the repo is compiled for Linux, but it could be compiled to use in a different operating system if needed.
Once the ranks have been scraped, I use csvkit to merge all of the files and join them together with information about the locations of the schools. The csvkit commands are in the csvkitcommands.txt files.
Once the ranks have been assembled, I use OpenRefine to clean the data for consistency. The data cleaning steps are contained in openRefineDataMerge.json.