There is an Iani site called Nan, which contains all the needs of the country.
address this website:
First, scroll the page with the Selenium package until it reaches the end and download the HTML file (the size of the file is almost one gigabyte).
Then with the package BeautifulSoup extract all the links from the HTML file and save it in a text file named link.txt
Then we started extracting information from each link in the main.py file and after finishing, we saved it as a csv file
Notebook is also availble on google colab.
Now, using the Pandas package, diagrams about :
- key words
- Application areas
- city and province
- Important application areas in each province
- Super words (regarding the summary of requirements)