A simple web scrapping script to extract h-index, citations & number of publications from Scopus Author Profile
An input file (Excel xlsx) must be prepared before hand with at least the following headers
- Main reference ID: any reference ID; just to identify unique person in your record
- Scopus ID: Scopus ID corresponds to the individual; multiple Scopus ID of the same person can be separated by ;[space]
Python 3, Pandas, Selenium
- There is even more efficient way of doing this, by using Scopus API. For more detail, please visit https://dev.elsevier.com/
- The script is not perfect. Sometimes, perhaps due to the connection to Scopus.com, things like timeout, pending javascript rendering, etc. will resulted in certain indicators to be set to zero (I took the easy route to just catch all Exception and reset them indicators to zero). It is important to go through the output after finished mining and fix these errors manually.