This repository contains a pipeline written in Python (Luigi) that performs the following steps:
- Crawls DBLP and downloads open access papers
- Extracts reference list from them and builds the citation network
- Computes citation count and pagerank based algorithms for ranking papers in the network
- Creates an analytic report for the most influential and trending papers found in the above network
Please visit the wiki at Install Python.
# install pip
sudo apt-get install python-pip
# install requirements
pip install -r requirements.txt
# Run the application
PYTHONPATH='.' luigi --module crawler Crawler --local-scheduler
...