This project was carried out by Group 19 of Algorithmic Methods for Data Mining, consisting of:
NAME and SURNAME | |
---|---|
Pasquale Luca Tommasino | pl.tommasino@gmail.com |
Deniz Yilmaz | denizyilmazz@yahoo.com |
Emmanuele De Lucia | delucia.2099678@studenti.uniroma1.it |
Paolo Zilviano | zilviano.1916518@studenti.uniroma1.it |
All the points of the project are contained in a single main file called HW5_main.ipynb
.
In this part, we build the two graph: Citation graph and Collaboration graph. For this, we created a file (dictionary.py
in the path dict/
) which contained all the functions that allowed us to transform the dataset from .json to dictionary.
In funct/
path, we create five files (functionality1.py
, functionality2.py
, functionality3.py
, functionality4.py
, functionality5.py
) for define all the Functionalities function, and then we import all the function in their respective section in the main file.
In this part, the following 3 command line questions are answered using the AWS CloudShell;
1)Is there any node that acts as an important "connector" between the different parts of the graph? 2)How does the degree of citation vary among the graph nodes? 3)What is the average length of the shortest path among nodes?
CommandLine.sh
: This file contains the code for commandline question.
CLQ_screenshots
: This folder contains screenshots of running command-line scripts to three CLQ questions individually.
In the final part of the homework it's provided an algorithmic code for the resolution of the problem requested in the part A and a heuristic demostration of the point in the part B