/anacAnalysis

Contains the code to crawl the Anac dataset and jupyter notebooks for explorative analysis

Primary LanguageJupyter Notebook

ANAC analysis

This project allow us to extract Anac dataset, to realize some descriptive analysis, apply machine learning algorithms, and generate a graph of semantic relationships among public administrations and private companies.

We present this project at Data Driven Conference. Here the presentation.

The project in organized in three main sections:

  • crawler: java project for crawling data. Starting from a json file containing information about all public administration, it extract the dataset used for our analysis.
  • notebooks: python project used for analyzing the ANAC dataset extracted by the crawling process.
  • web_api: a docker-composer file which runs a Neo4j instance and a React web-app and visualizes the semantic graph G(V, E), where the nodes in V are public administrations and private companies contained in the ANAC dataset and the edges E are the semantic relationships among nodes.

This is a prototype, so feel free to improve this project o suggest us alternative solutions.