Big Data course (81932), University of Bologna

This repository contains all the instructions, code, and material to carry out the laboratory exercises of the course for the Academic Year 2023/24.

Course info: Link

Teacher: Enrico Gallinucci

Software requirements

All laboratory machines are equipped with the following software.


  • Putty or MobaXTerm
    • To connect via SSH to the Virtual Lab machines, and to establish SSH tunnels that enable access to web-based services
  • WinSCP
    • To transfer files to the Virtual Lab machines


  • Power BI
    • To explore and visualize datasets
  • IntelliJ IDEA
    • To write and compile Spark applications