/biofuel_brazil_plants

[EN] Brazilian biofuel plants / [PT-BR] Usinas de biocombustível brasileiras

Primary LanguagePythonApache License 2.0Apache-2.0

Brazilian biofuel plants

Python version CI codecov License

Para acessar a versão em português, clique aqui.

Overview

This project aims to extract, refine, and geocode data about Brazilian biofuel plants obtained through Certificates of Efficient Biofuel Production issued by accredited inspection firms under RenovaBio, approved by the National Agency for Petroleum, Natural Gas and Biofuels (ANP).

The data generated after all processing can be used in geospatial analysis, with a certain margin of error in relation to the plants' locations.

The data pipeline was built using kedro 0.18.12.

Key Features

  1. Extraction of raw data from efficient biofuel production certificates issued by ANP.
  2. Refinement of raw data, including data type standardization and arrangement.
  3. Geocoding of refined data using Selenium and Google Maps.
  4. Map visualization of the geocoded data using Folium.

Data Pipeline

pipeline

Requirements

The required libraries for project execution are listed in the src/requirements.txt file. To install them, use the following command:

pip install -r src/requirements.txt

Additionally, you need to have the latest version of geckodriver in a known directory.

How to Run the Pipeline

To execute the project, use the command:

kedro run

To execute a specific pipeline, use the command:

kedro run --pipeline <pipeline-name>

Contact

LinkedIn: Daniel Rodrigues