This project is an Extract, Transform, Load (ELT) Python script that extracts data from a given URL, transforms it into a polars DataFrame, and loads it into CSV files.
To use this script, follow these steps:
-
Open your terminal or command prompt.
-
Navigate to the directory where you want to clone the repository.
-
Run the following command:
git clone https://github.com/aureliowozhiak/elt-python-2
This will create a local copy of the repository in your current directory.
-
Enter in the folder:
cd elt-python-2
-
Execute the code with your URL, example:
python3 -m app https://pt.wikipedia.org/wiki/Python https://pt.wikipedia.org/wiki/Java
Projeto feito no curso de engenharia de dados da Udemy: https://dataengineer.help/cursos/curso-de-engenharia-de-dados/