The objective of this project is to create an open-source, collaborative, and comprehensive dataset about football in Brazil, encompassing teams, players, matches, coaches, and more. Our mission is to collect and process data from all Brazilian states.
In the first version, our focus is on gathering data exclusively about football clubs. We aim to answer questions such as:
- Which is the oldest team in each state? Which is the oldest team in the country?
- Which is the youngest team in each state? Which is the youngest team in the country?
- What are the most common club names?
- How many clubs are registered in each state?
- Can we visualize all the clubs together on a map?"
You can run this project by following the instructions in the Makefile. Ensure that Docker and Python are installed on your system. The project uses the following technologies: Python, Docker, Docker Compose, Airflow.
To start the project with Airflow, execute the following commands:
make install_dependencies
make create_shared_network
make start
Once all services are running, you can access Airflow through your browser and execute the DAGs.
Read more about our catalog at the docs folder
- Collect data From Santa Catarina: Federação Catarinense de Futebol (FCF) (Under Development)
- Collect data From Acre: Federação de Futebol do Estado do Acre (FFAC)
- Collect data From Alagoas: Federação Alagoana de Futebol (FAF)
- Collect data From Amapá: Federação Amapaense de Futebol (FAF)
- Collect data From Amazonas: Federação Amazonense de Futebol (FAF)
- Collect data From Bahia: Federação Bahiana de Futebol (FBF)
- Collect data From Ceará: Federação Cearense de Futebol (FCF)
- Collect data From Distrito Federal: Federação de Futebol do Distrito Federal (FFDF)
- Collect data From Espírito Santo: Federação de Futebol do Estado do Espírito Santo (FES)
- Collect data From Goiás: Federação Goiana de Futebol (FGF)
- Collect data From Maranhão: Federação Maranhense de Futebol (FMF)
- Collect data From Mato Grosso: Federação Mato-Grossense de Futebol (FMF)
- Collect data From Mato Grosso do Sul: Federação de Futebol de Mato Grosso do Sul (FFMS)
- Collect data From Minas Gerais: Federação Mineira de Futebol (FMF)
- Collect data From Pará: Federação Paraense de Futebol (FPF)
- Collect data From Paraíba: Federação Paraibana de Futebol (FPF)
- Collect data From Paraná: Federação Paranaense de Futebol (FPF)
- Collect data From Pernambuco: Federação Pernambucana de Futebol (FPF)
- Collect data From Piauí: Federação de Futebol do Piauí (FFP)
- Collect data From Rio de Janeiro: Federação de Futebol do Estado do Rio de Janeiro (FFERJ)
- Collect data From Rio Grande do Norte: Federação Norte-Riograndense de Futebol (FNF)
- Collect data From Rio Grande do Sul: Federação Gaúcha de Futebol (FGF)
- Collect data From Rondônia: Federação de Futebol do Estado de Rondônia (FFER)
- Collect data From Roraima: Federação Roraimense de Futebol (FRF)
- Collect data From São Paulo: Federação Paulista de Futebol (FPF)
- Collect data From Sergipe: Federação Sergipana de Futebol (FSF)
- Collect data From Tocantins: Federação Tocantinense de Futebol (FTF)