Pinned Repositories
cloudrun-api-flask
data-engineer-handbook
This is a repo with links to everything you'd ever want to learn about data engineering
data-engineer-roadmap
Roadmap to becoming a data engineer in 2021
data-engineering-roadmap
roadmap de engenharia de dados da jornada 2024
DataProjectStarterKit
Estrutura completa para iniciar um projeto de dados com Python, abrangendo ambiente, git, desenvolvimento, testes e documentação.
duckdb-on-linux
Passo a passo para instalar no linux/wsl ubuntu
fancy-git-on-wsl
Passo a passo para instalar no fancy-git no WSL/Ubuntu
fastapi-docker
Building a optmize docker image using poetry
OneBillionRowsDataQuality
OneBillionRows with DataQuality project aims to ensure the integrity and quality of massive datasets comprising one billion rows. It harnesses the capabilities of Python 3.11.5 and advanced data quality libraries like Pydantic, Pandera, DuckDB,Pandas and Dash.
workshop-streamlit-aovivo
Dashboard Realtime com Kafka
kaiohp's Repositories
kaiohp/duckdb-on-linux
Passo a passo para instalar no linux/wsl ubuntu
kaiohp/fastapi-docker
Building a optmize docker image using poetry
kaiohp/workshop-streamlit-aovivo
Dashboard Realtime com Kafka
kaiohp/fancy-git-on-wsl
Passo a passo para instalar no fancy-git no WSL/Ubuntu
kaiohp/OneBillionRowsDataQuality
OneBillionRows with DataQuality project aims to ensure the integrity and quality of massive datasets comprising one billion rows. It harnesses the capabilities of Python 3.11.5 and advanced data quality libraries like Pydantic, Pandera, DuckDB,Pandas and Dash.
kaiohp/cloudrun-api-flask
kaiohp/data-engineer-handbook
This is a repo with links to everything you'd ever want to learn about data engineering
kaiohp/data-engineer-roadmap
Roadmap to becoming a data engineer in 2021
kaiohp/data-engineering-roadmap
roadmap de engenharia de dados da jornada 2024
kaiohp/DataProjectStarterKit
Estrutura completa para iniciar um projeto de dados com Python, abrangendo ambiente, git, desenvolvimento, testes e documentação.
kaiohp/duckdb_get_start
Start a DuckDb and Python Project End to End
kaiohp/fakeapiinjest
This project is designed to generate a fictitious database and populate a relational database hosted on Google Cloud. The data will be generated using the Faker and Random libraries, and the ORM will utilize SQLAlchemy. The API will follow REST standards and be implemented using Flask, containerized with Docker.
kaiohp/freecurrencyapi
Study Project: Collect data from the FreeCurrencyAPI a Private API, process and store it in Google Cloud services (Cloud Function, Cloud Scheduler, Cloud Scret Manager and Cloud Storage).
kaiohp/pydantic_pandas
Example how to use pydantic and pandas
kaiohp/webscraping
Bootcamp advanced webscraping
kaiohp/workshop
Workshop sobre estruturação de projeto
kaiohp/workshop-streamlit