J0hnG4lt
A computer scientist who is interested in data science, big data, distributed systems, and scalability.
Spain
Pinned Repositories
AdeIndexer
A command line tool that uses Lucene to build an inverted index on a folder with .txt files and allows for the execution of efficient searches on it.
airflow-GKE-k8sExecutor-helm
Quickly get a kubernetes executor airflow environment provisioned on GKE. Azure Kubernetes Service instructions included also as are instructions for docker-for-mac.
airflow-helm
bq-snitch
Get visibility into expensive Google BigQuery queries on Slack
ClienteServidor
delta-live-tables-notebooks
ml-ops
Get your MLOps (Level 1) platform started and going fast.
puzzle
survey_frontend
An online questionnaire showcasing a simple react app
TweetFeatureExtractionTools
J0hnG4lt's Repositories
J0hnG4lt/airflow-helm
J0hnG4lt/bq-snitch
Get visibility into expensive Google BigQuery queries on Slack
J0hnG4lt/delta-live-tables-notebooks
J0hnG4lt/ml-ops
Get your MLOps (Level 1) platform started and going fast.
J0hnG4lt/AdeIndexer
A command line tool that uses Lucene to build an inverted index on a folder with .txt files and allows for the execution of efficient searches on it.
J0hnG4lt/airflow-GKE-k8sExecutor-helm
Quickly get a kubernetes executor airflow environment provisioned on GKE. Azure Kubernetes Service instructions included also as are instructions for docker-for-mac.
J0hnG4lt/survey_frontend
An online questionnaire showcasing a simple react app
J0hnG4lt/AcidOnSpark-ETL
Delta-Lake, ETL, Spark, Airflow
J0hnG4lt/airflow-toolkit
Any Airflow project day 1, you can spin up a local desktop Kubernetes Airflow environment AND one in Google Cloud Composer with tested data pipelines(DAGs) :desktop_computer: >> [ :rocket:, :ship: ]
J0hnG4lt/CD4ML-Scenarios
Repository with sample code and instructions for "Continuous Intelligence" and "Continuous Delivery for Machine Learning: CD4ML" workshops
J0hnG4lt/code-with-engineering-playbook
This is the playbook for "code-with" customer or partner engagements
J0hnG4lt/data_exploration_spark
This repo is only used for learning Spark with Scala
J0hnG4lt/dbt-metabase
Model synchronization from dbt to Metabase
J0hnG4lt/DbtDagParser
Parses Dbt dags into graphs, and graphs into Airflow DAGs.
J0hnG4lt/DO180-apps
DO180 Repository for Sample Applications
J0hnG4lt/ehr
Electronic Health Record that uses git submodules and docker-compose to integrate all of its microservices
J0hnG4lt/freeflow
Apache Airflow development and deployment template to make your development process hopefully simplified. Included ability to do config via file (encrypted!), operator testing, and more!
J0hnG4lt/gitignore
A collection of useful .gitignore templates
J0hnG4lt/inmobiliaria
J0hnG4lt/k8s-UAP
⚙ Universal Analytics Platform: k8s-based Data-Driven Analytics/Data Science(ML/DeepML) PaaS/SaaS Platform for Data Analyst/Data Engineer/Data Scientist/DataOps/MLOps playground (R&D/MVP/POC/environmints)
J0hnG4lt/microservices-demo
Sample cloud-native application with 10 microservices showcasing Kubernetes, Istio, gRPC and OpenCensus.
J0hnG4lt/orchestra
Advertising Data Lakes and Workflow Automation
J0hnG4lt/patient
Microservice that uses Spring Boot and Mongo
J0hnG4lt/platys
A tool for generating docker-compose environments
J0hnG4lt/platys-modern-data-platform
Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....
J0hnG4lt/pyspark-example-project
Example project implementing best practices for PySpark ETL jobs and applications.
J0hnG4lt/semaphore-demo-python-pants
Demo for building Python projects with The Pants Build System.
J0hnG4lt/smart_scraper
Currently, this is only an example about how to use Sphinx
J0hnG4lt/starthinker
Framework for building data workflows provided by Google.
J0hnG4lt/superset
Apache Superset is a Data Visualization and Data Exploration Platform