pablo-git8
Data Engineer @ International Airlines Group (IAG)
International Airlines GroupBarcelona, Spain
Pinned Repositories
Air-Quality-Classification-India-Cities
Analyze and classify air quality in some cities in India before and after COVID-19 using machine learning. This project involves data wrangling, exploratory data analysis (EDA), feature engineering, model development and model deployment, to predict air quality conditions. Explore the impact of the pandemic on air pollution levels.
AirlineDataHub
Streamlines aviation data analysis and warehousing, combining Python, SQL, and AWS for EDA, cleaning, and modeling. This repo guides from CSV ingestion to Redshift analytics, highlighting best practices in data engineering.
Breast-Cancer-Detection-Tool-Project-BCDT-
In this project I will be developing a software tool for predicting breast cancer based on patient´s public clinical information retrieved from the records found at BCSC web page. The purpose of this project is to help physicians on determining whether a patient can be diagnosed with breast cancer based on its demographic data, prior studies evaluation, historical records, technologist assessment (BI-RADS score) and family medical records. The scope of the project is to develop and deploy a tool that can be used in a radiography software that may classify breast images and, together (as a complete solution) be able to work as Artificial Intelligence for medical decision support in mammography. All text included in _italics_ is retrieved from public relevant web pages. And its proper references are included in [#] with direct access to them through their http link.
FinSentNewsNLP
Explore the fusion of NLP with financial insights through our project, focusing on categorizing financial texts and sentiment tagging using advanced classifiers and pre-trained models finetuned on rich datasets, including finance-related tweets and articles, to decode the nuanced language of the financial world.
GlobalLogisticsInsights
A repository dedicated to aggregating and analyzing news and data from leading maritime and air transport websites, aiming to provide comprehensive insights into global logistics trends, challenges, and opportunities affecting the shipping and supply chain sectors.
GreenEnergyForecast-Europe
This repository aims to predict which European country will have the highest surplus of green energy in the next hour. It uses data from the ENTSO-E Transparency portal and features an ARIMA model for forecasting, with a focus on optimizing green energy usage to reduce the computing industry's carbon footprint.
IoT23-Malware-Detection
IoT23 Malware Detection Tool - Repository for the Capstone Project 2 at Springboard Data Science Career Track Bootcamp.
retinopathy-detection
This is a deep learning capstone project aimed at detecting aneurysms in patients with diabetic retinopathy.
spark-project
This is a project to run and easily setup Apache Spark, Postgres and JupyterLab for projects ranging multiple features.
Springboard-DS-Career-Track
Repository for uploads related to the Data Science Carreer Track Bootcamp at Springboard - Pablo Ruiz Lopez
pablo-git8's Repositories
pablo-git8/IoT23-Malware-Detection
IoT23 Malware Detection Tool - Repository for the Capstone Project 2 at Springboard Data Science Career Track Bootcamp.
pablo-git8/LoveDataInsights
Uncover the secrets of dating profiles with LoveDataInsights—a deep dive into user behaviors and trends using Pandas, Seaborn, and insightful data analysis. Explore, visualize, and decode the world of digital dating through a rich dataset and engaging storytelling.
pablo-git8/Springboard-DS-Career-Track
Repository for uploads related to the Data Science Carreer Track Bootcamp at Springboard - Pablo Ruiz Lopez
pablo-git8/Air-Quality-Classification-India-Cities
Analyze and classify air quality in some cities in India before and after COVID-19 using machine learning. This project involves data wrangling, exploratory data analysis (EDA), feature engineering, model development and model deployment, to predict air quality conditions. Explore the impact of the pandemic on air pollution levels.
pablo-git8/GlobalLogisticsInsights
A repository dedicated to aggregating and analyzing news and data from leading maritime and air transport websites, aiming to provide comprehensive insights into global logistics trends, challenges, and opportunities affecting the shipping and supply chain sectors.
pablo-git8/ai-logistics
Revolutionary Supply Chain Intelligence Software: Empowering smart and efficient operational management for supply chain companies
pablo-git8/AirlineDataHub
Streamlines aviation data analysis and warehousing, combining Python, SQL, and AWS for EDA, cleaning, and modeling. This repo guides from CSV ingestion to Redshift analytics, highlighting best practices in data engineering.
pablo-git8/business_analytics_with_SQL
Mini-project to showcase types of queries made with standard ANSI SQL using a database stored in PHPMyAdmin. First part are queries using MySQL. Second part are queries using SQLite + Python integration.
pablo-git8/decision_tree_coffe_cs
Case study in Python that simulates a business scenario focused on adopting and selling a new coffee product from a different supplier. It utilizes decision tree algorithms, implemented using Jupyter notebooks and the sklearn library.
pablo-git8/FinSentNewsNLP
Explore the fusion of NLP with financial insights through our project, focusing on categorizing financial texts and sentiment tagging using advanced classifiers and pre-trained models finetuned on rich datasets, including finance-related tweets and articles, to decode the nuanced language of the financial world.
pablo-git8/frequentist_inference_with_python
The purpose of this case study is to apply the concepts associated with Frequentist Inference using Python. Frequentist Inference is the process of deriving conclusions about an underlying distribution via the observation of data.
pablo-git8/GreenEnergyForecast-Europe
This repository aims to predict which European country will have the highest surplus of green energy in the next hour. It uses data from the ENTSO-E Transparency portal and features an ARIMA model for forecasting, with a focus on optimizing green energy usage to reduce the computing industry's carbon footprint.
pablo-git8/hypothesis_testing_apps
In this notebook, I will show an example of hypothesis testing to determine if Apple Store receive better reviews than Google Play and if those reviews are statistically significant.
pablo-git8/retinopathy-detection
This is a deep learning capstone project aimed at detecting aneurysms in patients with diabetic retinopathy.
pablo-git8/spark-project
This is a project to run and easily setup Apache Spark, Postgres and JupyterLab for projects ranging multiple features.
pablo-git8/Barcelona-EDA-Rent-Noise
Data-driven analysis of Barcelona's rent and noise levels using Python, API data ingestion, and CSV handling. Features EDA, PCA, and insightful visualizations on urban living costs.
pablo-git8/BarcelonaVehicleAnalysis-PandasVsPolar
Analysis of Barcelona's vehicle data comparing Pandas and Polar libraries in a Jupyter notebook. Explore the efficiency of data processing and the strengths of each library.
pablo-git8/BeverageConsumptionByCountry
Analysis of alcoholic beverage consumption across countries using pandas, numpy, matplotlib, and seaborn in a Jupyter notebook. Explore trends, patterns, and statistics of drink servings by country and continent.
pablo-git8/exoplanet-discovery
This is my third capstone project for the Springboard Data Science Career Bootcamp
pablo-git8/ITESM-Projects
pablo-git8/jump2digital-hackato
This repo contains the database component of our application dedicated to promoting sustainable tourism in the vibrant city of Barcelona. This repository is designed to assist travelers and city planners in identifying and managing high-concurrency monuments, ultimately contributing to a more balanced and responsible tourism experience.
pablo-git8/linear-reg_red-wine-dataset
Using Linear Regression for dealing with the Red Wine Dataset
pablo-git8/logistic_regression-cs109_2015
This is a case study on logistic regression adapted from CS109-2015. In this notebook we will show the math behind LR and focus on classification problems where this model can be used effectively.
pablo-git8/michelin_restaurants_app
MichelinMap: An R-based geospatial application that allows users to seamlessly browse Michelin restaurants on a map. Features intuitive filters for price and star rating, providing a tailored dining exploration experience.
pablo-git8/pablo-git8
This is my GitHub profile!
pablo-git8/RadianceInsight-StartupAnalytics
This project, developed by Radiance Group Consulting, is a comprehensive statistical analysis tool designed to dissect the dynamic world of startup foundations. Our primary aim is to provide in-depth insights into the patterns and trends shaping the startup ecosystem.
pablo-git8/RetailProductNLP-MatchCluster
NLP project for matching and clustering products from multiple retailers using tokenization, TF-IDF, and clustering algorithms. It offers insights into retail market dynamics.
pablo-git8/rmxc-tsts-app
This project entails the complete development of a tech support ticketing system application. The objective is to create a robust software solution that facilitates efficient ticket management and enhances the overall tech support process.
pablo-git8/SmartInvestment-SP500-Fund
SmartInvestment.com's FAANG+_forever fund mirrors the S&P 500, focusing on key tech stocks for optimal balance of risk and return. This repo details our commitment to regulatory compliance and investor insight, with quarterly reports and visual short-term analytics on performance against the SPX500.
pablo-git8/yushi1007