thecodemancer
Public Speaking | Generative AI | LLMs | Cloud | Data Science | Data Engineering | Mentorships
Data Engineering LatamPerú
Pinned Repositories
Airflow-Data-Orchestration-between-AWS-and-GCP
An AirFlow Dag that moves S3 Bucket Files to Google Cloud Storage. Keeping the same folder partitioning and file formats.
BIMarathon2021
Goal: Analyze the most popular songs and genres on the Spotify through release year and by the following cutoffs: duration, explicit content, artist, danceability, energy, key, loudness, mode, speechiness, acousticness, instrumentalness, liveness, valence, tempo, time signature, genre
data-science-ipython-notebooks
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Food-Classification-Model
This model is trained to recognize 2023 food dishes from images.
gemini_examples
A repo with code examples of Gemini AI
infrastructure_as_code_terraform
Prediction-Model-for-Loan-applications
This is a binary classification problem where we have information about a sample of applicants and we need to predict whether or not to grant a loan based on that data.
Prediction-Model-for-the-duration-of-taxi-trips-in-NYC
This project uses Machine Learning to predict duration of taxi trips in NYC.
Streaming-Data-Pipeline-for-a-Real-Time-Dashboard-with-Dataflow
You own a fleet of New York City taxi cabs and are looking to monitor how well your business is doing in real-time. You build a streaming data pipeline to capture taxi revenue, passenger count, ride status, and much more, and then visualize the results in a management dashboard.
study-with-me
Lots of code, resources, examples, some graphs and so much fun ahead!
thecodemancer's Repositories
thecodemancer/data-science-ipython-notebooks
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
thecodemancer/infrastructure_as_code_terraform
thecodemancer/Data-Science--Cheat-Sheet
Cheat Sheets
thecodemancer/Spotify_API
Build an ETL using your own Spotify data
thecodemancer/BIMarathon2021
Goal: Analyze the most popular songs and genres on the Spotify through release year and by the following cutoffs: duration, explicit content, artist, danceability, energy, key, loudness, mode, speechiness, acousticness, instrumentalness, liveness, valence, tempo, time signature, genre
thecodemancer/Efficient_Python_tricks_and_tools_for_data_scientists
Efficient Python Tricks and Tools for Data Scientists
thecodemancer/ExData_Plotting1
Plotting Assignment 1 for Exploratory Data Analysis
thecodemancer/Playing-with-Graphs
📊👨💻
thecodemancer/Residential_property_prices_2020
In this code, we're applying data cleansing to this dataset so that we can properly work with it later. The goal is to build a data model with a fact table and dimension tables.
thecodemancer/EasyBroker-API
Send your properties to your clients, real estate colleagues and portals with a single click.
thecodemancer/App1
thecodemancer/Assignment2
...
thecodemancer/cp100-appengine-cloudstorage-python
A Python App Engine application that demonstrates usage of the Google Cloud Storage API
thecodemancer/datasharing
The Leek group guide to data sharing
thecodemancer/DS-courses
Course materials for the Data Science Specialization: https://www.coursera.org/specialization/jhudatascience/1
thecodemancer/GettingAndCleaningDataWeek4
thecodemancer/Linear_Regression_Detailed_Implementation
thecodemancer/PML
thecodemancer/ProgrammingAssignment2
Repository for Programming Assignment 2 for R Programming on Coursera
thecodemancer/RepData_PeerAssessment1
Peer Assessment 1 for Reproducible Research