wrangling-cleaning
There are 54 repositories under wrangling-cleaning topic.
Adongo/HR-Employee-Attrition
Exploratory Data Analysis to uncover factors data lead to employee attrition.
jayronsoares/automated_data_engineering
Simple ETL pipeline to extract information from CSV, LOG, JSON files and load it into MySQL database using Python and SQL language.
Amr-YA/tmdb_analysis
Movies data analysis to produce visuals and insights about the data-set of 10,000 movies.
amribrahim2011/Ford_GoBike_2019_Udacity_Project
Ford GoBike 2019 Dataset is a dataset for the bikeshare system, in this study I have presented the data on the slides file as a part of the visualization Learning process of the Data Analysis Nanodegree of Udacity.
Braim016/weratedogs-wrangling
Wrangling the WeRateDogs datasets to showcase data gathering, assessing, cleaning, and documentation skills.
Daniel-Elston/Hum-Whistle-Song-Recognition-Software
Machine learning, signal processing pipeline used to identify song name from user input (hum/whistle to song).
dirkkadijk/analyse-and-wrangle-WeRateDogs-data
project in Udacity Data Analyst Nanodegree. This project focused on advanced data gathering (several sources incl twitter API), wrangling and cleaning of data. Plus 2 reports.
dirkkadijk/Ford-GoBike-Data-Exploration-and-Communication-of-insights
Capstone project of Udacity Data Analyst Nanodegree. Focus on advanced visualizations to explore data and to communicate insights and patterns. Final slide deck is made with Jupyter notebook with interactive HTML slides (based on reveal.js).
LucasDeMatheo/DataScienceProject_Titanic
This project, carried out in Jupyter Notebook, aims to explore the main Data Analysis techniques with Python tools. Pandas, Numpy, Seaborn, Matplotlib, Plotly and sklearn are used. Divided into three notebooks, I separate the data cleaning, data analysis and machine learning part. For more details and goals, see README
ujunwa-DS/SPACEX-FALCON-9-CAPSTONE-PROJECT
A total package of what data science is all about. from dashboard building to data wrangling, sql, data collection, vizualization, webscrapping to presentaion.
1anza/DataVisualizations
Utilized JavaScript and D3 to create polished visualizations that provide a meaningful interaction experience with a variety of datasets.
AjmalSarwary/BRENT-Model
Predictive Model for BRENT price movements
brianmaleek/project_workspace_2_tweepy
Wrangling and analyzing we rate dogs twitter account which rates people's dogs with a humorous comment about the dog.
chukwuyem20/No-Show-Appointment
This contains an exploration of the no-show appointment dataset from UDACITY's Nano Degree Programme
cristhianc001/Argentinian-Internet-Usage
Language: Spanish. Data Visualization project of the evolution of the Argentinian internet access by state, download speed and technology
Daniel-Elston/Credit-Card-Default-Prediction-Algorithm
Algorithm used to predict whether a bank customer will default on given credit cards using bank telemarketing dataset.
Joeymmes/Investigate_No_Show_Appointment_Data
Investigates a dataset that collects information from 110,527 medical appointments in Brazil and is focused on the question of whether or not patients show up for their medical appointment.
Ken-Vu/Event-Category-Disparities-in-Elm-City-Stories-
A data analysis project for the American Statistical Association's DataFest competition that won "Best in Show" in 2022
lunaloclet/movie-analytics
Analysis of movie data.
Sadiq-marcelo/investigate-FordGoBike-tripdata
Investigate Ford GoBike Project
Tola-adelase/data-visualization-udacityproject
Udacity Nano degree Project 5. (Communicate Data Findings)
Wb-az/ML-airbnb-paris-analytics-and-price-prediction
Airbnb Paris - analytics and accommodation price prediction
AbiolaBajo10/Prosper-Loan-Data
The Loan Data from Prosper dataset is a financial dataset which is related to the loan, borrowers, interest rates, etc. Prosper or Prosper Marketplace Inc. is a San Francisco, California based company specializing in loans at low interest rates to the borrowers. We are using the dataset from Prosper for exploratory data analysis.The dataset from prosper is comprised of 81 variables and contains 113937 entries.
akhmadtaufik/project-wrangling-pacmann
Medium Article
Anshuboom/Anshuboom_SAFP_repo
SharkAttackFiles based Fatality Prediction
c13mora/traffic_accidents_prediction
Machine learning models to predict if a given traffic accident will end up in casualties
codeninja2020/Telecomdata
Data analysis Project for A Telecoms Company
EZBanks/BellaBeat-Data-Analytics-Project
Bellabeat Analysis using R
EZBanks/Bike-Share-Data-Analytics-Project
Bike Share analysis using R
I-Sobe/absenteeism
My First Machine Learning Project (A 365 careers project)
kalyanrmk/Capstone_Applied_Data_Science
Coursera Data Science Specialization Capstone Project
s-njeru/Data-Analysis-Process
This project is from the Data Analysis Nano Degree from Udacity. The intent is to walk through the data analysis process on a medical patient show/no-show dataset, identifying the relationshp between various dependent variables to the dependent variable - whether a patient will show up or not.
s-njeru/Data-Wrangling
Data Wrangling Project from the Udacity Data Analytics Nano Degree
shiraen/sea_countries_debt_analysis
Analysis on external debt of select South East Asian Countries for the past 10 years.