Disaster-Response-Pipeline: A Python repository from aphdinh

Disaster Response Pipelines

Installation

Project Motivation

Project Descriptions

Files Descriptions

Instructions

Installation

All libraries are available in Anaconda distribution of Python. The used libraries are:

pandas
re
sys
json
sklearn
nltk
sqlalchemy
pickle
Flask
plotly
sqlite3

Project Motivation

The goal of the project is to analyze the disaster data from Figure Eight in order to build a model that classifies the disaster messages into categories. Through a web app, the user can input a new message and get classification results in several categories. This project will help people and organization quickly categorize messages into categories in the event of disaster

Project Descriptions

The project has three componants which are:

ETL Pipeline: process_data.py file contain the script to create ETL pipline which:

Loads the messages and categories datasets.
Merges the two datasets
Cleans the data
Stores the data in a SQLite database

ML Pipeline: train_classifier.py file contain the script to create ML pipline which:

Loads data from the SQLite database
Splits the dataset into training and test sets
Builds a text processing and machine learning pipeline
Trains and tunes a model using GridSearchCV
Outputs results on the test set
Exports the final model as a pickle file

Flask Web App: the web app enables the user to enter a disaster message, and then view the categories of the message.

The web app also contains some visualizations that describe the data.

Files Descriptions

The files structure is arranged as below:

README.md: read me file
Project Components.txt: Project requirements to complete
workspace
- \app
  - run.py: flask file to run the app
- \templates
  - master.html: main page of the web application
  - go.html: result web page
- \data
  - disaster_categories.csv: categories dataset
  - disaster_responses.csv: messages dataset
  - DisasterResponse.db: disaster response database
  - process_data.py: ETL process
- \models
  - train_classifier.py: classification code
  - classifier.pkl: model pickle file

Instructions

To execute the app follow the instructions:

Run the following commands in the project's root directory to set up your database and model. To run ETL pipeline that cleans data and stores in database python 'data/process_data.py data/disaster_messages.csv' 'data/disaster_categories.csv' 'data/DisasterResponse.db' To run ML pipeline that trains classifier and saves python 'models/train_classifier.py' 'data/DisasterResponse.db' 'models/classifier.pkl'

Run the following command in the app's directory to run your web app. python run.py

Go to http://0.0.0.0:3001/

aphdinh/Disaster-Response-Pipeline

Disaster Response Pipelines

Table of Contents

Installation

Project Motivation

Project Descriptions

ETL Pipeline: process_data.py file contain the script to create ETL pipline which:

ML Pipeline: train_classifier.py file contain the script to create ML pipline which:

Flask Web App: the web app enables the user to enter a disaster message, and then view the categories of the message.

Files Descriptions

The files structure is arranged as below:

Instructions