Disaster Response Pipelines

Udacity's Data Science Nanodegree project.

An ETL pipeline is built to read the dataset provided, clean the data, and store it in a SQLite database. Then a machine learning pipeline is created that uses NLTK as well as scikit-learn's Pipeline and GridSearchCV to output a final model that uses the message column to predict classifications for 36 categories (multi-output classification). The model is then exported to a pickle file and in the last step, the results are displayed in a Flask web app

Project Components

The project has 3 main parts:

ETL Pipeline (process_data.py):
- Loads the messages and categories datasets
- Merges the two datasets
- Cleans the data
- Stores it in a SQLite database
ML Pipeline (train_classifier.py):
- Loads data from the SQLite database
- Splits the dataset into training and test sets
- Builds a text processing and machine learning pipeline
- Trains and tunes a model using GridSearchCV
- Outputs results on the test set
- Exports the final model as a pickle file
Flask Web App (run)
- Web application where new messages can be input and get classification results in different categories
- Includes data visualizations using Plotly

Files in the repository

app/templates/go.html: web page that handles user query and displays model results
app/templates/master.html: index webpage displays visuals and receives user input text for model
app/run.py: flask application to run the web
data/disaster_categories.csv: categories dataset
data/disaster_messages.csv: messages dataset
data/DisasterResponse.db: SQLite database containing the merged dataset
data/process_data.csv: data cleaning ETL pipeline that merges the two dataset aforementioned
models/classifier.pkl: classifier pickle file model
models/train_classifier.py: machine learning pipeline that builds the classifier model aforementioned
README.md: description of the project

Instructions

Run the following commands in the project's root directory to set up your database and model:
- To run the ETL pipeline that cleans the data and stores it in a SQLite database: python data/process_data.py data/disaster_messages.csv data/disaster_categories.csv data/DisasterResponse.db
- To run the ML pipeline that trains the classifier and saves the model as a pickle file: python models/train_classifier.py data/DisasterResponse.db models/classifier.pkl
Run the following command in the app's directory to run the web app: python app/run.py
Go to http://0.0.0.0:3001/

Name		Name	Last commit message	Last commit date
Latest commit History 57 Commits
app		app
data		data
models		models
pics		pics
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Disaster Response Pipelines

Project Components

Files in the repository

Instructions

Website

About

Uh oh!

Releases

Packages

Languages

pedflotor/Disaster_Response_Pipelines

Folders and files

Latest commit

History

Repository files navigation

Disaster Response Pipelines

Project Components

Files in the repository

Instructions

Website

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages