Disaster Response Pipeline Project

Introduction

This project analyze the disaster response dataset from Appen to build a model for an API that classifies disaster messages. The dataset contains real messages that were sent during disaster events. A machine learning pipeline is developed to categorize these events so that one can send the messages to an appropriate disaster relief agency.

This project include a web app where an emergency worker can input a new message and get classification results in several categories. The web app will also display visualizations of the data.

Bellow are a few screenshots of the web app.

Some statistics of the dataset

An example of classifying a disaster response message

About the dataset

The disaster response dataset contains 30,000 messages drawn from events including an earthquake in Haiti in 2010, an earthquake in Chile in 2010, floods in Pakistan in 2010, super-storm Sandy in the U.S.A. in 2012, and news articles spanning a large number of years and 100s of different disasters. The data has been encoded with 36 different categories related to disaster response and has been stripped of messages with sensitive information in their entirety. Upon release, this is the featured dataset of a new Udacity course on Data Science and the AI4ALL summer school and is especially utile for text analytics and natural language processing (NLP) tasks and models.The input data in this job contains thousands of untranslated disaster-related messages and their English translations.

Structure of the repository

├── app/
│   ├── static/img
│   ├── templates/
│   │   ├── master.html  # main page of web app
│   │   └── go.html  # classification result page of web app
│   └── run.py  # script that runs the webapp using Flask
├── data/
│   ├── disaster_categories.csv  # data to be processed: message categories
│   ├── disaster_messages.csv  # data to be processed: disaster response messages
│   ├── DisasterResponse.db   # cleaned data will be exported to this SQL database
│   └── process_data.py    # ETL pipeline for data cleaning
├── models/
│   ├── train_classifier.py  # NLP pipeline for training a text-based classifier
│   └── classifier.pkl  # trained model
├── README.md
└── environment.yml   # dependencies of the conda environment

How to run the web application

Create a conda environment with dependencies specified in environment.yml.
Run the following commands in the project's root directory to set up the database and train a classification model.
- To run ETL pipeline that cleans data and stores in database: python data/process_data.py data/disaster_messages.csv data/disaster_categories.csv data/DisasterResponse.db
- To run ML pipeline that trains a classifier and saves the trained model to a pickle file: python models/train_classifier.py data/DisasterResponse.db models/classifier.pkl
Go to app directory: cd app.
Run your web app: python run.py.
Access 127.0.0.1:3000 in a web browser (e.g., Google Chrome) to open the webapp's homepage.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Disaster Response Pipeline Project

Introduction

About the dataset

Structure of the repository

How to run the web application

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
app		app
data		data
models		models
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml

License

linhhoang-ex/disaster-response-webapp

Folders and files

Latest commit

History

Repository files navigation

Disaster Response Pipeline Project

Introduction

About the dataset

Structure of the repository

How to run the web application

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages