NLP Cyberbullying Detection

Project Overview

This project focuses on detecting cyberbullying using Natural Language Processing (NLP) techniques. Cyberbullying is a significant issue in the digital age, affecting individuals' mental health and well-being. The aim of this project is to develop a robust model that can identify and flag instances of cyberbullying from textual data.

Dataset

The dataset used for this project is the Cyberbullying Tweets dataset available on Kaggle. It contains comments labeled as cyberbullying or not, providing a valuable resource for training and testing the detection model.

Cyberbullying Tweets Dataset

Key Features

Data Collection and Preprocessing: Collecting and preprocessing data to build a comprehensive dataset for training and testing.
Textual Analysis: Utilizing NLP techniques such as tokenization, stemming, and lemmatization to analyze the textual data.
Feature Extraction: Implementing feature extraction methods including TF-IDF, word embeddings, and sentiment analysis to capture the nuances of the text.
Model Development: Developing and training machine learning models, such as Logistic Regression, Support Vector Machines (SVM) and Naive Bayes to detect cyberbullying.
Evaluation: Evaluating the models using metrics such as accuracy, precision, recall, and F1-score to determine their effectiveness.

Technologies Used

Programming Languages: Python
Libraries and Frameworks: NLTK, SpaCy, Scikit-Learn
Tools: Jupyter Notebook, Pandas, NumPy, Matplotlib, Seaborn

A Project by Quah Seng Kit, Sattish Pratap Shewkani, Yeo Yee Tao

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
CyberBullying_Dataset.csv		CyberBullying_Dataset.csv
LICENSE		LICENSE
LogisticRegression.ipynb		LogisticRegression.ipynb
NB.ipynb		NB.ipynb
README.md		README.md
SVM.ipynb		SVM.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NLP Cyberbullying Detection

Project Overview

Dataset

Key Features

Technologies Used

About

Releases

Packages

Contributors 2

Languages

License

yeetao13/NLP_cyberbullying_detection

Folders and files

Latest commit

History

Repository files navigation

NLP Cyberbullying Detection

Project Overview

Dataset

Key Features

Technologies Used

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages