Skip to content

Latest commit

 

History

History
12 lines (8 loc) · 336 Bytes

README.md

File metadata and controls

12 lines (8 loc) · 336 Bytes

Spam-Detection-using-NLTK

This project is created to classify a certain incoming message as Ham(legitimate) or spam.

Dataset from UCI

  • used NLTK
  • used Normalisation
  • used Vectorisation
  • used TF-IDF (Term-Frequency - Inverse_Document_frequency)

for more details about Dataset , check out the other readme file in codespace.