Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Email Classification #79 #83

Open
wants to merge 1 commit into
base: master
Choose a base branch
from
Open

Conversation

NeonKazuha
Copy link
Contributor

@NeonKazuha NeonKazuha commented Mar 23, 2024

  • Tokenization: Split the text of each email into individual words or tokens.
    Screenshot 2024-03-23 143647

  • Normalization: Convert all text to lowercase, remove punctuation, and handle special cases (like email addresses or URLs).

  • Stopword Removal: Remove common words that don't carry much meaning (e.g., "the", "is", "and").
    Screenshot 2024-03-23 143810

  • Feature Extraction: Represent each email as a numerical vector using techniques like bag-of-words, TF-IDF (Term Frequency-Inverse Document Frequency), or word embeddings.
    Screenshot 2024-03-23 143853

Accuracy and Precision:
image

Precision Scores:
image

The best Precision score is given by Extra Trees Classifier: 0.9973

Solved Issue #79

Collab: https://colab.research.google.com/drive/1hlenQVeOThQZwZq6Hsr_VTI3v5soCBuV?usp=sharing

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant