Skip to content

Demonstrates how to use spark to implement a Bayes text classifier

License

Notifications You must be signed in to change notification settings

AEDWIP/Spark-Naive-Bayes-text-classification

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Spark-Naive-Bayes-text-classification

Demonstrates how to use spark to implement a Bayes text classifier

This code example demonstrates how to use spark to

  1. create machine learning pipeline
  2. train a Naive Bayes text classifier
  3. evaluate and explore the learned model
  4. make predictions

The example uses the data set from http://nlp.stanford.edu/IR-book/html/htmledition/naive-bayes-text-classification-1.html

The spark example does not produce the same results as published in the paper

Overview

See the JUnit test file src/test/java/com/santacruzintegration/spark/NaiveBayesStanfordExampleTest.java The unit test simply runs the code. It does not have any asserts or other invarient tests. I.E. all tests will always pass even thou the results do not match the published results

Running the code.

$ mvn clean test

About

Demonstrates how to use spark to implement a Bayes text classifier

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages