Spark-Naive-Bayes-text-classification

Demonstrates how to use spark to implement a Bayes text classifier

This code example demonstrates how to use spark to

create machine learning pipeline
train a Naive Bayes text classifier
evaluate and explore the learned model
make predictions

The example uses the data set from http://nlp.stanford.edu/IR-book/html/htmledition/naive-bayes-text-classification-1.html

The spark example does not produce the same results as published in the paper

Overview

See the JUnit test file src/test/java/com/santacruzintegration/spark/NaiveBayesStanfordExampleTest.java The unit test simply runs the code. It does not have any asserts or other invarient tests. I.E. all tests will always pass even thou the results do not match the published results

Running the code.

$ mvn clean test

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
src		src
LICENSE		LICENSE
README.md		README.md
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Spark-Naive-Bayes-text-classification

Overview

Running the code.

About

Releases

Packages

Languages

License

AEDWIP/Spark-Naive-Bayes-text-classification

Folders and files

Latest commit

History

Repository files navigation

Spark-Naive-Bayes-text-classification

Overview

Running the code.

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages