Skip to content

BulkWhiz/es-analyzer-plugin

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Elasticsearch analysis plugin

this plugin is used to join every possible two-word combinations (pairs) in a document.

the plugin exports a token filter called wordjoin_stemmer.

basically this filter stems every token from the stream and joins the original word and the stemmed one with every subsequent word in the document. this is useful in e-commerce websites where documents are small and a user might search with two words joined as a single word.

Stemming happens on two steps

to use the plugin please follow these steps:

  • clone the repo on your local machine
  • make sure you have maven installed on the machine
  • mvn -Dtest=EsAnalysisTests test. this will run tests to make sure everything is ok. you should always run tests if you change anything in the plugin
  • mvn package. this will package the project and output a zip file under target/releases folder in the project directory
  • sudo /usr/share/elasticsearch/bin/elasticsearch-plugin install file:///home/ubuntu/es-analyzer-plugin-7.0.0.zip install the plugin in elasticsearch using elasticsearch-plugin executable installed with elasticsearch. the path of this will vary depending on the way you installed elasic on the machine.
  • if the plugin is already installed you will need to remove it first.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages