This repository is maintained by the KNIME Team Rakete.
KNIME Analytics Platform - Text Processing Integration is designed to read, enrich, manipulate, and extract textual data, and transform it into numerical representations, such as document or term vectors. Once numerical vectors are created, regular KNIME nodes can be applied, for example, for predictive modeling, clustering analysis, or visualization.
This repository contains the source code for KNIME - Text Processing Integration. The code is organized as follows:
- org.knime.ext.textprocessing: Text Processing integration nodes and data types
- org.knime.ext.textprocessing.dl4j: Text Processing Word2Vec integration
- org.knime.ext.textprocessing.models: External models for tokenization and tagging (NER, POS, etc.)
You can find instructions on how to work with our code or develop extensions for KNIME Analytics Platform in the knime-sdk-setup repository on BitBucket or GitHub.