Skip to content
This repository has been archived by the owner on Jul 30, 2022. It is now read-only.

Commit

Permalink
Bumped version to 0.1.0. Updated README.md.
Browse files Browse the repository at this point in the history
  • Loading branch information
martijndeb committed Aug 16, 2014
1 parent 9c7b7c9 commit 6c96201
Show file tree
Hide file tree
Showing 2 changed files with 10 additions and 4 deletions.
10 changes: 8 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -27,7 +27,7 @@ haxe -main my.namespace.Application -lib haxe-linguisticts --interp
```

## Supported languages ##
Since the beginning English (as main language), Dutch and German have been supported. Want to contribute? Take a peek at the Dutch (nl) implementation and send a pull request.
Since the beginning English (as main language), Dutch and German have been supported as first class citizens, I added basic support for the Frisian language as a second class citizen. Want to contribute? Take a peek at the Dutch (nl) implementation and send a pull request.

## Tokenizing ##
Basic tokenizers are present for all current supported languages.
Expand Down Expand Up @@ -101,4 +101,10 @@ Quickest way is to use haxe in interpreter mode
```
cd src/
haxe --main tests.TestCaseRunner --interp
```
```

# Roadmap #
- Nested tokenization allowing tokens to have a parent and children. (Opening doors for a Sentence or quotation token).
- Applying tags to tokens and being able to filter them based on a token. (This differs from classification, but a classification could be a tag).
- Stemming of languages. (Porter stemmer would be sufficient).
- Parts of speech tagging. (This would require stemming and POS dictionaries).
4 changes: 2 additions & 2 deletions haxelib.json
Original file line number Diff line number Diff line change
Expand Up @@ -4,8 +4,8 @@
"license": "MIT",
"tags": ["cross", "linguistics", "language", "nlp", "analysis", "classification"],
"description": "Linguistical analysis and natural language processing library for Haxe",
"version": "0.0.3",
"releasenote": "Adds German as a first citizen language. Adds BasicStringBuilder utility for use with the BasicTokenizer. Externalized token filtering from tokenizers to ITokenFilter instances.",
"version": "0.1.0",
"releasenote": "Adds Frisian language support, BasicStringBuilder, separation of filtering and tokenizing, StopwordTokenFilter. Updated tests.",
"contributors": ["sexybiggetje"],
"dependencies": {
}
Expand Down

0 comments on commit 6c96201

Please sign in to comment.