Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Problems in wikipedia-wordlist #1

Open
nisargjhaveri opened this issue Aug 9, 2015 · 1 comment
Open

Problems in wikipedia-wordlist #1

nisargjhaveri opened this issue Aug 9, 2015 · 1 comment

Comments

@nisargjhaveri
Copy link
Contributor

There seems to be some problem with wikipedia-wordlist. I'm not sure what it is exactly, but looks like all matra are removed from words or it doesn't contain any words with matra.

And it also contains some words with some latin or other characters, and not really proper Gujarati words.

@kartikm
Copy link
Owner

kartikm commented Aug 10, 2015

They were from http://dumps.wikimedia.org/backup-index.html and I guess requires lots of filtering :/

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants