Skip to content
This repository has been archived by the owner on Sep 27, 2023. It is now read-only.

Currently "sentiment-basic" assumes text is Spanish, ignores "language" param #2

Open
drevicko opened this issue Apr 28, 2017 · 3 comments

Comments

@drevicko
Copy link

The calculation of sentiment of the analyse function appears to assume that the language is Spanish (see lines 79 and 84 of the analyse method).

Also, the "language" parameter, though extracted on line (116)[https://github.com/MixedEmotions/UPM_sentiment_and_emotion/blob/master/sentiment-basic/sentiment-basic.py#L116] is never used.

@militarpancho
Copy link
Contributor

The repo was outdated. We changed it in this commit: 5427b02#diff-478193974a5b639c4e5549d70bf999b9L112, adding support for several language using TextBlob

@drevicko
Copy link
Author

drevicko commented Apr 28, 2017

ok, that looks a lot better.

There is still a potential problem with pos tagging though, unless the pos tagger is multilingual, though the one bundled (unigram_spanish.pickle) looks like a spanish pos tagger.

However the pos tags don't appear to be used anywhere in the code except _compare_synsets(), which itself is not used anywhere, so it looks like it should work as expected.

Feel free to close this if you're happy with things as they are.

@militarpancho
Copy link
Contributor

As you point, pos_tagger is not used. I will remove in next commits. Thanks for the info 👍

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants