Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Build or find Semantic Similarity Dataset for Persian #12

Open
sehsanm opened this issue Dec 3, 2018 · 2 comments
Open

Build or find Semantic Similarity Dataset for Persian #12

sehsanm opened this issue Dec 3, 2018 · 2 comments
Assignees
Milestone

Comments

@sehsanm
Copy link
Owner

sehsanm commented Dec 3, 2018

See Persian Word Embedding Evaluation Benchmarks for semantic Relatedness dataset

Also see: J. Camacho-Collados, M. T. Pilehvar, N. Collier, and R. Navigli,
“Semeval-2017 task 2: Multilingual and cross-lingual semantic
word similarity,” in Proceedings of the 11th International
Workshop on Semantic Evaluation (SemEval 2017). Vancouver,
Canada, 2017.

The data must be stored in data/wordsim folder

@zahramajd
Copy link
Collaborator

zahramajd commented Dec 17, 2018

I just uploaded Semantic Similarity Dataset file to OneDrive, column 1 and column 2 are the pair words and column 3 is the score of their relatedness or similarity. (@kibamin please consider this format)
Unfortunately, I made a mistake and forgot to convert it to .csv, @sehsanm please give me the access to remove and change these files on OneDrive.

@sehsanm
Copy link
Owner Author

sehsanm commented Dec 19, 2018

@zahramajd Please put the file in the repository(data/similarity) and not onedrive (only corpus and models will go there as they are large)
I will delete the file from the OneDrive folder

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants