Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Document Embedding: add Word Embedding output? #846

Closed
ajdapretnar opened this issue May 9, 2022 · 1 comment
Closed

Document Embedding: add Word Embedding output? #846

ajdapretnar opened this issue May 9, 2022 · 1 comment
Labels
enhancement feast This may require a few weeks of work

Comments

@ajdapretnar
Copy link
Collaborator

Is your feature request related to a problem? Please describe.
Word embeddings could be used in semantic analysis or vocabulary exploration.

Describe the solution you'd like
It would be nice for Document Embedding to also allow outputting embedded words. That is just skipping the aggregation and returning a list of types with corresponding embeddings.

Describe alternatives you've considered
/

Additional context
Adding this kind of output would make the name of the widget a little strange.
This output could be then used to semantically label documents with the most suitable label. I.e. to replicate QDA analysis in an automated way. The new QDA widget would have two inputs, Corpus and Word Embeddings that would be used to find the closest word to the document.

@ajdapretnar ajdapretnar added enhancement feast This may require a few weeks of work labels Oct 5, 2022
@ajdapretnar
Copy link
Collaborator Author

Actually, this can be achieved with Word Cloud (Word Counts output) - Corpus (set Word as text feature) - DocEmb.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement feast This may require a few weeks of work
Projects
None yet
Development

No branches or pull requests

1 participant