Replies: 3 comments 6 replies
-
@tholor @brandenchan can you please help @nlstrait out @PiffPaffM adding you for this part -
|
Beta Was this translation helpful? Give feedback.
-
Hi @nlstrait, happy to try and help out. Just want to align some terminology first because I'm not fully sure what you mean with regards to semantic search vs QA search. The tasks that we are interested in are Open Domain Questions answering (ODQA), Document Retrieval and Document Reranking. In ODQA, you extract a text answer span from a large collection of documents, generally using a Retriever + Reader. In Document Retrieval, you might only want the documents returned, not specific text answer spans. In Document Ranking, you get the candidate documents provided by the Retriever, and rerank them so that the most relevant answers appear at the top of your list. Is Document Ranking what you are looking for? |
Beta Was this translation helpful? Give feedback.
-
Thanks also for your feedback regarding Haystack's potential as a general NLP framework. You're right we focus a lot on ODQA, but that's only because this was our first big task. Thanks to community input and contributions, we now have a lot more components, including the Ranker, Summarizer, Translator. With pipelines you can piece these together as it suits you. Interested to hear what tasks and tutorials you feel are missing! |
Beta Was this translation helpful? Give feedback.
-
Hi there!
I am very excited to have discovered Haystack, but I am running into one major issue for my use case: I am struggling to construct a pipeline for pure semantic search (rather than QA search) that still enables me to pull out segments with the help of a Reader. Put another way, I've been unable to find a Reader for the simple semantic search case. Currently, the only examples of pure semantic search abandon Readers altogether, but for my application, a simple Retriever is not enough.
I've looked through models on Hugging Face, but looking at the 'Tasks' tags, none of them seem to fit my specification. I am relatively new to the ML and NLP communities and could easily be missing some useful terminology or understanding... Do you have any suggestions for Readers for pure semantic search?
As an aside: from a new-comers perspective, Haystack appears to be overemphasizing QA while having the latent potential of being more general-purpose. This seems like more an issue with marketing/branding than functionality, though the scarcity of general-purpose, non-QA tutorials is functionally limiting in a sense.
Thanks in advance for any feedback, and thank you for helping support this awesome tool and community!
Beta Was this translation helpful? Give feedback.
All reactions