Outsourcing Reader (and other highly resource consuming component) #1157

guillim · 2021-06-07T12:59:00Z

guillim
Jun 7, 2021

Question
Hello, I am wondering if there is a way to outsource the Reader, or any other highly resource consuming component.

Additional context
The point is you can use a 2Go RAM server for 99% of any pipelines. It will fit most use-cases. However, the Reader part will require very specific hardware/RAM and can't scale. So instead of having to scale the Haystack server, we could simply have an outside service for running our Reader only. This external dedicated service could be connected as in the pipelines.yaml

Curious to read your thoughts

lalitpagaria · 2021-06-07T20:18:18Z

lalitpagaria
Jun 7, 2021

In my view it is nice idea. It will help in building scalable pipeline (for example scaling individual component and putting it behind load balancer with caching support)

There are multiple ways to achieve this but one quick hack I see to add operation mode remote and local with few components. For remove mode then user have to provide uri and pipeline will create http client which will make api call to that uri. Supplied rest endpoint should support same request and response, which are supported by it's run function. For example Reader component in remote mode should follow this syntax.

I am exploring this space for Obsei, like ray, temporal, airflow etc. But recently come across Apache Beam's Pipeline IO implementation https://beam.apache.org/documentation/io/built-in/. From design point of view I liked it. SO main idea to add communication (rest, binary protocol, streaming etc) as a part of edge and node have to follow what edge is require. But yes it will make make tool quite bloated.

I am curious to get more views around this topic :)

0 replies

guillim · 2021-06-08T06:53:34Z

guillim
Jun 8, 2021
Author

I agree with the URI first implementation. That will be the fastest way to make it happen !

0 replies

tholor · 2021-06-08T07:14:09Z

tholor
Jun 8, 2021
Maintainer

Yep, definitely a good idea and direction. We are exploring Ray for the purpose of parallelizing our pipelines and allowing distributed execution (#688). For our deployments, I could see a mix of Ray and Kubernetes to distribute our pipeline nodes to the right machines, but we are still in the phase of sorting out a good design here. open for any ideas and experience sharing regarding the above frameworks. From my perspective airflow is more suitable for batch jobs than real-time search queries. Not sure how Apache Beam fits in there, but I also mostly heard from people using it for bigger batch jobs with Spark & Co.

2 replies

guillim Jun 8, 2021
Author

@MichaelBitard adding you here since we started talking about it last week

lalitpagaria Jun 23, 2021

Even there is demand in Ray to integrate Apache Beam, if you like then bump up +1 count there -
ray-project/ray#16622

tholor · 2021-06-08T07:16:42Z

tholor
Jun 8, 2021
Maintainer

Moved it here to discussions as we want in future a clearer separation between these "early ideation & discussion" and actual issues/bugs/features.

0 replies

lalitpagaria · 2021-06-11T09:58:23Z

lalitpagaria
Jun 11, 2021

On DataTalks.Club community, I asked question about developing scalable model serving ML Pipeline to @hanneshapke (Author of "Building Machine Learning Pipelines). He suggested interesting approach of using "combination between Apache Beam and Kubernetes". Quoting his response -

Beam lets you scale all data heavy tasks. You can start off with a direct runner (runs within a k8s container) and you can later export the data heavy tasks to its own cluster (with Beam using Apache Flink or GCP Dataflow)

0 replies

guillim · 2021-06-11T12:24:01Z

guillim
Jun 11, 2021
Author

I saw recently on HuggingFace that there is an API endpoint for hosted Readers. It could be an option for companies who don't need on-premise and can publish to HF (our case lol)

Reference:
see How to serve this model with the Accelerated Inference API on this page

2 replies

lalitpagaria Jun 11, 2021

Ha ha
But their costing is very steep and based on number of chars. At least out of bound for hobbyist or small startup.

guillim Jun 11, 2021
Author

True. But fast to test. and also, it could be a starting point for creating the Haystack connection

lalitpagaria · 2021-06-14T12:44:03Z

lalitpagaria
Jun 14, 2021

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Outsourcing Reader (and other highly resource consuming component) #1157

{{title}}

Replies: 7 comments 4 replies

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

Outsourcing Reader (and other highly resource consuming component) #1157

guillim Jun 7, 2021

Replies: 7 comments · 4 replies

lalitpagaria Jun 7, 2021

guillim Jun 8, 2021 Author

tholor Jun 8, 2021 Maintainer

guillim Jun 8, 2021 Author

lalitpagaria Jun 23, 2021

tholor Jun 8, 2021 Maintainer

lalitpagaria Jun 11, 2021

guillim Jun 11, 2021 Author

lalitpagaria Jun 11, 2021

guillim Jun 11, 2021 Author

lalitpagaria Jun 14, 2021

guillim
Jun 7, 2021

Replies: 7 comments 4 replies

lalitpagaria
Jun 7, 2021

guillim
Jun 8, 2021
Author

tholor
Jun 8, 2021
Maintainer

guillim Jun 8, 2021
Author

tholor
Jun 8, 2021
Maintainer

lalitpagaria
Jun 11, 2021

guillim
Jun 11, 2021
Author

guillim Jun 11, 2021
Author

lalitpagaria
Jun 14, 2021