Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[META] Support application building using ingest and search pipelines in OpenSearch Dashboards #245

Open
14 of 20 tasks
ohltyler opened this issue Jul 25, 2024 · 0 comments
Open
14 of 20 tasks
Assignees
Labels

Comments

@ohltyler
Copy link
Member

ohltyler commented Jul 25, 2024

This is a meta issue capturing all of the details for the plugin's initial release targeted for 2.17.

Overview

Background & problem statement

Over the last several OpenSearch releases, there have been many enhancements made in ingest and search pipelines to support more flexible ML-related use cases, allowing customers to build exactly what they need. But with flexibility, comes complication and difficulty in configuration. It can be hard for users to see how their data flows and is transformed through these pipelines, as well as uncovering where certain problems may lie.

Solution

The [Flow Framework] plugin lets users build out custom search applications using ingest and search pipelines in a user-friendly UI. Users can iteratively build and test these pipelines, and visually see how data is transformed at each step. They can test out different processors, test out different configurations within individual processors, and finally, run concrete ingest and search queries to see how it performs. When users are satisfied with their solution, they can export their produced configuration via a Flow Framework workflow template. This template can then be exported and imported into different clusters, letting users scale and deploy their solution across their workloads/fleets.

Under the hood, this plugin uses the Flow Framework plugin to maintain configuration, created resources, and automatic provisioning and deprovisioning of such resources as users iteratively build out and test their applications.

Next steps

In the future, we would like to expand the plugin's functionality in several different ways. These include:

  1. More processor support: onboarding and integrating with more existing ingest / search request / search pipeline processors can increase the usability and flexibility of the plugin
  2. Add drag-and-drop support: rather than a form-based, guardrailed approach to configuration, the plugin could utilize the (currently read-only) ReactFlow workspace, into a customizable, drag-and-drop experience for building out their flows.
  3. Advanced / DAG-based flows: currently, the flows are tied to ingest and search pipelines, which only allow a single flow of data transformations. Allowing users to build DAG-based flows and parallel executions could greatly enhance the flexibility provided to users.

Checklist

For the 2.17 release, these are the following remaining issues to be completed:

@peterzhuamazon peterzhuamazon added Meta and removed Meta labels Aug 12, 2024
@getsaurabh02 getsaurabh02 added the Roadmap:Search Project-wide roadmap label label Aug 12, 2024
@minalsha minalsha added Roadmap:Vector Database/GenAI Project-wide roadmap label and removed Roadmap:Search Project-wide roadmap label labels Sep 30, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
Status: 2.17 (First RC 09/03, Release 09/17)
Status: New
Development

No branches or pull requests

4 participants