[Feature/multi_tenancy] Add SearchDataObject interface, Client, and Connector Implementations #2559

dbwiddis · 2024-06-16T04:11:34Z

Description

Adds search method to SdkClient interface
Implements search method on Local and Remote clients
Migrates Connector Search action to use SdkClient for the searching
Updates Delete and Update connector actions which search model index to use new API

Issues Resolved

Continuation of PR #2459

Check List

New functionality includes testing.
- All tests pass
New functionality has been documented.
- New functionality has javadoc added
Commits are signed per the DCO using --signoff

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

Signed-off-by: Daniel Widdis <[email protected]>

dhrubo-os · 2024-06-17T19:19:20Z

common/src/main/java/org/opensearch/sdk/SearchDataObjectRequest.java

+     * <p>
+     * For data storage implementations other than OpenSearch, an index may be referred to as a table
+     *
+     * @param indices             the indices to search for the object


How this will work? After searching in multiple indices are we going to combine the results and then send it back through SearchDataObjectResponse?

In practice in ML commons, there's a single index (the system index for connectors, models, etc.).

In general it can be an empty list (search all indices) or multiple indices. The search request really just packages up the entire JSON of the search query and the full response as well, parsing it back out from the XContent.

So the search response will be identical (some metadata/stats and then a list of hits).

dhrubo-os · 2024-06-17T19:20:22Z

plugin/src/main/java/org/opensearch/ml/action/connector/DeleteConnectorTransportAction.java

+
+            SearchDataObjectRequest searchDataObjectRequest = new SearchDataObjectRequest.Builder()
+                .indices(ML_MODEL_INDEX)
+                .searchSourceBuilder(sourceBuilder)


I think we should also add tenant id in the sourceBuilder?

Yes, have you already added it to all the others? I lost track with the multiple PRs at the same time.

No, I haven't made any changes with search functionalities yet.

I think @arjunkumargiri is adding tenant id in https://github.com/opensearch-project/ml-commons/pull/2553/files

It's not currently in of the other requests; probably needs to be its own standalone field.

yeah, I'll address that accordingly after that PR is being merged.

dhrubo-os · 2024-06-18T00:26:07Z

plugin/src/main/java/org/opensearch/ml/action/connector/DeleteConnectorTransportAction.java

+                                );
+                        }
+                    } else {
+                        Throwable cause = st.getCause() == null ? st : st.getCause();


May be we should have a recursive method to identify the root cause as we know the exception can be wrapped with other exception. What do you think about this method:

private Throwable getRootCause(Throwable throwable) { Throwable cause = throwable; while (cause.getCause() != null && cause != cause.getCause()) { cause = cause.getCause(); } return cause; }

See unwrapCause() here. This seems to do what you're asking for. https://github.com/opensearch-project/OpenSearch/blob/802f2e6e4b21f27ddc6c01e7fc6f6cdcd69138d3/libs/core/src/main/java/org/opensearch/ExceptionsHelper.java#L127-L145

Yes! Should we refactor the code accordingly then?

Possibly? But possibly not. I intentionally only unwrapped one level. Tracing the code, we know we're using actionGet so looking at the code we either throw the original exception (if it's a RTE) or it's a wrapped UncategorizedExecutionException containing the cause, which is what we're extracting here. So that built in recursive method is a bit too deep.

While a util method is helpful here, a ternary one-liner is also pretty brief so I'm struggling to see the huge benefit of a refactor.

So I'm open to it, but not thinking it's a high priority.

Client interface Search API and implementations

4aecac9

Signed-off-by: Daniel Widdis <[email protected]>

dbwiddis requested review from b4sjoo, dhrubo-os, jngz-es, model-collapse, rbhavna, ylwu-amzn, zane-neo, Zhangxunmt, austintlee, HenryL27 and sam-herman as code owners June 16, 2024 04:11

dbwiddis had a problem deploying to ml-commons-cicd-env June 16, 2024 04:11 — with GitHub Actions Failure

dbwiddis had a problem deploying to ml-commons-cicd-env June 16, 2024 04:11 — with GitHub Actions Error

dbwiddis had a problem deploying to ml-commons-cicd-env June 16, 2024 04:11 — with GitHub Actions Failure

dbwiddis force-pushed the search-api branch from c226861 to 8128022 Compare June 16, 2024 05:57

dbwiddis temporarily deployed to ml-commons-cicd-env June 16, 2024 05:57 — with GitHub Actions Inactive

dbwiddis had a problem deploying to ml-commons-cicd-env June 16, 2024 06:49 — with GitHub Actions Error

dbwiddis had a problem deploying to ml-commons-cicd-env June 16, 2024 06:49 — with GitHub Actions Failure

dbwiddis had a problem deploying to ml-commons-cicd-env June 16, 2024 06:49 — with GitHub Actions Error

dbwiddis force-pushed the search-api branch from 8128022 to 097a687 Compare June 16, 2024 17:41

dbwiddis had a problem deploying to ml-commons-cicd-env June 16, 2024 17:41 — with GitHub Actions Failure

dbwiddis temporarily deployed to ml-commons-cicd-env June 16, 2024 23:47 — with GitHub Actions Inactive

dbwiddis had a problem deploying to ml-commons-cicd-env June 16, 2024 23:48 — with GitHub Actions Error

dbwiddis had a problem deploying to ml-commons-cicd-env June 16, 2024 23:48 — with GitHub Actions Failure

dbwiddis had a problem deploying to ml-commons-cicd-env June 16, 2024 23:48 — with GitHub Actions Error

dbwiddis force-pushed the search-api branch from 804ca81 to 0da3167 Compare June 17, 2024 01:33

dbwiddis had a problem deploying to ml-commons-cicd-env June 17, 2024 01:33 — with GitHub Actions Error

dbwiddis had a problem deploying to ml-commons-cicd-env June 17, 2024 01:33 — with GitHub Actions Failure

dbwiddis had a problem deploying to ml-commons-cicd-env June 17, 2024 01:33 — with GitHub Actions Error

dbwiddis had a problem deploying to ml-commons-cicd-env June 17, 2024 01:33 — with GitHub Actions Failure

Update search call in Update and Delete Connector actions using new API

f002571

Signed-off-by: Daniel Widdis <[email protected]>

dbwiddis force-pushed the search-api branch from 0da3167 to f002571 Compare June 17, 2024 03:04

dbwiddis had a problem deploying to ml-commons-cicd-env June 17, 2024 03:04 — with GitHub Actions Error

dbwiddis had a problem deploying to ml-commons-cicd-env June 17, 2024 03:04 — with GitHub Actions Failure

dbwiddis temporarily deployed to ml-commons-cicd-env June 17, 2024 03:05 — with GitHub Actions Inactive

dbwiddis had a problem deploying to ml-commons-cicd-env June 17, 2024 03:57 — with GitHub Actions Failure

dbwiddis had a problem deploying to ml-commons-cicd-env June 17, 2024 03:57 — with GitHub Actions Error

dhrubo-os reviewed Jun 17, 2024

View reviewed changes

dhrubo-os reviewed Jun 18, 2024

View reviewed changes

dhrubo-os approved these changes Jun 18, 2024

View reviewed changes

dhrubo-os merged commit 9a92eeb into opensearch-project:feature/multi_tenancy Jun 18, 2024
6 of 12 checks passed

dbwiddis deleted the search-api branch June 18, 2024 04:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature/multi_tenancy] Add SearchDataObject interface, Client, and Connector Implementations #2559

[Feature/multi_tenancy] Add SearchDataObject interface, Client, and Connector Implementations #2559

dbwiddis commented Jun 16, 2024 •

edited

Loading

dhrubo-os Jun 17, 2024

dbwiddis Jun 17, 2024

dhrubo-os Jun 17, 2024

dbwiddis Jun 17, 2024

dhrubo-os Jun 17, 2024

dbwiddis Jun 17, 2024

dhrubo-os Jun 17, 2024

dhrubo-os Jun 18, 2024

dbwiddis Jun 18, 2024 •

edited

Loading

dhrubo-os Jun 18, 2024

dbwiddis Jun 18, 2024

[Feature/multi_tenancy] Add SearchDataObject interface, Client, and Connector Implementations #2559

[Feature/multi_tenancy] Add SearchDataObject interface, Client, and Connector Implementations #2559

Conversation

dbwiddis commented Jun 16, 2024 • edited Loading

Description

Issues Resolved

Check List

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dbwiddis Jun 18, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dbwiddis commented Jun 16, 2024 •

edited

Loading

dbwiddis Jun 18, 2024 •

edited

Loading