You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm trying to use a model hosted in a Sagemaker endpoint in the same AWS Account as the Opensearch cluster to perform a Neural search.
The issue that I observe is that, while ingesting data into the index, I observe the following error for many documents:
I don't see any logs in OpenSearch error logs, and I don't see any 4xx or 5xx requests in Sagemaker.
This error only happens with a reasonable amount of data in bulk ingestion, which in this case is 250 records. When I ingest only 20 records, it works.
I already tested getting some documents that failed and tried to ingest them separately, and it worked. So, the issue is not with the document or with the Sagemaker model.
Related component
Plugins
To Reproduce
First, deploy the bge-base-en-v1.5 embedding model in Sagemaker using this python script:
@chishui the Sagemaker rate for endpoint requests is 10,000 per second, we're ingesting only 250 documents.
CPU, GPU and memory usage is very low during the execution and Sagemaker doesn't register any 4xx or 5xx requests
Describe the bug
I'm trying to use a model hosted in a Sagemaker endpoint in the same AWS Account as the Opensearch cluster to perform a Neural search.
The issue that I observe is that, while ingesting data into the index, I observe the following error for many documents:
I don't see any logs in OpenSearch error logs, and I don't see any 4xx or 5xx requests in Sagemaker.
This error only happens with a reasonable amount of data in bulk ingestion, which in this case is 250 records. When I ingest only 20 records, it works.
I already tested getting some documents that failed and tried to ingest them separately, and it worked. So, the issue is not with the document or with the Sagemaker model.
Related component
Plugins
To Reproduce
bge-base-en-v1.5
embedding model in Sagemaker using this python script:Expected behavior
It's expected that all documents have the following status in ingestion:
Additional Details
Plugins
Neural Search plugin
Host/Environment (please complete the following information):
I'm running it in the AWS OpenSearch managed version 2.11.
The text was updated successfully, but these errors were encountered: