You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The "exists" part in the query is generated to differentiate calls of explode and explode_outer because explode drops nulls elements whereas explode_outer keeps them.
But since the field is a nested, the query never gets any match because it is not a nested query therefore the dataset is always empty.
Steps to reproduce
Create an index with a nested mapped field
Put a document with a valued nested field
Read the index from Spark into a dataset
Call Spark explode(field) on the nested field on the dataset
The dataset is empty because the generated query does not match any document
Hi everyone,
What kind an issue is this?
Issue description
We use Spark to manipulated an array of distinct objects in an ElasticSearch Index.
The ElasticSearch index's field is mapped as :
When we use the explode Spark function on a dataset created from reading from ElasticSearch the connector generates the following query :
The "exists" part in the query is generated to differentiate calls of explode and explode_outer because explode drops nulls elements whereas explode_outer keeps them.
But since the field is a nested, the query never gets any match because it is not a nested query therefore the dataset is always empty.
Steps to reproduce
explode(field)
on the nested field on the datasetVersion Info
OS: : Linux
JVM : 1.8
Hadoop/Spark: Spark 3.3.0
ES-Hadoop : elasticsearch-spark-30_2.12:8.2.2
The text was updated successfully, but these errors were encountered: