spark-sql-integration

History

Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md

README.md

Spark SQL Application

This application execute sql query and store the result in OpenSearch index in following format

"stepId":"<emr-step-id>",
"applicationId":"<spark-application-id>"
"schema": "json blob",
"result": "json blob"

Prerequisites

Spark 3.3.1
Scala 2.12.15
flint-spark-integration

Usage

To use this application, you can run Spark with Flint extension:

./bin/spark-submit \
    --class org.opensearch.sql.SQLJob \
    --jars <flint-spark-integration-jar> \
    sql-job.jar \
    <spark-sql-query> \
    <opensearch-index> \
    <opensearch-host> \
    <opensearch-port> \
    <opensearch-scheme> \
    <opensearch-auth> \
    <opensearch-region> \

Result Specifications

Following example shows how the result is written to OpenSearch index after query execution.

Let's assume sql query result is

+------+------+
|Letter|Number|
+------+------+
|A     |1     |
|B     |2     |
|C     |3     |
+------+------+

OpenSearch index document will look like

{
  "_index" : ".query_execution_result",
  "_id" : "A2WOsYgBMUoqCqlDJHrn",
  "_score" : 1.0,
  "_source" : {
    "result" : [
    "{'Letter':'A','Number':1}",
    "{'Letter':'B','Number':2}",
    "{'Letter':'C','Number':3}"
    ],
    "schema" : [
      "{'column_name':'Letter','data_type':'string'}",
      "{'column_name':'Number','data_type':'integer'}"
    ],
    "stepId" : "s-JZSB1139WIVU",
    "applicationId" : "application_1687726870985_0003"
  }
}

Build

To build and run this application with Spark, you can run:

sbt clean sparkSqlApplicationCosmetic/publishM2

Test

To run tests, you can use:

sbt test

Scalastyle

To check code with scalastyle, you can run:

sbt scalastyle

Code of Conduct

This project has adopted an Open Source Code of Conduct.

Security

If you discover a potential security issue in this project we ask that you notify AWS/Amazon Security via our vulnerability reporting page. Please do not create a public GitHub issue.

License

See the LICENSE file for our project's licensing. We will ask you to confirm the licensing of your contribution.

Copyright

Copyright OpenSearch Contributors. See NOTICE for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

spark-sql-integration

spark-sql-integration

README.md

Spark SQL Application

Prerequisites

Usage

Result Specifications

Build

Test

Scalastyle

Code of Conduct

Security

License

Copyright

Files

spark-sql-integration

Directory actions

More options

Directory actions

More options

Latest commit

History

spark-sql-integration

Folders and files

parent directory

README.md

Spark SQL Application

Prerequisites

Usage

Result Specifications

Build

Test

Scalastyle

Code of Conduct

Security

License

Copyright