-
Notifications
You must be signed in to change notification settings - Fork 33
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Upgrade Spark 3.5.1 #525
Upgrade Spark 3.5.1 #525
Conversation
* Support Spark 3.4.1 Signed-off-by: Peng Huo <[email protected]> * Ignore FlintSparkWindowingFunctionITSuite and IcebergIT Signed-off-by: Peng Huo <[email protected]> --------- Signed-off-by: Peng Huo <[email protected]>
* Fix IcebergIT and Refactor SessionCatalog Signed-off-by: Peng Huo <[email protected]> * update format Signed-off-by: Peng Huo <[email protected]> * fix UT Signed-off-by: Peng Huo <[email protected]> * address comments Signed-off-by: Peng Huo <[email protected]> --------- Signed-off-by: Peng Huo <[email protected]>
Signed-off-by: Peng Huo <[email protected]>
…earch-project#349) * enable Iceberg IT Signed-off-by: Peng Huo <[email protected]> * push down read-padding on char type Signed-off-by: Peng Huo <[email protected]> --------- Signed-off-by: Peng Huo <[email protected]>
* Bump Spark version Signed-off-by: Chen Dai <[email protected]> * Ignore broken IT temporarily Signed-off-by: Chen Dai <[email protected]> * Fix broken IT Signed-off-by: Chen Dai <[email protected]> --------- Signed-off-by: Chen Dai <[email protected]>
Signed-off-by: Peng Huo <[email protected]>
Signed-off-by: Peng Huo <[email protected]>
Signed-off-by: Peng Huo <[email protected]>
Signed-off-by: Peng Huo <[email protected]>
Signed-off-by: Peng Huo <[email protected]>
Signed-off-by: Peng Huo <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Leave this PR as it is for author to decide whether to squash or not.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thx for change. Any data can be shared for the below?
Performance test
Direct SparkSQL query performance
Flint index building performance, including skipping/CV/MV
Query acceleration performance, including skipping/CV
This reverts commit d6e71fa.
Description
Merge feature bracnh spark-3.5.1.
Test
Perf Test - direct query
Compared to Spark 3.3 (EMR-S 6.10), there is no significant difference observed.
Perf Test - directy query OpenSearch index
SELECT COUNT(*) FROM dev.default.logs-181998
SELECT COUNT(*) FROM dev.default.logs-181998 WHERE status <> 0
SELECT COUNT(*), AVG(size) FROM dev.default.logs-181998
SELECT AVG(CAST(size AS BIGINT)) FROM dev.default.logs-181998
SELECT MIN(@timestamp), MAX(@timestamp) FROM dev.default.logs-181998
SELECT status, COUNT() FROM dev.default.logs-181998 WHERE status <> 0 GROUP BY status ORDER BY COUNT() DESC
Issues Resolved
#352
By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.