Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Persistence Component: Bulk Load Support for Big Query #2294

Merged
merged 69 commits into from
Oct 19, 2023

Conversation

kumuwu
Copy link
Contributor

@kumuwu kumuwu commented Sep 22, 2023

What type of PR is this?

  • Improvement

What does this PR do / why is it needed ?

  • Added support for bulk load for Big Query
  • Added unit tests and end-to-end tests with Big Query
  • Modified H2 digest MD5 algorithm to provide consistency
  • Modified capabilities interface
  • Implement bulk load batch id as ever increasing integer and changed the original batch id to event id
  • Fixed bug in Snowflake bulk load stats: return stats also when the status is failed

Which issue(s) this PR fixes:

Fixes #

Other notes for reviewers:

Does this PR introduce a user-facing change?

prasar-ashutosh and others added 30 commits August 18, 2023 14:52
…FROM/VALID_TRHOUGH field if the name matches with source specified VALID_FROM/VALID_TRHOUGH fields
… zhlizh-bulk-load

# Conflicts:
#	legend-engine-xts-persistence/legend-engine-xt-persistence-component/legend-engine-xt-persistence-component-logical-plan/src/main/java/org/finos/legend/engine/persistence/components/logicalplan/datasets/StagedFilesDatasetProperties.java
#	legend-engine-xts-persistence/legend-engine-xt-persistence-component/legend-engine-xt-persistence-component-logical-plan/src/main/java/org/finos/legend/engine/persistence/components/planner/BulkLoadPlanner.java
kumuwu and others added 10 commits September 19, 2023 11:25
… zhlizh-big-query-bulk-load

# Conflicts:
#	legend-engine-xts-persistence/legend-engine-xt-persistence-component/legend-engine-xt-persistence-component-logical-plan/src/main/java/org/finos/legend/engine/persistence/components/common/FileFormat.java
#	legend-engine-xts-persistence/legend-engine-xt-persistence-component/legend-engine-xt-persistence-component-logical-plan/src/main/java/org/finos/legend/engine/persistence/components/planner/BulkLoadPlanner.java
#	legend-engine-xts-persistence/legend-engine-xt-persistence-component/legend-engine-xt-persistence-component-relational-core/src/main/java/org/finos/legend/engine/persistence/components/relational/sqldom/common/Clause.java
#	legend-engine-xts-persistence/legend-engine-xt-persistence-component/legend-engine-xt-persistence-component-relational-h2/src/main/java/org/finos/legend/engine/persistence/components/relational/h2/logicalplan/datasets/H2StagedFilesDatasetPropertiesAbstract.java
#	legend-engine-xts-persistence/legend-engine-xt-persistence-component/legend-engine-xt-persistence-component-relational-h2/src/test/java/org/finos/legend/engine/persistence/components/ingestmode/bulkload/BulkLoadTest.java
# Conflicts:
#	legend-engine-xts-persistence/legend-engine-xt-persistence-component/legend-engine-xt-persistence-component-logical-plan/src/main/java/org/finos/legend/engine/persistence/components/planner/BulkLoadPlanner.java
#	legend-engine-xts-persistence/legend-engine-xt-persistence-component/legend-engine-xt-persistence-component-relational-h2/src/test/java/org/finos/legend/engine/persistence/components/ingestmode/bulkload/BulkLoadTest.java
#	legend-engine-xts-persistence/legend-engine-xt-persistence-component/legend-engine-xt-persistence-component-relational-h2/src/test/resources/data/bulk-load/expected/expected_table1.csv
#	legend-engine-xts-persistence/legend-engine-xt-persistence-component/legend-engine-xt-persistence-component-relational-h2/src/test/resources/data/bulk-load/expected/expected_table2.csv
#	legend-engine-xts-persistence/legend-engine-xt-persistence-component/legend-engine-xt-persistence-component-relational-h2/src/test/resources/data/bulk-load/expected/expected_table3.csv
#	legend-engine-xts-persistence/legend-engine-xt-persistence-component/legend-engine-xt-persistence-component-relational-h2/src/test/resources/data/bulk-load/expected/expected_table4.csv
…legend-engine into zhlizh-big-query-bulk-load

# Conflicts:
#	legend-engine-xts-persistence/legend-engine-xt-persistence-component/legend-engine-xt-persistence-component-logical-plan/src/main/java/org/finos/legend/engine/persistence/components/planner/BulkLoadPlanner.java
#	legend-engine-xts-persistence/legend-engine-xt-persistence-component/legend-engine-xt-persistence-component-relational-h2/src/test/java/org/finos/legend/engine/persistence/components/ingestmode/bulkload/BulkLoadTest.java
… zhlizh-big-query-bulk-load

# Conflicts:
#	legend-engine-xts-persistence/legend-engine-xt-persistence-component/legend-engine-xt-persistence-component-logical-plan/src/main/java/org/finos/legend/engine/persistence/components/planner/BulkLoadPlanner.java
#	legend-engine-xts-persistence/legend-engine-xt-persistence-component/legend-engine-xt-persistence-component-relational-h2/src/test/java/org/finos/legend/engine/persistence/components/ingestmode/bulkload/BulkLoadTest.java
#	legend-engine-xts-persistence/legend-engine-xt-persistence-component/legend-engine-xt-persistence-component-relational-h2/src/test/resources/data/bulk-load/expected/expected_table3.csv
#	legend-engine-xts-persistence/legend-engine-xt-persistence-component/legend-engine-xt-persistence-component-relational-h2/src/test/resources/data/bulk-load/expected/expected_table4.csv
@kumuwu kumuwu requested a review from a team as a code owner September 22, 2023 07:27
@kumuwu kumuwu changed the title Persistence Component: Bulk Load Support forBig Query Persistence Component: Bulk Load Support for Big Query Sep 22, 2023
@github-actions
Copy link

github-actions bot commented Sep 22, 2023

Test Results

     693 files  +     693       693 suites  +693   58m 39s ⏱️ + 58m 39s
11 446 tests +11 446  11 062 ✔️ +11 062  384 💤 +384  0 ±0 
14 338 runs  +14 338  13 870 ✔️ +13 870  468 💤 +468  0 ±0 

Results for commit f611912. ± Comparison against base commit a8d72b3.

♻️ This comment has been updated with latest results.

akphi
akphi previously approved these changes Oct 16, 2023
@rafaelbey rafaelbey merged commit bc6cc1a into finos:master Oct 19, 2023
3 checks passed
@kumuwu kumuwu deleted the zhlizh-big-query-bulk-load branch January 19, 2024 03:43
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants