Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Persistence Library: Dry run mode support for Bulk Load and Capture Data Errors #2642

Merged
merged 39 commits into from
Mar 13, 2024

Conversation

prasar-ashutosh
Copy link
Contributor

@prasar-ashutosh prasar-ashutosh commented Feb 20, 2024

What type of PR is this?

  • Improvement

What does this PR do / why is it needed ?

  1. Dry run mode support: Ability to run ingest in dry run which runs validation on the data without actually loading the data to target table
    1.1 Supported Formats: CSV, AVRO, JSON
    1.2 Supported Sink: H2, Snowflake
    1.2 Supported Error Categories: Data type Conversion , Null Check , File not found, parsing error, others
  2. Capture Fail on Duplicates Data errors
  3. Capture Data Version errors
  4. Bug Fix: Append a hash to the temporary table names to provide concurrent safety

Which issue(s) this PR fixes:

Fixes #

Other notes for reviewers:

Does this PR introduce a user-facing change?

NO

@prasar-ashutosh prasar-ashutosh requested a review from a team as a code owner February 20, 2024 07:13
Copy link

github-actions bot commented Feb 20, 2024

Test Results

     748 files  +  2       748 suites  +2   1h 4m 41s ⏱️ -34s
12 285 tests +28  12 120 ✔️ +28  165 💤 ±0  0 ±0 
15 318 runs  +28  15 143 ✔️ +28  175 💤 ±0  0 ±0 

Results for commit bea2326. ± Comparison against base commit 3037b89.

♻️ This comment has been updated with latest results.

kumuwu and others added 7 commits February 27, 2024 16:48
# Conflicts:
#	legend-engine-xts-persistence/legend-engine-xt-persistence-component/legend-engine-xt-persistence-component-physical-plan/src/main/java/org/finos/legend/engine/persistence/components/executor/Executor.java
#	legend-engine-xts-persistence/legend-engine-xt-persistence-component/legend-engine-xt-persistence-component-relational-bigquery/src/main/java/org/finos/legend/engine/persistence/components/relational/bigquery/executor/BigQueryExecutor.java
#	legend-engine-xts-persistence/legend-engine-xt-persistence-component/legend-engine-xt-persistence-component-relational-core/src/main/java/org/finos/legend/engine/persistence/components/relational/RelationalSink.java
#	legend-engine-xts-persistence/legend-engine-xt-persistence-component/legend-engine-xt-persistence-component-relational-core/src/main/java/org/finos/legend/engine/persistence/components/relational/api/RelationalIngestorAbstract.java
#	legend-engine-xts-persistence/legend-engine-xt-persistence-component/legend-engine-xt-persistence-component-relational-core/src/main/java/org/finos/legend/engine/persistence/components/relational/executor/RelationalExecutor.java
@prasar-ashutosh prasar-ashutosh changed the title Persistence Library: Dry run mode support for Snowflake While loading CSV files Persistence Library: Dry run mode support Feb 28, 2024
@prasar-ashutosh prasar-ashutosh changed the title Persistence Library: Dry run mode support Persistence Library: Dry run mode support for CSV loading in Snowflake Feb 28, 2024
@prasar-ashutosh prasar-ashutosh changed the title Persistence Library: Dry run mode support for CSV loading in Snowflake Persistence Library: Dry run mode support for Bulk Load Mar 13, 2024
@prasar-ashutosh prasar-ashutosh changed the title Persistence Library: Dry run mode support for Bulk Load Persistence Library: Dry run mode support for Bulk Load and Capture Merge Errors Mar 13, 2024
@prasar-ashutosh prasar-ashutosh changed the title Persistence Library: Dry run mode support for Bulk Load and Capture Merge Errors Persistence Library: Dry run mode support for Bulk Load and Capture Data Errors Mar 13, 2024
@epsstan epsstan merged commit 20de9ff into finos:master Mar 13, 2024
3 checks passed
lisaknbur pushed a commit to goldmansachs/legend-engine that referenced this pull request Mar 15, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants