generated from finos/software-project-blueprint
-
Notifications
You must be signed in to change notification settings - Fork 237
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Persistence Component: Bulk Load Support for Big Query (#2294)
* Bug Fix: Bitemporal milestoning Derive Main schema removes the VALID_FROM/VALID_TRHOUGH field if the name matches with source specified VALID_FROM/VALID_TRHOUGH fields * Bug Fix: Bitemporal milestoning Schema Evolution must ignore user provided validity fields * Adding code for concurrent safety feature * Adding test for Multi Ingest Mode with concurrent Safety * Adding tests for concurrent safety * Code Clean up * Bug Fix: Bitemporal temp tables must be deleted after usage * Update readme and code review comments * Fix typo * Fix typos in readme * Bug Fix: Empty Batch Handling in Unitemp Snapshot * Bug Fix: Code review comments * Implement StagedFilesDatasetReference * Implement StagedFilesSelection * Support for Empty Batch Handling in Unitemporal Snapshot * Support for FailEmptyBatch strategy in Unitemporal Snapshot * Enrich datasets to add additionalDatasetproperties every where * Add tests for Empty Data handling * Support ICEBERG_TABLE_2022 for Iceberg tables * Implement bulk load for h2 * Add missing datasetAdditionalProperties in BigQueryHelper * Implement executor flow and add end-to-end test * Add test * Add support for digest udf in h2 and add tests * Clean up * Add file format and validation for file format and add tests * Add missing statement * Fix typo in year * Fix comments * Add H2 MD5 * Change file format interface * Change stats * Change stats - make snowflake always return stats no matter success or failure * Implement bulk load in using copy and insert + modify interface for capabilities * Add Support for metadata for BulkLoad Task * Refactor Digest Generation Strategy * Implement bulk load for big query * Addressed Code Review Comments * Clean up * Add basic tests for bulk load for big query * Refactor Code to rename AppendLog to BulkLoad * Add default bulkLoad Batchid * Refactor Append Log table name * Resolve conflicts * Add digest udf and more tests * Fix digest problem * Change H2 digest algo * Fix tests * Fix typo * Refactor file format and load options * Refactor dataset, selection, reference logic * Fix other comments * Fix big query bulk load ingestor flow and add end-to-end tests * Add rows with error handling and test * Address comments * Bulk Load Batch ID and Task ID & PK Validation (#11) * Add PK validation in bulk load * Resolve conflict * Remove unnecessary delete * Introduce bulk load batch id and bulk load task id * Rename variable * Address Code Review Comments --------- Co-authored-by: Ashutosh <[email protected]> Co-authored-by: prasar-ashutosh <[email protected]>
- Loading branch information
1 parent
fb7e2bb
commit bc6cc1a
Showing
70 changed files
with
2,218 additions
and
273 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -19,5 +19,5 @@ public enum FileFormat | |
CSV, | ||
JSON, | ||
AVRO, | ||
PARQUET | ||
PARQUET; | ||
} |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -37,5 +37,6 @@ public enum FunctionName | |
GENERATE_ARRAY, | ||
PARSE_DATETIME, | ||
OBJECT_CONSTRUCT, | ||
TO_VARIANT; | ||
TO_VARIANT, | ||
TO_JSON; | ||
} |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.