[HUDI-7436] Fix the conditions for determining whether the records need to be rewritten #10727

ThinkerLei · 2024-02-22T02:56:22Z

Change Logs

Fix the conditions for determining whether the records need to be rewritten

Impact

low

Risk level (write none, low medium or high below)

low

Documentation Update

Describe any necessary documentation update if there is any new feature, config, or user-facing change

The config description must be updated if new configs are added or the default value of the configs are changed
Any new feature or user-facing change requires updating the Hudi website. Please create a Jira ticket, attach the
ticket number here and follow the instruction to make
changes to the website.

Contributor's checklist

Read through contributor's guide
Change Logs and Impact were stated clearly
Adequate tests were added if applicable
CI passed

ThinkerLei · 2024-02-22T02:59:46Z

@xiarixiaoyao PTAL when you have free time, thanks a lot~

xiarixiaoyao · 2024-02-22T08:14:18Z

.../hudi-client-common/src/main/java/org/apache/hudi/table/action/commit/HoodieMergeHelper.java

- || SchemaCompatibility.checkReaderWriterCompatibility(newWriterSchema, writeSchemaFromFile).getType() == org.apache.avro.SchemaCompatibility.SchemaCompatibilityType.COMPATIBLE;
+ boolean needToReWriteRecord = sameCols.size() != colNamesFromWriteSchema.size() ||
+ !(SchemaCompatibility.checkReaderWriterCompatibility(newWriterSchema, writeSchemaFromFile).getType() ==
+ org.apache.avro.SchemaCompatibility.SchemaCompatibilityType.COMPATIBLE);


yes, we need avoid unnecessary rewrite overhead. Thanks for your fix.

xiarixiaoyao · 2024-02-22T08:17:52Z

@ThinkerLei pls fix checkstyle thanks

ThinkerLei · 2024-02-22T11:51:29Z

@ThinkerLei pls fix checkstyle thanks

Thanks for your comment, it has been modified

ThinkerLei · 2024-02-26T09:21:20Z

@ThinkerLei pls fix checkstyle thanks

Thanks for your comment, it has been modified

@xiarixiaoyao @danny0405 The failed CI has nothing to do with my modifications, anyone can help me to trigger the CI again.

yihua · 2024-02-27T00:51:32Z

.../hudi-client-common/src/main/java/org/apache/hudi/table/action/commit/HoodieMergeHelper.java

@@ -202,7 +202,9 @@ private Option<Function<HoodieRecord, HoodieRecord>> composeSchemaEvolutionTrans
 Schema newWriterSchema = AvroInternalSchemaConverter.convert(mergedSchema, writerSchema.getFullName());
 Schema writeSchemaFromFile = AvroInternalSchemaConverter.convert(writeInternalSchema, newWriterSchema.getFullName());
 boolean needToReWriteRecord = sameCols.size() != colNamesFromWriteSchema.size()
- || SchemaCompatibility.checkReaderWriterCompatibility(newWriterSchema, writeSchemaFromFile).getType() == org.apache.avro.SchemaCompatibility.SchemaCompatibilityType.COMPATIBLE;
+ || !(SchemaCompatibility.checkReaderWriterCompatibility(newWriterSchema, writeSchemaFromFile).getType()


I'm confused here. If the schema is not compatible, the merging should fail here, correct?

Can the incompatible avro schema be blent together?

@danny0405 @yihua Thanks for your comment. What you said makes sense to me. Is it reasonable to use needToReWriteRecord = sameCols.size() != colNamesFromWriteSchema.size() && SchemaCompatibility.checkReaderWriterCompatibility(newWriterSchema, writeSchemaFromFile).getType() == SchemaCompatibility.SchemaCompatibilityType.COMPATIBLE; ?

This is the original logic right?

This is the original logic right?

The original logic is boolean needToReWriteRecord = sameCols.size() != colNamesFromWriteSchema.size() || SchemaCompatibility.checkReaderWriterCompatibility(newWriterSchema, writeSchemaFromFile).getType() == org.apache.avro.SchemaCompatibility.SchemaCompatibilityType.COMPATIBLE;

So there is no need for rewrite if the schema is compatible and the fields number equals?

I don't have much experience on this, @xiarixiaoyao do you have any suggestions?

Alternatively, we could find another way to determine whether the newWriterSchema is consistent with the writeSchemaFromFile. If they are consistent, then there would be no need for rewriting.

The original logic has certain performance issues
If the read-write schema is compatible, i think we no need rewrite the entire record. since we can read from old parquet file by new schema correctly.

@danny0405 @ThinkerLei

…ed to be rewritten

hudi-bot · 2024-02-27T06:48:44Z

CI report:

285ed82 Azure: SUCCESS

Bot commands

@hudi-bot supports the following commands:

@hudi-bot run azure re-run the last Azure build

danny0405 · 2024-03-12T02:33:13Z

Nice ping for @xiarixiaoyao ~

danny0405 · 2024-03-12T09:09:16Z

.../hudi-client-common/src/main/java/org/apache/hudi/table/action/commit/HoodieMergeHelper.java

- || SchemaCompatibility.checkReaderWriterCompatibility(newWriterSchema, writeSchemaFromFile).getType() == org.apache.avro.SchemaCompatibility.SchemaCompatibilityType.COMPATIBLE;
+ && SchemaCompatibility.checkReaderWriterCompatibility(newWriterSchema, writeSchemaFromFile).getType()
+ == org.apache.avro.SchemaCompatibility.SchemaCompatibilityType.COMPATIBLE;
+


So when the column size equals, there is no need to rewrite no matter whether the schema is compatible?

@danny0405 @ThinkerLei
i think again.
SameCols. size() == colNamesFromWriteScheme. size() only happen in following scence
The table has new columns, while the old columns have not been changed(rename, type change).
eg:

write schema: a string, b int, c long read schema: a string, b int, c long, d int

In this case
SameCols. size() == colNamesFromWriteScheme. size().
and, writeSchema is equivalent to a pruned readschema.

However, some versions of AVRO, such as AVRO 1.8. x , may report errors when using pruned schemas to read AVRO files. (avro 1.10x has no such problem)

Therefore, even if sameCols. size() == colNamesFromWriteScheme. size(), we still need to check the compatibility of the read-write schema. If it is compatible, we can directly use this writeSchema to read avo data.

maybe we can use following logic to avoid unnecessary rewrite.

boolean needToReWriteRecord = sameCols.size() != colNamesFromWriteSchema.size() || !SchemaCompatibility.checkReaderWriterCompatibility(newWriterSchema, writeSchemaFromFile).getType() == org.apache.avro.SchemaCompatibility.SchemaCompatibilityType.COMPATIBLE

@danny0405
This place can actually raise an additional question,
Now when we are reading the MOR table, we pass the full schema when reading the AVRO log; Even if we only query one column, if this table has 100 rows of avro logs, using full schema to read data and generate BitCatstMap will consume a lot of memory, and the performance will not be good.
now our current version of Avro has been upgraded to 1.10. x. In fact, we can pass pruned schemas directly when reading logs. This way, when reading logs and generating bitcastmaps, the speed and memory consumption are much better.
Forgive me for that i can not paste test pic due to company information security reasons

presto read hudi log

pass full schema, we will see following log
Total size in bytes of MemoryBasedMap in ExternalSpillableMap => 712，956，000
final query time: 35672ms

pass puned schema
Total size in bytes of MemoryBasedMap in ExternalSpillableMap => 45，500，000
final query time: 13373ms

Good point for optimization, we introduce some changes like the dynamic read schema based on write schema in release 1.x as for the HoodieFileGroupReader, but I'm not sure whether it is applied automically for all the read paths, cc @yihua for confirming this.

And anyway, I think we should have such optimization in 0.x branch and master for the legacy HoodieMergedLogRecordReader which will still be benefic to engines line Flink and Hive.

@xiarixiaoyao do you have intreast to contribute this?

@danny0405 @ThinkerLei i think again. SameCols. size() == colNamesFromWriteScheme. size() only happen in following scence The table has new columns, while the old columns have not been changed(rename, type change). eg:

write schema: a string, b int, c long read schema: a string, b int, c long, d int

In this case SameCols. size() == colNamesFromWriteScheme. size(). and, writeSchema is equivalent to a pruned readschema.

However, some versions of AVRO, such as AVRO 1.8. x , may report errors when using pruned schemas to read AVRO files. (avro 1.10x has no such problem)

Therefore, even if sameCols. size() == colNamesFromWriteScheme. size(), we still need to check the compatibility of the read-write schema. If it is compatible, we can directly use this writeSchema to read avo data.

maybe we can use following logic to avoid unnecessary rewrite.

boolean needToReWriteRecord = sameCols.size() != colNamesFromWriteSchema.size() || !SchemaCompatibility.checkReaderWriterCompatibility(newWriterSchema, writeSchemaFromFile).getType() == org.apache.avro.SchemaCompatibility.SchemaCompatibilityType.COMPATIBLE

This was the logic I initially fixed. Do I still need to make changes based on this PR? cc @danny0405 @xiarixiaoyao

Yeah, let's change it back, we better have some test cases.

Good point for optimization, we introduce some changes like the dynamic read schema based on write schema in release 1.x as for the HoodieFileGroupReader, but I'm not sure whether it is applied automically for all the read paths, cc @yihua for confirming this.

And anyway, I think we should have such optimization in 0.x branch and master for the legacy HoodieMergedLogRecordReader which will still be benefic to engines line Flink and Hive.

@xiarixiaoyao do you have intreast to contribute this?

will try

@xiarixiaoyao This info is valuable. Basically using pruned schema to read Avro records is supported on Avro 1.10 and above, not on lower versions. I see that Spark 3.2 and above and all Flink versions use Avro 1.10 and above. So for these integrations and others that rely on Avro 1.10 and above, we should use pruned schema to read log records to improve performance. I'll check the new file group reader.

danny0405 assigned xiarixiaoyao Feb 22, 2024

ThinkerLei force-pushed the HUDI-7436 branch from 88ee6a7 to 5fe0d0e Compare February 22, 2024 03:04

ThinkerLei changed the title ~~HUDI-7436:Fix the conditions for determining whether the records need to be rewritten~~ [HUDI-7436] Fix the conditions for determining whether the records need to be rewritten Feb 22, 2024

xiarixiaoyao reviewed Feb 22, 2024

View reviewed changes

xiarixiaoyao approved these changes Feb 22, 2024

View reviewed changes

ThinkerLei force-pushed the HUDI-7436 branch from 5fe0d0e to 87d9930 Compare February 22, 2024 11:48

github-actions bot added the size:XS PR with lines of changes in <= 10 label Feb 26, 2024

yihua requested changes Feb 27, 2024

View reviewed changes

[HUDI-7436] Fix the conditions for determining whether the records ne…

285ed82

…ed to be rewritten

ThinkerLei force-pushed the HUDI-7436 branch from 87d9930 to 285ed82 Compare February 27, 2024 03:47

danny0405 reviewed Mar 12, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[HUDI-7436] Fix the conditions for determining whether the records need to be rewritten #10727

[HUDI-7436] Fix the conditions for determining whether the records need to be rewritten #10727

ThinkerLei commented Feb 22, 2024 •

edited

Loading

ThinkerLei commented Feb 22, 2024

xiarixiaoyao Feb 22, 2024

xiarixiaoyao commented Feb 22, 2024

ThinkerLei commented Feb 22, 2024

ThinkerLei commented Feb 26, 2024

yihua Feb 27, 2024

danny0405 Feb 27, 2024

ThinkerLei Feb 27, 2024

danny0405 Feb 27, 2024

ThinkerLei Feb 27, 2024

danny0405 Feb 27, 2024

ThinkerLei Feb 27, 2024

ThinkerLei Mar 12, 2024

xiarixiaoyao Mar 12, 2024

hudi-bot commented Feb 27, 2024

danny0405 commented Mar 12, 2024

danny0405 Mar 12, 2024

xiarixiaoyao Mar 14, 2024 •

edited

Loading

xiarixiaoyao Mar 14, 2024

danny0405 Mar 14, 2024

ThinkerLei Mar 14, 2024

danny0405 Mar 14, 2024

xiarixiaoyao Mar 18, 2024

yihua Mar 18, 2024

[HUDI-7436] Fix the conditions for determining whether the records need to be rewritten #10727

Are you sure you want to change the base?

[HUDI-7436] Fix the conditions for determining whether the records need to be rewritten #10727

Conversation

ThinkerLei commented Feb 22, 2024 • edited Loading

Change Logs

Impact

Risk level (write none, low medium or high below)

Documentation Update

Contributor's checklist

ThinkerLei commented Feb 22, 2024

Choose a reason for hiding this comment

xiarixiaoyao commented Feb 22, 2024

ThinkerLei commented Feb 22, 2024

ThinkerLei commented Feb 26, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

hudi-bot commented Feb 27, 2024

CI report:

danny0405 commented Mar 12, 2024

Choose a reason for hiding this comment

xiarixiaoyao Mar 14, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ThinkerLei commented Feb 22, 2024 •

edited

Loading

xiarixiaoyao Mar 14, 2024 •

edited

Loading