[AWS::Glue::Crawler] - [Enhancement] - https://github.com/aws/aws-cdk/issues/31133 #2120
Open
1 of 2 tasks
Labels
enhancement
New feature or request
Name of the resource
AWS::Glue::Crawler
Resource name
No response
Description
Posting : aws/aws-cdk#31133
Describe the feature
Hello
If I set up my crawler from GUI console I can choose Hudi S3 Table as a data source to be crawled. This is not yet possible neither on CFN or CDK. CDK already has iceberg and delta lake as sources for the crawler but no Hudi : https://docs.aws.amazon.com/cdk/api/v2/docs/aws-cdk-lib.aws_glue.CfnCrawler.html
Use Case
I would like to use the serde Hudi and not parquet when crawling my hudi S3 folders, although available via GUI it is still not present in CDK or CFN
Proposed Solution
P1. Add at least to CloudFormation HudiTarget to complete the data lake frameworks formats (given that Iceberg and Delta Target are present in CFN)
P2. Add to CDK the target type
Other Information
No response
Acknowledgements
CDK version used
"aws-cdk-lib": "^2.115.0",
Environment details (OS name and version, etc.)
MACBookPro M1
Other Details
No response
The text was updated successfully, but these errors were encountered: