You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm using this package io.github.spark-redshift-community:spark-redshift_2.12:4.2.0 as a dependency in the context of AWS EMR job trying to save a dataframe to Redshift.
Hi there
I'm using this package io.github.spark-redshift-community:spark-redshift_2.12:4.2.0 as a dependency in the context of AWS EMR job trying to save a dataframe to Redshift.
Sadly this attempt fails with following stacktrace:
https://gist.github.com/marek-babic/0110160bdd0ba11533b6f425559d2f1c
I know that the dataframe is in healthy state as show() and printSchema() output what I expect and the schema matches the one from Redshift table.
The code looks like so (where the capital letter vars are set appropriately):
I tried to save the dataframe to S3 just by running:
which worked, so the permissions in AWS are correct.
Any ideas why this could be happening?
Thanks
Marek
The text was updated successfully, but these errors were encountered: