Skip to content

Commit

Permalink
Fix issue NVIDIA-Merlin#1226
Browse files Browse the repository at this point in the history
Since the datasets contains many null values, Int32 ( nullable integer ) should be used instead
  • Loading branch information
ibraheemalayan authored May 4, 2024
1 parent eb1e541 commit f8e108e
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion merlin/datasets/ecommerce/aliccp/dataset.py
Original file line number Diff line number Diff line change
Expand Up @@ -445,6 +445,6 @@ def _convert_data(
current = []

tmp_files = glob(os.path.join(tmp_dir, f"{data_type}_*.parquet"))
dtypes = {f.name: "int32" for f in _Features().features}
dtypes = {f.name: "Int32" for f in _Features().features}
merlin.io.Dataset(tmp_files, dtypes=dtypes).to_parquet(out_dir)
shutil.rmtree(tmp_dir)

0 comments on commit f8e108e

Please sign in to comment.