[BUG] pinned_host_vector can cause abrupt program termination #14165
Labels
1 - On Deck
To be worked on next
bug
Something isn't working
libcudf
Affects libcudf (C++/CUDA) code.
Spark
Functionality that helps Spark RAPIDS
Describe the bug
pinned_host_vector can throw from within it's destructor, causing the application process to terminate abruptly. For example, this is causing abnormal termination of Spark executors when a GPU illegal access occurs during a Parquet read. Since the executor process is abruptly terminated via low-level
abort()
, there's no chance to convey a useful message from the driver to the executor otherwise try to handle at the application level thecudf::fatal_cuda_error
exception that is being thrown.Steps/Code to reproduce bug
Compile and run the following program:
which will produce the following output:
Expected behavior
Exceptions should not be thrown from destructors and cause an application to be terminated abruptly with no chance for the application to shutdown gracefully.
The text was updated successfully, but these errors were encountered: