You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I successfully reproduced the issue on Kaggle T4 x2 GPUs, but the error did not occur when I ran the same code in Google Colab with the v4 runtime (as mentioned in the tutorial notebook). In Kaggle, one GPU’s memory is fully occupied, and the code params = params_lib.load_and_format_params(ckpt_path) does not automatically utilize the free GPU. This suggests a memory allocation management issue. To resolve this and avoid the error, please refer to the solution provided in the code below, as well as the linked gist file for more details.
I'm following
colabs/fine_tuning_tutorial.ipynb
, but still ran out of memory on stepparams = params_lib.load_and_format_params(ckpt_path)
error message:
I thought it shouldn't be running out of memory with a 2b model on T4x2. How can I solve this issue?
The text was updated successfully, but these errors were encountered: