-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
RuntimeError: Tensor Size Mismatch During Training #2
Comments
Hi Xiao Xiong, |
I also have a similar issue. When I run bash ioi_sweep.sh, it returns me that src/modeling/modeling_fpt2.py", line 1192, in write tok_embeds = tok_embeds + corr_x[0] * (1 - z_tokens) |
@XiaoXiong-Sherry @ZekaiZhaostats I also have this issue when I run ioi_sweep.sh with two GPU device, the dims of |
Thank you all for the suggestions! I ran it with two GPUs and indeed, I got the error originally reported. It seems to have gone undetected since we ran our GPT-2 experiments on one GPU. Based on the suggestion by @TianBian95, I have now introduced a fix to the |
I am encountering the following error while executing run_scripts/gt_sweep.sh:
File "Edge-Pruning/src/modeling/modeling_fpt2.py", line 1293, in forward hidden_states, embeds, z_nodes_sum = self.write(inputs_embeds, position_embeds, corr_x=corr_x)
File "Edge-Pruning/src/modeling/modeling_fpt2.py", line 1184, in write tok_embeds = tok_embeds + corr_x[0] * (1 - z_tokens)
RuntimeError: The size of tensor a (32) must match the size of tensor b (64) at non-singleton dimension 0
Thank you for your assistance!
The text was updated successfully, but these errors were encountered: