targets were set to all -100 in stage2_sft stage due to cur_len != expected_len #39

hchc007 · 2024-08-19T13:23:29Z

Hi,

In stage2_sft.py (line 292), the targets are all set to -100 because cur_len isn't updated to match expected_len. This seems like a bug. Could you please help to verify this? Thanks!

if cur_len < tokenizer.model_max_length: if cur_len != expected_len: for k in range(total_len): target[k] = IGNORE_TOKEN_ID rank0_print( f"WARNING: tokenization mismatch: {cur_len} vs. {total_len}." f" #turn = {len(turns) - 1}. (ignored)" )

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

targets were set to all -100 in stage2_sft stage due to cur_len != expected_len #39

targets were set to all -100 in stage2_sft stage due to cur_len != expected_len #39

hchc007 commented Aug 19, 2024

targets were set to all -100 in stage2_sft stage due to cur_len != expected_len #39

targets were set to all -100 in stage2_sft stage due to cur_len != expected_len #39

Comments

hchc007 commented Aug 19, 2024