Bug in calculating loss when using DPO #30

zeta-zl · 2024-04-22T14:00:27Z

I have encountered what appears to be a bug in the calculation of the loss function within the dataloader.py file, specifically between lines 167 and 174. This issue arises while using the dpo method.

pi_logratios = idk_loss_current - forget_loss_current
ref_logratios = idk_loss_oracle - forget_loss_oracle

beta = 0.1
loss = -F.logsigmoid(beta * (pi_logratios - ref_logratios)).mean()
print(loss.item())
loss = -pi_logratios.mean()
loss = -idk_loss_current.mean()

It appears that the final calculation of loss as -idk_loss_current.mean() contradicts the expected retult described in the paper.
Thank you for your attention to this matter.

The text was updated successfully, but these errors were encountered:

molereddy · 2024-04-22T20:39:16Z

It was clarified on an earlier issue that the "dpo" loss is deprecated and isn't used in the results #20 (comment)

zeta-zl closed this as completed Apr 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug in calculating loss when using DPO #30

Bug in calculating loss when using DPO #30

zeta-zl commented Apr 22, 2024

molereddy commented Apr 22, 2024

Bug in calculating loss when using DPO #30

Bug in calculating loss when using DPO #30

Comments

zeta-zl commented Apr 22, 2024

molereddy commented Apr 22, 2024