You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I am running experiments using sampled softmax: I am using new method (topk_encoder) to evaluate when am using sampled softmax. However, when am not using sampled softmax, I expected both the below methods to return the same metrics, however they do not.
Once this is run, now please run the following and compare the metric values coming from model.evaluate() above and topk_model.evaluate below. The results do not match.
Please note that the metrics results change each time we rerun topk_model.evaluate(). I added shuffle=False in the loader, but I still get metrics values different.
The text was updated successfully, but these errors were encountered:
rnyak
changed the title
[BUG] trainer.evaluate() and topk_model.evaluate() do not give same metric (case of no sampled softmax)
[BUG] model.evaluate() and topk_model.evaluate() do not give same metric (case of no sampled softmax)
Jun 14, 2023
Bug description
I am running experiments using sampled softmax: I am using new method (topk_encoder) to evaluate when am using sampled softmax. However, when am not using sampled softmax, I expected both the below methods to return the same metrics, however they do not.
Steps/Code to reproduce bug
Original way of evaluation:
New way of evaluation:
Expected behavior
In the case of no sampled softmax, expected both the above codes to provide the same result.
Environment details
Additional context
REPRODUCIBLE EXAMPLE (from Ronay):
Once this is run, now please run the following and compare the metric values coming from
model.evaluate()
above andtopk_model.evaluate
below. The results do not match.Please note that the
metrics
results change each time we reruntopk_model.evaluate()
. I addedshuffle=False
in the loader, but I still get metrics values different.The text was updated successfully, but these errors were encountered: