There is a training loop in the trainer which return loss and accuracy for the last batch #1

burlachenkok · 2021-10-02T18:37:29Z

cs326-few-shot-classification/trainers/trainer.py

Line 46 in 7db7842

return losses[-1], accs[-1]

I understand that we for save of computing power measure accuracy and loss via moving model, but it may be the case that current implementation that just returns last loss and last acc in the last batch a bit incorrect.

I suggest replace it:
``` sum(losses)/len(losses), sum(accs)/len(accs)``

It's still approximate computation of loss and accuracy because we evaluate acc and loss in different points.

universome · 2021-10-03T11:15:29Z

To be honest, this design choice feels subjective. In the current implementation, we return a noisy estimate of the final training loss / final training accuracy of the episode. In your case, you propose to return the average training loss / average training accuracy. Since at the beginning of each episode the model has random performance, such measure would be polluted with bad scores received at the beginning of the training — especially if num_train_steps_per_episode is small (as in our case). For me, evaluating a model's performance based on its final performance feels more natural, but you can use whatever measure you are comfortable with

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

There is a training loop in the trainer which return loss and accuracy for the last batch #1

There is a training loop in the trainer which return loss and accuracy for the last batch #1

burlachenkok commented Oct 2, 2021

universome commented Oct 3, 2021

There is a training loop in the trainer which return loss and accuracy for the last batch #1

There is a training loop in the trainer which return loss and accuracy for the last batch #1

Comments

burlachenkok commented Oct 2, 2021

universome commented Oct 3, 2021