Compare loss on enwik8 #16

Bachstelze · 2022-12-04T15:57:29Z

The validation loss went to 1.3564 after 1 epoch of training with a small model on enwik8.
How can this be evaluated and compared with respect to other models?

conceptofmind · 2023-01-12T19:12:12Z

@Bachstelze That is difficult to properly evaluate at a smaller scale without sufficient pre-training. It would likely require a single go over something like OpenWebText or Wikitext if you wanted to get a more detailed analysis. ColossalAI trained a PaLM model based on Lucid's repository on Wikitext and it may be worth checking out their results: https://github.com/hpcaitech/PaLM-colossalai

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Compare loss on enwik8 #16

Compare loss on enwik8 #16

Bachstelze commented Dec 4, 2022

conceptofmind commented Jan 12, 2023 •

edited

Loading

Compare loss on enwik8 #16

Compare loss on enwik8 #16

Comments

Bachstelze commented Dec 4, 2022

conceptofmind commented Jan 12, 2023 • edited Loading

conceptofmind commented Jan 12, 2023 •

edited

Loading