Training settings in the paper differ from the code #15

aapanaetov · 2022-10-24T02:49:08Z

Hi! Thank you for publishing such an amazing work! In the paper you decay learning rate after 6e5 steps while in HQ_Dictionary.yaml it is set to 4e5 steps. Schedule steps, learning rate and loss weights in the configs for both HQ dictionary and RestoreFormer are different from the paper. Which settings should I use to reproduce your excellent results?

wzhouxiff · 2022-10-26T03:48:02Z

Please follow the setting described in the paper. Note that the learning rate set in the config is not the actual learning rate. It will be divided by the number of gpus used. The learning rate described in the paper is the one after dividing.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training settings in the paper differ from the code #15

Training settings in the paper differ from the code #15

aapanaetov commented Oct 24, 2022 •

edited

Loading

wzhouxiff commented Oct 26, 2022

Training settings in the paper differ from the code #15

Training settings in the paper differ from the code #15

Comments

aapanaetov commented Oct 24, 2022 • edited Loading

wzhouxiff commented Oct 26, 2022

aapanaetov commented Oct 24, 2022 •

edited

Loading