Implement Pop-Art #6

Kaixhin · 2016-04-13T08:20:49Z

Learning functions across many orders of magnitudes introduces Preserving Outputs Precisely, while Adaptively Rescaling Targets (Pop-Art). In summary it normalises outputs across orders of magnitudes and gets rid of the clipping (i.e. counting) rewards heuristic for Atari games. The normalisation is also better for non-stationary problems, i.e., any decent real world problem.

The below is a picture of extra notes from the authors, next to their poster at NIPS 2016:

Kaixhin added enhancement help wanted labels Apr 13, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement Pop-Art #6

Implement Pop-Art #6

Kaixhin commented Apr 13, 2016 •

edited

Loading

Implement Pop-Art #6

Implement Pop-Art #6

Comments

Kaixhin commented Apr 13, 2016 • edited Loading

Kaixhin commented Apr 13, 2016 •

edited

Loading