Cannot reproduce the results of IQL on antmaze #163

Shenzhi-Wang · 2022-04-03T09:50:51Z

I've run examples/iql/antmaze_finetune.py, but the results are so bad, oscillating between 0 and 1 (as shown in the figure below), which are totally different from the result figures in examples/iql/README.md.

The text was updated successfully, but these errors were encountered:

anair13 · 2022-04-03T19:09:02Z

I think you just need to smooth (each epoch contains 1 rollout which either succeeds or fails), can you average the returns over a moving window and plot it again? Our results were plotted with https://github.com/rail-berkeley/rlkit/blob/master/rlkit/visualization/plot_util.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cannot reproduce the results of IQL on antmaze #163

Cannot reproduce the results of IQL on antmaze #163

Shenzhi-Wang commented Apr 3, 2022

anair13 commented Apr 3, 2022

Cannot reproduce the results of IQL on antmaze #163

Cannot reproduce the results of IQL on antmaze #163

Comments

Shenzhi-Wang commented Apr 3, 2022

anair13 commented Apr 3, 2022