Different results of halfcheetah #1

rainbow979 · 2023-01-17T05:38:32Z

Hello,

I tried the same config with the repo and got the same good performance with the paper. However, when I tried the env halfcheetah and the testing score is much lower than the results from the paper. I only changed the returns_scale from 400 to 800 since halfcheetah had higher discounted returns. The training loss is shown in the figure. The blue line is halfcheetah and the purple line is hopper. I am wondering if there are other hyperparemeters needs to be changed.

WangYong-Design · 2023-03-23T08:16:04Z

Have you kept track of how long it takes to run an experiment on GPU?

Looomo · 2023-06-11T08:20:20Z

Hi, I was trying to reproduce the results of DD, but I couldnt make it, results on each dataset differ significantly from the results presented in the paper in Tab1. （I have tried only on 4 datasets, walker2d-medium-replay-v2, walker2d-medium-v2,halfcheetah-medium-replay-v2, and halfcheetah-medium-v2）. So have you change anything in this repo or just directly run the downloaded code? Thanks!
Also, did you find the codes of length-K history condition in this repo?

RenMing-Huang · 2023-07-10T03:03:18Z

same question

SpaceLearner · 2023-07-19T02:57:44Z

same question

xishuxishu · 2024-04-30T15:37:32Z

Hello,

I tried the same config with the repo and got the same good performance with the paper. However, when I tried the env halfcheetah and the testing score is much lower than the results from the paper. I only changed the returns_scale from 400 to 800 since halfcheetah had higher discounted returns. The training loss is shown in the figure. The blue line is halfcheetah and the purple line is hopper. I am wondering if there are other hyperparemeters needs to be changed.

hello，Have you solved this problem

wangerlie · 2024-11-12T12:35:50Z

Hello,

I tried the same config with the repo and got the same good performance with the paper. However, when I tried the env halfcheetah and the testing score is much lower than the results from the paper. I only changed the returns_scale from 400 to 800 since halfcheetah had higher discounted returns. The training loss is shown in the figure. The blue line is halfcheetah and the purple line is hopper. I am wondering if there are other hyperparemeters needs to be changed.

hello @rainbow979
I also tried to reproduce the results, I run the code directly without ANY change in the code, I just couldn't get the same result in 'hopper-medium-expert-v2'. I am wondering how to reproduce the result in 'hopper-medium-expert-v2', have you ever change any hyperparameters? Thanks a lot .

wangerlie · 2024-11-12T12:43:43Z

I am also wondering do you have the same warning as following, I think it's not reasonable, there may have some problem within the normalization code?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Different results of halfcheetah #1

Different results of halfcheetah #1

rainbow979 commented Jan 17, 2023

WangYong-Design commented Mar 23, 2023

Looomo commented Jun 11, 2023

RenMing-Huang commented Jul 10, 2023

SpaceLearner commented Jul 19, 2023

xishuxishu commented Apr 30, 2024

wangerlie commented Nov 12, 2024 •

edited

Loading

wangerlie commented Nov 12, 2024

Different results of halfcheetah #1

Different results of halfcheetah #1

Comments

rainbow979 commented Jan 17, 2023

WangYong-Design commented Mar 23, 2023

Looomo commented Jun 11, 2023

RenMing-Huang commented Jul 10, 2023

SpaceLearner commented Jul 19, 2023

xishuxishu commented Apr 30, 2024

wangerlie commented Nov 12, 2024 • edited Loading

wangerlie commented Nov 12, 2024

wangerlie commented Nov 12, 2024 •

edited

Loading