-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to run Atari environments? #1
Comments
Hi Aj, Do the implementations run for you if you run Cartpole-v0? It would be nice to know that this works on other machines. Also, main.py runs DQN (with some extras), a2c.py runs an actor-advantage-critic algorithm (using replay mem rather than distributed rep) and NEC.py runs the NEC agent. -Will EDIT: Just remembered that a2c.py should be set up to work with Atari envs. Have a look at that if you want to look at adapting the others. |
Hi Will, thanks a lot for the help 👍 - I'm just working through your code and the paper now. Cartpole seems to work well - I haven’t checked against my A3C implementations, but from memory I think it looks better,
I'll try to get it working for the Atari envs too :) If you're interested there's a fairly clean implement in PyTorch Looks like a fun project 👍 All the best - Aj PS - I've only read the paper quickly, but it seems there's no need for the actor-critic stuff in a2c? |
Hi Will, I was wondering whether you got this working for 2D pixel inputs, i.e. Atari. If so did you manage to get anywhere close to DMs published results, (I guess they do a lot of model searching/hyper-parameter tuning) ? All the best, Aj |
Hi Aj, Code is a bit of a mess, so will probably tidy it up before sharing. |
Update: |
Great, thanks very much for your work on it :) I guess if it does'nt perform SOTA on Atari, (or you can't tune it as well as DM), you'll find some environments where it is strong in - you know the Wolpert and Macready NFL thm,
|
Hi @EndingCredits,
this is really cool that you got the
NEC
working 👍Have you tried to run your code on the Atari environments, in Open AI gym?
I tried to train on
Pong
, but I got this error,I guess it might be related to
TF v1.0
, does this repo use an earlier version?Thank a lot for your help,
Aj
The text was updated successfully, but these errors were encountered: