Skip to content

Commit

Permalink
add comment, GLIE
Browse files Browse the repository at this point in the history
  • Loading branch information
zsdonghao committed Jul 24, 2017
1 parent 9a95269 commit e46a3a8
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions example/tutorial_frozenlake_dqn.py
Original file line number Diff line number Diff line change
Expand Up @@ -5,7 +5,7 @@
from tensorlayer.layers import *
import matplotlib.pyplot as plt

""" Q-Network Q(a, s) - TD Learning, Off-Policy, e-Greedy Exploration
""" Q-Network Q(a, s) - TD Learning, Off-Policy, e-Greedy Exploration (GLIE)
Q(S, A) <- Q(S, A) + alpha * (R + lambda * Q(newS, newA) - Q(S, A))
delta_w = R + lambda * Q(newS, newA)
Expand Down Expand Up @@ -90,7 +90,7 @@ def to_one_hot(i, n_classes=None):
s = s1
## Reduce chance of random action if an episode is done.
if d == True:
e = 1./((i/50) + 10) # reduce e
e = 1./((i/50) + 10) # reduce e, GLIE: Greey in the limit with infinite Exploration
break

## Note that, the rewards here with random action
Expand Down

0 comments on commit e46a3a8

Please sign in to comment.