-
Notifications
You must be signed in to change notification settings - Fork 8.6k
MountainCarContinuous v0
Name: MountainCarContinuous-v0
Category: Classic Control
Environment Page
Algorithms Page
An underpowered car must climb a hill.
This environment corresponds to the continuous version of the mountain car environment described in Andrew Moore's PhD thesis.
Type: Box(2) Num Observation Min Max 0 Car Position -1.0 1.0 1 Car Velocity -1.0 1.0
Type: Box(1) Num Action 0 Push car to the left (negative value) or to the right (positive value)
Reward is 100 for reaching the target of the hill on the right hand side, minus the squared sum of actions from start to goal.
Note that this reward is unusual with respect to most published work, where the goal was to reach the target as fast as possible, hence favouring a bang-bang strategy.
The current reward function raises an exploration challenge, because if you do not reach the target, you will find that it is better not to move, and you will stop reaching the target forever.
Position between -0.6 and -0.4, null velocity
Position equals 0.5
Get a reward over 90 (I'm not sure this is doable), should be tuned.
- Gym Repository
- Wiki Home
- Leaderboard
- Environments
- FAQ
- Resources
- Feature Requests