AUTOGENERATED: DO NOT EDIT FILE DIRECTLY
This environment is an empty room with moving obstacles. The goal of the agent is to reach the green goal square without colliding with any obstacle. A large penalty is subtracted if the agent collides with an obstacle and the episode finishes. This environment is useful to test Dynamic Obstacle Avoidance for mobile robots with Reinforcement Learning in Partial Observability.
"get to the green goal square"
Num | Name | Action |
---|---|---|
0 | left | Turn left |
1 | right | Turn right |
2 | forward | Move forward |
3 | pickup | Unused |
4 | drop | Unused |
5 | toggle | Unused |
6 | done | Unused |
(OBJECT_IDX, COLOR_IDX, STATE)
OBJECT_TO_IDX
and COLOR_TO_IDX
mapping can be found in
minigrid/minigrid.pySTATE
refers to the door state with 0=open, 1=closed and 2=lockedA reward of '1' is given for success, and '0' for failure. A '-1' penalty is subtracted if the agent collides with an obstacle.
The episode ends if any one of the following conditions is met:
max_steps
).MiniGrid-Dynamic-Obstacles-5x5-v0
MiniGrid-Dynamic-Obstacles-Random-5x5-v0
MiniGrid-Dynamic-Obstacles-6x6-v0
MiniGrid-Dynamic-Obstacles-Random-6x6-v0
MiniGrid-Dynamic-Obstacles-8x8-v0
MiniGrid-Dynamic-Obstacles-16x16-v0