AUTOGENERATED: DO NOT EDIT FILE DIRECTLY
Classic four room reinforcement learning environment. The agent must navigate in a maze composed of four rooms interconnected by 4 gaps in the walls. To obtain a reward, the agent must reach the green goal square. Both the agent and the goal square are randomly placed in any of the four rooms.
"reach the goal"
Num | Name | Action |
---|---|---|
0 | left | Turn left |
1 | right | Turn right |
2 | forward | Move forward |
3 | pickup | Unused |
4 | drop | Unused |
5 | toggle | Unused |
6 | done | Unused |
(OBJECT_IDX, COLOR_IDX, STATE)
OBJECT_TO_IDX
and COLOR_TO_IDX
mapping can be found in
minigrid/minigrid.pySTATE
refers to the door state with 0=open, 1=closed and 2=lockedA reward of '1' is given for success, and '0' for failure.
The episode ends if any one of the following conditions is met:
max_steps
).MiniGrid-FourRooms-v0