AUTOGENERATED: DO NOT EDIT FILE DIRECTLY
This environment has multiple objects of assorted types and colors. The agent receives a textual string as part of its observation telling it which object to pick up. Picking up the wrong object terminates the episode with zero reward.
"{syntax} {color} {type}"
{syntax} is one of the following: "get a", "go get a", "fetch a", "go fetch a", "you must fetch a".
{color} is the color of the box. Can be "red", "green", "blue", "purple", "yellow" or "grey".
{type} is the type of the object. Can be "key" or "ball".
Num | Name | Action |
---|---|---|
0 | left | Turn left |
1 | right | Turn right |
2 | forward | Move forward |
3 | pickup | Pick up an object |
4 | drop | Unused |
5 | toggle | Unused |
6 | done | Unused |
(OBJECT_IDX, COLOR_IDX, STATE)
OBJECT_TO_IDX
and COLOR_TO_IDX
mapping can be found in
minigrid/minigrid.pySTATE
refers to the door state with 0=open, 1=closed and 2=lockedA reward of '1' is given for success, and '0' for failure.
The episode ends if any one of the following conditions is met:
max_steps
).N: number of objects to be generated.
MiniGrid-Fetch-5x5-N2-v0
MiniGrid-Fetch-6x6-N2-v0
MiniGrid-Fetch-8x8-N3-v0