|
@@ -295,13 +295,10 @@ Registered configurations:
|
|
|
- `MiniGrid-RedBlueDoors-6x6-v0`
|
|
|
- `MiniGrid-RedBlueDoors-8x8-v0`
|
|
|
|
|
|
-The purpose of this environment is to test memory.
|
|
|
The agent is randomly placed within a room with one red and one blue door
|
|
|
facing opposite directions. The agent has to open the red door and then open
|
|
|
-the blue door, in that order. The agent, when facing one door, cannot see
|
|
|
-the door behind him. Hence, the agent needs to remember whether or not he has
|
|
|
-previously opened the other door in order to reliably succeed at completing
|
|
|
-the task.
|
|
|
+the blue door, in that order. Note that, surprisingly, this environment is
|
|
|
+solvable without memory.
|
|
|
|
|
|
### Memory environment
|
|
|
|
|
@@ -310,8 +307,6 @@ Registered configurations:
|
|
|
- `MiniGrid-MemoryS13Random-v0`
|
|
|
- `MiniGrid-MemoryS13-v0`
|
|
|
- `MiniGrid-MemoryS11-v0`
|
|
|
-- `MiniGrid-MemoryS9-v0`
|
|
|
-- `MiniGrid-MemoryS7-v0`
|
|
|
|
|
|
This environment is a memory test. The agent starts in a small room
|
|
|
where it sees an object. It then has to go through a narrow hallway
|