瀏覽代碼

Corrected reward ranges for environments

Maxime Chevalier-Boisvert 7 年之前
父節點
當前提交
2b1d180dda
共有 4 個文件被更改,包括 3 次插入5 次删除
  1. 1 1
      gym_minigrid/envs/gotodoor.py
  2. 1 1
      gym_minigrid/envs/gotoobject.py
  3. 0 2
      gym_minigrid/envs/lockedroom.py
  4. 1 1
      gym_minigrid/envs/putnear.py

+ 1 - 1
gym_minigrid/envs/gotodoor.py

@@ -14,7 +14,7 @@ class GoToDoorEnv(MiniGridEnv):
     ):
         assert size >= 5
         super().__init__(gridSize=size, maxSteps=10*size)
-        self.reward_range = (-1, 1)
+        self.reward_range = (0, 1)
 
     def _genGrid(self, width, height):
         # Create the grid

+ 1 - 1
gym_minigrid/envs/gotoobject.py

@@ -14,7 +14,7 @@ class GoToObjectEnv(MiniGridEnv):
     ):
         self.numObjs = numObjs
         super().__init__(gridSize=size, maxSteps=5*size)
-        self.reward_range = (-1, 1)
+        self.reward_range = (0, 1)
 
     def _genGrid(self, width, height):
         assert width == height

+ 0 - 2
gym_minigrid/envs/lockedroom.py

@@ -38,8 +38,6 @@ class LockedRoom(MiniGridEnv):
             'image': self.observation_space
         })
 
-        self.reward_range = (-1, 1)
-
     def _genGrid(self, width, height):
         # Create the grid
         grid = Grid(width, height)

+ 1 - 1
gym_minigrid/envs/putnear.py

@@ -14,7 +14,7 @@ class PutNearEnv(MiniGridEnv):
     ):
         self.numObjs = numObjs
         super().__init__(gridSize=size, maxSteps=5*size)
-        self.reward_range = (-1, 1)
+        self.reward_range = (0, 1)
 
     def _genGrid(self, width, height):
         # Create a grid surrounded by walls