浏览代码

Updated README

Maxime Chevalier-Boisvert 7 年之前
父节点
当前提交
6092631168
共有 2 个文件被更改,包括 22 次插入0 次删除
  1. 19 0
      README.md
  2. 3 0
      basicrl/envs.py

+ 19 - 0
README.md

@@ -71,6 +71,25 @@ You can view the result of training using the `enjoy.py` script:
 python3 basicrl/enjoy.py --env-name MiniGrid-Empty-6x6-v0 --load-dir ./trained_models/acktr
 ```
 
+## Features
+
+The environment is partially observable and uses a compact and efficient
+encoding, with just 3 inputs per visible grid cell. It is also easy to
+produce pixels for observations if desired.
+
+Each cell/tile in the grid world contains one object, each object has an
+associated discrete color. The objects currently supported are walls, doors,
+locked doors, keys, balls,boxes and a goal square. The basic version of the
+environment has just 4 possible actions: turn left, turn right, move
+forward and pickup/toggle to interact with objects. The agent can carry
+one carryable item at a time (eg: ball or key). By default, only sparse
+rewards for reaching the goal square are provided.
+
+Design choices were made to try to keep everything as simple as possible.
+Extending the environment with new object types and dynamics should be
+very easy. If you wish to do this, you should take a look at
+the [gym_minigrid/minigrid.py](gym_minigrid/minigrid.py) source file.
+
 ## Included Environments
 
 The environments listed below are implemented in the [gym_minigrid/envs](/gym_minigrid/envs) directory.

+ 3 - 0
basicrl/envs.py

@@ -14,6 +14,7 @@ except ImportError:
 
 try:
     import gym_minigrid
+    from gym_minigrid.wrappers import *
 except:
     pass
 
@@ -34,6 +35,8 @@ def make_env(env_id, seed, rank, log_dir):
         if len(obs_shape) == 3 and obs_shape[2] == 3:
             env = WrapPyTorch(env)
 
+        env = StateBonus(env)
+
         return env
 
     return _thunk