浏览代码

Added four room QA env to README

Maxime Chevalier-Boisvert 7 年之前
父节点
当前提交
45d30eb98a
共有 2 个文件被更改,包括 17 次插入0 次删除
  1. 17 0
      README.md
  2. 二进制
      figures/four-room-qa.png

+ 17 - 0
README.md

@@ -126,3 +126,20 @@ This environment has multiple objects of assorted types and colors. The
 agent receives a textual string as part of its observation telling it
 which object to pick up. Picking up the wrong object produces a negative
 reward.
+
+### Four Room Question Answering
+
+Registered configurations:
+- `MiniGrid-FourRoomQA-v0`
+
+<p align="center">
+<img src="/figures/fourroomqa-env.png">
+</p>
+
+This environment is inspired by the
+[Embodied Question Answering](https://arxiv.org/abs/1711.11543) paper.
+There are four colored rooms, and the agent is placed at a random position.
+Multiple objects of different types and colors are also placed at random
+positions in random rooms. A question and answer pair is generated, the
+question is given to the agent as an observation, and the agent has a limited
+number of time steps to explore the environment and produce a response.

二进制
figures/four-room-qa.png