Maxime Chevalier-Boisvert преди 3 години
родител
ревизия
399c04d73b
променени са 1 файла, в които са добавени 2 реда и са изтрити 2 реда
  1. 2 2
      README.md

+ 2 - 2
README.md

@@ -264,8 +264,8 @@ Registered configurations:
 
 This environment has multiple objects of assorted types and colors. The
 agent receives a textual string as part of its observation telling it
-which object to pick up. Picking up the wrong object produces a negative
-reward.
+which object to pick up. Picking up the wrong object terminates the
+episode with zero reward.
 
 ### Go-to-door environment