Maxime Chevalier-Boisvert 3 rokov pred
rodič
commit
399c04d73b
1 zmenil súbory, kde vykonal 2 pridanie a 2 odobranie
  1. 2 2
      README.md

+ 2 - 2
README.md

@@ -264,8 +264,8 @@ Registered configurations:
 
 This environment has multiple objects of assorted types and colors. The
 agent receives a textual string as part of its observation telling it
-which object to pick up. Picking up the wrong object produces a negative
-reward.
+which object to pick up. Picking up the wrong object terminates the
+episode with zero reward.
 
 ### Go-to-door environment