Maxime Chevalier-Boisvert e5f35ea056 Randomized agent position in playground environment hace 7 años
..
__init__.py c4049b13ed Added Playground-v0 environment for experiments hace 7 años
doorkey.py 25fe4664fa Modified environments so they all produce observations in a dict hace 7 años
empty.py 25fe4664fa Modified environments so they all produce observations in a dict hace 7 años
fetch.py 2cdc42ac43 Modified reward range for fetch environment hace 7 años
fourroomqa.py 25fe4664fa Modified environments so they all produce observations in a dict hace 7 años
gotodoor.py c4049b13ed Added Playground-v0 environment for experiments hace 7 años
gotoobject.py c4049b13ed Added Playground-v0 environment for experiments hace 7 años
lockedroom.py 2b1d180dda Corrected reward ranges for environments hace 7 años
multiroom.py 25fe4664fa Modified environments so they all produce observations in a dict hace 7 años
playground_v0.py e5f35ea056 Randomized agent position in playground environment hace 7 años
putnear.py 2b1d180dda Corrected reward ranges for environments hace 7 años