Maxime Chevalier-Boisvert
|
c07125ebeb
Cleaned up and simplified _genGrid functions
|
vor 6 Jahren |
Maxime Chevalier-Boisvert
|
c4049b13ed
Added Playground-v0 environment for experiments
|
vor 6 Jahren |
Maxime Chevalier-Boisvert
|
2b1d180dda
Corrected reward ranges for environments
|
vor 6 Jahren |
Maxime Chevalier-Boisvert
|
25fe4664fa
Modified environments so they all produce observations in a dict
|
vor 6 Jahren |
Maxime Chevalier-Boisvert
|
114caa944a
Fixes based on changes in OpenAI Gym 0.9.6
|
vor 6 Jahren |
Maxime Chevalier-Boisvert
|
c46ade2f4f
Fixed bug in fetch environment
|
vor 6 Jahren |
Maxime Chevalier-Boisvert
|
723359da33
Removed waitEnds flag
|
vor 6 Jahren |
Maxime Chevalier-Boisvert
|
73e6d3d2f1
Made adjustments to GoToObject based on GoToDoor env
|
vor 6 Jahren |
Maxime Chevalier-Boisvert
|
3d0c94f876
Removed "advice" from observations. Randomized GoToDoor room size.
|
vor 6 Jahren |
Maxime Chevalier-Boisvert
|
360927639b
Added wrapper for one-hot string encoding. Fixed bugs in goto env.
|
vor 6 Jahren |
Maxime Chevalier-Boisvert
|
116bfb2d4b
Completed implementation of goto env
|
vor 6 Jahren |
Maxime Chevalier-Boisvert
|
67acc5ff18
Started work on goto env
|
vor 6 Jahren |