Maxime Chevalier-Boisvert
|
340c03a446
Cleaned up and simplified _genGrid functions
|
6 anni fa |
Maxime Chevalier-Boisvert
|
2b1d180dda
Corrected reward ranges for environments
|
6 anni fa |
Maxime Chevalier-Boisvert
|
25fe4664fa
Modified environments so they all produce observations in a dict
|
6 anni fa |
Maxime Chevalier-Boisvert
|
4d84ecd45f
Eliminated source of non-determinism
|
6 anni fa |
Maxime Chevalier-Boisvert
|
114caa944a
Fixes based on changes in OpenAI Gym 0.9.6
|
6 anni fa |
Maxime Chevalier-Boisvert
|
99f583af9e
Completed LockedRoom environment
|
6 anni fa |