Maxime Chevalier-Boisvert
|
e270e76ee5
Added object position tracking
|
6 gadi atpakaļ |
Maxime Chevalier-Boisvert
|
c99822121e
Added reward penalty based on number of time steps taken
|
6 gadi atpakaļ |
Maxime Chevalier-Boisvert
|
146fd10741
Made reward_range the same for all environments, rewards are in [0, 1]
|
6 gadi atpakaļ |
Maxime Chevalier-Boisvert
|
852476db7c
Finished renaming MiniGrid methods for PEP8 conformance
|
6 gadi atpakaļ |
Maxime Chevalier-Boisvert
|
bfd0f76513
Faster visibility algorithm. Method renamings.
|
6 gadi atpakaļ |
Maxime Chevalier-Boisvert
|
15e83a570a
Made gen_obs a public method, renamed public methods.
|
6 gadi atpakaļ |
Maxime Chevalier-Boisvert
|
ec9e19efe7
Renamed fields to match PEP8 convention
|
6 gadi atpakaļ |
Maxime Chevalier-Boisvert
|
340c03a446
Cleaned up and simplified _genGrid functions
|
6 gadi atpakaļ |
Maxime Chevalier-Boisvert
|
2b1d180dda
Corrected reward ranges for environments
|
6 gadi atpakaļ |
Maxime Chevalier-Boisvert
|
25fe4664fa
Modified environments so they all produce observations in a dict
|
6 gadi atpakaļ |
Maxime Chevalier-Boisvert
|
114caa944a
Fixes based on changes in OpenAI Gym 0.9.6
|
6 gadi atpakaļ |
Maxime Chevalier-Boisvert
|
aa65f2f84f
Changed observation_space for putnear
|
6 gadi atpakaļ |
Maxime Chevalier-Boisvert
|
4267b1d39e
Completed PutNear environment
|
6 gadi atpakaļ |
Maxime Chevalier-Boisvert
|
87a0befdbf
Added ability to drop/put down objects. Started work on PutNear env.
|
6 gadi atpakaļ |