Maxime Chevalier-Boisvert
|
146fd10741
Made reward_range the same for all environments, rewards are in [0, 1]
|
6 vuotta sitten |
Maxime Chevalier-Boisvert
|
852476db7c
Finished renaming MiniGrid methods for PEP8 conformance
|
6 vuotta sitten |
Maxime Chevalier-Boisvert
|
bfd0f76513
Faster visibility algorithm. Method renamings.
|
6 vuotta sitten |
Maxime Chevalier-Boisvert
|
ec9e19efe7
Renamed fields to match PEP8 convention
|
6 vuotta sitten |
Maxime Chevalier-Boisvert
|
340c03a446
Cleaned up and simplified _genGrid functions
|
6 vuotta sitten |
Maxime Chevalier-Boisvert
|
e5f35ea056
Randomized agent position in playground environment
|
6 vuotta sitten |
Maxime Chevalier-Boisvert
|
c4049b13ed
Added Playground-v0 environment for experiments
|
6 vuotta sitten |