Maxime Chevalier-Boisvert
|
c99822121e
Added reward penalty based on number of time steps taken
|
6 سال پیش |
Maxime Chevalier-Boisvert
|
636226f90e
Renamed wait action to done
|
6 سال پیش |
Maxime Chevalier-Boisvert
|
146fd10741
Made reward_range the same for all environments, rewards are in [0, 1]
|
6 سال پیش |
Maxime Chevalier-Boisvert
|
852476db7c
Finished renaming MiniGrid methods for PEP8 conformance
|
6 سال پیش |
Maxime Chevalier-Boisvert
|
bfd0f76513
Faster visibility algorithm. Method renamings.
|
6 سال پیش |
Maxime Chevalier-Boisvert
|
ec9e19efe7
Renamed fields to match PEP8 convention
|
6 سال پیش |
Maxime Chevalier-Boisvert
|
3c52268b00
Progress on RoomGrid-v0 env
|
6 سال پیش |
Maxime Chevalier-Boisvert
|
340c03a446
Cleaned up and simplified _genGrid functions
|
6 سال پیش |
Maxime Chevalier-Boisvert
|
c4049b13ed
Added Playground-v0 environment for experiments
|
6 سال پیش |
Maxime Chevalier-Boisvert
|
2b1d180dda
Corrected reward ranges for environments
|
6 سال پیش |
Maxime Chevalier-Boisvert
|
25fe4664fa
Modified environments so they all produce observations in a dict
|
6 سال پیش |
Maxime Chevalier-Boisvert
|
114caa944a
Fixes based on changes in OpenAI Gym 0.9.6
|
6 سال پیش |
Maxime Chevalier-Boisvert
|
6db3f6bb87
Added code to automatially use flat obs wrapper when needed
|
6 سال پیش |
Maxime Chevalier-Boisvert
|
2fdde6eb6b
Removed pytorch_rl dependency on OpenAI baselines to make install easier
|
6 سال پیش |
Maxime Chevalier-Boisvert
|
723359da33
Removed waitEnds flag
|
6 سال پیش |
Maxime Chevalier-Boisvert
|
3d0c94f876
Removed "advice" from observations. Randomized GoToDoor room size.
|
6 سال پیش |
Maxime Chevalier-Boisvert
|
f2dd5adfc3
Added color names list to minigrid.py
|
6 سال پیش |
Maxime Chevalier-Boisvert
|
1a80488ad0
Fixed bug, improved reward function in GoToDoor env
|
6 سال پیش |
Maxime Chevalier-Boisvert
|
c75d108362
Added more configs for GoToDoor environment
|
6 سال پیش |
Maxime Chevalier-Boisvert
|
c4f68f309b
Implemented GoToDoor environment
|
6 سال پیش |