Commit History

Autor SHA1 Mensaxe Data
  Maxime Chevalier-Boisvert c99822121e Added reward penalty based on number of time steps taken %!s(int64=6) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 636226f90e Renamed wait action to done %!s(int64=6) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 146fd10741 Made reward_range the same for all environments, rewards are in [0, 1] %!s(int64=6) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 852476db7c Finished renaming MiniGrid methods for PEP8 conformance %!s(int64=6) %!d(string=hai) anos
  Maxime Chevalier-Boisvert bfd0f76513 Faster visibility algorithm. Method renamings. %!s(int64=6) %!d(string=hai) anos
  Maxime Chevalier-Boisvert ec9e19efe7 Renamed fields to match PEP8 convention %!s(int64=6) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 3c52268b00 Progress on RoomGrid-v0 env %!s(int64=6) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 340c03a446 Cleaned up and simplified _genGrid functions %!s(int64=6) %!d(string=hai) anos
  Maxime Chevalier-Boisvert c4049b13ed Added Playground-v0 environment for experiments %!s(int64=6) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 2b1d180dda Corrected reward ranges for environments %!s(int64=6) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 25fe4664fa Modified environments so they all produce observations in a dict %!s(int64=6) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 114caa944a Fixes based on changes in OpenAI Gym 0.9.6 %!s(int64=6) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 6db3f6bb87 Added code to automatially use flat obs wrapper when needed %!s(int64=6) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 2fdde6eb6b Removed pytorch_rl dependency on OpenAI baselines to make install easier %!s(int64=6) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 723359da33 Removed waitEnds flag %!s(int64=6) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 3d0c94f876 Removed "advice" from observations. Randomized GoToDoor room size. %!s(int64=6) %!d(string=hai) anos
  Maxime Chevalier-Boisvert f2dd5adfc3 Added color names list to minigrid.py %!s(int64=6) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 1a80488ad0 Fixed bug, improved reward function in GoToDoor env %!s(int64=6) %!d(string=hai) anos
  Maxime Chevalier-Boisvert c75d108362 Added more configs for GoToDoor environment %!s(int64=6) %!d(string=hai) anos
  Maxime Chevalier-Boisvert c4f68f309b Implemented GoToDoor environment %!s(int64=6) %!d(string=hai) anos