Commit History

Author SHA1 Message Date
  StringTheory d53edeb5a2 Initial commit 2 years ago
  Maxime Chevalier-Boisvert c99822121e Added reward penalty based on number of time steps taken 7 years ago
  Maxime Chevalier-Boisvert 146fd10741 Made reward_range the same for all environments, rewards are in [0, 1] 7 years ago
  Maxime Chevalier-Boisvert 852476db7c Finished renaming MiniGrid methods for PEP8 conformance 7 years ago
  Maxime Chevalier-Boisvert bfd0f76513 Faster visibility algorithm. Method renamings. 7 years ago
  Maxime Chevalier-Boisvert ec9e19efe7 Renamed fields to match PEP8 convention 7 years ago
  Maxime Chevalier-Boisvert 340c03a446 Cleaned up and simplified _genGrid functions 7 years ago
  Maxime Chevalier-Boisvert 2cdc42ac43 Modified reward range for fetch environment 7 years ago
  Maxime Chevalier-Boisvert 25fe4664fa Modified environments so they all produce observations in a dict 7 years ago
  Maxime Chevalier-Boisvert 114caa944a Fixes based on changes in OpenAI Gym 0.9.6 7 years ago
  Maxime Chevalier-Boisvert c46ade2f4f Fixed bug in fetch environment 7 years ago
  Maxime Chevalier-Boisvert 3d0c94f876 Removed "advice" from observations. Randomized GoToDoor room size. 7 years ago
  Maxime Chevalier-Boisvert 32535cc145 Changed reward range for fetch environment 7 years ago
  Maxime Chevalier-Boisvert e02777d81b Added function to encode observations to FetchEnv 7 years ago
  Maxime Chevalier-Boisvert 16698be044 Added 5x5 config for the fetch environment 7 years ago
  Maxime Chevalier-Boisvert 5daf219a68 Added tests. Moved envs into own source files. 7 years ago