Commit History

Autor SHA1 Mensaxe Data
  Maxime Chevalier-Boisvert f2dd5adfc3 Added color names list to minigrid.py %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 1a80488ad0 Fixed bug, improved reward function in GoToDoor env %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert c75d108362 Added more configs for GoToDoor environment %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert c4f68f309b Implemented GoToDoor environment %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 28df92e70d Fixed issues with run_tests.py, grid encode/decode %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert e7e870ce2d Fixed issues with wrappers %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 360927639b Added wrapper for one-hot string encoding. Fixed bugs in goto env. %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 116bfb2d4b Completed implementation of goto env %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 67acc5ff18 Started work on goto env %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert a05dd9456f Split out simple_envs.py %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert d4d83a9bf6 Added warning if cuda is disabled, to avoid silent failure %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 32535cc145 Changed reward range for fetch environment %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert e02777d81b Added function to encode observations to FetchEnv %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 16698be044 Added 5x5 config for the fetch environment %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 80b3178610 Moved rl code to pytorch-rl. Fixed warnings. Fixed issue w/ flat obs. %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert c080bf08d8 Added _randPos utility function %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert f47e47272e Added animation for door-key curriculum %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert dbecad9ad0 Added randomization to DoorKey envs %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 95448c1ebd Added check in run_tests.py %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 25cc3b5253 Made it so agent can see what it is carrying in observations %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 5daf219a68 Added tests. Moved envs into own source files. %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 4c0fb7cf53 Update README.md %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 7edf1575f1 Update README.md %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 0d9c084e68 Merge pull request #2 from zach-nervana/master %!s(int64=7) %!d(string=hai) anos
  Zach Dwiel 4cfe559b0e include environments in install %!s(int64=7) %!d(string=hai) anos
  Zach Dwiel 31132832e3 update setup.py to actually install the package %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 6092631168 Updated README %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 23d1b6b98d Added smaller DoorKey environments, exploration bonus wrapper %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 18480bfef3 Changed empty env size %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert f2824c7687 Implemented count-based exploration system (intrinsic motivation) %!s(int64=7) %!d(string=hai) anos