Commit History

Autor SHA1 Mensaxe Data
  Maxime Chevalier-Boisvert b6ffc9ba38 Added colored floor tile the agent can walk over %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 636226f90e Renamed wait action to done %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 041225e96b Added position randomization to RedBlueDoors env. Updated README. %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 2b243b906f Update README.md %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 146fd10741 Made reward_range the same for all environments, rewards are in [0, 1] %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert bfd0f76513 Faster visibility algorithm. Method renamings. %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 06676a4c74 Fixed issue with agent_sees introduced by visibility changes %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 0a9d6d8631 Enabling Travis automated tests %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 63289b521c Split toggle into pickup/drop/toggle actions %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 2abfef3b02 Removed FourRooomQA environment %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 3096a3b7f4 Updated README %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 2cdc42ac43 Modified reward range for fetch environment %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 24f7678f57 Update README.md %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 7c41bdff21 Added MiniGrid-MultiRoom-N2-S4-v0 environment %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert ea0e67e005 Added environments to README %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 2fdde6eb6b Removed pytorch_rl dependency on OpenAI baselines to make install easier %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 21c0eaa8c7 Renamed pytorch-rl to pytorch_rl for Python importability %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert a99538576b Update README.md %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert c8b35cb515 Updated README %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert ad4358543a Added go to door environment to README %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 16698be044 Added 5x5 config for the fetch environment %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 80b3178610 Moved rl code to pytorch-rl. Fixed warnings. Fixed issue w/ flat obs. %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 4c0fb7cf53 Update README.md %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 7edf1575f1 Update README.md %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 6092631168 Updated README %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 23d1b6b98d Added smaller DoorKey environments, exploration bonus wrapper %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 18480bfef3 Changed empty env size %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert f2824c7687 Implemented count-based exploration system (intrinsic motivation) %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert c23608ec9a Update README.md %!s(int64=7) %!d(string=hai) anos
  Maxime Chevalier-Boisvert 491c79586b Update README.md %!s(int64=7) %!d(string=hai) anos