Maxime Chevalier-Boisvert
|
7acd1ea326
Added timeout to place_obj and place_agent
|
%!s(int64=6) %!d(string=hai) anos |
Maxime Chevalier-Boisvert
|
290ab259e4
Modified RedBlueDoor env to enforce door opening sequence
|
%!s(int64=6) %!d(string=hai) anos |
Maxime Chevalier-Boisvert
|
c99822121e
Added reward penalty based on number of time steps taken
|
%!s(int64=6) %!d(string=hai) anos |
Maxime Chevalier-Boisvert
|
041225e96b
Added position randomization to RedBlueDoors env. Updated README.
|
%!s(int64=6) %!d(string=hai) anos |
Lucas Willems
|
ec123f87cf
Add a RedBlueDoors environment
|
%!s(int64=6) %!d(string=hai) anos |