Historia zmian

Autor SHA1 Wiadomość Data
  Maxime Chevalier-Boisvert c99822121e Added reward penalty based on number of time steps taken 7 lat temu
  Maxime Chevalier-Boisvert 041225e96b Added position randomization to RedBlueDoors env. Updated README. 7 lat temu
  Lucas Willems ec123f87cf Add a RedBlueDoors environment 7 lat temu