Lucas Willems
|
867477f48c
Increase max_steps in DoorKey to make it learnable (#15)
|
6 years ago |
Maxime Chevalier-Boisvert
|
7acd1ea326
Added timeout to place_obj and place_agent
|
6 years ago |
Maxime Chevalier-Boisvert
|
c53358bf8f
Fixed bug in environment string representation
|
6 years ago |
Maxime Chevalier-Boisvert
|
b6ffc9ba38
Added colored floor tile the agent can walk over
|
6 years ago |
Maxime Chevalier-Boisvert
|
e270e76ee5
Added object position tracking
|
6 years ago |
Lucas Willems
|
ea6416989f
Add `_rand_float` method (#10)
|
6 years ago |
Lucas Willems
|
c125ca7998
Minimum reward when success is now 0.1 (#9)
|
6 years ago |
Maxime Chevalier-Boisvert
|
290ab259e4
Modified RedBlueDoor env to enforce door opening sequence
|
6 years ago |
Maxime Chevalier-Boisvert
|
c99822121e
Added reward penalty based on number of time steps taken
|
6 years ago |
Maxime Chevalier-Boisvert
|
636226f90e
Renamed wait action to done
|
6 years ago |
Maxime Chevalier-Boisvert
|
0ad70c61dd
Added _rand_subset method to MiniGridEnv
|
6 years ago |
Maxime Chevalier-Boisvert
|
40f29632c2
Added ability to render observations at any resolution
|
6 years ago |
Maxime Chevalier-Boisvert
|
041225e96b
Added position randomization to RedBlueDoors env. Updated README.
|
6 years ago |
Maxime Chevalier-Boisvert
|
216fe87800
Merge pull request #8 from lcswillems/master
|
6 years ago |
Lucas Willems
|
82d080fa8b
Merge remote-tracking branch 'upstream/master'
|
6 years ago |
Lucas Willems
|
ec123f87cf
Add a RedBlueDoors environment
|
6 years ago |
Maxime Chevalier-Boisvert
|
2b243b906f
Update README.md
|
6 years ago |
Maxime Chevalier-Boisvert
|
342b3c96f1
Fixed bug in place_agent
|
6 years ago |
Maxime Chevalier-Boisvert
|
5d88cb4376
Doors can now be closed back after they are opened
|
6 years ago |
Maxime Chevalier-Boisvert
|
4f4992265b
Fixed issue wrt agent view size pointed out by Anirudh
|
6 years ago |
Maxime Chevalier-Boisvert
|
2190ce0b59
Full map correctly highlights cells visible to the agent
|
6 years ago |
Maxime Chevalier-Boisvert
|
076e074b0a
Changed timestep limits based on feedback from Rosemary Ke
|
6 years ago |
Maxime Chevalier-Boisvert
|
6b513362e6
Added gen_obs_grid method which outputs visibility mask
|
6 years ago |
Maxime Chevalier-Boisvert
|
a6ece826ce
Removed old visibility code
|
6 years ago |
Maxime Chevalier-Boisvert
|
76b43b7534
Added DIR_TO_VEC array. Agent position is now a numpy array.
|
6 years ago |
Maxime Chevalier-Boisvert
|
d5d117bb78
Added goal_pos variable to MultiRoomEnv
|
6 years ago |
Maxime Chevalier-Boisvert
|
146fd10741
Made reward_range the same for all environments, rewards are in [0, 1]
|
6 years ago |
Maxime Chevalier-Boisvert
|
fe469bb1cc
Added MiniGrid._rand_color method
|
6 years ago |
Maxime Chevalier-Boisvert
|
30c97ffc57
Added _rand_bool() method to MiniGrid
|
6 years ago |
Maxime Chevalier-Boisvert
|
d562fb8f82
Fixup. Default random seed should be fixed, deterministic.
|
6 years ago |