Maxime Chevalier-Boisvert
							
						 | 
						
							
							
								0e496fe636
							
							Fixed visdom visualization code
						 | 
						7 年之前 | 
					
				
					
						
							
								   Maxime Chevalier-Boisvert
							
						 | 
						
							
							
								e5f35ea056
							
							Randomized agent position in playground environment
						 | 
						7 年之前 | 
					
				
					
						
							
								   Maxime Chevalier-Boisvert
							
						 | 
						
							
							
								1e5d26e4c0
							
							Fixed issue with environment seeding
						 | 
						7 年之前 | 
					
				
					
						
							
								   Maxime Chevalier-Boisvert
							
						 | 
						
							
							
								c4049b13ed
							
							Added Playground-v0 environment for experiments
						 | 
						7 年之前 | 
					
				
					
						
							
								   Maxime Chevalier-Boisvert
							
						 | 
						
							
							
								5a6461ff2e
							
							Added GRU to policy, made model larger.
						 | 
						7 年之前 | 
					
				
					
						
							
								   Maxime Chevalier-Boisvert
							
						 | 
						
							
							
								17eba78e33
							
							Fixed pyqt5 package version
						 | 
						7 年之前 | 
					
				
					
						
							
								   Maxime Chevalier-Boisvert
							
						 | 
						
							
							
								2cdc42ac43
							
							Modified reward range for fetch environment
						 | 
						7 年之前 | 
					
				
					
						
							
								   Maxime Chevalier-Boisvert
							
						 | 
						
							
							
								2b1d180dda
							
							Corrected reward ranges for environments
						 | 
						7 年之前 | 
					
				
					
						
							
								   Maxime Chevalier-Boisvert
							
						 | 
						
							
							
								24f7678f57
							
							Update README.md
						 | 
						7 年之前 | 
					
				
					
						
							
								   Maxime Chevalier-Boisvert
							
						 | 
						
							
							
								27e33995ab
							
							Fixed wrappers.py following changes in OpenAI gym
						 | 
						7 年之前 | 
					
				
					
						
							
								   Maxime Chevalier-Boisvert
							
						 | 
						
							
							
								25fe4664fa
							
							Modified environments so they all produce observations in a dict
						 | 
						7 年之前 | 
					
				
					
						
							
								   Maxime Chevalier-Boisvert
							
						 | 
						
							
							
								7c41bdff21
							
							Added MiniGrid-MultiRoom-N2-S4-v0 environment
						 | 
						7 年之前 | 
					
				
					
						
							
								   Maxime Chevalier-Boisvert
							
						 | 
						
							
							
								4d84ecd45f
							
							Eliminated source of non-determinism
						 | 
						7 年之前 | 
					
				
					
						
							
								   Maxime Chevalier-Boisvert
							
						 | 
						
							
							
								d7b381ce47
							
							Added "in" operator for Grid objects
						 | 
						7 年之前 | 
					
				
					
						
							
								   Maxime Chevalier-Boisvert
							
						 | 
						
							
							
								dcc9732274
							
							Added env.agentSees(x, y) method
						 | 
						7 年之前 | 
					
				
					
						
							
								   Maxime Chevalier-Boisvert
							
						 | 
						
							
							
								50bd721381
							
							Removed inaccurate comment
						 | 
						7 年之前 | 
					
				
					
						
							
								   Maxime Chevalier-Boisvert
							
						 | 
						
							
							
								bb6dc6196e
							
							Added test
						 | 
						7 年之前 | 
					
				
					
						
							
								   Maxime Chevalier-Boisvert
							
						 | 
						
							
							
								114caa944a
							
							Fixes based on changes in OpenAI Gym 0.9.6
						 | 
						7 年之前 | 
					
				
					
						
							
								   Maxime Chevalier-Boisvert
							
						 | 
						
							
							
								ea0e67e005
							
							Added environments to README
						 | 
						7 年之前 | 
					
				
					
						
							
								   Maxime Chevalier-Boisvert
							
						 | 
						
							
							
								99f583af9e
							
							Completed LockedRoom environment
						 | 
						7 年之前 | 
					
				
					
						
							
								   Maxime Chevalier-Boisvert
							
						 | 
						
							
							
								d70e134948
							
							Eliminated WrapPyTorch
						 | 
						7 年之前 | 
					
				
					
						
							
								   Maxime Chevalier-Boisvert
							
						 | 
						
							
							
								c46ade2f4f
							
							Fixed bug in fetch environment
						 | 
						7 年之前 | 
					
				
					
						
							
								   Maxime Chevalier-Boisvert
							
						 | 
						
							
							
								cd33e57ae6
							
							Updated default arguments for RL code
						 | 
						7 年之前 | 
					
				
					
						
							
								   Maxime Chevalier-Boisvert
							
						 | 
						
							
							
								aa65f2f84f
							
							Changed observation_space for putnear
						 | 
						7 年之前 | 
					
				
					
						
							
								   Maxime Chevalier-Boisvert
							
						 | 
						
							
							
								16085191ab
							
							Refactored handling of recurrent policies for simplicity
						 | 
						7 年之前 | 
					
				
					
						
							
								   Maxime Chevalier-Boisvert
							
						 | 
						
							
							
								6db3f6bb87
							
							Added code to automatially use flat obs wrapper when needed
						 | 
						7 年之前 | 
					
				
					
						
							
								   Maxime Chevalier-Boisvert
							
						 | 
						
							
							
								2fdde6eb6b
							
							Removed pytorch_rl dependency on OpenAI baselines to make install easier
						 | 
						7 年之前 | 
					
				
					
						
							
								   Maxime Chevalier-Boisvert
							
						 | 
						
							
							
								ca85d1086d
							
							Added recurrent MLP policy
						 | 
						7 年之前 | 
					
				
					
						
							
								   Maxime Chevalier-Boisvert
							
						 | 
						
							
							
								c4257fd34c
							
							Minor change
						 | 
						7 年之前 | 
					
				
					
						
							
								   Maxime Chevalier-Boisvert
							
						 | 
						
							
							
								780c75e2cd
							
							RL code updates from upstream repos, removed Atari dependencies
						 | 
						7 年之前 |