Maxime Chevalier-Boisvert
|
5a6461ff2e
Added GRU to policy, made model larger.
|
%!s(int64=6) %!d(string=hai) anos |
Maxime Chevalier-Boisvert
|
27e33995ab
Fixed wrappers.py following changes in OpenAI gym
|
%!s(int64=6) %!d(string=hai) anos |
Maxime Chevalier-Boisvert
|
d70e134948
Eliminated WrapPyTorch
|
%!s(int64=6) %!d(string=hai) anos |
Maxime Chevalier-Boisvert
|
16085191ab
Refactored handling of recurrent policies for simplicity
|
%!s(int64=6) %!d(string=hai) anos |
Maxime Chevalier-Boisvert
|
2fdde6eb6b
Removed pytorch_rl dependency on OpenAI baselines to make install easier
|
%!s(int64=6) %!d(string=hai) anos |
Maxime Chevalier-Boisvert
|
ca85d1086d
Added recurrent MLP policy
|
%!s(int64=6) %!d(string=hai) anos |
Maxime Chevalier-Boisvert
|
21c0eaa8c7
Renamed pytorch-rl to pytorch_rl for Python importability
|
%!s(int64=6) %!d(string=hai) anos |