Commit History

Author SHA1 Message Date
  Martin Thoma 001350bae4 Add q-learning and improve value iteration pseudocode 9 years ago