Commit History

Author SHA1 Message Date
  Martin Thoma ddd08a2a45 Improve pseudocode 9 years ago
  Martin Thoma 45e56d0320 Improve pseudocode 9 years ago
  Martin Thoma ea63ce4d57 Not learning rate but discount factor 10 years ago
  Martin Thoma 001350bae4 Add q-learning and improve value iteration pseudocode 10 years ago
  Martin Thoma 807b9268d0 Improve quality of description 10 years ago
  Martin Thoma 940436c883 Update pseudocode to include cost function as parameter 10 years ago
  Martin Thoma d65f5d2933 Add pseudocode for policy- and value-iteration 10 years ago