Commit History

Author SHA1 Message Date
  Martin Thoma c0bbfa6811 Add q-lambda 9 years ago
  Martin Thoma 93fc9e52ed Add sarsa lambda pseudocode 9 years ago
  Martin Thoma ea63ce4d57 Not learning rate but discount factor 9 years ago
  Martin Thoma 2ec24b14b8 Add XOR problem graphic 9 years ago
  Martin Thoma 0085bb50d5 Add return value 9 years ago
  Martin Thoma 1c54ccd821 Fix error in label correction; extend for banch-and-bound 9 years ago
  Martin Thoma 001350bae4 Add q-learning and improve value iteration pseudocode 9 years ago
  Martin Thoma 807b9268d0 Improve quality of description 9 years ago
  Martin Thoma b9e2162ab8 Add dynamic programming and label correction algorithm 9 years ago
  Martin Thoma 940436c883 Update pseudocode to include cost function as parameter 9 years ago
  Martin Thoma 23462814aa Slant text to arrow 9 years ago
  Martin Thoma f4674abc32 Make Kalman filter formulas more memorizable 9 years ago
  Martin Thoma d65f5d2933 Add pseudocode for policy- and value-iteration 9 years ago
  Martin Thoma fc8c41330a Add agent environment diagram for RL 9 years ago
  Martin Thoma 2a2e2d1a88 Rename MDP, POMDP 9 years ago
  Martin Thoma 73be14bd67 Mention requirements 9 years ago
  Martin Thoma 37aa7c4ecb Update README 9 years ago
  Martin Thoma 5400ce8e04 Add elevation chart 9 years ago
  Martin Thoma 41896e3668 Add pomdp scheme 9 years ago
  Martin Thoma 868f09c3ae Add MDP schema 9 years ago
  Martin Thoma 8753f03ecb Update triangle 9 years ago
  Martin Thoma d85550b33b Improve quality of Triangle-9-point-circle-circumscribed-circle 9 years ago
  Martin Thoma 8a46e994b5 Add kalman filter 10 years ago
  Martin Thoma 362ee691b1 Add footnote example 10 years ago
  Martin Thoma 52eab59d82 Improve quality 10 years ago
  Martin Thoma 5937b8ac49 Improve quality 10 years ago
  Martin Thoma e56992d7ca Improve quality 10 years ago
  Martin Thoma 3ec36a9361 Improve quality 10 years ago
  Martin Thoma 7f5f40a3a6 Improve quality 10 years ago
  Martin Thoma e8b15c5d2e Improve quality 10 years ago