浏览代码

Add return value

Martin Thoma 9 年之前
父节点
当前提交
0085bb50d5

二进制
source-code/Pseudocode/Policy-Iteration/Policy-Iteration.png


+ 2 - 1
source-code/Pseudocode/Policy-Iteration/Policy-Iteration.tex

@@ -37,9 +37,10 @@
                     \State $\pi(x) \gets \arg \min_a \{Q(x, a)\}$
                 \EndFor
             \EndWhile
+            \Return $\pi$
         \EndProcedure
         \end{algorithmic}
-    \caption{Policy Iteration}
+    \caption{Policy Iteration: Learning a policy $\pi: \mathcal{X} \rightarrow \mathcal{A}$}
     \label{alg:policy-iteration}
     \end{algorithm}
 \end{preview}