Commit 8783cf02 by Lenovo

再补一句

parent 0a7f9c76
...@@ -91,10 +91,13 @@ do not explicitly utilize exploration strategies ...@@ -91,10 +91,13 @@ do not explicitly utilize exploration strategies
We proved that the 2048 game is acyclic between non-absorbing states. We proved that the 2048 game is acyclic between non-absorbing states.
This is the fundamental reason why explicit exploration cannot be This is the fundamental reason why explicit exploration cannot be
employed in the 2048 game. employed in the 2048 game.
Compared to explicit exploration strategies,
backward learning, restart, and optimistic initialization
are more suitable for acyclic MDPs or MDPs with acyclic structures.
\end{abstract} \end{abstract}
\begin{IEEEkeywords} \begin{IEEEkeywords}
Acyclicity, 2048 game, ergodicity, backward learning. Acyclicity, 2048 game, ergodicity, exploration, backward learning.
\end{IEEEkeywords} \end{IEEEkeywords}
......
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment