![]() |
Reinforcement Learning and Computer
Go (RLGO) |
| Uncertainty |
Probability and expectation
We often use terms such as probability
and expectation when
discussing Go. But what does this really mean?
When we refer to the probability of an event occurring, we are
normally describing the question of whether a binary observation will
become 1 at any point in the future. But once we view the probability
as a question, it is clear that we must specify more information: who
is to play, and what policies will be followed by us and them, and what
timescale we are interested in. Without these specifications the term probability is ill-defined!
When we refer to the expectation of a value, we are normally
describing the expected outcome
of that value. Again, we can view this as answering a question - after
all this is the definition of a question! However, we must again
specify more information: who is to play, and what policies will be
followed by us and them, and what timescale we are interested in.
Without these specifications the term expectation
is ill-defined!
In general, the question framework forces us to be more precise with
our descriptions of uncertainty than when we use the vague but familiar
terms probability and expectation.