
Measure theory
Given a Markov transition matrix and an invariant distribution on the states, we can impose a probability measure on the set of subshifts. For example, consider the Markov chain given on the left on the states , with invariant distribution
. If we "forget" the distinction between
, we project this space of subshifts on
into another space of subshifts on
, and this projection also projects the probability measure down to a probability measure on the subshifts on
.
The hidden part of a hidden Markov model, whose observable states is non-Markovian.
The curious thing is that the probability measure on the subshifts on is not created by a Markov chain on
, not even multiple orders. Intuitively, this is because if one observes a long sequence of
, then one would become increasingly sure that the
, meaning that the observable part of the system can be affected by something infinitely in the past.
Conversely, there exists a space of subshifts on 6 symbols, projected to subshifts on 2 symbols, such that any Markov measure on the smaller subshift has a preimage measure that is not Markov of any order (Example 2.6).