Stochastic Processes I4

One Hundred

Solved

Exercises

for the subject:

Stochastic Processes I

Takis Konstantopoulos

In the Dark Ages, Harvard, Dartmouth, and Yale admitted only male students. As-

sume that, at that time, 80 percent of the sons of Harvard men went to Harvard and

the rest went to Yale, 40 percent of the sons of Yale men went to Yale, and the rest

split evenly between Harvard and Dartmouth; and of the sons of Dartmouth men, 70

percent went to Dartmouth, 20 percent to Harvard, and 10 percent to Yale. (i) Find

the probability that the grandson of a man from Harvard went to Harvard. (ii) Modify

the above by assuming that the son of a Harvard man always went to Harvard. Again,

ﬁnd the probability that the grandson of a man from Harvard went to Harvard.

Solution. We ﬁrst form a Markov chain with state space S = {H, D, Y } and the

following transition probability matrix :

P =





.8 0 .2

.2 .7 .1

.3 .3 .4





Note that the columns and rows are ordered: ﬁrst H, then D, then Y . Recall: the ij

entry of the matrix P

gives the probability that the Markov chain s tarting in state

i will be in state j after n steps. Thus, the probability that th e grandson of a m an

from Harvard went to Harvard is the upper-left element of the matrix





.7 .06 .24

.33 .52 .15

.42 .33 .25





It is equal to .7 = .8

+ .2 × .3 and , of course, one does not n eed to calculate all

elements of P

to answer this question.

If all sons of men from Harvard went to Harvard, this would give the following matrix

for th e new Markov chain with the same set of states:

P =





1 0 0

.2 .7 .1

.3 .3 .4





The upper-left element of P

is 1, which is not surprising, because the oﬀspring of

Harvard men enter this very institution only.

Consider an experiment of mating rabbits. We watch the evolution of a particular

More or less

Most of them

Some of these exercises are taken verbatim from Grinstead and Snell; some from other standard sources;

some are original; and some are mere repetitions of things explained in my lecture notes.

The subject covers the basic theory of Markov chains in discrete time and simple random walks on the

integers

Thanks to Andrei Bejan for writing solutions for many of them

gene that appears in two types, G or g. A rabbit has a pair of genes, either GG (dom-

inant), Gg (hybrid–the order is irrelevant, so gG is the same as Gg) or gg (recessive).

In mating two rabbits, the oﬀspring inherits a gene from each of its parents with equal

probability. Thus, if we mate a dominant (GG) with a hybrid (Gg), the oﬀspring is

dominant with probability 1/2 or hybrid with probability 1/2.

Start with a rabbit of given character (GG, Gg, or gg) and mate it with a hybr id. The

oﬀspring produced is again mated with a hybrid, and the process is repeated through

a number of generations, always mating with a hybrid.

(i) Write down the transition probabilities of the Markov chain thus deﬁned.

(ii) Assume that we start with a hybrid rabb it. Let µ

be the probability dis-

tribution of the character of the r abbit of the n-th generation. In other words,

(GG), µ

(Gg), µ

(gg) are the probabilities that the n-th generation rabbit is GG,

Gg, or gg, respectively. Compute µ

, µ

. Can you do the same for µ

for general

Solution. (i) The set of states is S = {GG, Gg, gg} with the following transition

probabilities:

GG Gg gg

GG .5 .5 0

Gg .25 .5 .25

gg 0 .5 .5

We can rewrite the transition matrix in the following form:

P = 2

−1





1 1 0

0 1 1





(ii) Th e elements from the second row of the matrix P

will give us the probabilities

for a hybrid to give dominant, hybrid or recessive species in (n − 1)

generation in

this experiment, respectively (reading this row from left to right). We ﬁrst ﬁ nd

= 2

−2





1.5 2 0

1 2 1

0.5 2 1.5





= 2

−3





2.5 4 1.5

2 4 2

1.5 4 2.5





= 2

−4





4.5 8 3.5

4 8 4

3.5 8 4.5





so that

(GG) = .25, µ

(Gg) = .5, µ

(gg) = .25, i = 1, 2, 3.

Actually the p robabilities are the same f or any i ∈ N. If you obtained this result before

1858 when Gregor Mendel s tarted to breed garden peas in his monastery garden and

analysed the oﬀspring of these matings, you would probably be very famous because it

deﬁnitely looks like a law! This is what Mendel found when he crossed mono-hybrids.

In a more general setting, this law is known as Hardy-Weinberg law.

As an exercise, show that

= 2

−n





+ (2

n−2

− 1) 2

n−1

+ (2

n−2

− 1)

n−2

n−1

n−2

+ (2

n−2

− 1) 2

n−1

+ (2

n−2

− 1)





Try!

A certain calculating machine uses only the digits 0 and 1. It is supposed to transm it

one of these digits through several stages. However, at every stage, there is a prob-

ability p that the digit that enters this stage w ill be changed when it leaves and a

probability q = 1 −p that it won’t. Form a Markov chain to represent the process of

transmission by taking as states the digits 0 and 1. What is the matrix of transition

probabilities?

Now draw a tree and assign probabilities assuming that the process begins in state

0 and moves through two stages of transmission. What is the probability that the

machine, after two stages, produces the d igit 0 (i.e., the correct digit)?

Solution. Taking as states the digits 0 and 1 we identify the following Markov chain

(by specifying states and transition probabilities):

0 1

0 q p

1 p q

where p + q = 1. Thus, the trans ition matrix is as follows:

P =



q p

p q





1 −p p

p 1 − p





q 1 − q

1 − q q



It is clear that the probability that that the machine will produce 0 if it starts with 0

is p

+ q

Assume that a man’s profession can be classiﬁed as professional, skilled labourer,

or unskilled labourer. Assume that, of the sons of professional men, 80 percent are

professional, 10 percent are skilled labourers, and 10 percent are unskilled labourers.

In the case of sons of skilled labourers, 60 percent are skilled labourers, 20 percent are

professional, and 20 percent are unskilled. Finally, in the case of unskilled labourers,

50 percent of the sons are unskilled labourers, and 25 percent each are in the other

two categories. Assume that every man has at least one son, and form a Markov chain

by following the profession of a randomly chosen son of a given family through several

generations. Set up the matrix of transition probabilities. Find the probability that a

randomly chosen grandson of an unskilled labourer is a professional m an.

Solution. The Markov chain in this exercise has the following set states

S = {Professional, Skilled, Unskilled}

with the following transition probabilities:

Professional Skilled Unskilled

Professional .8 .1 .1

Skilled .2 .6 .2

Unskilled .25 .25 .5

so that the transition matrix for this chain is

P =





.8 .1 .1

.2 .6 .2

.25 .25 .5





with





0.6850 0.1650 0.1500

0.3300 0.4300 0.2400

0.3750 0.3000 0.3250





and thus the probability that a randomly chosen grandson of an un s k illed labourer is

a professional man is 0.375.

I have 4 umbrellas, some at home, some in the oﬃce. I keep moving between home

and oﬃce. I take an umbrella with me only if it rains. If it does not rain I leave the

umbrella behind (at home or in the oﬃce). It may happen that all umbrellas are in

one place, I am at the other, it starts raining and must leave, so I get wet.

1. If the probab ility of rain is p, what is the probability that I get wet?

2. Curr ent estimates show that p = 0.6 in Edinburgh. How many umbrellas should I

have so that, if I follow the strategy above, the probability I get wet is less than 0.1?

Solution. To solve the problem, consider a Markov chain taking values in the set

S = {i : i = 0, 1, 2, 3, 4}, where i represents the number of umbrellas in the place

where I am currently at (home or oﬃce). If i = 1 and it rains then I take the

umbrella, move to the other place, where there are already 3 umbrellas, and, including

the one I bring, I have next 4 umbrellas. Thus,

1,4

= p,

because p is th e probability of rain. If i = 1 but d oes n ot r ain then I do not take the

umbrella, I go to the other place and ﬁnd 3 u mbrellas. Thus,

1,3

= 1 − p ≡ q.

Continuing in the same mann er, I form a Markov chain with the following diagram:

But this does not look very nice. So let’s redraw it:

4 2

p q

Let us ﬁnd the stationary distribution. By equating ﬂuxes, we have:

π(2) = π(3) = π(1) = π(4)

π(0) = π(4)q.

Also,

i=0

π(i) = 1.

Expressing all pr ob abilities in terms of π(4) and inserting in this last equation, we ﬁnd

π(4)q + 4π(4) = 1,

π(4) =

q + 4

= π(1) = π(2) = π(3), π(0) =

q + 4

I get wet every time I happen to be in state 0 and it rains. Th e ch ance I am in state

0 is π(0). The chance it rains is p. Hence

P (W ET ) = π(0) · p =

q + 4

With p = 0.6, i.e. q = 0.4, we have

P (W ET ) ≈ 0.0545,

less than 6%. That’s nice.

If I want the chance to be less th an 1% then, clearly, I need more umbrellas. So,

suppose I need N umbrellas. S et up the Markov chain as above. It is clear that

π(N) = π(N − 1) = ··· = π(1),

π(0) = π(N)q.

Inserting in

i=0

π(i) we ﬁnd

π(N) =

q + N

= π(N − 1) = ··· = π(1), π(0) =

q + N

and so

P (W ET ) =

q + N

We want P (W ET ) = 1/100, or q + N > 100pq, or

N > 100pq −q = 100 × 0.4 × 0.6 − 0.4 = 23.6.

So to reduce the chance of getting wet from 6% to less than 1% I need 24 umbrellas

instead of 4. Th at’s too much. I’d rather get wet.

Suppose that ξ

, ξ

, . . . are independent random variables with common probability

function f(k) = P (ξ

= k) where k belongs, say, to the integers. Let S = {1, . . . , N}.

Let X

be another random variable, independent of the sequence (ξ

), taking values in

S and let f : S ×Z → S be a certain function. Deﬁne new random variables X

, X

, . . .

n+1

= f(X

, ξ

), n = 0, 1, 2 . . .

(i) Show that the X

form a Markov chain.

(ii) Find its transition probabilities.

Solution. (i) Fix a time n ≥ 1. Suppose that you know that X

= x. The goal is

to sh ow that PAST=(X

, . . . , X

n−1

) is independent of FUTURE=(X

n+1

, X

n+2

, . . .).

The variables in the PAST are functions of

, ξ

, . . . , ξ

n−2

The variables in the FUTURE are functions of

x, ξ

, ξ

n+1

, . . .

But X

, ξ

, . . . , ξ

n−2

are independent of ξ

, ξ

n+1

, . . .. Therefore, the PAST and the

FUTURE are independent.

(ii)

P (X

n+1

= y|X

= x) = P (f (X

, ξ

) = y|X

= x)

= P (f(x, ξ

) = y|X

= x)

= P (f(x, ξ

) = y)

= P (f(x, ξ

) = y) = P (ξ

∈ A

x,y

where

x,y

:= {ξ : f(x, ξ) = y}.

Discuss the topological properties of the graphs of the following Markov chains:

(a) P =



0.5 0.5



(b) P =



0.5 0.5

1 0







1/3 0 2/3

0 1 0

0 1/5 4/5





(d) P =



0 1

1 0



(e) P =





1/2 1/2 0

0 1/2 1/2

1/3 1/3 1/3





Solution. Draw the transition diagram for each case.

(a) Irreducible? YES because there is a path from every state to any other state.

Aperiodic? YES because the times n for which p

(n)

1,1

> 0 are 1, 2, 3, 4, 5, . . . and their

gcd is 1.

(b) Irreducible? YES because there is a path from every state to any other state.

Aperiodic? YES because the times n for which p

(n)

1,1

> 0 are 1, 2, 3, 4, 5, . . . and their

gcd is 1.

can be checked that all states have period 1, simply because p

i,i

> 0 f or all i = 1, 2, 3.

(d) Irreducible? YES because there is a path from every state to any other state.

Aperiodic? NO because the times n for which p

(n)

1,1

> 0 are 2, 4, 6, . . . and their gcd is

(e) Ir reducible? YES because there is a path from every state to any other state.

Aperiodic? YES because the times n for which p

(n)

1,1

> 0 are 1, 2, 3, 4, 5, . . . and their

gcd is 1.

Consider the knight’s tour on a chess board: A knight selects one of the next positions

at random independ ently of the past.

(i) Why is this process a Markov chain?

(ii) What is the state space?

(iii) Is it irreducible? Is it aperiodic?

(iv) Find the stationary distribution. Give an interpretation of it: what does it mean,

physically?

(v) Which are the most likely states in steady-state? Which are the least likely ones?

Solution. (i) Part of the problem is to set it up correctly in mathematical terms.

When we say that the “knight selects one of the next positions at random indepen-

dently of the past” we mean that the next position X

n+1

is a function of the current

position X

and a random choice ξ

of a neighbou r. Hence the problem is in the same

form as the one above. Hence (X

) is a Markov chain.

(ii) The state space is the set of the squares of the chess board. There are 8 × 8 = 64

squares. We can label them by a pair of integers. Hence the state space is

S = {(i

, i

) : 1 ≤ i

≤ 8, 1 ≤ i

≤ 8} = {1, 2, 3, 4, 5, 6, 7, 8} ×{1, 2, 3, 4, 5, 6, 7, 8}.

(iii) The best way to see if it is irreducible is to take a knight and move it on a chess

board. You will, indeed, realise that you can ﬁ nd a path that takes the knight from

any square to any other square. Hence every state communicates with every other

state, i.e. it is irreducible.

To see what the period is, ﬁn d the perio d for a s peciﬁc state, e.g. from (1, 1). You

can see that, if you start the knight from (1, 1) you can return it to (1, 1) only in

even number of steps. Hence the period is 2. So the answer is that the chain is not

aperiodic.

(iv) You have no chance in solving a set of 64 equations with 64 unknowns, unless you

make an educated guess. First, there is a lot of sym metry. So squares (states) that

are symmetric with respect to the centre of the chess board must have the pr ob ability

under the stationary d istrib ution. So, for example, states (1, 1), (8, 1), (1, 8), (8, 8)

have the same probability. And so on. Second, you sh ould realise that (1, 1) must be

less likely than a square closer to the centre, e.g. (4, 4). The reason is that (1, 1) has

fewer next states (exactly 2) th an (4, 4) (which has 8 next states). So let us make the

guess that if x = (i

, i

), then π(x) is proportional to the number N (x) of the possible

next states of the squ are x:

π(x) = CN(x).

But we must SHOW that this choice is correct. Let us say that y us a NEIGHBOUR

of x if y is a possible next s tate of x (if it is possible to move the kn ight from x to y

in on e step). So we must show that such a π satisﬁes the b alance equations:

π(x) =

y ∈S

π(y)p

y ,x

Equivalently, by cancelling C from both sides, we wonder whether

N(x) =

y ∈S

N(y)p

y ,x

holds true. But the sum on the right is zero unless x is a NEIGHBOUR of y:

N(x) =

y ∈S: x neighbour of y

N(y)p

y ,x

But the rule of m otion is to choose on of the neighbours with equal probability:

y ,x

(

N(y)

, if x is a neighbour of y

0, otherwise.

Which means that the previous equation becomes

N(x) =

y ∈S: x neighbour of y

N(y)

y ∈S: x neighbour of y

y ∈S: y neighbour of x

where in the last equ ality we used the obvious fact that x is a neighbour of y if and

only if y is a neighbour of x (sym metry of the relation) and so the last sum equals,

indeed, N (x). So our guess is correct!

Therefore, all we have to do is count the neighbours of each square x. Here we go:

2 3

8 8 8

We have

2 × 4 + 3 × 8 + 4 × 20 + 6 × 16 + 8 × 16 = 336.

So C = 1/336, and

π(1, 1) = 2/336, π(1, 2) = 3/336, π(1, 3) = 4/336, . . . , π(4, 4) = 8/336, . . . ,

etc.

Meaning of π. If we start with

P (X

= x) = π(x), x ∈ S,

then, for all times n ≥ 1,

P (X

= x) = π(x), x ∈ S.

(v) The corner ones are the least likely: 2/336. The 16 middle ones are the most likely:

8/336.

Consider a Markov chain with two states 1, 2. Suppose that p

1,2

= a, p

2,1

= b. For

which values of a and b do we obtain an absorbing Markov chain?

Solution. O ne of them (or both) should be zero. Because, if they are both positive,

the chain will keep moving between 1 and 2 forever.

10.

Smith is in jail and has 3 dollars; he can get out on bail if he has 8 dollars. A guard

agrees to make a series of bets with him. If Smith bets A dollars, he wins A dollars

with probability 0.4 and loses A dollars with probability 0.6. Find the p robability that

he wins 8 dollars before losing all of his money if (a) he bets 1 d ollar each time (timid

strategy). (b) he bets, each time, as much as possible but not more than necessary to

bring his fortune up to 8 dollars (bold strategy). (c) Which strategy gives Smith the

better chance of getting out of jail?

Solution. (a) The Markov chain (X

, n = 0, 1, . . .) representing the evolution of

Smith’s money has diagram

0.4 0.4 0.4 0.4 0.4 0.4 0.4

0.6 0.6 0.6 0.6 0.6 0.6 0.6

4 5

Let ϕ(i) be the probability that the chain reaches state 8 before reaching state 0,

starting from state i. In other words, if S

is the ﬁrst n ≥ 0 such that X

= j,

ϕ(i) = P

< S

) = P (S

< S

= i).

Using ﬁrst-step analysis (viz. the Markov property at time n = 1), we have

ϕ(i) = 0.4ϕ(i + 1) + 0.6ϕ(i − 1), i = 1, 2, 3, 4, 5, 6, 7

ϕ(0) = 0

ϕ(8) = 1.

We solve this system of linear equations and ﬁnd

ϕ = (ϕ(1), ϕ(2), ϕ(3), ϕ(4), ϕ(5), ϕ(6), ϕ(7))

= (0.0203, 0.0508, 0.0964, 0.1649, 0.2677, 0.4219, 0.6531, 1).

E.g., the pr obab ility that the chain reaches state 8 before reaching state 0, starting

from state 3 is the th ird component of this vector and is equal to 0.0964. Note that

ϕ(i) is increasing in i, which was expected.

(b) Now the chain is

4 5

0.4

0.6

and the equations are:

ϕ(3) = 0.4ϕ(6)

ϕ(6) = 0.4ϕ(8) + 0.6ϕ(4)

ϕ(4) = 0.4ϕ(8)

ϕ(0) = 0

ϕ(8) = 1.

We solve and ﬁnd

ϕ(3) = 0.256, ϕ(4) = 0.4, ϕ(6) = 0.64.

gives Smith a better chance to get out jail.

11.

A Markov chain with state space {1, 2, 3} has transition probability matrix

P =





1/3 1/3 1/3

0 1/2 1/2

0 0 1





Show that state 3 is absorbing and, starting from state 1, ﬁnd the expected time until

absorption occurs.

Solution. Let ψ(i) be the expected time to reach state 3 starting from state i, where

i ∈ {1, 2, 3}. We have

ψ(3) = 0

ψ(2) = 1 +

ψ(2) +

ψ(3)

ψ(1) = 1 +

ψ(1) +

ψ(2) +

ψ(3).

We solve and ﬁnd

ψ(3) = 0, ψ(2) = 2, ψ(1) = 5/2.

12.

A fair coin is tossed repeatedly and independently. Find th e expected number of tosses

till the pattern HTH appears.

Solution. Call HTH our target. Consider a chain that starts from a state called

nothing ∅ and is eventually absorbed at HTH. If we ﬁrst toss H then we move to state

H because this is the ﬁrst letter of our target. If we toss a T then we move back to ∅

having expended 1 unit of time. Being in state H we either move to a new state HT if

we bring T and we are 1 step closer to the target or, if we b ring H, we move back to

H: we have expended 1 unit of time, but the new H can be the beginning of a target.

When in state HT we either move to HTH and we are done or, if T occurs th en we

move to ∅. The trans ition diagram is

H HT HTH

1/2 1/2

1/2

Rename the states ∅, H, HT, HTH as 0, 1, 2, 3, respectively. Let ψ(i) be the expected

number of steps to reach HTH starting from i. We have

ψ(2) = 1 +

ψ(0)

ψ(1) = 1 +

ψ(1) +

ψ(2)

ψ(0) = 1 +

ψ(0) +

ψ(1).

We solve and ﬁnd ψ(0) = 10.

13.

Consider a Markov chain with states S = {0, . . . , N} and transition probabilities

i,i+1

= p, p

i,i−1

= q, for 1 ≤ i ≤ N − 1, where p + q = 1, 0 < p < 1; assume p

0,1

= 1,

N,N−1

= 1.

1. Draw the graph (= transition diagram).

2. Is the Markov chain irreducible?

3. Is it aperiodic?

4. What is the period of the chain?

5. Find the stationary distribution.

Solution. 1. The transition diagram is:

N−12

i−1

2. Yes, it is possible to go from any state to any other state.

3. Yes, because p

0,0

> 0.

4. One.

5. We write balance equations by equating ﬂuxes:

π(i)q = π(i − 1)p,

as long as 1 ≤ i ≤ N. Hence

π(i) =

π(i − 1) =





π(i − 2) = ··· =





π(0), 0 ≤ i ≤ N.

Since

π(0) + π(1) + . . . + π(N −1) + π(N ) = 1,

we ﬁnd

π(0)

1 +





+ ··· +





= 1,

which gives

π(0) =

1 +





+ ··· +





−1

(p/q)

− 1

(p/q) −1

as long as p 6= q. Hence, if p 6= q,

π(i) =

(p/q)

− 1

(p/q) − 1





, 0 ≤ i ≤ N.

If p = q = 1/2, then

π(0) =

1 +





+ ··· +





−1

N + 1

and so

π(i) =

N + 1

, f or all i.

Thus, in this case, π(i) is the uniform distribution on the set of states.

14.

A. Assume that an experiment has m equally p robable outcomes. Show that the

expected number of independent trials before the ﬁrst occurrence of k consecutive

occurrences of one of these outcomes is

− 1

m − 1

Hint: Form an absorbing Markov chain with states 1, 2, . . . , k with state i representing

the length of the current run. The expected time until a run of k is 1 more than the

expected time until absorption for the chain started in state 1.

B. It has been found that, in the decimal expansion of π = 3.14159 . . ., starting with

the 24,658,601st digit, there is a run of nine 7’s. What would your result say about

the expected number of digits necessary to ﬁnd such a run if the digits are pr oduced

randomly?

Solution. A. Let the outcomes be a, b, c, . . . (m of th em in total). Suppose that a is

the desirable outcome. We set up a chain as follows. Its states are

∅, (a), (aa), (aaa), . . . , (aa ···a)

{z }

m times

Or, more simply, 0, 1, 2, . . . , m. State k means that you are currently at the end of

a run of k a’s. If you see an extra a (with probability 1/m) you go to state k + 1.

Otherwise, you go to ∅. Let ψ(k) be the expected number of steps till state m is

reached, starting from state k:

ψ(k) := E

We want to ﬁ nd ψ(0). We have

ψ(k) = 1 + (1 − 1/m)ψ(0) + (1/m)ψ(k + 1).

Solving these, we ﬁnd

ψ(0) = 1 + m + m

+ ··· + m

k−1

− 1

m − 1

B. So to get 10 consecutive sixes by rolling a die, you need more than 12 million rolls

on the average (12, 093, 235 rolls to be exact).

C. They are not random. If they were, we expect to have to pick (10

− 1)/9 digits

before we see nine consecutive sevens. That’s about 100 million digits. The actual

position (24 million digits) is one fourth of the expected one.

15.

A rat runs through the maze shown below . At each step it leaves the room it is in by

choosing at ran dom one of the doors out of the room.

2 3

5 6

(a) Give the transition matrix P for this Markov chain. (b) Show that it is irreducible

but not aperiodic. (c) Find the stationary distribution (d) Now suppose that a piece

of mature cheddar is placed on a deadly trap in Room 5. The mouse starts in Room 1.

Find the expected number of steps before reaching Room 5 for the ﬁrst time, starting

in R oom 1. (e) Find the exp ected time to return to room 1.

Solution

(a) The transition matrix P f or this Markov chain is as follows:

P =







0 0 1 0 0 0

1/4 1/4 0 1/4 1/4 0

0 0 1/2 0 0 1/2

0 0 0 1/2 1/2 0







(b) The chain is irreducible, because it is possible to go f rom any state to any other

state. However, it is not aperiodic, because for any n even p

(n)

6,1

will be zero and for

any n odd p

(n)

6,5

will also be zero (why?). This means that there is no power of P that

would have all its entries strictly positive.

π = (

You should carry out the calculations and check that this is correct.

(d) We ﬁnd from π that the mean recur rence time (i.e. the expected time to return)

for th e room 1 is 1/π(1)=12.

(e) Let

ψ(i) = E(number of steps to reach state 5 | X

= i).

We have

ψ(5) = 0

ψ(6) = 1 + (1/2)ψ(5) + (1/2)ψ(4)

ψ(4) = 1 + (1/2)ψ(6) + (1/2)ψ(3)

ψ(3) = 1 + (1/4)ψ(1) + (1/4)ψ(2) + (1/4)ψ(4) + (1/4)ψ(5)

ψ(1) = 1 + ψ(3)

ψ(2) = 1 + ψ(3).

We solve and ﬁnd ψ(1) = 7.

16.

Show that if P is the transition matrix of an irreducible chain with ﬁnitely many states,

then Q := (1/2)(I + P) is the transition matrix of an irreducible and aperiodic chain.

(Note that I stands for the identity matrix, i.e. th e matrix which has 1 everywhere on

its diagonal and 0 everywhere else.)

Show that P and (1/2)(I + P) have the same stationary distributions.

Discuss, physically, how the two chains are related.

Solution. Let p

be the entries of P. Then the entries q

of Q are

, if i 6= j,

(1 + p

The graph of the new chain has more arrows th an the original one. Hence it is also

irreducible. But the new chain also has self-loops for each i because q

> 0 for all i.

Hence it is ap eriodic.

Let π be a stationary distribution for P. Then

πP = π.

We must show that

πQ = π.

But

πQ =

(πI + πP) =

(π + π) = π.

The physical meaning of the new chain is that it represents a slowing down of the orig-

inal one. Indeed, all outgoing probabilities have been halved, while the probability of

staying at the same state has been increased. The chain performs the same transitions

as the original on e but stays longer at each state.

17.

Two players, A and B, play the game of matching pennies: at each time n, each player

has a penny and must secretly turn the penny to heads or tails. The players then

reveal their choices simultaneously. If the pen nies match (both heads or both tails),

Player A wins the penny. If the pennies do not match (one heads and one tails), Player

B wins the penny. Suppose the players have between them a total of 5 pennies. I f

at any time one player has all of the pen nies, to keep the game going, he gives one

back to the other player and the game will continue. (a) Show that this game can be

formulated as a Markov chain. (b) Is the chain regular (irredu cible + aperiodic?) (c)

If Player A starts with 3 pennies an d Player B with 2, what is the probability that A

will lose his pennies ﬁrst?

Solution (a) The problem is easy: The pr obab ility that two pennies match is 1/2.

The probability they do not match is 1/2. Let x be the number of pennies that A has.

Then with probability 1/2 he will next have x + 1 pennies or with probability 1/2 he

will next have x −1 pennies. The exception is when x = 0, in wh ich case, he gets, for

free, a penny from B and he next has 1 penny. Also, if x = 5 he gives a penny to B

and he next has 4 pennies. Thus:

4 5

1/2

1/2 1/2

1/21/2

1/2

11/2

(b) The chain is clearly irreducible. But the period is 2. Hence it is not regular.

pennies. After all, we are NOT interested in the behaviour of the chain after this time.

The modiﬁcation is an absorbing chain:

4 5

1/2

1/2 1/2

1/21/2

1/2

We then want to compute the absorbing probability ϕ

(3) where

(i) = P

(hit 0 before 1).

Write ϕ(i) = ϕ

(i), for brevity, and apply ﬁrst-step analysis:

ϕ(0) = 1

ϕ(1) =

ϕ(0) +

ϕ(1)

ϕ(2) =

ϕ(1) +

ϕ(2)

ϕ(3) =

ϕ(2) +

ϕ(3)

ϕ(4) =

ϕ(3) +

ϕ(4)

ϕ(5) = 0.

Six equations with six unknowns. Solve and ﬁnd: ϕ(3) = 2/5.

Alternatively, observe, from Thales’ theorem,

that ϕ must be a straight line:

ϕ(x) = ax + b.

From ϕ(0) = 1, ϕ(5) = 0, we ﬁnd a = −1/5, b = 1, i.e.

ϕ(i) ≡ 1 − (i/5),

which agrees with the above.

18.

A process moves on the integers 1, 2, 3, 4, and 5. It starts at 1 and, on each successive

step, moves to an integer greater than its present position, moving with equal proba-

bility to each of the remaining larger integers. State ﬁve is an absorbing state. Find

the expected number of steps to reach state ﬁve.

Solution. A Markov chain is deﬁned an d its transition probability matrix is as follows:

P =







0 0

0 0 0

0 0 0 0 1







We apply ﬁrst step analysis for the function

ψ(i) := E

, 1 ≤ i ≤ 5,

Thales’ theorem says (proved around the

year 600 BCE) says that if the lines L, L

′

are parallel then

where S

= inf{n ≥ 0 : X

= 5}. One of the equations is ψ(5) = 0 (obviously).

Another is

ψ(1) = 1 +

ψ(2) +

ψ(3) +

ψ(5).

It’s up to you to wr ite the remaining equations and solve to ﬁnd

ψ(1) = 1 +

≈ 2.0833.

19.

Generalise the previous exercise, by replacing 5 by a general positive integer n. Find

the expected number of steps to reach state n, when starting from state 1. Test your

conjecture for several diﬀerent values of n. Can you conjecture an estimate for the

expected number of steps to reach state n, for large n?

Solution. The answer here is

n−1

k=1

We here recognise the harmonic series:

k=1

≈ log n,

for large n, in the sense that the diﬀerence of the two sides converges to a constant.

So,

≈ log n,

when n is large.

20.

A gamb ler plays a game in which on each play he wins one dollar with probability p

and loses one dollar with probability q = 1 − p. The Gambler’s Ruin Problem is

the problem of ﬁnding

ϕ(x) :=the probability of winning an amount b

before losing everything, starting with state x

< S

1. Show that th is problem may be considered to be an absorb ing Markov chain with

states 0, 1, 2, . . . , b, with 0 and b absorbing states.

2. Write down the equations satisﬁed by ϕ(x).

3. If p = q = 1/2, show that

ϕ(x) = x/b.

4. If p 6= q, show that

ϕ(x) =

(q/p)

− 1

(q/p)

− 1

Solution. 1. If the current fortune is x the next fortune will be either x + 1 or x −1,

with probability p or 1, r espectively, as long as x is not b or x is not 0. We assume

independ en ce between games, so the next fortune will not depend on the previous

ones; whence the Markov property. If the fortune reaches 0 then the gambler must

stop playing. S o 0 is absorbing. If it reaches b then the gambler has reached the target

hence the play stops again. So both 0 and T are absorbing states. The transition

diagram is:

T−2

T−1

x+1

x−1

p p

q q

2. The equations are:

ϕ(0) = 0

ϕ(b) = 1

ϕ(x) = pϕ(x + 1) + qϕ(x − 1), x = 1, 2, . . . , b − 1.

3. If p = q = 1/2, we have

ϕ(x) =

ϕ(x + 1) + ϕ(x − 1)

, x = 1, 2, . . . , b − 1.

This means that the point (x, ϕ(x)) in the plane is in the middle of the segment with

endpoints (x − 1, ϕ(x − 1)), (x + 1, ϕ(x + 1)). Hence the graph of the function ϕ(x)

must be on a s tr aight line (Thales’ theorem):

x+1

x−1

x+1

In other words,

ϕ(x) = Ax + B.

We determine the constants A, B from ϕ(0) = 0, ϕ(b) = 1. Thus , ϕ(x) = x/b.

4. If p 6= q, then this nice linear property does not h old. However, if we substitute the

given function to the equations, we see that they are satisﬁed.

21.

Consider the Markov chain with transition matrix

P =





1/2 1/3 1/6

3/4 0 1/4

0 1 0





(a) Show that this is irreducible and aperiodic.

(b) The process is started in state 1; ﬁnd the probability that it is in state 3 after two

steps.

as n → ∞.

Solution

1/2

1/3

1/6

3/4

1/4 1

(a) Draw the transition diagram and observe that there is a path from every state to

any other state. Hence it is irreducible. Now consider a state, say state i = 1 and the

times n at which p

(n)

1,1

> 0. These times are 1, 2, 3, 4, 5, . . . and their gcd is 1. Hence it

is aperiodic. So the chain is regular.

(b)

= 3) = p

(2)

1,3

i=1

1,i

i,3

= p

1,1

1,3

+ p

1,2

2,3

+ p

1,3

3,3

· 0 =

lim

n→∞





π(1) π(2) π(3)





where π = (π(1), π(2), π(3)) is the stationary distribution which is found by solving

the balance equations

πP = π,

together with

π(1) + π(2) + π(3) = 1.

The balance equations are equivalent to

π(1)

+ π(1)

= π(2)

π(3) = π(2)

+ π(1)

Solving the last 3 equations with 3 unknowns we ﬁnd

π(1) =

, π(2) =

, π(3) =

Hence

lim

n→∞





3/6 2/6 1/6





22.

Show that a Markov chain with transition matrix

P =





1 0 0

1/4 1/2 1/4

0 0 1





has more than one stationary distributions. Find the matrix that P

converges to, as

n → ∞, and verify that it is not a matrix all of whose rows are the same.

You should work out this exerc ise by direct methods, without appealing to the general

limiting theory of Markov chains–see lecture notes.

Solution. The transition diagram is:

1/2

1/4

Write the balance equations πP = π :



π(1) π(2) π(3)







1 0 0

1/4 1/2 1/4

0 0 1







π(1) π(2) π(3)



π(1) · 1 + π(2) · (1/4) + π(3) · 0 = π(1) (1)

π(1) · 0 + π(2) · (1/2) + π(3) · 0 = π(2) (2)

π(1) · 0 + π(2) · (1/4) + π(3) · 1 = π(3), (3)

together with the normalisation condition

π(i) = 1, i.e.

π(1) + π(2) + π(3) = 1. (4)

and solve for π(1), π(2), π(3). Equation (1) gives

π(2) = 0.

Equation (2) gives

π(2) = π(2),

i.e. it is useless. Equation (3) gives

π(3) = π(3),

again, obviously true. Equation (4) gives

π(1) + π(3) = 1.

Therefore, equations (1)–(4) are EQUIVALENT TO:

π(2) = 0, π(1) + π(3) = 1.

Hence we can set π(1) to ANY value we like between 0 and 1, s ay, π(1) ≡ p, and then

let π(3) = 1−p. Thus there is not just one stationary distribution but inﬁnitely many.

For each value of p ∈ [0, 1], any π of the form

π =



p 0 1 − p



is a stationary distribution.

To ﬁnd the limit of P

as n → ∞, we compute the entries of the matrix P

. Notice

that the (i, j)-entry of P

equals

(n)

i,j

= P

= j).

If i = 1 we have

= 1) = 1, P

= 2) = 0, P

= 3) = 0,

because state 1 is absorbing. Similarly, state 3 is absorbing:

= 1) = 0, P

= 2) = 0, P

= 3) = 1.

We thus know the ﬁrst and third rows of P





1 0 0

(n)

2,1

(n)

2,2

(n)

2,3

0 0 1





We now compute the missing entries of the second row by simple observations, based

on the fact that the chain, started in state 2, will remain at 2 for some time and then

will leave it and either go to 1 or 3:

= 2) = P

(chain has stayed in state 2 for n consecutive steps) = (1/2)

= 1) =

m=1

m−1

= 2, X

= 1)

m=1

(1/2)

m−1

· (1/4)

1 − (1/2)

1 − 0.5

= 3) = 1 − P

= 2) − P

= 1) =

1 − 0.5

Therefore,





1 0 0

1−0.5

(0.5)

1−0.5

0 0 1





Since 0.5

→ 0 as n → ∞, we have

→





1 0 0

1/2 0 1/2

0 0 1





, as n → ∞.

23.

Toss a fair die repeatedly. Let S

denote the total of the outcomes thr ough the nth

toss. Show that there is a limiting value for the proportion of the ﬁrst n values of S

that are divisible by 7, and compute the value for this limit.

Hint: The desired limit is a stationary distribution for an appropriate Markov chain

with 7 states.

Solution. An integer k ≥ 1 is divisible by 7 if it leaves remainder 0 when divided by

7. When we divide an integer k ≥ 1 by 7, the possible remainders are

0, 1, 2, 3, 4, 5, 6.

Let X

, X

, . . . be the outcomes of a fair die tossing. These are i.i.d. random variables

uniformly distributed in {1, 2, 3, 4, 5, 6}. We are asked to consider the sum

= X

+ ··· + X

Clearly, S

is an integer. We are interested in the remainder of S

when divided by

7. Call this R

. So:

:= the remainder of the division of S

by 7.

Note that the random variables R

, R

, . . . form a Markov chain becaus e if we

know the value of R

, all we have to do to ﬁnd the next value R

n+1

is to add X

to R

, divide by 7, and take the remainder of this division, as in elementary-school

arithmetic:

n+1

= the remainder of the division of R

+ X

by 7.

We need to ﬁnd the transition probabilities

i,j

:=P (R

n+1

= j|R

= i)

=P ( the remainder of the division of i + X

by 7 equals j)

for this Markov chain, for all i, j ∈ {0, 1, 2, 3, 4, 5, 6}. But X

takes values in {1, 2, 3, 4, 5, 6}

with equal probabilities 1/6. If to an i we add an x chosen from {1, 2, 3, 4, 5, 6} and

then divide by 7 we are going to obtain any j in {0, 1, 2, 3, 4, 5, 6}. Therefore,

i,j

= 1/6, for all i and all j ∈ {0, 1, 2, 3, 4, 5, 6}.

We are asked to consider the proportion of the ﬁrst n values of S

that are divisible

by 7, namely the quantity

k=1

1(R

= 0).

This quantity has a limit from the Strong Law of Large Numbers for Markov chains

and the limit is the s tationary distribution at state 0:

lim

n→∞

k=1

1(R

= 0) = π(0)

= 1

Therefore we need to compute π for the Markov chain (R

). This is very easy. From

symmetry, all states i must have the same π(i). Therefore

π(i) = 1/7, i = 0, 1, 2, 3, 4, 5, 6.

Hence

lim

n→∞

k=1

1(R

= 0) = 1/7

= 1.

In other words, if you toss a fair die 10, 000 times then approximately 1667 times n

you had a sum S

that was divisible by 7, and this is true with probability very close

to 1.

24.

(i) Consider a Markov chain on the vertices of a triangle: the chain moves from one

vertex to another with probability 1/2. Find the prob ability th at, in n steps, the chain

returns to the vertex it started from.

(ii) Suppose that we alter the probabilities as follows:

= p

= 2/3, p

= p

= 1/3.

Answer the same question as above.

Solution. (i) T he transition matrix is

P =





0 1 1

1 0 1

1 1 0





The characteristic polynomial is

det(xI −P) = x

−

whose roots are

:= 0, x

:= 1, x

:= −

Therefore,

(n)

= C

+ C

= C

+ C

where C

, C

are constants. Since, clearly, p

(0)

= 1, p

(1)

= p

= 0, we have

+ C

= 1

+ C

= 0.

Solving, we ﬁnd C

= 1/3, C

= 2/3. So

(n)

(−1/2)

(ii) We now have

P =





0 1 2

2 0 1

1 2 0





The characteristic polynomial is

det(xI −P) = x

−

x −

(3x

− 2x − 1) :=

f(x).

Checking the divisors of the constant (1 or 2), we are lucky because we see that 1 is

a zero:

f(1) = 3 − 2 −1 = 0.

So we divide f (x) with x − 1. Since

(x − 1) = 3x

− 3x

we have

f(x) − 3x

(x − 1) = 3x

− 2x − 1.

Since

3x(x − 1) = 3x

− 3x,

we have

− 2x − 1 − 3x(x − 1) = x − 1.

Therefore,

f(x) = 3x

(x − 1) + 3x

− 2x − 1

= 3x

(x − 1) + 3x(x − 1) + (x − 1)

= (3x

+ 3x + 1)(x − 1).

So the other roots of f(x) = 0 are the roots of 3x

+ 3x + 1 = 0. The discriminant of

this quadratic is

− 4 · 3 · 1 = −3 < 0,

so the r oots are complex:

−1

√

−3

, x

−1

−

√

−3

Letting x

= 1 (the ﬁrst root we found), we now have

(n)

= C

+ C

We need to determine the constants C

, C

. But we have

1 = p

(0)

= C

+ C

0 = p

(1)

= C

+ C

= p

(2)

= C

+ C

Solving for the constants, we ﬁnd

(n)

(−1/

√

cos(nπ/6).

25.

A certain experiment is believed to be described by a two-state Markov chain with

the transition matrix P, where

P =



0.5 0.5

p 1 − p



and the parameter p is not known. When the experiment is performed many times,

the chain ends in state one approximately 20 percent of the time and in state two

approximately 80 percent of the time. Comp ute a sensible estimate for the unk nown

parameter p an d exp lain how you found it.

Solution. If X

is the position of the chain at time k, we are being told that when we

perform the exp eriment (i.e. watch the chain), s ay, n times we see that approximately

20% of the time the chain is in state 1:

n=1

1(X

= 1) ≈ 0.2 (5)

We know, from the Strong Law (=Theorem) of Large Numbers that

lim

n→∞

k=1

1(X

= 1) = π(1)

= 1, (6)

where π = (π(1), π(2)) is the s tationary distribution. Combining the obs ervation (5)

with the Law of Large Numbers (6) we obtain

π(1) ≈ 0.2.

We can easily compute π because

π(1)

= π(2)p,

and, of course,

π(1) + π(2) = 1,

whence

π(1) =

1 + 2p

Solving

1+2p

= 0.2 we ﬁnd p = 1/8.

26.

Here is a trick to try on your friends. Shuﬄe a deck of cards and deal out one at a

time. Count the face cards each as ten. Ask your friend to at one of the ﬁrst ten cards;

if this card is a s ix, she is to look at the card turns up six cards later; if this card is a

three, she is to look at the card turns up three cards later, and so forth. Eventually

she will reach a where she is to look at a card that tu rns up x cards later but there are

x cards left. You then tell her the last card that she looked at even though you did

not know her starting point. You tell her you do this by watching her, and she cannot

disguise the times that she looks at the cards. In fact just do the same procedure and,

even though you do not start at the point as she does, you will most likely end at the

same point. Why?

Solution. Let X

denote the value of the n-th card of the experiment when you start

from the x-th card from the top. Let Y

denote the value of the n-th card of another

experiment when you start from the y-th card from the top. You use exactly the same

deck with the cards in the same order in both experiments. If, for some n and some

m we have

= Y

then X

n+1

= Y

m+1

, X

n+2

= Y

m+2

, etc. The point is that the event

{∃m, n such that X

= Y

}

has a large probability. In fact, it has p robability close to 1.

27.

You h ave N books on your shelf, labelled 1, 2, . . . , N. You pick a book j with prob-

ability 1/N. Then you place it on the left of all others on the shelf. You r epeat the

process, independently. Construct a Markov chain which takes values in the set of all

N! permutations of the books.

(i) Discuss the state space of the Markov chain. Think how many elements it has and

how are its elements represented.

(ii) Show that the chain is regular (irreducible and aperiodic) and ﬁnd its stationary

distribution.

Hint: You can guess the stationary distribution before computing it.

Solution. (i) T he state space is

S = {all function σ : {1, 2, . . . , N} → {1, 2, . . . , N} which are one-to-one and onto}.

These σ are called permutations and there are N ! of them:

|S| = N!

Each σ can be represented by a the list of each values:

σ = (σ(1), σ(2), . . . , σ(N))

i.e. σ(i) is its values at i.

(ii) Let us ﬁnd the transition probabilities. If σ is the current state and we pick the

j-th book and place it in front, then the next state is the same if j = 1 or

(σ(j), σ(1), σ(2), . . . , σ(j − 1), σ(j + 1), . . .),

if j 6= 1. There are N possible next states and each occurs with probability 1/N . If

we denote the next state obtained when picking the j-th book by σ

(j)

then we have

σ,σ

(j)

= 1/N, j = 1, . . . , N.

(For example, σ

(1)

= σ.) And, of course, p

σ,τ

= 0 if τ is not of the form σ

(j)

for some j.

The chain is aperiodic because p

σ,σ

= 1/N for all σ. It is irreducible because, clearly,

it can move from any state (i.e. any arr an gement of books) to any other. Hence it is

regular.

It does not require a lot of thought to see that there is complete symmetry! Therefore

all states must have the same stationary distribution, i.e.

π(σ) =

, for all σ ∈ S.

You can easily verify that

π(σ) =

π(τ)p

τ,σ

, for all σ ∈ S,

i.e. the balance equations are satisﬁed and so our educated guess was correct.

28.

In unproﬁtable times corporations sometimes suspend dividend payments. Suppose

that after a dividend has been paid the next one will be paid with probability 0.9,

while after a dividend is s uspended the next one will be suspended with p robability

0.6. In the long run what is the fraction of dividends that will be paid?

Solution. We here have a Markov chain with two states:

State 1: “dividend paid”

State 2: “dividend suspended”

We are given the following transition probabilities:

1,1

= 0.9, p

2,2

= 0.6

Hence

1,2

= 0.1, p

2,1

= 0.4

Let π be the stationary distribution. In the long run the fraction of dividends that

will be paid equals π(1). But

π(1) × 0.1 = π(2) × 0.4

and

π(1) + π(2) = 1,

whence

π(1) = 4/5.

So, in the long run, 80% of the d ividen ds will be paid.

29.

Five white balls and ﬁve black balls are distributed in two urns in such a way that each

urn contains ﬁve balls. At each step we draw one ball from each urn and exchange

them. Let X

be the number of white balls in the left urn at time n.

(a) Compute the transition probability for X

(b) Find the stationary distribution and show that it corresponds to picking ﬁve balls

at random to be in the left urn.

Solution Clearly, (X

, X

, . . .) is a Markov chain with state space

S = {0, 1, 2, 3, 4, 5}.

(a) If, at some point of time, X

= x (i.e. the number of white balls in the left urn is

x) then there are 5 −x black balls in the left urn, while the right urn contains x black

and 5 − x white balls. Clearly,

x,x+1

= P (X

n+1

= x + 1|X

= x)

= P (pick a white ball from the right urn and a black ball from the left urn)

5 − x

as long as x < 5. On the other hand,

x,x−1

= P (X

n+1

= x − 1|X

= x)

= P (pick a white ball from the left urn and a black ball from the right urn)

as long as x > 0. When 0 < x < 5, we have

x,x

= 1 − p

x,x+1

− p

x,x−1

because there is no chance that the number of balls change by more than 1 ball.

Summarising, the ans wer is:

x,y













5−x



, if 0 ≤ x ≤ 4, y = x + 1





, if 1 ≤ x ≤ 5, y = x − 1

1 −



5−x



−





, if 1 ≤ x ≤ 4, y = x,

0, in all other cases.

If you want, you may draw the transition diagram:

4 5

On this diagram, I d id not indicate the p

x,x

(b) To compute the stationary distribution, cut the diagram between states x and

x − 1 and equate the two ﬂows, as usual:

π(x)p

x,x−1

= π(x − 1)p

x−1,x

i.e.

π(x)





= π(x − 1)



5 − (x − 1)



which gives

π(x) =



6 − x



π(x − 1)

We thus have

π(1) =





π(0) = 25π(0)

π(2) =





π(1) =









π(0) = 100π(0)

π(3) =





π(2) =













π(0) = 100π(0)

π(4) =





π(3) =

















π(0) = 25π(0)

π(5) =





π(4) =





















π(0) = π(0).

We ﬁn d π(0) by normalisation:

π(0) + π(1) + π(2) + π(3) + π(4) + π(5) = 1

⇒ π(0) = 1/(1 + 25 + 100 + 100 + 25 + 1) = 1/252.

Putting everything together, we have

π(0) =

252

, π(1) =

252

, π(2) =

100

252

, π(3) =

100

252

, π(4) =

252

, π(5) =

252

This is th e answer f or the stationary distribu tion.

We are also asked to interpret π(x) as

From a lot of 10 (= 5 black + 5 white balls) pick 5 at random and place

them in the left urn (place the rest in the right urn) and consider the chance

that amongst the 5 balls x are white.

We know how to answer this problem: it is a hypergeometric distribution:

Chance that amongst the 5 balls x are white =





5 − x











255

, x = 0, . . . , 5.

This is PRECISELY the distribution obtained ab ove. Hence π(x) IS A HYPERGE-

OMETRIC DISTRIBUTION.

30.

An auto insurance company classiﬁes its customers in three categories: poor, satisfac-

tory and pr eferred. No one moves from poor to preferred or from preferred to poor

in one year. 40% of the customers in the poor category become satisfactory, 30% of

those in the satisfactory category moves to preferred, while 10% become poor; 20% of

those in th e preferred category are downgraded to satisfactory.

(a) Write the transition matrix for the m odel.

(b) What is the limiting fraction of drivers in each of these categories? (Clearly s tate

which theorem you are applying in order to compute this.)

Solution. (a) The transition probabilities for this Markov chain with three states are

as follows:

POOR SATISFACTORY PREFERRED

POOR 0.6 0.4 0

SATISFACTORY 0.1 0.6 0.3

PREFERRED 0 0.2 0.8

so that the transition probability matrix is

P =





0.6 0.4 0

0.1 0.6 0.3

0 0.2 0.8





(b) We will ﬁnd the limiting fraction of drivers in each of these categories from the

components of the stationary distribution vector π, which satisﬁes the following equa-

tion:

π = πP.

The former is equivalent to the following system of linear equations:

π(1) = 0.6π(1) + 0.1π(2)

π(2) = 0.4π(1) + 0.6π(2) + 0.2π(3)

π(3) = 0.3π(2) + 0.8π(3)

1 = π(1) + π(2) + π(3). (7)

This has the following solution: π = (

Thus, the limiting f raction of drivers in th e POOR category is

, in the SATIS-

FACTO RY category—

, and in the PREFERRED category—

. By the way, the

proportions of the drivers in each category in 15 years ap proximate these numbers

with two signiﬁcant digits (you can check it, calculating P

and looking at its rows).

31.

The President of the United States tells person A his or her intention to run or not to

run in the next election. Then A relays the news to B, who in turn relays the message

to C, and so forth, always to some new person. We assume that there is a probability

a that a person will change the answer from yes to no when transmitting it to the next

person and a p robability b that he or she will change it from no to yes. We choose as

states the message, either yes or no. The transition probabilities are

y es,no

= a, p

no,yes

= b.

The initial state represents the President’s choice. S uppose a = 0.5, b = 0.75.

(a) Assume that the President says that he or she will run. Find the expected length

of time before the ﬁrst time the ans wer is passed on incorrectly.

(b) Find the mean recurrence time for each s tate. In other words, ﬁnd the expected

amount of time r

, for i = yes and i = no required to return to that state.

n→∞

(d) Repeat (b) for general a and b.

(e) Repeat (c) for general a and b.

Solution. (a) The expected length of time before the ﬁrst answer is passed on incor-

rectly, i.e. that the Pr esident will not run in the next election, equals the mean of the

geometrically distributed random variable with parameter 1 − p

y es,no

= 1 − a = 0.5.

Thus, the expected length of time before the ﬁrst answer is passed on incorrectly is 2.

What is found can be viewed as the mean ﬁrst passage time from the state yes to the

state no. By making the corresponding ergodic Markov chain with transition matrix

P =



0.5 0.5

0.75 0.25



(8)

absorbing (with absorbing state being no), check that the time until absorp tion will

be 2. This is nothing but the mean ﬁrst passage time from yes to no in the original

Markov chain.

(b) We use the following result to ﬁnd mean recurrence time for each state:

for an ergodic Markov chain, the mean recurrence time for state i is

= E

π(i)

where π(i) is the ith component of the stationary distribution for the tran-

sition probability matrix.

The transition probability matrix (8) has the following stationary distribution:

π =



.6, .4



from which we ﬁnd the mean recurrence time for the state yes is

and for the state

no is

corresponding chain is irreducible and aperiodic. For such a chain

lim

n→+∞



π(1) π(2)



Thus,

lim

n→+∞



0.6 0.4



(d) We apply the same arguments as in (b) and ﬁ nd that the transition probability

matrix

P =



1 − a a

b 1 − b



has th e following ﬁxed probability vector:

π =



a+b



so that the mean recurrence time for the state yes is 1 +

and for the state no is

1 +

(d) Suppose a 6= 0 and b 6= 0 to avoid absorb ing states and achieve regularity. Then

the corresponding Markov chain is regular. Thus,

lim

n→+∞



a+b



32.

A fair die is rolled repeatedly and independently. S how by the results of the Markov

chain theory that the mean time between occurrences of a given number is 6.

Solution. We construct a Markov chain with the states 1, 2, . . . , 6 and transition

probabilities p

for each i, j = 1, 2, . . . , 6. Such Markov chain has the transition

probability matrix which has all its entries equal to

. The chain is irr ed ucible and

aperiodic and its stationary distribution is nothing but

π =





This means that th e mean time between occurren ces of a given number is 6.

33.

Give an example of a three-state ir reducible-aperiodic Markov chain that is not re-

versible.

Solution.

We will see how to choose transition probabilities in such a way that the chain would

not be reversible.

If our three-state chain was a reversible chain, that would meant that the detailed

balance equations hold, i.e.

π(1)p

= π(2)p

π(1)p

= π(3)p

π(2)p

= π(3)p

From this it is easy to see that if the detailed balance equations hold, then necessarily

= p

. So, choose them in such a way that this does not hold.

For instance, p

= 0.7, p

= 0.2, p

= 0.3, p

= 0.2, p

= 0.1. And

these specify an ergodic Markov chain which is not reversible.

Another solution is: Consider the Markov chain with three states {1, 2, 3} and deter-

ministic transitions: 1 → 2 → 3 → 1. Clearly, the Markov chain in r everse time moves

like 1 → 3 → 2 → 1 and so its law is not the same. (We can tell the arrow of time by

runnin g the ﬁ lm backward s.)

34.

Let P be the transition matrix of an irreducible-aperiodic Markov chain. Let π be its

stationary distribution. Suppose the Markov chain starts with P (X

= i) = π(i), for

all i ∈ S.

(a) [Review question] Show that P (X

= i) = π(i) f or all i ∈ S and all n.

(b) Fix N ≥ 1 and consider the process X

∗

= X

, X

∗

= X

N−1

, . . . Show that it is

Markov.

∗

be the transition probability matrix of P

∗

(it is called: the reverse transition

matrix). Find its entries p

∗

i,j

(d) Show that P and P

∗

they have the same stationary d istrib ution π.

Solution. (a) By deﬁnition, π(i) satisﬁes

π(i) =

π(j)p

j,i

, i ∈ S.

If P (X

= i) = π(i), then

P (X

= i) =

P (X

= j, X

= i)

P (X

= j, X

= i)

π(j)p

j,i

= π(i).

Hence P (X

= i) ≡ π(i). Repeating the process we ﬁnd P (X

= i) ≡ π(i), and so on,

we have P (X

= i) ≡ π(i) f or all n.

(b) Fix n and consider the future of X

∗

after n. This is X

∗

n + 1, X

∗

n + 2, . . .. Con-

sider also the past of X

∗

before n. This is X

∗

n−1

, X

∗

n−2

, . . .. But

∗

n+1

, X

∗

n+2

, . . .) = (X

N−n−1

, X

N−n−2

, . . .)

is the past of X before time N − n. And

∗

n−1

, X

∗

n−2

, . . .) = (X

N−n+1

, X

N−n+2

, . . .)

is the future of X after time N − n. Since X is Markov, these are independent,

conditional on X

N−n

. But X

N−n

= X

∗

. Hence, given X

∗

, the future of X

∗

after n is

independ ent of the past of X

∗

before n, and this is true for all n, and so X

∗

is also

Markov.

= i) ≡ π(i). Hence, by (a), P (X

= i) ≡ π(i) for all

n. We have

∗

i,j

:= P (X

∗

n+1

= j|X

∗

= i) = P (X

N−n−1

= j|X

N−n

= i)

P (X

N−n

= i|X

N−n−1

= j)P (X

N−n−1

= j)

P (X

N−n

= i)

j,i

π(j)

π(i)

(d) We need to check that, for all i ∈ S,

π(i) =

π(k)p

∗

k,i

. (9)

This is a matter of algebra.

35.

Consider a random walk on the following graph consisting of two nested dodecagons:

(a) Explain why it is reversible (this is true for any RWonG).

(b) Find the stationary distribution.

same for all states, and compute this time.

(d) Let X

be the position of the chain at time n (it takes values in a set of 24

elements). Let Z

= 1 if X

is in the inner dodecagon and Z

= 2 is X

is at the

outer dodecagon. Is (Z

) Markov?

Solution. (a) Our chain has 24 states. From each of the states we jump to any of three

neighbour ing states with equal probability

(see the ﬁgure below: each undirected

edge combines two directed edges-arrows). The chain is reversible, i.e. it is possible

to move from any state to any other state. This is obviously the case for any random

walk on a connected graph. Note that the notion of reversibility of the discrete Markov

chain is related to the topology of the graph on which the chain is being run.

1/3

(b) The stationary d istribution exists and because of the symmetry the stationary

vector has all components equal, and since the numb er of the components is 24 the

stationary vector is

π = (

, . . . ,

) ∈ R

(d) Observe ﬁrst that

P (Z

= 2|X

= i) = 1/3, as long as i = 13, . . . , 24,

and

P (Z

= 1|X

= i) = 1/3, as long as i = 1, . . . , 12.

We now verify that (Z

) is Markov. (We s hall argue directly. Alternatively, see

section on “fu nctions of Markov chains” from my lecture notes.) By the deﬁnition of

conditional probability,

P (Z

n+1

= 2|Z

= 1, Z

n−1

= w, . . .)

i=13

P (Z

n+1

= 2|X

= i, Z

= 1, Z

n−1

= w, . . .) P (X

= i|Z

= 1, Z

n−1

= w, . . .)

Due to the fact that (X

) is Markov, when we know that X

= i that the future after

n is in dependent from the past before n. But Z

n+1

belongs to the future after n, while

n−1

= w, . . . belongs to the past before n. Hence, for i = 13, . . . , 24,

P (Z

n+1

= 2|X

= i, Z

= 1, Z

n−1

= w, . . .) = P (Z

n+1

= 2|X

= i) = 1/3.

Hence

P (Z

n+1

= 2|Z

= 1, Z

n−1

= w, . . .) =

i=13

P (X

= i|Z

= 1, Z

n−1

= w, . . .) =

because, obviously,

i=13

P (X

= i|Z

= 1, Z

n−1

= w, . . .) = 1.

(If Z

= 1 then X

is in the ins ide dodecagon.) Thus,

P (Z

n+1

= 2|Z

= 1, Z

n−1

= w, . . .) = P (Z

n+1

= 2|Z

= 1).

Similarly, we can show

P (Z

n+1

= 1|Z

= 2, Z

n−1

= w, . . .) = P (Z

n+1

= 1|Z

= 2).

Hence, no matter what the value of Z

is, the future of Z after n is independent of

the past of Z before n. Hence Z is Markov as well.

36.

Consider a Markov chain in the set {1, 2, 3} with transition probabilities

= p

= p, p

= p

= q = 1 − p,

where 0 < p < 1. Determine whether the Markov chain is reversible.

Solution. If p = 1/2 then the chain is a random walk on a graph; so it is reversible.

If p 6= 1/2 then Kolmogorov’s loop criterion requires that

= p

But this is equivalent to

= q

which is not true (unless p = 1/2). Hence the chain is not reversible if p 6= 1/2.

37.

Consider a Markov chain whose transition diagram is as below:

1 2

0.6

0.1

0.3

0.7

0.2

0.5

0.7

0.4

0.5

(i) Which (if any) states are in essential?

(ii) Which (if any) states are absorbing?

(iii) Find the communication classes.

(iv) Is the chain irreducible?

(v) Find the period of each essential state. Verify that

essential s tates that belong to the same communica-

tion class have the same period.

(vi) Are there any aperiodic communication classes?

(vii) Will your answers to the questions (i)–(vi) change

if we replace the positive transition probabilities by

other positive probabilities and why?

Solution. (i) The inessential states are: 1, 2, 3, 5, 6, because each of them leads to a

state from which it is not possible to return.

(ii) 4 is the only absorbing state.

(iii) As usual, let [i] den ote the class of state i i.e. [i] = {j ∈ S : j ! i}. We have:

[1] = {1}.

[2] = {2}.

[3] = {3}.

[4] = {4}.

[5] = {5, 6}.

[6] = {5, 6}.

[7] = {7, 8}.

[8] = {7, 8}.

[9] = {9, 10, 11}

[10] = {9, 10, 11}

[11] = {9, 10, 11}

Therefore th ere are 7 communication classes:

{1}, {2}, {3}, {4}, {5, 6}, {7, 8}, {9, 10, 11}

(iv) No because there are many communication classes.

(v) Recall that for each essential state i, its period d(i) is the gcd of all n such that

(n)

i,i

> 0. So:

d(4) = gcd{1, 2, 3, . . .} = 1

d(7) = gcd{1, 2, 3, . . .} = 1

d(8) = gcd{1, 2, 3, . . .} = 1

d(9) = gcd{3, 6, 9, . . .} = 3

d(10) = gcd{3, 6, 9, . . .} = 3

d(11) = gcd{3, 6, 9, . . .} = 3

Observe d(7) = d(8) = 1, and d(10) = d(11) = d(9) = 3.

(vi) Yes: {4} and {7, 8} are aperiodic commun ication classes (each has period 1).

(vii) No the answers will not change. These questions depend only on whether, for

each i, j, p

i,j

is positive or zero.

38.

Consider a Markov chain, with state space S the set of all positive integers, whose

transition diagram is as f ollows:

3 4

1/2

1/3 1/3

1/3

2/3 2/3

2/3

(i) Which states are essential and which inessential?

(ii) Which states are transient and which recurrent?

(iii) Discuss the asymptotic behaviour of the chain, i.e. ﬁnd the limit, as n → ∞, of

= j) for each i and j.

Solution. (i) The states 3, 4, 5, . . . communicate with one another. So they are all

essential. However state 1 leads to 3 but 3 d oes not lead to 1. Hence 1 is in essential.

Likewise, 2 is inessential.

(ii) Every inessential state is transient. Hence both 1 and 2 are transient. On the other

hand, the Markov chain will eventually take values only in the set {3, 4, 5, . . .}. We

observe that the chain on this set is the same typ e of chain we discussed in gambler’s

ruin problem with p = 2/3, q = 1/3. Since p > q the chain is transient. Therefore all

states of the given chain are transient.

(iii) Since the states are transient, we have that X

→ ∞ as n → ∞, with pr ob ability

1. Therefore,

= j) → 0, as n → ∞,

for all i and j.

39.

Consider the following Markov chain, which is motivated by the “umbrellas problem”

(see–but it’s not necessary–an earlier exercise). Here, p + q = 1, 0 < p < 1.

1 2

p q

(i) Is the chain irreducible?

(ii) Does it have a stationary distribution?

Hint: Write the balance equations, together with the normalisation condition and draw

your conclusions.

(iii) Find the period d(i) of each state i.

(iv) Decide which states are transient and which recurrent.

Hint: Let τ

be the ﬁrst hitting time of state j. Let N ≥ 1 As in the gambler’s ruin

problem, let ϕ(i) := P

(τ

< τ

). What is ϕ(0)? What is ϕ(N)? For 1 < i < N , how

does ϕ(i) relate to ϕ(i − 1) and ϕ(i + 1)? Solve the equations you thus obtain to ﬁnd

ϕ(i). Let N → ∞. What do you conclude?

Solution. (i) Yes because all states communicate with one another. (There is just

one communication class).

(ii) Let us write balance equations in the form of equating ﬂows (see handou t). We

have

π(0) = π(1)q

π(1)p = π(1)p

π(2)q = π(2)q

···

Let π(1) = c. Then π(0) = cq and

π(1) = π(2) = π(3) = ··· = c.

The n ormalisation condition is

∞

i=0

π(i) = 1. This implies that c = 0. Hence

π(i) = 0 for all i. This is NOT a probability distribution. Hence there is no stationary

distribution.

(iii) We only have to ﬁnd the period of one state, since all states communicate with

one another. Pick state 0. We have d(0) = gcd{2, 4, 6, . . .} = 2. Hence d(i) = 2 for all

(iv) Let ϕ(i) := P

(τ

< τ

). We have

ϕ(0) = 0, ϕ(N) = 1.

Indeed, if X

= 0 then τ

= 0 and so ϕ(0) = P

(τ

< 0) = 0. On the other hand, if

= N then τ

= 0 and τ

≥ 1, so ϕ(N) = P

(0 < τ

) = 1.

Now, from ﬁrst-step analysis, for each i ∈ [1, N − 1], we have

ϕ(i) = p

i,i+1

ϕ(i + 1) + p

i,i−1

ϕ(i).

But p

i,i+1

= p

i,i−1

= p if i is odd and p

i,i+1

= p

i,i−1

= q if i is even and positive. So

p[ϕ(i + 1) − ϕ(i)] = q[ϕ(i) − ϕ(i − 1)], i odd

q[ϕ(i + 1) − ϕ(i)] = p[ϕ(i) − ϕ(i − 1)], i even.

Hence

ϕ(2) − ϕ(1) =

[ϕ(1) − ϕ(0)] =

ϕ(1)

ϕ(3) − ϕ(2) =

[ϕ(2) − ϕ(1)] = ϕ(1)

ϕ(4) − ϕ(3) =

[ϕ(3) − ϕ(2)] =

ϕ(1)

ϕ(5) − ϕ(4) =

[ϕ(4) − ϕ(3)] = ϕ(1),

and, in general,

ϕ(i) − ϕ(i − 1) =

ϕ(1) i even

ϕ(i) − ϕ(i − 1) = ϕ(1) i odd.

Next, use the “fundamental theorem of (discrete) calculus”:

ϕ(i) = [ϕ(i) − ϕ(i − 1)] + [ϕ(i − 1) − ϕ(i − 2)] + ··· + [ϕ(2) − ϕ(1)] + ϕ(1).

If i is even then, amongst 1, 2, . . . , i there are i/2 even numbers and i/2 odd numbers.

ϕ(i) =





i/2

ϕ(1) +

ϕ(1) i even

Suppose N is even. Use ϕ(N) = 1 to get that, if both i and N are even,

ϕ(i) =





i/2





N/2

= P

(τ

< τ

Taking the limit as N → ∞, we ﬁnd

(τ

= ∞) = 0, i even.

This implies that P

(τ

< ∞) = 1. The same conclusion holds for i odd. (After all,

all states communicate with one another.) Therefore all states are recurrent.

40.

Suppose that X

, X

. . . are i.i.d. random variables with values, say, in Z and common

distribution p(i) := P (X

= i), i ∈ Z.

(i) Explain why the sequence has the Markov property.

(ii) Let A be a subset of the integers such that

i∈A

p(i) > 0. Consider the ﬁrst

hitting time τ

of A and the random variable Z := X

. Show that the distribution

of Z is the conditional distribution of X

given that X

∈ A.

Hint: Clearly, {Z = i} =

∞

n=1

{Z = i, τ

= n}, and the events in this union are

disjoint; therefore the probability of the u nion is the su m of the probabilities of the

events comprising it.

Solution. (i) As explained in the beginning of the lectures.

(ii) Since τ

is the FIRST time th at A is hit, it means that

= n ⇐⇒ X

6∈ A, X

6∈ A, . . . , X

n−1

6∈ A, X

∈ A.

Therefore, with Z = X

, and i ∈ A,

P (Z = i) =

∞

n=1

P (X

= i, τ

= n)

∞

n=1

P (X

= i, X

6∈ A, X

6∈ A, . . . , X

n−1

6∈ A, X

∈ A)

∞

n=1

P (X

= i, X

6∈ A, X

6∈ A, . . . , X

n−1

6∈ A)

∞

n=1

p(i)P (X

6∈ A)

n−1

[geometric series]

= p(i)

1 − P (X

6∈ A)

p(i)

P (X

∈ A)

If i 6∈ A, then, obviously, P (Z = i) = 0. So it is clear that P (Z = i) = P (X

= i|X

∈

A), for all i, from th e deﬁnition of conditional probability.

41.

Consider a random walk on the following inﬁnite graph:

The graph continues

ad inﬁnitum in the

same manner.

Here, each state has exactly 3 neighbouring states (i.e. its degree is 3) and so the

probability of moving to one of them is 1/3.

(i) Let 0 be th e “central” state. (Actually, a closer look shows that no state deserves

to be central, for they are all equivalent. So we just arbitrarily pick one and call it

central.) Having d on e that, let D(i) be the distance of a state i from 0, i.e. the number

of “hops” required to reach 0 starting from i. So D(0) = 0, each neighbour i of 0 has

D(i) = 1, etc. Let X

be the position of the chain at time n. Observe that th e process

= D(X

) has the Markov property. (See lecture notes for criterion!) The question

is:

Find its transition probabilities.

(ii) Using the results from the gambler’s r uin pr ob lem, show that (Z

) is trans ient.

(iii) Use (ii) to explain why (X

) is also transient.

Solution. (i) First draw a ﬁgure:

The states with the

same distance from 0

are shown in this ﬁg-

ure as belonging to

the same circle.

Next observe that if Z

= k (i.e. if the distance from 0 is k) then, no matter where

is actually located the distance Z

n+1

of the next state X

n+1

from 0 will either be

k + 1 with probability 2/3 or k − 1 with probability 1/3. And, of course, if Z

= 0

then Z

n+1

= 1. So

P (Z

n+1

= k + 1|Z

= k) = 2/3, k ≥ 0

P (Z

n+1

= k − 1|Z

= k) = 1/3, k ≥ 1

P (Z

n+1

= 1|Z

= 0) = 1.

(ii) Since 2/3 > 1/3, the chain (Z

) is trans ient.

(iii) We have that Z

→ ∞ as n → ∞, with probability 1. This means that for any k,

there is a time n

such that for all n ≥ n

we have D(X

) ≥ k, and this happens with

probability 1. So, with pr ob ability 1, the chain (X

) will visit states with distance from

0 less than k only ﬁnitely many times. This means that the chain (X

) is trans ient.

42.

A company requires N employees to function properly. If an employee becomes sick

then he or she is replaced by a new one. It takes 1 week for a new employee to be

recruited and to start working. Time here is measur ed in weeks.

(i) If at the beginning of week n there are X

employees working and Y

of them get

sick d uring week n then show that at the beginning of week n + 1 there will be

n+1

= N − Y

employees working.

(ii) Suppose that each employee becomes sick in dependently with probability p. Show

that

P (Y

= y|X

= x) =





(1 − p)

x−y

, y = 0, 1, . . . , x.

(iii) Show that (X

) is a Markov chain with state space S = {0, 1, . . . , N} and derive

its transition probabilities.

(vi) Write the balance equation for the stationary distribution π of the chain.

(v) What is the number of emp loyees working in steady state?

Do this without using (vi) by assuming that the X is in steady state [i.e. that X

(and

therefore each X

) has distribution π] and by taking expectations on the equation you

derived in (i).

Solution. (i) This is elementary: Since every time an employee gets sick he or she is

replaced by a new one, but it takes 1 week for the new employee to start working, it

means that those employees who got sick durin g week n − 1 will be replaced by new

ones who will start working sometime during week n and so, by the end of week n,

the number of employees will be brought up to N, pr ovided nobody got sick during

week n. If the latter happens, then we subtract the Y

employees who got sick during

week n to obtained the desired equation.

(ii) Again, this is easy: If X

= x, at most x employees can get sick. Each one gets

sick with probability p, independently of one another, so the total number, Y

, of sick

employees has the Binomial(x, p) distribution.

(iii) We have that Y

depends only on X

and not on X

n−1

, X

n−2

, . . ., and therefore

P (X

n+1

= j|X

= i, X

n−1

= i

, X

n−2

= i

. . .) = P (X

n+1

= j|X

= i). Hence X is

Markov. We are asked to derive p

i,j

= P (X

n+1

= j|X

= i) for all i, j ∈ S. If X

= i

then Y

≤ i and so X

n+1

≥ N − i, so the only possible values j for which p

i,j

> 0 are

j = N − i, . . . , N. In fact, P (X

n+1

= j|X

= i) = P (Y

= N − j|X

= i) and s o,

using the formulae of (ii),

i,j









N − j



N−j

(1 − p)

i−N+j

, j = N −i, . . . , N

0, otherwise

, i = 0, 1, . . . , N.

(vi) The balance equations are:

π(j) =

i=0

π(i)p

i,j

i=N−j

π(i)



N − j



N−j

(1 − p)

i−N+j

(v) If X

has distribution π then X

has distribution π for all n. S o EX

≡ µ does not

depend on n. Now , if X

= x, Y

is Binomial(x, p) and therefore E(Y

= x) = px.

x=0

pxP (X

= x) = pEX

= pµ.

Since EX

n+1

= N − EY

we have

µ = N − pµ,

whence

µ =

1 + p

This is the mean number of employees in steady state. So, for example, if p = 10%,

then µ ≈ 0.91N .

43.

(i) Let X be the number of heads in n i.i.d. coin tosses where the probability of heads

is p. Find the generating function ϕ(z) := Ez

of X.

(ii) Let Y be a random variable with P (Y = k) = (1 − p)

k−1

p, k = 1, 2, . . . Find the

generating function of Y .

Solution. (i) The random variable X, which is deﬁned as the number of heads in n

i.i.d. coin tosses where the probability of heads is p, is binomially distributed:

P (X = k) =





(1 − p)

n−k

Thus,

ϕ(z) := Ez

k=0

P (X = k)z

k=0





(1 − p)

n−k

(pz)

= ((1 − p) + zp)

= (q + zp)

, where q = 1 − p.

(ii) The random variable Y , deﬁned by

P (Y = k) = (1 − p)

k−1

p, k = 1, 2, . . .

has th e following generating function:

ϕ(z) := Ez

∞

k=1

P (Y = k)z

∞

k=1

(1 − p)

k−1

1 − p

∞

k=1

[(1 − p)z]

1 − p



1 − z(1 − p)

− 1



1 − zq

, where q = 1 − p.

44.

A random variable X with values in {1, 2, . . . , }∪{∞} has generating function ϕ(z) =

(i) Express P (X = 0) in terms of ϕ.

(ii) Express P (X = ∞) in terms of ϕ.

(iii) Express EX and varX in terms of ϕ.

Solution. (i) ϕ(0) =

∞

k=0

P (X = k)z

z=0

= P (X = 0), thus, P (X = 0) = ϕ(0).

(ii) The following mus t hold:

∞

k=0

P (X = k) + P (X = ∞) = 1. This may be rewritten

as follows: ϕ(1) + P (X = ∞) = 1, fr om wh ich we get

P (X = ∞) = 1 − ϕ(1).

(iii) By deﬁnition of the expected value of a discrete random variable

EX =

∞

k=0

kP (X = k).

Now note, that

′

(z) =

∞

k=0

kP (X = k)z

k−1

so that ϕ

′

(1) should give nothing but EX. We conclude that

EX = ϕ

′

(1).

Let p

:= P (X = k). Now we take the second derivative of ϕ(z):

′′

(z) =

∞

k=2

k(k −1)p

k−2

so that

′′

(1) =

∞

k=2

− kp

) =

∞

k=2

−

∞

k=2

∞

k=0

−

∞

k=0

= EX

− EX = EX

− ϕ

′

(1),

from which we get that EX

= ϕ

′

(1) + ϕ

′′

(1). But this is enough for var X, since

var X = EX

− (EX)

= ϕ

′

(1) + ϕ

′′

(1) −



′

(1)



45.

A random variable X with values in {1, 2, . . . , } ∪ {∞} has generating function

ϕ(z) =

1 −

1 − 4pqz

2qz

where p, q ≥ 0 and p + q = 1.

(i) Compute P (X = ∞). (Consider all possible values of p).

(ii) For those values of p for which P (X = ∞) = 0 compute EX.

Solution.

(i) As it was found above, P (X = ∞) = 1 − ϕ(1), and particularly

P (X = ∞) = 1 − ϕ(1) = 1 −

1 −

√

1 − 4pq

= 1 −

1 − |p −q|



1 −

, p < q

0, p ≥ q

(ii) It f ollows that P (X = ∞) = 0 for p ≥

. The expected value of X is given by

EX = ϕ

′

(1) =



p−q

, p >

∞, p =

and we are done.

46.

You can go up the stair by climbing 1 or 2 steps at a time. There are n steps in total.

In how many ways can you climb all steps?

Hint 1: If n = 3, you can reach the 3d step by climbing 1 at a time, or 2 ﬁrst and 1

next, or 1 ﬁrst and 2 next, i . e. there are 3 ways.

Hint 2: if w

is the number of ways to climb m steps, how is w

related to w

m−1

and

m−2

Hint 3: Consider the generating function

Solution. Jus t before being at step m you are either at step m −1 or at step m −2.

Hence

= w

m−1

+ w

m−2

, m ≥ 2. (10)

Here, s tep 0 means being at the bottom of the stairs. So

= 1, w

= 1.

= w

+ w

= 2

= w

+ w

= 3

= w

+ w

= 5

= w

+ w

= 8

= w

+ w

= 13

= w

+ w

= 21

············

How do we ﬁnd a formula for w

? Here is where generating f unctions come to rescue.

Let

W (s) =

m≥0

be the generating of (w

, m ≥ 0). Then the generating f unction of (w

m+1

, m ≥ 0) is

m≥0

m+1

= s

−1

(W (s) − w

)

and the generating function of (w

m+2

, m ≥ 0) is

m≥0

m+2

= s

−2

(W (s) − w

− sw

From the recursion

m+2

= w

m+1

+ w

, m ≥ 0

(obtained from (10) by replacing m by m + 2) we have (and this is were linearity i s

used) that the generating fun ction of (w

m+2

, m ≥ 0) equals the sum of the generating

functions of (w

m+1

, m ≥ 0) and (w

, m ≥ 0), namely,

−2

(W (s) − w

− sw

) = s

−1

(W (s) − w

) + W (s). (11)

Since w

= w

= 1, we can solve for W (s) and ﬁnd

W (s) =

−1

+ s − 1

Essentially, what generating functions have done for us is to transform the LIN-

EAR recursion (10) into the ALGEBRAIC equation (11). This is something you

have learnt in your introductory Mathematics courses. The tools and recipes

associated with LINEARITY are indispensable for anyone who does anything

of value. Thus, keep them always in your bag of tricks.

The question we ask is:

Which sequence (w

, n ≥ 0) has generating function W (s)?

We start by noting that the polynomial s

+ s − 1 has two roots:

a = (

√

5 − 1)/2, b = (−1 −

√

5)/2.

Hence s

+ s − 1 = (s − a)(s − b), and so, by simple algebra,

W (s) =

b − a



s − a

−

s − b



Write this as

W (s) =

b − a



bs − ab

−

as − ab



Noting that ab = −1, we further have

W (s) =

b − a

1 + bs

−

b − a

1 + as

But

1+bs

∞

n=0

(−bs)

1+as

∞

n=0

(−as)

, and so W (s) is the generating function

b − a

(−b)

−

b − a

(−a)

, n ≥ 0.

This can be written also as

(1 +

√

n+1

− (1 −

√

n+1

√

which is always an integer (why?)

47.

Consider a branching process starting with Z

= 1 and branching mechanism

= 1 − p, p

= p.

(Each individual gives birth to 1 or 2 children with probability 1−p or p, respectively.)

Let Z

be the size of the n-th generation. Compute the probabilities P (Z

= k) for

all possible values of k, the generating function ϕ

(z) = Ez

, and the mean size of

the n-th generation m

= EZ

. Do the computations in whichever order is convenient

for you.

Solution. The mean number of oﬀspring of a typical in dividual is

m := (1 − p) + 2p = 1 + p.

Therefore

= m

= (1 + p)

Let q = 1 − p. To compute P (Z

= 4), we consider all possibilities to have 4 children

in the second generation. There is only one possibility:

Therefore P (Z

= 4) = p

To compute P (Z

= 3) we have

and so P (Z

= 3) = pqp + ppq.

For P (Z

= 2) we have

and so P (Z

= 2) = qp + pq

And for P (Z

= 1) there is only one possibility,

and so P (Z

= 2) = q

You can continue in this mann er to compute P (Z

= k), etc.

The generating function of the branching mechanism is

ϕ(z) = p

z + p

= qz + pz

So ϕ

(z) = Ez

= ϕ(z). Next, we have ϕ

(z) = ϕ

(ϕ(z)) an d so

(z) = ϕ(ϕ(z)) = qϕ(z) + pϕ(z)

= p

+ 2 p



qp + pq



+ q

Similarly, ϕ

(z) = ϕ

(ϕ(z)) and so

(z) = p

ϕ(z)

+ 2 p

qϕ(z)



qp + pq



ϕ(z)

+ q

ϕ(z)

= p

+ 4 p

+ p(2 (qp + pq

+ 4 p

+ p(2 q

+ 4 (qp + pq

q)z

+ (qp

+ p(4 q

+ (qp + pq

)

))z

+ (2 q

+ 2 pq

(qp + pq

))z

+ (q(qp + pq

) + pq

+ q

48.

Consider a branching process with Z

= 1 and branching mechanism

, p

(i) Compute probability of ultimate extinction.

(ii) Compute the mean size of the n-th generation.

(iii) Compute the standard deviation of the size of the n-th generation.

Solution. (i) T he generating function of the branching mechanism is

ϕ(z) =

(1 + 7z + 2z

The probability ε of ultimate extinction is the smallest positive z such that

ϕ(z) = z.

We have to solve

1 + 7z + 2z

= 10z.

Its solutions are 1, 1/2. Therefore,

ε = 1/2.

(ii) The mean number of oﬀspring of an individual is

m =

× 2 =

Therefore th e mean size of the n-th generation is

= m

= (11/10)

(iii) As in Exercise 2 above, we have that

′

(1) = EX, ϕ

′′

(1) = EX

−EX, var X = EX

−(EX)

= ϕ

′′

(1) + EX −(EX)

Since ϕ

(z) = ϕ

n−1

(ϕ(z)), we have

′

(z) = ϕ

′

n−1

(ϕ(z)) ϕ

′

(z)

′′

(z) = ϕ

′′

n−1

(ϕ(z)) ϕ

′

(z)

+ ϕ

′

n−1

(ϕ(z)) ϕ

′′

(z).

Setting z = 1 and using that ϕ(1) = 1 we have

′′

(1) = ϕ

′′

n−1

(1) ϕ

′

(1)

+ ϕ

′

n−1

(1) ϕ

′′

(1).

But ϕ

′

(1) = m, ϕ

′

n−1

(1) = m

n−1

and so

′′

(1) = ϕ

′′

n−1

(1) m

+ m

n−1

′′

(1).

Iterating this we ﬁnd

′′

(1) = ϕ

′′

(1)

2n−2

k=n−1

We here have m = 11/10, ϕ

′′

(1) = 4/10. But then

= var Z

= ϕ

′′

(1) + EZ

− (EZ

)

= ϕ

′′

(1)

2n−2

k=n−1

+ m

−m

= ϕ

′′

(1)m

n−1

− 1

m − 1

+ m

− m

n−1

− 1

1/10

−m

−1)

= 4m

n−1

− 1) − m

− 1)

= (4 − m)m

n−1

− 1)





n−1





− 1



Of course, the standard deviation is the square r oot of this number.

49.

Consider the same branching process as above, but now start with Z

= m, an arbi-

trary positive integer. Answer the same questions.

Solution. (i) The process behaves as the superposition of N i.i.d. copies of the

previous process. This becomes extinct if and only if each of the N copies becomes

extinct and so, by in dependence, the extinction probability is

= (1/2)

(ii) Th e n-th generation of the new process is the sum of the populations of the n-th

generations of each of the N constituent processes. T herefore the mean size of the

n-th generation is

= N(11/10)

(iii) For the same reason, the standard deviation of the size of the n-th generation is

√

Nσ

50.

Show th at a branching process cannot have a stationary distribution π with π(i) > 0

for s ome i > 0.

Solution. If the mean number m of oﬀspring is ≤ 1 then we know that the process will

become extinct f or sure, i.e. it will be absorbed by state 0. Hence the only stationary

distribution satisﬁes

π(0) = 1, π(i) = 0, i ≥ 1.

If the mean number m of oﬀspring is > 1 then we know that the probability that it

will become extinct is ε < 1, i.e. P

(τ

= ∞) = 1 − ε > 0. But we showed in Part

(i) of Problem 8 above that P

(τ

= ∞) = 1 − ε

> 0 for all i. Hence the process is

transient. And so there is NO stationary distribution at all.

51.

Consider the following Markov chain, which is motivated from the “umbrellas problem”

(see earlier exercise). Here, p + q = 1, 0 < p < 1.

1 2

p q

Is it positive recurrent?

Solution. We showed in another problem that the chain is irreducible and recurrent.

Let us now see if it is positive recurrent. In other words, let us see if E

< ∞ for

some (and thus all) i.

As we said in the lectures, this is equivalent to having π(i) > 0 for all i w here π is

solution to the balance equations. We solved the balance equations in the past and

found that π(i) = c for all i, where c is a constant. But there is no c > 0 for which

∞

i=0

π(i) = 1. And so th e chain is not positive recurrent; it is null recurrent.

52.

Consider a Markov chain with state sp ace {0, 1, 2, . . .} and transition probabilities

i,i−1

= 1, i = 1, 2, 3, . . .

0,i

= p

, i = 0, 1, 2, 3, . . .

where p

> 0 for all i and

i≥0

= 1.

(i) Is the chain irreducible?

(ii) What is the period of state 0?

(iii) What is the perio d of state i, for all values of i?

(iv) Under what condition is th e chain positive recurrent?

(v) If the chain is positive recurrent, what is the mean number of steps required for it

to return to state i if it starts from i?

Solution.

i−

1 1

(i) Yes it is. It is possible to move from any state to any other state.

(ii) It is 1.

(iii) Same.

(iv) We write balance equations:

π(i) = π(i + 1) + π(0)p

, i ≥ 0.

Solving this we ﬁnd

π(i) = π(0)(1 − p

− ··· − p

i−1

), i ≥ 1.

The normalising condition gives

1 =

∞

i=0

π(i) = π(0)

∞

i=0

(1 − p

−··· −p

i−1

This can be satisﬁed if and only if

∞

i=0

(1 − p

− ··· − p

i−1

) < ∞

This is th e condition for positive recurrence.

Note that, since p

+ ··· + p

i−1

= P

≤ i − 1), the condition can be written as

∞

i=0

≥ i) < ∞

But

∞

i=0

≥ i) ==

∞

i=0

≥ i) = E

∞

i=0

1(X

≥ i) = E

i=0

1 = E

(X + 1)

so the condition is equ ivalent to

< ∞.

(v)

π(i)

53.

Consider a simple symmetric random walk S

= ξ

+···+ξ

, started fr om S

= 0.

Find the following pr ob abilities:

(i) P (S

= k), for all possible values of k.

(ii) P (S

≥ 0 ∀n = 1, 2, 3, 4).

(iii) P (S

6= 0 ∀n = 1, 2, 3, 4).

(iv) P (S

≤ 2 ∀n = 1, 2, 3, 4).

(v) P (|S

| ≤ 2 ∀n = 1, 2, 3, 4).

Solution. (i) We have

P (S

= k) =



4+k



−4

, k = −4, − 2, 0, 2, 4,

and so

P (S

= −4) = P (S

= 4) = 1/16, P (S

= −2) = P (S

= 2) = 4/16, P (S

= 0) = 6/16.

−1

−2

−3

−4

0 1 2 3 4

(ii) Sin ce the random walk is symmetric, all paths of the

same length are equally likely. T here are 6 paths compris-

ing the event {(S

≥ 0 ∀n = 1, 2, 3, 4} and so P ((S

≥

0 ∀n = 1, 2, 3, 4) = 6/16.

(iii) There are just 2 paths comprising the event {S

0 ∀n = 1, 2, 3, 4}. Hence P (S

6= 0 ∀n = 1, 2, 3, 4) = 4/16.

(iv) There are 2 paths violating the condition {S

≤ 2 ∀n =

1, 2, 3, 4}. Hence P (S

≤ 2 ∀n = 1, 2, 3, 4) = (16 − 2)/16 =

14/16.

(v) Th ere are 4 paths violating the condition {|S

| ≤ 2 ∀n =

1, 2, 3, 4}. Hence P (|S

| ≤ 2 ∀n = 1, 2, 3, 4) = (16−4)/16 =

12/16.

54.

Consider a simple random walk S

= ξ

+ ··· + ξ

, started from S

= 0, with

P (ξ

= 1) = p, P (ξ

= −1) = q, p + q = 1.

(i) Show that

E(S

| S

) =

(

, if m ≤ n

, if m > n

(ii) Are you surprised by the fact that the answer does not depend on p?

Solution. (i) If m > n then S

= S

+ (S

− S

), so

E(S

| S

) = S

+ E(S

− S

| S

But S

− S

= ξ

n+1

+ ··· + ξ

. Since ξ

n+1

, . . . , ξ

are independent of S

, we have

E(S

− S

| S

) = E(S

− S

) =

k=n+1

Eξ

= (m − n)(p − q).

Thus,

E(S

− S

| S

) = S

+ (p − q)(m − n), if m > n.

If m ≤ n, then

E(S

| S

) = E

k=1

| S

k=1

E(ξ

| S

)

Now notice that for all k = 1, . . . , n,

E(ξ

| S

) = E(ξ

| S

because the random variables ξ

, . . . , ξ

are i.i.d. and S

is a symmetric function of

them (interchanging two does not change the sum). Hence for all m = 1, . . . , n,

E(S

| S

) = mE(ξ

| S

This is true even for m = n. But, in this case, E(S

| S

) = E(S

| S

) = S

, so that

E(ξ

| S

) = S

/n. Thus,

E(S

− S

| S

) =

, if m ≤ n.

(ii) At ﬁrst sight, yes, you should be surprised. But look (think) again...

55.

Consider a simple random walk S

again, which does not necessarily start from 0,

and deﬁne the processes the processes:

= S

, n ≥ 0

= S

2n+1

, n ≥ 0

= e

, n ≥ 0

(i) Show that each of them is Markov and identify their state spaces.

(ii) Compute their trans ition probabilities.

Solution. (i) The ﬁrst two are Markov because they are a subsequence of a Markov

chain. The third is Markov because x 7→ e

is a bijection from R into (0, ∞). The

state space of the ﬁrst two is Z. The state space of the third is the set S = {e

: k ∈

Z} = {. . . , e

−2

, e

−1

, 1, e, e

, e

, . . .}.

(ii) For the ﬁrst one we h ave

P (X

n+1

= j|X

= i) = P (S

= j|S

= i) = P (i+ξ

2n+1

+ξ

= j) = P (ξ

+ξ

= j−i).

Hence, given i, the only possible values of j are i − 2, i, i + 2. For all other values of

j, the trans ition probability is zero. We have

P (X

n+1

= i + 2|X

= i) = P (ξ

= ξ

= 1) = p

P (X

n+1

= i − 2|X

= i) = P (ξ

= ξ

= −1) = q

P (X

n+1

= i|X

= i) = P (ξ

= 1, ξ

= −1 or ξ

= −1, ξ

= 1) = 2pq

The second process has the same transition probabilities.

For the third process we have

P (Z

n+1

= e

k+1

) = P (S

n+1

= k + 1|S

= k) = p

P (Z

n+1

= e

k−1

) = P (S

n+1

= k −1|S

= k) = q

56.

Consider a simple random walk S

again, and suppose it starts from 0. As usual,

P (ξ

= 1) = p, P (ξ

= −1) = q = 1 − p. Compute Ee

αS

for α ∈ R.

Solution. We have S

= ξ

+ ··· + ξ

. By independence

αS

= E

αξ

···e

αξ

= E

αξ

···E

αξ



αξ

i

= (pe

+ qe

−α

)

57.

(i) Explain why P (lim

n→∞

= ∞) = 1 is p > q and, similarly, P (lim

n→∞

−∞) = 1 if p < q.

(ii) What can you say about the asymptotic behaviour of S

as n → ∞ when p = q?

Solution. (i) T he Strong Law of Large Numbers (SLLN) says that

P ( lim

n→∞

/n = p − q) = 1,

because Eξ

= p − q. If p > q, then SLLN implies that

P ( lim

n→∞

/n > 0) = 1.

But

{ lim

n→∞

/n > 0} ⊂ { lim

n→∞

= ∞}.

Since the event on the left has probability 1, so does the event on the right, i.e.

P ( lim

n→∞

= ∞) = 1, if p > q.

If, on the other hand , p < q, then p − q < 0, and so SLLN implies that

P ( lim

n→∞

/n < 0) = 1.

But

{ lim

n→∞

/n < 0} ⊂ { lim

n→∞

= −∞}.

Since the event on the left has probability 1, so does the event on the right, i.e.

P ( lim

n→∞

= −∞) = 1, if p < q.

(ii) If p = q, then p − q = 0, and the fact that S

/n converges to 0 cannot be used to

say something about the sequence S

other than that the sequence S

has no limit.

So, we may conclude that

P (S

has no limit as n → ∞) = 1, if p = q.

Stronger conclusions are possible, as we saw in the lectures.

58.

For a simple symmetric random walk let f

be the p robability of ﬁrst return to 0

at time n. Compute f

for n = 1, . . . , 6 ﬁrst by ap plying the general formula and then

by path counting (i.e. by considering the possible paths that contribute to the event).

Solution. Obviously, f

= 0 if n is odd. Recall the formula



1/2



(−1)

k−1

, k ∈ N.

With k = 1, 2, 3, we have



1/2



= −



1/2



= −

(1/2)(1/2 − 1)



1/2



= −

(1/2)(1/2 −1)(1/2 − 2)

= −



1/2



= −

(1/2)(1/2 − 1)(1/2 − 2)(1/2 − 3)

128

To do path counting, we consider, e.g. th e last case. The possible paths contributing

to th e event {T

′

= 8} are the ones in the ﬁgure below as well as their reﬂections:

Each path consists of 8 segments, so it has probability 2

−8

. There are 5 paths, so

= 10/2

= 5/128.

59.

Consider a simple symmetric random walk starting from 0. Equalisation at time

n means that S

= 0, and its probability is denoted by u

(i) Show that for m ≥ 1, f

= u

2m−2

− u

(ii) Using part (i), ﬁnd a closed-form expr ession for the sum f

+ f

+ ··· + f

(iii) Using part (i), show th at

∞

k=1

= 1. (One can also obtain this statement from

the fact that F (x) = 1 − (1 − x)

1/2

(iv) Show that the probability of no equalisation in the ﬁrst 2m steps equals the

probability of equalisation at 2m.

60.

A fair coin is tossed repeatedly and independently. Find th e expected number of tosses

required until the patter HTHH appears.

Solution. It’s easy to see that the Markov chain described by the following transition

diagram captures exactly what we are looking for.

H HT HTH HTHH

1/2

1/2 1/2 1/2

1/2

Rename the states ∅, H, HT, HT H, HT HH as 0, 1, 2, 3, 4, respectively, and let ψ

the average number of steps required for the state 4 to be reached if the starting state

is i. Writing ﬁ rst-step (backwards) equations we have

= 1 +

Also, obviously, ψ

= 0. Solving, we ﬁn d

= 8, ψ

= 14, ψ

= 16, ψ

= 18.

So the answer is: “it takes, on the average, 18 coin tosses to see the pattern HTHH

for th e ﬁrst time”.

61.

Show that the stationary distribution for the Ehrenfest chain is Binomial.

Solution. The Ehr en fest chain has state space

S = {0, 1, . . . , n}

and transition probabilities

i,i+1

= 1 −

, p

i,i−1

, i = 0, . . . , n.

From the transition diagram we immediately deduce that detailed balance equations

must h old, so, if π denotes the stationary distribution,

π(i)p

i,i−1

= π(i − 1)p

i−1,i

, 1 ≤ i ≤ n,

π(i) =

n − i + 1

π(i −1), 1 ≤ i ≤ n,

iterating of which gives

π(i) =

n − i + 1

n − i + 2

i − 1

···

n − 1

π(0) =

(n − i)!i!

π(0),

which is immediately recognisable as Binomial distribution.

62.

A Markov chain has transition probability matrix

P =







0 1 0 0

0 0 1/3 2/3

1 0 0 0

0 1/2 1/2 0







Draw the transition diagram.

Are there any absorbing states?

Which are the communicating classes?

Can you ﬁnd a stationary distribution?

What are the periods of the states?

Are there any inessential states?

Which states are recurrent?

Which states are transient?

Which states are positive recurrent?

Solution.

1/3

2/3

1/2

• There are no abs orbing states because there is no state i for which p

i,i

= 1.

• All states commu nicate with one another. Therefore there is on ly one communicating

class, {1, 2, 3, 4}, the whole state space. (We r efer to this by saying that th e chain is

irreducible.)

• Yes, of course we can. We can ALWAYS ﬁnd a stationary distribution if the state

space is FINITE. It can be found by solving the system of equations (known as balance

equations)

πP = π,

which, in explicit form, yield

π(1) = π(3)

π(2) = π(1) +

π(4)

π(3) =

π(2) +

π(4)

π(4) =

π(2)

Solving these, along with the normalisation condition π(1) + π(2) + π(3) + π(4) = 1,

we ﬁnd

π(1) = π(4) = π(3) = 9/2, π(2) = 27/4.

• Since the chain is ir reducible the periods of all the states are the same. So let take

a particular state, say state 4 and consider th e set

{n ≥ 1 : p

(n)

4,4

> 0}.

We see that the ﬁrst few elements of this set are

{2, 5, 6, . . .}.

We immediately deduce that the greatest common divisor of the set is 1. Therefore

the period of state 4 is 1. And so each state has period 1. (We refer to this by saying

that the chain is aperiodic.)

• Since all states communicate with one another there are no inessential states.

• Since π(i) > 0 for all i, all states are recurrent.

• Since all states are recurrent there are no transient states.

• Since π(i) > 0 for all i, all states are positive recurrent.

63.

In tennis the winner of a game is the ﬁrst player to win four points, un less the score is

4–3, in which case the game must continue until one player wins by two points. Suppose

that the game has reached the point where one player is trying to get two points ahead

to win and that the server will independently win the point with probability 0.6. What

is the probability the s erver w ill win th e game if the score is tied 3-3? if she is ahead

by one point? Behind by one point?

Solution. Say that a score x-y means that the server has x points and the other

player y. If the current score is 3-3 the next s core is either 4-3 or 3-4. In either case,

the game must continue until one of the players is ahead by 2 points. So let us say

that i r ep resents the diﬀerence x − y. We model the situation by a Markov chain as

follows:

−2

−1 0

0.6 0.6 0.6

0.4 0.4 0.4

Let ϕ

be the probability that the server wins, i.e. that state 2 is reached before state

−2. First-step equations yield:

= 0.6ϕ

i+1

+ 0.4ϕ

i−1

, −1 ≤ i ≤ 1.

In other words,

−1

= 0.6ϕ

+ 0.4ϕ

−2

= 0.6ϕ

+ 0.4ϕ

−1

= 0.6ϕ

+ 0.4ϕ

Of course,

−2

= 0, ϕ

= 1.

Solving, we ﬁnd

0.6

1 − 2 × 0.6 × 0.4

≈ 0.69, ϕ

≈ 0.88, ϕ

−1

≈ 0.42.

64.

Consider a simple rand om walk with p = 0.7, starting from zero. Find the probability

that state 2 is reached before state −3. Compu te the mean number of s teps until the

random walk reaches state 2 or state 3 for the ﬁrst time.

Solution. Let ϕ

be the probability that state 2 is reached before state −3, starting

from state i. By writing ﬁrst-step equations we have:

= pϕ

i+1

+ qϕ

i−1

, −3 < i < 2.

In other words,

−2

= 0.7ϕ

−1

+ 0.3ϕ

−3

−1

= 0.7ϕ

+ 0.3ϕ

−2

= 0.7ϕ

+ 0.3ϕ

−1

= 0.7ϕ

+ 0.3ϕ

We also have, of course,

−3

= 0, ϕ

= 1.

By solving these equations we ﬁnd:

(q/p)

i+3

− 1

(q/p)

− 1

, −3 ≤ i ≤ 2.

Therefore

(3/7)

− 1

(3/7)

− 1

≈ 0.93.

Next, let t

be the mean number of steps until the random walk r eaches state 2 or state

3 for the ﬁrst time, starting from state i. By w riting ﬁrst-step equ ations we have:

= 1 + pt

i+1

+ qt

i−1

, −3 < i < 2.

In other words,

−2

= 1 + 0.7t

−1

+ 0.3t

−3

−1

= 1 + 0.7t

+ 0.3t

−2

= 1 + 0.7t

+ 0.3t

−1

= 1 + 0.7t

+ 0.3t

We also have, of course,

= 0, t

−3

= 0.

By solving these equations we ﬁnd:

p − q

(q/p)

i+3

− 1

(q/p)

− 1

−

i + 3

p − q

, −3 ≤ i ≤ 2.

Therefore

0.4

(3/7)

− 1

(3/7)

− 1

−

0.4

≈ 4.18.

65.

A gambler has £9 and has the opportunity of p laying a game in which the pr ob ability

is 0.4 that he wins an amount equal to his stake, and probability 0.6 that he loses h is

stake. He is allowed to decide how much to stake at each game (in multiple of 10p).

How should he choose the stakes to maximise his chances of increasing his capital to

£10?

66.

Let ξ

, ξ

, . . . be i.i.d. r.v.’s with values in, say, Z and P (ξ

= x) = p(x), x ∈ Z.

Let A ⊆ Z such that P (ξ

∈ A) > 0. Let T

= inf{n ≥ 1 : ξ

∈ A}. Show that

P (ξ

= x) = p(x)/

a∈A

p(a), x ∈ A.

Solution. Let x ∈ A.

P (ξ

= x) =

∞

n=1

P (ξ

= x, ξ

6∈ A, . . . , ξ

n−1

6∈ A)

∞

n=1

p(x)P (ξ

6∈ A)

n−1

= p(x)

P (ξ

6∈ A)

p(x)

a∈A

p(a)

67.

For a simple symmetric random walk starting from 0, compute ES

Solution. We have that S

= ξ

+ ··· + ξ

, where ξ

, . . . , ξ

are i.i.d. with P (ξ

1) = P (ξ

= −1) = 1/2. When we expand the fourth power of the sum we have

=ξ

+ ··· + ξ

+ ξ

+ ··· + ξ

n−1

+ ξ

+ ··· + ξ

n−1

+ ξ

+ ··· + ξ

n−2

n−1

+ ξ

+ ··· + ξ

n−3

n−2

n−1

After taking expectation, we see that the expectation of each term in the last three

rows is zero, because Eξ

= 0 and because of independence. There are n terms in the

ﬁrst row and 3(n

− n) terms in the second one. Hence

= nEξ

+ 3(n

− n)Eξ

Eξ

But ξ

= ξ

= 1. So the answer is:

= n + 3(n

− n) = 3n

− 2n.

68.

For a simple random walk, compute E(S

− ES

)

and observe that this is less

than Cn

for s ome constant C.

Solution. Write

−ES

k=1

where

= ξ

− Eξ

= ξ

− (p − q).

Notice that

= 0,

and repeat the computation of

as above but with

in place of ξ

+ ··· +

n−1

+ ··· +

n−1

+ ··· +

n−2

n−1

+ ··· +

n−3

n−2

n−1

After taking expectation, we see that the expectation of each term in the last three

rows is zero. Hence

= nE

+ 3(n

− n)E

This is of the form c

+ c

n. And clearly this is less than Cn

where C = c

+ |c

69.

For a simple symmetric random walk let T

′

be the time of ﬁrst return to 0. Compute

P (T

′

= n) for n = 1, . . . , 6 ﬁrst by applying the general formula and then by path

counting.

Solution. Obviously, P (T

′

= n) = 0 if n is odd. Recall the formula

P (T

′

= 2k) =



1/2



(−1)

k−1

, k ∈ N.

With k = 1, 2, 3, we have

P (T

′

= 2) =



1/2



P (T

′

= 4) = −



1/2



= −

(1/2)(1/2 −1)

P (T

′

= 6) =



1/2



= −

(1/2)(1/2 −1)(1/2 − 2)

P (T

′

= 8) = −



1/2



= −

(1/2)(1/2 −1)(1/2 − 2)(1/2 − 3)

128

To do path counting, we consider, e.g. th e last case. The possible paths contributing

to the event {T

′

= 8} are the ones in the ﬁgure below as well as th eir reﬂections:

Each path consists of 8 segments, so it has probability 2

−8

. There are 5 paths, so

P (T

′

= 8) = 10/2

= 5/128.

70.

Show that the formula P (M

≥ x) = P (|S

| ≥ x) −

P (|S

| = x) can also be derived

by summing up over y the formula P (M

< x, S

= y) = P(S

= y)−P (S

= 2x−y),

x > y.

Solution.

71.

How would you modify the formula we derived f or Es

for a simple random walk

starting from 0 in order to m ake it valid for all a, positive or negative? Here T

is the

ﬁrst hitting time of a.

Solution. For a simple random walk S

= ξ

+ ··· + ξ

, with P (ξ

= +1) = p,

P (ξ

= −1) = q, we found

1 −

1 − 4pqs

2qs

≡ ψ(p, s),

where T

is the ﬁrst time that the RW hits 1 (starting fr om 0), and we use the notation

ψ(p, s) just to denote this function as a function of p and s. We argued that w hen

a > 0, the random variable T

is the s um of a i.i.d. copies of T

and so

= ψ(p, s)

Let us now look at T

−1

. Since the distribution of T

−1

is the same as that of T

but

for a RW where the p and q are interchanged we have

−1

= ψ(q, s).

Now, if −a < 0, the random variable T

−a

is the s um of a i.i.d. copies of T

−1

. Hence

−a

= ψ(q, s)

72.

Consider a simple symmetric random walk starting from 0 and let T

be the ﬁrst

time that state a will be visited. Find a formula for P (T

= n), n ∈ Z.

Solution. Let us consider the case a > 0, the other being similar. We have

1 −

1 − 4pqs

2qs

whence

(2q)

E(s

a+T

) =



1 −

1 − 4pqs



We write power series for the right and left hand sides separately.

RHS =

r=0







−

1 − 4pqs



r=0





(−1)

∞

n=0



r/2



(−4pqs

)

∞

n=0

r=0





r/2



(−1)

n+r

(4pq)

∞

n=0

r=1





r/2



(−1)

n+r

(4pq)

LHS = (2q)

∞

m=0

P (T

= m)s

a+m

Equating powers of s in both sides, we see that we need a + m = 2n, i.e. m = 2n −a.

LHS = (2q)

n≥a

P (T

= 2n − a)s

We conclude:

P (T

= 2n − a) =

(2q)

r=1





r/2



(−1)

n+r

(4pq)

, n ≥ a.

73.

Consider a simple symmetric random walk starting from 0 and let T

be the ﬁrst

time that state a will be visited. Derive the formulæ for P (T

< ∞) in detail.

Solution. We have

P (T

< ∞) = lim

s↑1

First consider the case a > 0. We have that, for all values of p,

= (Es

)

1 −

1 − 4pqs

2qs

, |s| < 1.

P (T

< ∞) = lim

s↑1

1 −

1 − 4pqs

2qs



1 −

√

1 − 4pq





1 − |p − q|



If p ≥ q then |p − q| = p − q and simple algebra gives P (T

< ∞) = 1. If p < q then

|p − q| = q − p and simple algebra gives P (T

< ∞) = (q/p)

. Next consider the case

a < 0. By interchanging the roles of p and q we have

P (T

< ∞) =



1 − |q −p|



If p ≥ q then |q − p| = p − q and simple algebra gives P (T

< ∞) = (p/q)

. I f p > q

then |q − p| = q − p and simp le algebra gives P (T

< ∞) = 1.

74.

Show that for a symm etric simple random walk any state is visited inﬁnitely many

times with probability 1.

Solution.

75.

Derive the expectation of the running maximum M

for a SSRW starting from 0:

= E|S

| +



n/2



−n−1

−

Conclude that EM

/E|S

| → 1, as n → ∞.

Solution. This follows from the formula

P (M

≥ x) = P (|S

| ≥ x) −

P (|S

| = x).

We have EM

∞

x=1

P (M

≥ x), E|S

| =

∞

x=1

P (|S

| ≥ x), so:

= E|S

| −

∞

x=1

P (|S

| = x).

The last sum equals 1 − P (S

= 0) = 1 −



n/2



−n

76.

Using the ballot theorem, show that, for a SSRW starting from 0,

P (S

> 0, . . . , S

> 0) =

where S

= max(S

, 0).

Solution. The ballot theorem says

P (S

> 0, . . . , S

> 0 | S

= x) = x/n, x > 0.

Hence

P (S

> 0, . . . , S

> 0) =

∞

x=1

P (S

> 0, . . . , S

> 0 | S

= x)P (S

= x)

∞

x=1

P (S

= x) =

77.

For a simple random walk with p < q show that EM

∞

q−p

Solution. We have

∞

x=1

P (M

∞

≥ x) =

∞

x=1

P (T

< ∞) =

∞

x=1

(p/q)

p/q

1 − (p/q)

q −p

78.

Consider a SSRW, starting f rom some positive integer x, and let T

be the ﬁrst n such

that S

= 0. Let M = max{S

: 0 ≤ n ≤ T

}. Show that M has th e same distribution

as the integer part of (i.e. the largest integer not exceeding) x/U, wh ere U is a uniform

random variable between 0 and 1.

Solution. Let T

be the ﬁrst time that the random walk reaches level a ≥ x. Then

P (M ≥ a) = P (T

< T

) = x/a.

On the other hand, if [y] denotes the largest integer not exceeding the real number y,

we have, for all a ≥ x,

P ([x/U] ≥ a) = P (x/U ≥ a) = P (U ≤ x/a) = x/a.

Hence P ([x/U] ≥ a) = P (M ≥ a), for all x ≥ a (while both probabilities are 1 for

x < a). Hence M has the same distribution as [x/U].

79.

Show that (X

, n ∈ Z

) is Markov if and only if for all intervals I = [M, N] ⊆ Z

, n ∈ I) (the process inside) is independent of (X

, n 6∈ I) (the process outside),

conditional on th e pair (X

, X

) (the process on the boundary).

80.

A deck of cards has 3 Red and 3 Blue cards. At each stage, a card is selected at

random. If it is Red, it is removed from the deck. If it is Blue then the card is not

removed and we move to the next stage. Find the average number of steps till the

process ends.

Solution. T he problem can be solved by writing ﬁ rst step equations for the Markov

chain representing the number of red cards remaining: The transition probabilities

are:

p(3, 2) = 3/6, p(2, 1) = 2/5, p(1, 0) = 1/4,

p(i, i) = 1 − p(i, i − 1), i = 3, 2, 1, p(0, 0) = 1.

Let ψ(i) be the average number of steps till the process ends if the initial state is i.

Then

ψ(3) = 1 + (3/6)ψ(3) + (3/6)ψ(2),

ψ(2) = 1 + (3/5)ψ(2) + (2/5)ψ(1),

ψ(1) = 1 + (3/4)ψ(1).

Alternatively, observe that, to go from state i to i − 1, we basically toss a coin with

probability of success equal to p(i, i − 1). Hence the expected number of tosses till

success is 1/p(i, i − 1). Adding these up we have the answer:

ψ(3) = (6/3) + (5/2) + (2/1).

81.

There are two decks of cards. Deck 1 contains 50 Red cards, 30 Blue cards, and 20

Jokers. Deck 2 contains 10, 80, 10, respectively. At each stage we select a card fr om a

deck. If we select a Red card then, we select a card of the other deck at the next stage.

If we select a Blue card then we select a card from the same deck at the next stage.

If, at any stage, a Joker is selected, then the game ends. Cards are always replaced in

the decks. Set up a Markov chain and ﬁnd , if we ﬁrst pick up a card at random from

a deck at random, how many steps it takes on the average for the game to end.

Solution. The obvious Markov chain has three states: 1 (you take a card from Deck

1), 2 (you take a card from Deck 2), and J (you selected a Joker). We have:

p(1, 2) = 30/100, p(1, J) = 20/100,

p(2, 1) = 80/100, p(2, J) = 10/100,

p(J, J) = 1.

Let ψ(i) be the average number of steps for th e game to end. when we start from deck

i Then

ψ(1) = 1 + (50/100)ψ(1) + (30/100)ψ(2),

ψ(2) = 1 + (10/100)ψ(2) + (80/100)ψ(1).

Solve for ψ(1), ψ(2). Since the initial deck is selected at random, the answer is (ψ(1)+

ψ(1))/2.

82.

Give an example of a Markov chain with a small number of states that

(i) is irreducible

(ii) has exactly two communication classes

(iii) has exactly two inessential and two essential states

(iv) is irreducible and aperiodic

(v) is irreducible and has period 3

(vi) has exactly one stationary distribution but is not irreducible

(vii) has more than one stationary distributions

(viii) has one state with period 3, one with period 2 and one with period 1, as well as

a number of other states

(ix) is irreducible and detailed balance equations are satisﬁed

(x) has one absorbing state, one inessential state and two other states which form a

closed class

(xi) has exactly 2 transient and 2 recurrent states

(xii) has exactly 3 states, all recurrent, and exactly 2 communication classes

(xiii) has exactly 3 states and 3 communication classes

(xiv) has 3 states, one of which is visited at most ﬁnitely many times and the other

two are visited inﬁnitely many times, with probability one

(xv) its stationary distribution is unique and uniform

83.

Give an example of a Markov chain with inﬁnitely many states that

(i) is irreducible and positive recurrent

(ii) is ir reducible and null recurrent

(iii) is irreducible and transient

(iv) forms a random walk

(v) has an inﬁnite number of inessential states and an inﬁnite numb er of essential

states which are all positive recurrent

84.

A drunkard starts from the pub (site 0) and moves one step to the right with probability

1. If, at some stage, he is at site k he moves one step to the right with probability

, on e s tep to the left with p robability q

, or stays where he is with the remaining

probability. Suppose p + q = 1, 0 < p < q. Show that the drun kard will visit 0

inﬁnitely many times with probability 1.

Solution. Write down balance equations for the stationary distribution. Observe

that, since

∞

k=1

(p/q)

k(k−1)/2

< ∞, we have that π(k) > 0 for all k. Hence, not only

0 w ill be visited inﬁnitely many times, but also the expected time, starting from 0, till

the ﬁrst r eturn to 0 is 1/π(0) < ∞.

85.

A Markov chain takes values 1, 2, 3, 4, 5. From i it can move to any j > i w ith equal

probability. State 5 is absorbing. Starting from 1, how many steps in the average will

it take till it reaches 5?

Solution. 2.08

86.

There are N individuals, some infected by a disease (say the disease of curiosity) and

some not. At each stage, exactly one uninfected individual is placed in contact with

the infected ones. An infected individual infects with probability p. So an uninf ected

individual becomes infected if he or she gets infected by at least one of the infected

individuals. Assume that, to start with, there is only one infected person. Build a

Markov chain with states 1, 2, . . . , N and argue that p(k, k + 1) = 1 − (1 − p)

. Show

that, on the average, it will take N +q(1−q

)/(1−q) for everyone to become infected.

Solution. When there are k inf ected individuals an d one uninfected is brought in

contact with them, the chance that the latter is not infected is (1 − p)

. So

p(k, k) = (1 − p)

, p(k, k + 1) = 1 − (1 − p)

, , k = 1, . . . , N − 1.

Of course, p(N, N) = 1. The average numb er of steps to go from k to k + 1 is

1/p(k, k + 1). Hence, starting with 1 infected individual it takes

N−1

k=1

1 − (1 − p)

for everyone to become infected.

87.

Assume, in addition, that exactly one infected individual is selected for treatment

and he or she becomes well with p robability α > 0, and this occurs independently of

everything else. (i) What is the s tate space and the transition probabilities? (ii) How

many absorbing states are there? (iii) What kind of question would you like to ask

here and how would you answer it?

Solution. (i) Since p(1, 0) is equal to α which is positive, we now need to include 0

among the states. So the state space is

S = {0, 1, 2, . . . , N}.

The transition probabilities now become

p(k, k − 1) = αq

, p(k, k) = (1 − α)q

, p(k, k + 1) = (1 − α)(1 − q

), k = 1, . . . N − 1

p(N, N − 1) = αq

, p(N, N ) = 1 − αq

, p(0, 0) = 1.

where q = 1 −p. (ii) There is only one absorbing state: the state 0. (iii) The question

here is: How long will it take for the chain to be absorbed at 0? Letting g(k) be the

mean time to absorption starting from k, we have

g(k) = 1 + (1 − αq

)g(k) + αq

g(k −1) + (1 − α)(1 − q

)g(k + 1), 1 ≤ k ≤ N

g(N) = 1 + (1 − αq

)g(N) + αq

g(N − 1),

g(0) = 0.

There is a unique solution.

88.

Prove that for an irreducible Markov chain with N states it is possible to go from any

state to any other state in at most N − 1 steps.

Solution. For any two distinct states i, j there is a path that takes you fr om i to j.

Cut out any loops from this path and you still have a path that takes you from i to j.

But this p ath has distinct states and distinct arrows. There are at most N − 1 such

arrows.

89.

Consider the general 2-state chain, where p(1, 2) = a, p(2, 1) = b. Give necessary and

suﬃcient conditions for the chain (i) to be aperiodic, (ii) to possess an absorbing state,

(iii) to have at least one stationary distribution, (iv) to have exactly one stationary

distribution.

Solution. (i) T here mus t be at least one self-loop. The condition is:

a < 1 or b < 1.

(ii)

a = 0 or b = 0.

(iii) It always does because it has a ﬁnite number of states.

(iv) There must exist exactly one communication class:

a > 0 or b > 0.

90.

Show th at the stationary distribution on any (undirected) graph whose vertices have

all the same degree is uniform.

Solution. We know th at

π(i) = cd(i),

where d(i) is the degree of i, an d c some constant. Indeed, the detailed balance

equations

π(i)p(i, j) = π(j)p(j, i), i 6= j

are trivially satisﬁed because p(i, j) = 1/d(i), p(j, i) = 1/d(i), by deﬁnition.

So when d(i) = d = constant, the distribution π is uniform.

91.

Consider the random walk on the graph

1 —– 2 —– 3 —– 4 —– ··· —– N-1 —– N

(i) Find its stationary distribution. (ii) Find the average number of steps to return to

state 2, starting from 2. (iii) Repeat for 1. (iv) Find the average number of steps f or

it to go from i to N. (v) Find the average number of steps to go from i to either 1 or

N. (vi) Find the average number of steps it takes to visit all states at least once.

Solution. Hint for (vi): The time to visit all states at least once is the time to hit

the boundary plus the time to hit the other end of the boundary.

92.

When a bus arrives at the HW campus, the next bus arrives in 1, 2, . . . , 20 minutes

with equal probability. You arrive at the bus stop without checking the schedule, at

some ﬁxed time. How long, on the average, should you wait till the next bus arrives?

What is the standard deviation of this time?

Solution. This is based on one of the examples we discussed: Let X

be the time

elapsed from time n till the arrival of the next bus. Then X

is a Markov chain with

transition probabilities

p(k, k − 1) = 1, k > 0,

p(0, k) = p

, k > 0,

where p

= (1/20)1(1 ≤ k ≤ 20). We ﬁnd that the stationary distribution is

π(k) = c

j≥k

= c

j=k

c(21 − k)

, 0 ≤ k ≤ 20.

where c a constant determined by normalisation:

1 =

k=0

π(k) =

k=0

(21 − k) =

21 ×22

, c =

231

Hence

π(k) =

21 −k

231

, 0 ≤ k ≤ 20,

and so the average waiting time is

k=0

kπ(k) =

k=0

21 − k

231

= 20/3 = 6

′

′′

The standard deviation is

k=0

π(k) − (20/3)

√

230/3 ≈ 5

′

′′

Note: To do the sums without too much work, use the f ormulae

k=1

k =

n(n + 1)

k=1

n(n + 1)(2n + 1)

k=1



n(n + 1)



93.

Build a Markov chain as follows: When in state k (k = 1, 2, 3, 4, 5, 6), roll a die k times,

take the largest value and move to th at state. (i) Comp ute the transition probabilities

and write d own the transition probability matrix. (ii) Is the chain aperiodic? (iii)

Do es it have a unique stationary distribution? (iv) Can you ﬁnd which state will be

visited more frequently on the average?

Solution. (i) Let M

be the m aximum of k independent rolls. Then

P (M

≤ ℓ) = (ℓ/6)

, k, ℓ = 1, . . . , 6.

The transition probability from state k to state ℓ is

p(k, ℓ) = P (M

= ℓ) = (ℓ/6)

− ((ℓ − 1)/6)

, k, ℓ = 1, . . . , 6.

The transition probability matrix is







1/6 1/6 1/6 1/6 1/6 1/6

1/36 3/36 5/36 7/36 9/36 11/36

1/216 7/216 19/216 37/216 61/216 91/216

1/1296 15/1296 65/1296 175/1296 369/1296 671/1296

1/7776 31/7776 211/7776 781/7776 2101/7776 4651/7776

1/46656 63/46656 665/46656 3367/46656 11529/46656 31031/46656







≈







0.167 0.167 0.167 0.167 0.167 0.167

0.0278 0.0833 0.139 0.194 0.250 0.306

0.00463 0.0324 0.0880 0.171 0.282 0.421

0.000772 0.0116 0.0502 0.135 0.285 0.518

0.000129 0.00399 0.0271 0.100 0.270 0.598

0.0000214 0.00135 0.0143 0.0722 0.247 0.665







(ii) The chain is obviously aperiodic because it has at least one self-loop.

(iii) Yes it does because it is ﬁnite and irreducible.

(iv) Intuitively, this sh ould be state 6.

94.

Simple queueing system: Someone arrives at a bank at time n with probability α. He

or she waits in a queue (if any) which is served by one bank clerk in a FCFS fashion.

When at the head of the queue, the person requires a service which is distributed like

a random variable S with values in N: P (S = k) = p

, k = 1, 2, . . .. Diﬀerent people

require services which are independent random variables. Consider the quantity W

which is the total waiting time at time n: if I take a look at the queue at time n then

represents the time I have to wait in line till I ﬁnish my service. (i) Show that W

obeys the recursion

n+1

= (W

+ S

− 1)

where the S

are i.i.d. random variables distributed like S, independent of the ξ

. The

latter are also i.i.d. with P (ξ

= 1) = α, P (ξ

= 0) = 1 − α. Thus ξ

= 1 indicates

that there is an arrival at time n. (ii) Show that W

is a Markov chain and compute

its tr ansition p robabilities p(k, ℓ), k, ℓ = 0, 1, 2, . . ., in terms of the parameters α and

. (iii) Sup pose that p

= 1 − β, p

= β. Find conditions on α and β so that the

stationary distribution exists. (iv) Give a physical interpretation of this condition. (v)

Find the stationary distribution. (vi) Find the average waiting time in steady-state.

(vii) If α = 4/5 (4 cu s tomers arrive every 5 units of time on the average–heavy traﬃc),

what is the maximum value of β so that a stationary distribution exists? What is the

average waiting time when β = 0.24?

Solution. (i) If, at time n the waiting time W

is nonzero and nobody arrives then

n+1

= W

− 1, because, in 1 unit of time the waiting time d ecreases by 1 unit.

If, at time n, somebody arrives and has service time S

then, immediately the wait-

ing time becomes W

+ S

and so, in 1 unit of time this decreases by 1 so that

n+1

= W

+ S

− 1. Putting things together we arrive at the announced equation.

Notice that the superscript + means maximum with 0, because, if W

= 0 and nobody

arrives, then W

n+1

= W

= 0.

(ii) That the W

form a Markov chain with values in Z

follows from th e previ-

ous exercise. To ﬁnd the transition probabilities we argue as follows : Let p(k, ℓ) =

P (W

n+1

= ℓ | W

= k). First, observe that, for a given k, the ℓ cannot be less than

k − 1. In fact, p(k, k − 1) = 1 − α (the probability that nobod y arrives). Next, for

ℓ to be equal to k we need that somebody arrives and brings work equal to 1 unit:

p(k, k) = αp

. Finally, for general ℓ > k we need to have an arrival which brings work

equal to ℓ − k + 1: p(k, ℓ) = αp

ℓ−k+1

(iii) Here we have

p(k, k − 1) = 1 − α, p(k, k + 1) = αβ, p(k, k) = α(1 − β).

To compute the stationary distribution we write balance equations:

π(k)(1 − α) = π(k − 1)αβ, k ≥ 0.

Iterating this we get

π(k) =



αβ

1 − α



π(0).

We need to be able to normalise:

∞

k=0



αβ

1 − α



π(0) = 1.

We can do this if and only if the geometric series converges. This happens if and only

αβ

1 − α

< 1,

(iv) The condition can also be written as

1 + β < 1/α.

The left side is the average service time (1 × (1 − β) + 2 × β). Th e right side is the

average time between two successive arrivals. So the condition reads:

Average service time < average time between two successive arrivals.

(v) From the normalisation condition, π(0) = (1 − α(1 + β))/(1 − α). This follows

because

∞

k=0



αβ

1 − α



1 −

αβ

1−α

1 − α

1 − α(1 + β)

Hence

π(k) =

1 − α(1 + β)

1 − α



αβ

1 − α



, k ≥ 0.

This is of the form π(0) = (1 − ρ)ρ

, where ρ = αβ/(1 − α), hence a geometric

distribution.

(vi) The average waiting time in steady-state is

∞

k=0

kπ(k) =

∞

k=0

kρ

(1 − ρ) =

1 − ρ

αβ

1 − α(1 + β)

(vii) If α = 4/5 then β < (5/4) − 1 = 1/4. So the service time must be such that

P (S = 2) = β < 1/4, and P (S = 1) = 1 − β > 3/4. When β = 0.24 we are OK s ince

0.2 < 0.25. In th is case, the average waiting time is equal to 24, which is quite large

compared to the maximum value of S.

95.

Let S

be a simple symmetric random walk with S

= 0. Show that |S

| is Markov.

Solution. If we know the value of S

, we know two things: its absolute value |S

and its sign. But to determine the value of |S

n+1

|, knowledge of the sign is irrelevant,

since, by symmetry |S

n+1

| = |S

|±1 with probability 1/2 if S

6= 0, while |S

n+1

| = 1

is S

= 0. Hence P (|S

n+1

| = j | S

= i) depends only on |i|: if |i| > 0 it is 1/2 for

j = i ± 1, and if |i| = 0 it is 1 for j = 1.

96.

Let S

be a simple but not symmetric r an dom walk. Show that |S

| is not Markov.

Solution. In contrast to the above, P (|S

n+1

| = j | S

= i) is not a function of |i|.

97.

Consider a modiﬁcation of a simple sym metric random walk that takes 1 or 2 steps

up with probability 1/4 or a step down with probability 1/2. Let Z

be the position

at time n. Show that P (Z

→ ∞) = 1.

Solution. From the Strong Law of Large Numbers, Z

/n converges, as n → ∞, with

probability 1, to the expected value of the step:

(−1) × 0.5 + (+1) × 0.25 + (+2) × 0.25 = 0.25.

Since this a positive number it follows that Z

must converge to inﬁnity with proba-

bility 1.

98.

There are N coloured items. There are c possible colours. Pick an items at r andom

and change its colour to one of the other c − 1 colours at random. Keep doing this.

What is the Markov chain describing this experiment? Find its stationary distribution.

(Hint: When c = 2 it is the Ehrenfest chain.)

Solution. The Markov chain has states

x = (x

, . . . , x

where x

is the number of items having colour i. Of course, x

+ ··· + x

= N . If we

let e

be the vector with 1 in the i-th position and 0 everywh ere else, then we see that

from state x only a transition to a state of th e form x − e

+ e

is possible if x

> 0.

The transition probability is

p(x, x − e

+ e

) =

c − 1

because x

/N is the probability that you pick an item with colour i, and 1/(c − 1) is

the probability that its new colour will be j. To ﬁnd the stationary distribu tion π(x)

we just try to see if detailed balance equations hold. If they do then we are happy and

know that we have found it. If they don’t, well, we don’t give up and try to see how

to s atisfy the (full) balance equations. Recall:

• Full balance equations: π(x) =

π(y)p(y, x), for all x.

If the chain is ﬁnite (and here it is), we can always ﬁnd a probability distribution π

that satisﬁes the full balance equations.

• Detailed balance equations: π(x)p(x, y) = π(y)p(y, x), for all x, y.

Even if the chain is ﬁnite, it is NOT always the case that detailed balance hold. If

they do, then we should feel lucky!

Since, for c = 2 (the Ehrenfest chain) the stationary distribution is the binomial

distribution, we may GUESS th at the stationary distribution here is multinomial:

GUESS: π(x) = π(x

, . . . , x

) =



, . . . , x



−N

! ···x

−N

Now

CHECK WHETHER: π(x)p(x, y) = π(y)p(y, x) HOLD FOR ALl x, y.

If y, x are not related by y = x − e

+ e

for some distinct colours i, j, then p(x, y) =

p(y, x) = 0, and so the equations hold trivially. Suppose then that y = x − e

+ e

for some distinct colours i, j, then p(x, y) = p(y, x) = 0, and so the equations hold

trivially. Suppose then that y = x − e

+ e

for s ome distinct colours i, j. We have

π(x)p(x, x −e

+ e

) =

N!c

−N

! ···x

c − 1

and

π(x, x − e

+ e

)p(x − e

+ e

, x) =

N!c

−N

! ···(x

− 1)! ···(x

+ 1)! ···x

+ 1

c − 1

The two qu antities are obviously the same. Hence detailed balance equations are

satisﬁed. Hence the multinomial distrib ution IS THE stationary distribution.

99.

In the previous problem: If there are 9 balls and 3 colours (Red, Green, Blue) and we

initially s tart with 3 balls of each colour, how long will it take on the average till we

see again the same conﬁgu ration? (Suppose th at 1 step = 1 minute.) If we start we

all balls coloured Red, how long will it take on the average till we see the same again?

Solution.

π(3, 3, 3) =

3!3!3!

−9

= 560/6561.

Hence the average number of steps between two successive occurrences of the state

(3, 3, 3) is 6561/560 ≈ 11.72 minutes. Next,

π(9, 0, 0) =

9!0!0!

−9

= 1/19683.

Hence the average number of steps between two successive occurrences of the state

(9, 0, 0) is 19683 minutes = 328.05 hours ≈ 13 and a half days.

100.

Consider a rand om walk on a star-graph that has one centre vertex 0 and N legs

emanating from 0. Leg i contains ℓ

vertices (in addition to 0) labelled

i,1

, v

i,2

, . . . , v

i,ℓ

The vertices are in sequence: 0 is connected to v

i,1

which is connected to v

i,2

, etc. till

the end vertex v

i,ℓ

. (i) A particle starts at 0. Find the probability that it reaches the

1,1

2,1

3,1

2,2

3,2

5,2

5,1

1,2

4,1

end of leg i before reaching the end of any other leg. (ii) Suppose N = 3, ℓ

= 2, ℓ

3, ℓ

= 100. Play a game as follows: start from 0. If end of leg i is reached you win ℓ

pounds. Find how much money you are willing to pay to participate in this game.

Solution. Let ϕ

(x) be the probability that end of leg i is reached before reaching

the end of any other leg. Clearly,

i,ℓ

) = 1, ϕ

k,ℓ

) = 0, k 6= i.

Now, if v

i,r

is an interior vertex of leg i (i.e. neither 0 nor the end vertex), then

k,r

) =

k,r−1

) +

k,r+1

This means that th e function

r 7→ ϕ

k,r

)

must be linear for each k (for the same r eason that the probability of hitting the left

boundary of an interval before hitting the right one is linear for a simple symm etric

random walk). Hence

k,r

) = a

i,k

r + b

i,k

where a

i,k

, b

i,k

are constants. For any leg we determine the constants in terms of the

values of ϕ

at the centre 0 and the end vertex. Thus,

k,r

) =

(0)

ℓ

(ℓ

− r), k 6= i,

i,r

) =

ℓ

(0)

ℓ

(ℓ

−r).

Now, for vertex 0 we have

(0) =

k=1

k,1

) =





ℓ

(0)

ℓ

(ℓ

− 1) +

k6=i

(0)

ℓ

(ℓ

− 1)





ℓ

+ ϕ

(0)

k=1



1 −

ℓ



whence

(0) =

ℓ

k=1

ℓ

(ii)

(0) =

1/600

1/2 + 1/3 + 1/600

, ϕ

(0) =

1/3

1/2 + 1/3 + 1/600

, ϕ

(0) =

1/2

1/2 + 1/3 + 1/600

The average winnings are:

600ϕ

(0) + 3ϕ

(0) + 2ϕ

(0) ≈ 1.2 pounds.