game_theory_tadelis_solutions

This is page i

Printer: Opaque

Solution M anual

G am e T heory: A n In trodu ction

Steve Tadelis

Jan uary 31, 2013

Th is is page ii

Printer: Opaque

ABSTRACT This Slution Manual is incomplete. It will be updated every 2-3 weeks

to add the solutions to problems as they become available. A complete version is

expected by March 15, 2013.

Th is is page iii

Printer: Opaque

C on ten ts

I R a tio n a l De c isio n Mak in g 2

1 The Single-P erson Decision Problem 3

2 Introducing Uncertain ty and T ime 13

II Static G ames of Complete Information 44

3Preliminaries 45

4 R a tio n a lity and Com mon Knowledge 51

5 P in n in g Down Beliefs: Nash Equ ilibrium 61

6 Mixed Strategies 95

Contents 1

III Dynamic Games of Complete Information 113

7 Preliminaries 115

8 Credibility and Sequen tial Rationalit y 133

9 M ulti-Stage Games 163

10 Repeated G am es 179

11 Strategic B argaining 197

IV Static Gam es of Incomplete Information 206

12 Ba yesian Game s 207

13 Auctions and Com petitiv e Bidding 215

14 Mechanism Design 221

V Dynamic Gam es of Incomplete Information 222

15 Sequen tial Rationalit y with Incomplete Information 223

16 Signaling Games 231

17 Building a Reputation 239

18 Information Transm ission and Cheap Talk 243

Part I

R ational De cision M aking

This is page 3

Printer: Opaque

The Single-P erson D ecision P roblem

1. Think of a simple decision y ou face regularly and formalize it as a decision

problem , carefully listing the actions and outcomes withou t the preference

relation. Then, assign payoﬀs to the outcom es, and dra w the decision tree.

2. Going to the Movies: There are two movie theatres in you r neighbor-

hood: Cineclass, which is located one mile from yo ur home, and Cineblast,

located 3 miles from you r home, eac h showing three ﬁlms.Cineclassisshow-

ing Casablanca, Gone with the Wind and Dr. Str angelove, while Cineblast

is show ing The Matrix, Blade Runner and Aliens. Your problem is to decide

which movie to go to.

(a) Dra w a decision tree tha t represents this problem without assigning

pa y oﬀ values.

Answ er:

4 1. The Single-Person Decision Problem

(b) Imagine that you don’t care about distance and that y our preferences

for movies is alphabetic (i.e., y ou like Aliens the most and The Matrix

the least.) U sin g payoﬀ values 1 throug h 6 complete th e decision tree

y ou drew in part (a). What option wou ld y ou choose?

Answ er:

mile is equal to one unit of pay oﬀ.Updatethepayoﬀs in the decision

tree. Would y our c hoice change?

Answ er:

1. The Single-Person Decision Problem 5

3. Fruit or Candy: A banana costs $050 and a candy costs $025 at the local

cafeteria. You have $1.25 in your pocket and y ou value money. The money-

equivalent value (payoﬀ) you get from eating y our ﬁrst banana is $1.20, and

that of each additiona l banana is half the previous one (the second banana

gives y ou a value of $0.60, the third 0.30, etc.). Similarly, the pa y oﬀ you get

from eating your ﬁrst candy is $0.40, and that of eac h additional candy is

half the previous one ($0.20, 0.10, etc.). Your value from eating bananas is

not aﬀected by ho w ma ny candies you eat and vice v ersa.

(a) What is the set of possible actions you can tak e given your budget of

$1.25?

Answ er: You can buy any combination of bananas and candies that

sum up to no more than $1.25. If we denote by ( ) the choice to buy

 bananas and  cand ies, then the set of possible actions is

 = {(0 0) (0 1) (0 2) (0 3) (0  4) (0 5) (1 0) (1 1) (1 2) (1 3) (2 0) (2 1)}

(b) Dra w the decision tree that is associated with this decision problem.

Answ er: For each c hoice you need to calculate the ﬁnal net value. For

example, if y ou buy one banana and 2 candies then y ou get 1.2 w orth

from the banana, 0.4 from the ﬁrst candy an d 0.2 from the second which

totals 1.8. To this w e need to add the $0.25 you ha ve left (the cost was

only $1) so the net ﬁnal v alue you hav e is 2.05.

6 1. The Single-Person Decision Problem

with a rational choice argum en t.

Answ er: Yes. The highest net ﬁnal value if from buying two bananas

and one candy. ¥

(d) No w imagine that the price of a candy increased to $030.Howmany

possible actions do you have? Does your answer to (c) above chan ge?

Answ er: Of the 12 options above, three are no longer possible: (0 5) (1 3)

and (2 1).Also,thenetﬁnal values change because each candy is 5

cen ts more expensive. The highest net ﬁnal value is 2.05 whic h can be

obtained from one of two choices: (1 1) and (2 0), both lea ving some

money in the decision maker’s poc ket. ¥

4. Alcohol Consumption: Recall the examp le in which you needed to choose

ho w m uc h to drink. Imagine that your payoﬀ function is given by  − 4

where  is a parameter that depends on yo ur ph ysique. Ev ery person m ay

ha v e a diﬀerent value of , and it is known that in th e population ()the

smallest  is 02;() the largest  is 6;and()largerpeoplehavehigher’s

than smaller people.

1. The Single-Person Decision Problem 7

(a) Can you ﬁnd an am ount of drinking that no person should drink?

Answ er: The utilit y from drinkin g 0 is equal to 0. If a decision maker

drinks  =2then, if he has the largest  =6,hispayoﬀ is  =6× 2 −

4 × (2)

= −4 and it is easy to see th at decision m akers w ith smaller

values of  will obtain an even more negativ e payoﬀ from consuming

 =2. Hence, no person should choose  =2. ¥

(b) How much should y ou drink if yo u r  =1?If  =4?

Answ er: The optimal solu t i on is ob t ained b y maxi mizing the p ay oﬀ

function ()= − 4

.Theﬁr st-order maximization condition is

 − 8 =0implyin g that  =



is the optimal solution. For  =1the

solution is  =

and for  =4it is  =

. ¥

Answ er: This fo llows from the so lution in part (b) a bo ve. For ev e ry

type of person , the solution is ()=



which is increasing in ,and

larger people ha ve higher values of  ¥

(d) Should an y person drink more than one bottle of wine?

Answ er: N o. Ev en the largest type of person with  =6should only

consume  =

of a bottle of wine. ¥

5. Buying a Car: Youplanonbuyingausedcar.Youhave$12,000,andyou

arenoteligibleforanyloans.Thepricesofavailablecarsonthelotaregiven

as follo ws:

Make, Model & Year Price

To yota Corolla 2002 $9,350

Toyota Camry 2001 $10,500

Buick LeSabre 2001 $8,825

Honda Civic 2000 $9,215

Subaru Impreza 2000 $9,690

For any given year, you prefer a Ca m ry to an Impreza , an Im preza to a

Corolla, a Corolla to a Civic and a Civic to a LeSabre. For any given ye ar,

8 1. The Single-Person Decision Problem

you are willing to pay up to $999 to move form a car to the next preferred

car. For example, if the price of a Corolla is $, then you are willing to buy

it over a Civic if the Civic costs mo r e that $( − 999) but you w ould prefer

buy ing the Civic if it costs less than this amo u nt. Similarly, you prefer the

Civic at $ to a Corolla that costs more than $( + 1000) but y ou prefer the

Corolla if it costs less. For any given c ar,youarewillingtomovetoamodel

a year older if it is chea per b y at least $500 For example, if the price of a

2003 Civic is $, then you are willing to buy it o ver a 200 2 Civic if the 2002

Civic costs more that $( −500) but you would prefer buying the 2002 Civic

ifitcostslessthanthisamount.

(a) What is y our set of possible alternatives?

Answ er: Given that you have $12,000, w hich is m ore than the price

of any car, y ou have six alternativ es: any one of the ﬁve cars or buying

nothing. ¥

(b) Whatisyourpreferencerelationbetweenthealternativesin(a)above?

Answ er: To answer this we need use the information on willingness to

pay giv en in the question, together with the prices. The least valued

car w ould be a 2000 LeSabre. Assum e that the value of ow ning that car

is given by . From the information above, a 2000 Civic is valued at

 +999, a 2000 Corolla is valued at  +1 998,andsoonuptoa2000

Camry valued at  +3 996. Similarly, eac h of these ca rs for the year

2001 is valued at 500 m ore than the 2000 model, and the 20 02 model

is valued at 1,000 more than the 2000 model. Hence, w e can write the

tableofvaluesasfollows:

Make and Model year 2000 y ear 2001 year 2002

Toyota Camry  +3 996  +4 496  +4 996

Subaru Impreza  +2 997  +3 497  +3 997

Toyota Corolla

 +1 998  +2 498  +2 998

Hond a Civic  +999  +1 499  +1 999

Buick LeSabre  +0  +500  +1 000

1. The Single-Person Decision Problem 9

Now, to see what the net value from eac h purchase would be w e must

deduct the price of the car from the value. Using the ﬁve prices given

aboveandthevalueswejustcalculatedwehavenetpayoﬀsas(e.g.,for

the 2002 Corolla, the net pa yoﬀ is  +2 998 − 9 350 =  − 6 352),

Make, Model & Year Price

Toyota Corolla 2002  − 6 352

Toyota Camry 2001  − 6 004

Buick LeSabre 2001  − 8 325

Honda Civic 2000  − 8 216

Subaru Impreza 2000  − 6 693

Assum ing that  is large enough to wan t to buy any car, the ranking

of the alternativ es is, Toyota Camry 2001, followed b y To yota Corolla

2002, followed by Subaru Impreza 2000, followed by Honda Civic 2000

and last being the Buic k LeSab re 2001. ¥

with the possible alternativ es. Wh at would y o u c h oose?

Answ er: This follow s directly from th e analy sis in (b) abo ve: you shou ld

c hoose the Toyota Camry 2001 (with six branches, including no pur-

chase.) ¥

(d) Can you dra w a decision tree with diﬀerent pa yoﬀs that represents the

same pro blem ?

Answ er: Because w e left  as undetermined, w e can ﬁnd many v alues

of  that will represen t this problem. Notice that if  is small enough

(less than 6,004) then the best choice w o uld be not to buy a car. ¥

6. Fruit Trees: You h ave room for up to t wo fruit bearing trees in your gar de n.

The fruit trees that can grow in y our garden are either apple, orange or pear.

The cost of maintenance is $100 for an ap ple tree, $70 for an orange tree and

$120 for a pear tree. Your food bill w ill be reduced b y $130 for each apple

tree you plant, b y $145 for each pear tree you plan t and b y $90 for eac h

10 1. The Single-Person Decision Problem

orange tree you plant. You care only about your total expenditure in makin g

any planting decisions.

(a) What is the set of possible actions and related outcomes?

Answ er: Youhavetwo“slots”thatcanbeleftempty,orhaveoneof3

possible trees plan ted in eac h slot. Hence, you have 10 possible c h oices.

Theoutcomeswilljustbethechoicesofwhattoplant.¥

(b) What is the pa yoﬀ of eac h action/outcome?

Answ er: To calculate the pay oﬀsfromeachchoiceitisconvenientto

useatableasfollows:

Choice cost food sa vings net pa yoﬀ

nothing 00 0

one apple tree 100 130 30

one orange tree 70 90 20

one pear tree 120 145 25

t wo ap ple trees 200 260 60

t wo orange trees 140 180 40

t wo pear trees 240 290 50

apple and orange 170 220 50

apple and pear 220 275 55

pear and orange 190 235 45

Answ er: All but choosing t wo apple trees are dominated. ¥

(d) Dra w the associated decision tree. What will a rational play er choose?

Answ er: Thetreewillhavetenbrancheswiththepayoﬀs associated

with the table abo ve, and the optim al c ho ice is two apple trees. ¥

Th is is a pro be m o f cho o sing 2 items out of 4 poss ib i litie s wit h rep la ce ment, wh ich i s equ a l to



4+2−1



(4+2−1)!

2!(4−1)!

5×4

=10.

1. The Single-Person Decision Problem 11

(e) Nowimaginethatthefoodbillreductionishalfforthesecondtreeof

thesamekind(youlikevariety).Thatis,theﬁrst apple still reduces

y o u r food bill by $130, but if y o u plan t t wo apple trees y o u r food bill

will be reduced b y $130 + $65 = $195. (Similarly for pear and orange

trees.) What will a rational player choose now?

Answ er: An ap ple tree is still the best ch oice for the ﬁrst tree, but no w

thesecondtreeshouldbeapeartree.¥

7. City P arks: A city’s mayor has to decide how much money to spend on

parks and recreation. City codes restrict this spending to be no more than

5% of the budget, and the yearly budget of the city is $20,000,0 00. He wants

to please his constituen ts who ha v e diminishing returns from parks. The

money-equivalen t beneﬁtfromspending$ on parks is ()=

√

400 −

.

(a) What is the action set of the city’s mayor?

Answ er: The lim it on spendin g is $1 million, so the actions set is

 ∈ [0 1000000]. ¥

(b) Ho w m uch should the mayor spend?

Answ er: The maximizat ion prob lem is

max

∈[01000000]

√

400 −



and taking the der ivativ e for the ﬁrst-order condition we obtain,

√



−

=0,

or  =$640 000. T he second order d erivative is −5

−

 0 so this is

indeed a maximum. ¥

people are mor e willing to pay for parks. The new pr eferences of the

people are giv en by ()=

√

1600 −

 .Whatnowistheactionset

12 1. The Single-Person Decision Problem

of the ma yor, and ho w much spending should he choose to cater to his

constituents?

Answ er: The ﬁrst-ord er condition is now,

√



−

=0,

or  =$2 560 000. This exceeds the budget and hence the optim al

solution is to spend $1 million. ¥

This is page 13

Printer: Opaque

Introdu cing U ncerta in t y a nd T im e

1. Getting an MB A : Recall the decision problem in Section 2.3.1, and now

assum e that the probabilit y of a strong labor market is ,ofanaveragelabor

marketis0.5andofaweaklabormarketis05 −. All the other values are

the same.

(a) For which values of  willyoudecidenottogetanMBA?

Answ er: The expected pay oﬀsfromeachchoicearegivenby,

(Get MBA)= × 22 + 05 × 6+(05 − ) × 2=20 +4

(Don’t get MBA)= × 12 + 05 × 8+(05 − ) × 4=8 +6

which imp lies that getting an MBA is worthw hile if and only if

20 +4≥ 8 +6

or,  ≥

 ¥

(b) If  =04, what is the highest price the univ ersity can charge for you

to be willing to go ahead and get an MB A ?

14 2. Introducing Uncertainty and Time

Answ er: If  =04 then the pa yoﬀsare,

(Get MBA)=04 × 22 + 05 × 6+01 × 2=12

(Don’t get MBA)=04 × 12 + 05 × 8+01 × 4=92

which implies that an extra charge of up to 2.8 can be c h arged by the

university and y ou wo uld still be willing to get an MB A . ¥

2. Recreation Ch oices: A player has three possible ven ues to c hoose from:

going to a football game, going to a bo xing mat ch, or going for a hik e.

The pay oﬀ from each of these alternatives will depend on the weather. Th e

following table gives the agent’s pa yoﬀ in eac h of the two re levant w ea ther

ev ents:

Alternative pay oﬀ if R ain payoﬀ if Shine

Football game 1 2

Bo xing Match 3 0

Hike 0 1

For Let  denote the probability of rain.

(a) Is there an alternative that a rational pla yer will nev e r take regardless

of ? (i.e., it is dom ina ted for any  ∈ [0 1].)

Answ er: For this decision maker c hoosing the hike is always w orse

(dominated) by going to the football game, and he should never go on

ahike.¥

(b) What is the optima l decision, or best response, as a function of .

Answ er: The expected payoﬀs from eac h of the remain ing two c h oices

are giv en by,

(Football)= × 1+(1− ) × 2=2− 

(Boxing)= × 3+(1− ) × 0=3

which implies that football is a better choic e if and only if

2 −  ≥ 3

2. Introducing Uncertainty and Time 15

or,  ≤

, and boxin g is better otherwise. ¥

3. At the Dog Races: You’re in Las Vegas, and you can decide what to do at

the dog-racing bet room. You can c hoose not to participate, or you bet on

one of t wo dogs as follo w s. Betting on Snoopy costs $1, and y ou will be paid

$2 if he wins. Betting on Lassie costs $1, and you will be paid $11 if she wins.

You believe that Snoop y has probab ility 0.7 of winnin g and that Lassie has

proba bility 0.1 of winn ing (there are other dogs that you are not considerin g

betting on). Your goal is to maximize the expected monetary return of your

action.

(a) Dra w the decision tree of this problem.

Answ er:

(b) What is your best course of action, and wh at is your expected value?

Answ er: The expected payoﬀ from betting on Snoopy is 07−03=04

while betting on Lassie yields 1 −09=01, so betting on Snoop y is the

best action. ¥

can agree or not. If you agree to it, you get paid $2 up front and y ou

agree to pa y back 50% of any winnings you receive. Draw the new de-

cision tree, and ﬁnd the optimal action.

16 2. Introducing Uncertainty and Time

Answ er:

The best action is still to bet on Snoopy with an expected payoﬀ of 1.7

v e rsu s 1.55 from betting on Lassie. ¥

4. Dr illin g for Oil: An oil dr illing com p any m u st decide whether or not t o

engag e in a new drilling activit y before regulators pass a la w tha t bans drillin g

at that site. The cost of drilling is $1,000,000. After drilling is complet ed and

the drilling costs are incurred, then the com pa ny will learn if there is oil or

not. If there is oil, operating proﬁts generated are estimated at $4,000,000.

If there is no oil, there will be no future proﬁts.

(a) Using  to denot e the likelihood that drilling results in oil, dra w th e

decision tree of this problem.

Answ er: Two decision br a nches: drill or n ot drill. Following drilling,

Nature c hooses oil with probabilit y ,withthepayoﬀ of $3 million (4

minus the initial investment). With p rob ab ility 1 −  Nature ch ooses

no-oil with a pa y oﬀ $ − 1 million. ¥

(b) The company estima tes that  =06. What is the expected value of

drilling? Sho u ld the comp any go ahead and drill?

Answ er: The expected pa yoﬀ (in millio ns ) from drilling is  × 3 −(1 −

) × 1=4 − 1=06, which means that the company should drill. ¥

more accurate estimate of . What is the minimum vale of  for wh ich

2. Introducing Uncertainty and Time 17

it w ou ld be the company’s best response to go ahead and drill?

Answ er: Th e minimum value of  for whic h drilling causes no expected

loss is calculate d by solving  × 3 − (1 − ) × 1 ≥ 0,or ≥

 ¥

5. Discount Prices: A local department store puts out products at an initial

price, and ev ery week the product goes unsold, its price is discoun ted by

25% of the origin al pr ice. If it is not sold a fte r 4 w ee ks, it is sent bac k t o

the r egiona l wa rehou se. There is a set of bu tcher kniv es that was just put

out for the price of $200. Your willingness to pa y for the knives (your dollar

value) is $180, so if you buy them at a price  ,yourpayoﬀ is  =180−  .

If you don’t buy the knives, the c han ces that they are sold to someo ne else

conditio nal on not selling in the week before are giv en in the follo w in g table:

week 1 0.2

week 2 0.4

week 3 0.6

week 4 0.8

For example, if y ou do not buy it during the ﬁrst t wo weeks, the likelihood

that it is a vailable at the beginning of the third w eek is the likelih ood that

it does not sell in either weeks 1 and 2, which is 08 × 06=048.

(a) Drawyourdecisiontreeforthe4weeksaftertheknivesareputoutfor

sale.

Answ er: We can draw each week as having nature mo ve ﬁrst to deter-

mine whether someone else bought the kniv es, and if they did not, then

our pla yer can buy or w ait. The tree therefore will be,

18 2. Introducing Uncertainty and Time

where the numbers in the squares next to Nature’s nodes mark the ex-

pected value from c h oosing w ait before tha t node. ¥

(b) A t the beginning of which w eek, if any, should you run to buy the

knives?

Answ er: We solve this bac kward. In week 4 the player will buy the

knives of they are there. Waiting in w eek 3 gives an expected payoﬀ

of only 02 × (180 − 50) = 26, while buying in w eek 3 gives a pa yoﬀ of

180−100 = 80  26, so buying in w eek 3 beats wa iting. Moving back to

week 2, waiting giv es an expected payoﬀ of 04 × 80 = 32 while buying

yields 180−150 = 30  32 so waiting beats buying, and moving back to

w eek 1 mak es waiting ev en more valuable compared to buying (buying

in w e e k 1 is dominated by not bu y ing. Hence, the player will wai t till

week 3 and then try to buy the kniv e s. ¥

buy at the beginning of the ﬁrst week.

Answ er: Waiting is risky so intuitively, to make an early p u rchase

valuable, the willingn ess to pay must be very high. Set the willingness

to pa y at 1000. In week 4 the pla yer will buy the kniv es. Waiting in w eek

3yields02 × (1000 − 50) = 190, while buying in week 3 give s a payoﬀ

of 1000 − 100 = 900  190, so buying in w eek 3 beats w aiting. Mo ving

back to w eek 2, waiting gives an expected payoﬀ of 04×190 = 76 while

buying yields 1000 − 150 = 850  76 so buying beats w aiting. Moving

2. Introducing Uncertainty and Time 19

back to week 1, w aiting gives an expected pa yoﬀ of 06 × 850 = 510

wh ile buying yields 1000 −200 = 800  510 so buying in the ﬁrst w eek

is the optim a l decision. ¥

(d) Find a willingn ess to pay that would make it optima l to buy at the

beginning of the fourth w eek.

Answ er: Similarlyto(c)above,tomakealatepurchasevaluable,the

willing nes s to pay m ust be quite lo w. Set the willingn ess to pa y at 100.

In any wee k but we ek 4 the price is above the willingne ss to pay, so the

optim al decision is to wait for we ek 4 and then buy the kniv es if they

are a vailable. ¥

6. Real Estate Developm ent: A real estate dev eloper wishes to build a new

dev elopment. Regulations impose an en vironmen tal impact study that will

yield an “imp act score,” whic h is an index n umber based on the impact the

developm e nt will likely have on traﬃc, air qualit y, sew age and water usage,

etc. The developer, w ho has lots of experience, knows that the score will

be no less than 40, and no m ore than 70. Furthermore, he knows th at any

score between 40 and 70 is as likely as any other score bet ween 40 and 70

(use continuous values ). The local go vernment’s past behavior implies that

there is a 35% chance that it will approve the development if the impact

score is less than 50, a 5% chance that it will approve the dev elopmen t if

the score is bet ween 50 and 55, and if the score is greater than 55 then the

project will surely be halted. The v alue of the developmen t to the devel-

oper is $20,000,000. Assuming that the developer is risk neutral, what is the

maxim u m cost of the impact study suc h that it is still w orth while for the

dev eloper to have it conducted?

Answ er: Observ e that there is a

probability of getting a score between

40 and 50 giv en that 40 to 50 is one-th ird of the range 40 to 70. There is

proba bility of getting a score bet ween 50 and 55 giv en that 50 to 55 is

one-sixth of the range 40 to 70. Hence, the expected value of doing a study

20 2. Introducing Uncertainty and Time

× 35 × $20 000 000 +

× 05 × $20 000 000 +

× 0 × $20 000 000

=$2 500 000

Hence, the most the dev eloper should pa y for the study is $2,500,000. ¥

7. Toys: WakTek is a renowned man ufactu rer of electronic to ys, with a spe-

cialty in r emote-controlle d (RC) miniature vehicles. WakTek is consid erin g

the introduction of a new product, an R C Ho vercraft called WakA tak. Pre-

liminary designs ha v e already been produced at a cost of $2 million. To

introduce a marketable product requires the building of a dedicated product

lineatacostof$12 million. Also, before the product can be launc hed a pro-

tot y pe need s to be built and tested for safety. The prototy pe can be crafted

ev en in the absence of a production line, at a cost of $05 million, but if the

prototype is built after the production line then its cost is negligible.

There

is uncertaint y over what safety rating WakAtak will get. This could have a

large impact on demand, as a lower safet y-rating will increase the minimum

age required from users. The safety-testing costs $1 million. The outcome of

the safet y-test is estimated to ha ve a 65% c h ance of resulting in a minimum

age of 8 years, a 30% c han ce of m inimum age 15 years, and a 5% chance of

being declared unsafe in whic h case it could not be sold at all. (The cost of

improving the safety status of a ﬁnished design is deem ed prohibitive.) Af-

ter successful safety-testing the product could be launched at a cost of $15

million .

There is also uncertaint y o ver demand, which will have a crucial impact on

the ev entual proﬁts. Currently the best estimate is that the ﬁnished product,

if available to the 8 − 14 demographic, has a 50 − 50 c han ce of resulting in

proﬁts of either $10 million or $5 million from that demographic. Similarly

there is a 50−50 ch ance of either $14 million or $6 million proﬁtfromthe15-

or-above demo grap hic. These dem an d outcom e s are independent across the

demographics. The proﬁts do not take into account the costs deﬁned abo v e;

“Negligible” mean s you can treat it as zero.

2. Introducing Uncertainty and Time 21

they are measured in expected present-value terms so they are directly com-

parable with the costs.

(a) What is the optimal plan of actio n for WakTek? W ha t is currently the

expected economic value of the WakAtak project?

Answ er: The optim al plan is to build the prototype ﬁrst and then do

the safety test, then build the production line and launch the product

only if the safet y test results in the “safe for 8 years and above” status.

The expected economic proﬁts from this plan are $1.1 million. For jus-

tiﬁcation of this answer, consider the follow ing decision tree:

Notice that the cost of the prelim inar y design is sunk (cannot be recov-

ered) and should be ignored. ¥

(b) Suddenly it turns out that the original estimate of the cost of safet y-

testing w as incorrect. Analyze the sensitivit y of WakTek’s optima l plan

of action to the cost of safet y-testing.

Answ er: If the cost of safety-testing is too high, then the expected

value becomes negativ e and the optimal plan is to exit the project. To

ﬁnd out the threshold cost of safety-testing abov e which exit becomes

optim al, notice that the cost of safet y-testing is incurred for sure under

22 2. Introducing Uncertainty and Time

the optimal plan of action which brings expected proﬁts of $1.1 million.

Therefor e, if the cost of safety-testing is increased by $1.1 million or

more (bringing it to $2.1 million or more) then the decision should be

c hanged to “exit.” ¥

which would tell exactly wh ich dem and scenario is true. This m a rket

researc h costs $15 million if done simu ltaneo usly for both demograph -

ics, and $1 million if done for one demogr ap hic only. How, if at all, is

theanswertoparta)aﬀected?

Answ er: Firstexaminethedecisiontreefromparta)toseewhether

we can simplify the eﬀect of the market research, b y elimina ting som e

logically possible alternativ e s. Which alternatives to eliminate from the

tree as “ob viously irrelevan t” is partly a matter of taste. For example,

there are poin ts in the tree where the opportunit y to exit is irrelevan t

(e.g.afterwe’vefoundoutthatdemandishighforthe“young”

)be-

cause the proﬁts will clearly be higher by not exiting. You can alwa ys

just include all alternativ e s, although that can lead to a v ery large tree;

the ﬁnal answer is of course unaﬀected. Elim inations that are not obvi-

ous but that were used in simplifying the decision trees are justiﬁed b y

logic as follow s:

(i) We can com plet ely ignore the possibility of building a production line

before the safety test. We already established in part (a) that doing the

safety test ﬁrst achiev es expected proﬁts that are (11−(−005) = 115)

million higher than doing the production line ﬁrst. The only poten tial

beneﬁtofdoingtheproductionlineﬁrst is the sa ved $0.5 million pr o-

totype cost. Thus no information could ever c hange the diﬀer ence in

pa y oﬀs to the advantage of a “production line ﬁrst” plan b y more than

this $0.5 million. Since research always costs at least $1 million, “prod.

line ﬁrst” can not become optim al due to the possibilit y of doing market

For b revity, th e 8-14 dem og raphic is henceforth referred to as the “young,” and th e 14+ dem ograp hic as the

“old.”

2. Introducing Uncertainty and Time 23

researc h.

(ii) It is never proﬁtable to do researc h after the safety test. If the re-

sult w e re “safe for both groups” then the only case where info is useful

(i.e. ch a nges the decision to ente r into exit) is if both groups have low

demand. (See Figure 1: exiting payoﬀ −15 is better than the −4 of Low-

Low demand scenario, but less than the payoﬀ under the other three de-

man d scenarios). This demand scenario could be ruled out b y researc h -

ing either group. The expected payoﬀ would be

(9+1+4−15)−1  25,

i.e., not worth it after pa y ing for the cost of research. Research after ﬁnd-

ing out that WakAtak is only “safe for old” is obviously not proﬁtable,

since even if the information caused the decision to c ha ng e (from “exit”

to “enter,” if demand is high) this results only in a pa yoﬀ of −1 before

the researc h cost, w hile exit guarantees −15; since r esea rch is mo re

costly than the 0:5 diﬀerence it canno t be w or thwhile.

(iii) The potential beneﬁt of researc h is that it allow s WakTek to sa ve

the cost of production line under unfavorable demand conditions, so

there w ou ld be no point in plans of action where research is conducted

after the production line is built.

Consider a plan where both groups are researched sim u ltaneously.

This would lead to e xpected value of $0.456 million, so not doing re-

24 2. Introducing Uncertainty and Time

search is better than researching both simultan eously. We can now de-

duce that researching only one of the group s cannot be optimal either.

The reason is th at it is less inform ative than researching both, so th e

expected payoﬀ could not be higher than $0.456 million for any other

reason than the fact that it is cheaper by $(15 −1=05) million. This

means that the expected value (EV ) of a plan where only one group is

researc h ed must be lower than ($0456 + $05=$0956) million. Thus

the $1.1 million value from no research is still the highest. Similar ly,

consider the possibilit y of researc hing both groups sequentially. This is,

at best, equally infor m ative as researchin g both groups simultan eously.

It oﬀers the added option of stopping the research after ﬁnding out

the results for one group, and thu s potentially a saving of $0.5 million

comp ared to the cost of researc hin g both sim u lta neou sly. Again, this

cost-saving could not increase the EV to abo ve $0.956, so the optimal

plan of action for part a) is not aﬀected.

(d) Suppose that demand is not independen t across demographics after all,

but instead is perfectly correlated (i.e., if deman d is high in one dem o-

graphic , then it is for sure high in the oth er one as w ell). How, if at all,

w ould that change y our answer to part c)?

Answ er: Now researc h ing either one of the demog raphic groups is

just as in formative as resea rching both (bu t c h ea per, at $1 million);

it tells WakTek whether the dem a nd is high for both groups or low for

both groups. In this case the optimal decision w o uld be to research one

(doesn’t matter which) group, and do the safety testing if the dema nd

is high for both group s, then build the production line and launch the

product unless deemed unsafe; This results in EV of $1.7375 million.

The follow ing ﬁgure show s the decision tree.

Note that exp ected values are not directly axoected by the correlation so the E V of no research is still 1.1.

However, the correlation of dem ands is goo d for WakTek, not just b ecause it m akes m arket research cheap er.

For example, compared to the case (in part c) w here WakTek researches b oth groups simultaneously, one ad ded

beneﬁt here is that WakTek will n ever have to “waste” the cost of safety-testing in the event where the result

turns out to be “safe for old only,” which leads to exit.

2. Introducing Uncertainty and Time 25

8. Juice: Bozoni is a reno w n ed Swiss ma ker of fruit and vegeta ble juice, w hose

products are sold at specialty stores around Western Europe. Bozoni is con-

sidering whether to add c her imoya juice to its line of products. “It w ou ld

be one of our more diﬃcult varieties to produce and distribute,” observ es

Johann Ziﬀenboeﬀel, Bozoni’s CEO. “The cherim oya would be ﬂowninfrom

New Zealand in ﬁrm , unripe form , and it w ou ld need its o w n dedicated ripen-

ing facility her e in Europe.” Three succ essful steps are absolute ly necessary

for the new c herimo ya variety to be worth producing. The industrial ripen-

ing process must be shown to allow the delicate ﬂavors of the cherimoya

to be preserved; the testin g of the ripening process requires th e building

of a small-sc ale ripening facilit y. M arket research in selected small regions

around Europe must sho w that there is suﬃcien t demand am ong consumers

for cherimo ya juice. And cherimo y a juice m ust be sho wn to withstand the

existing tin y gaps in the cold chain bet ween the Bozoni plan t and the end

consumers (these gaps would be prohibitively expensive to ﬁx). Once these

three steps have been completed , there are about 2,500,000 worth of ex-

penses in lau nching the new variety of juice. A successful new variet y w ill

then yield proﬁts, in expected presen t-value terms, of 42.5 m illion.

26 2. Introducing Uncertainty and Time

The three absolutely necessary steps can be done in parallel or sequentially

in any order. Data about these three steps is given in Table 1. “Prob ability

of success” refers to how lik ely it is that the step will be successful. If it is not

successful, then that means that cherim oya juice cannot be sold at a proﬁt.

All pro babilities are ind ependent of each other (i.e., whether a giv en step is

successful or not does not aﬀect the probabilities that the other steps will be

successful). “Cost” refers to the cost of doing this step (regardless of whether

it is successful or not).

(a) Suppose Mr. Ziﬀen boeﬀel calls y ou and asks y ou r advice about the

project. In particular, he wants to know (i) should he do the three

necessary steps in parallel (i.e., all a t once) or should he do them se-

quentially; and (ii) if sequentially, w hat’s the righ t order for the steps

to be done? What answers do y ou give him?

Answ er: Bo zoni should do the steps sequentially in this order: ﬁrst test

the cold cha in, then the ripening process, then do the test-marketing.

The expected value of proﬁts is 1.84 million. Observe that it wo uld not

be proﬁtable to launc h the product if Bozoni had to do all the steps

simultane ou sly. This is an exam ple of real options– by sequencin g the

steps, Bozoni creates options to switc h out of a doomed project before

too much money gets spen t. ¥

(b) Mr. Ziﬀenboeﬀel calls you back. Since Table 1 wa s produced (see below ),

Bozoni has found a small research ﬁrm that can perform the necessary

tests for the r ipening process at a lower cost than Bozoni’s in-house

researc h department.

Table 1: D ata on launching the Ch erimoya juice

Step Pr o b a bility of succes s Cost

Ripening process 0.7 1,000,000

Test marketing 0.3 5,000,000

Cold chain 0.6 500,000

At the same time, th e EU has raised the criteria for getting approval

for new food producing facilities, which raises the costs of these tests.

2. Introducing Uncertainty and Time 27

Mr. Ziﬀenboeﬀel would, therefore, like to kno w how your answe r to (a)

c h ang es as a function of the cost of the ripening test. What do you tell

him?

Answ er: This is sensitivit y analysis for the cost of testing the ripening

process. This can be done by varying the cost for ripening, and seeing

which expected payoﬀ (highlig hted y ellow) is highest for which values of

the cost. For example, whenever w e set the cost below 375,000 it turns

out that the pa yoﬀ from the sequence  →  →  giv es the highest

pa y oﬀ among the six possible sequences. (Excel’s GoalSeek is a partic-

ularly handy wa y for ﬁnding the threshold v alues quic k ly).

Speciﬁcally, the optimal sequence is

i)  →  →  if the cost of  ≤ 375 000

ii)  →  →  if the cost of 375 000 ≤  ≤ 2 142 857

iii)  →  →  if the cost of 2 142 857 ≤  ≤ 8 640 000

iv) don’t launch if  costsmorethan8 640 000

where “” stands for th e ripening process, “” stands for the cold

c hain, and “ ” stands for test marketing. ¥

el calls you back y et again. The good news is the EU

regulations and the ou tsourcing of the ripening tests “ balan ce” eac h

other out, so the cost of the test rem ains 1,000,000. No w the problem

is that his marketing department is suggesting that the probabilit y that

the market researc h will result in good news about the deman d could

be diﬀeren t in light of some recen t data on the sales of other subtropical

fruit products. He would, therefore, like to kno w how your answer to

(a) c ha nges as a functio n of the probability of a positiv e result from the

market researc h. What do you tell him?

Answ er: This can be found b y varying th e probability of success for

test mark eting (highlighted by blue in the excel sheet) bet ween 0 and

1. The optimal sequence turns out to be

28 2. Introducing Uncertainty and Time

i) don’t launch if  ≤ 01905

ii)  →  →  if 01905

where  is the probability that the test marketing will be successful.

9. Steel: AK Steel Hold ing Corporation is a producer of ﬂat-rolled carbon,

stainless and electrical steels and tubular products through its wholly o w ned

subsidiary, AK Steel Corporation. The recent surge in the demand for steel

signiﬁcan tly increased AK ’s proﬁts,

and it is no w engaged in a research

project to improve its production of rolled steel. The research involves three

distinct steps, eac h of whic h must be successfully completed before the ﬁrm

can implement the cost-sa ving new production process. If the research is

completed successfully, it will save the ﬁrm $4 million. Unfortunately, there

is a c ha nce that one or more of the research steps might fail, in which case

the project is worth less. The three steps are done sequen tially, so that the

ﬁrm knows whether one step w as successful before it has to in vest in the next

step.Eachstephasa08 probabilit y of success and eac h step costs $500 000.

The risks of failure in the three steps are uncor relate d with one another. AK

Steel is a risk neutral company. (In case you are worried about such things,

the interest rate is zero).

(a) Dra w the decision tree for the ﬁrm.

Answ er:

See “Demand Send s A K Steel ProﬁtUp32%,”New York Time, 07/23/2008.

http://www.nytimes.com/2008/07/23/business/23steel.html?partner=rssnyt&emc=rss

2. Introducing Uncertainty and Time 29

(b) If the ﬁrm proceeds with this project, what is the probabilit y that it

will succeed in implementing the new production process?

Answ er: For the project to be successful, each of the three independen t

steps must be complet ed. Since the probability of success in eac h stage

is 0.8 and the probabilities are independen t, the probabilit y of three

successes is  =08 · 08 · 08=08

=0512, just over one-half. ¥

from it before the project began?

Answ er: E[]=0512 · $4 000 000 + 0488 · 0=$2 048 000¥

(d) Should the ﬁrm begin the researc h , given that each step costs $500 000?

Answ er: The expected cost of the project is

02·$500 000+08·02·$1 000 000+08·08·$1 500 000 = $1 220 000

The ﬁrst term is the probability times cost of a failure in the ﬁr st step.

The second term is the probabilit y times cost of success in the ﬁrst step

and failure in the second step. The third term is the probability times

cost of success in the ﬁrst step and success in the second step (success

or failure in the third step does not aﬀect the cost of the project, just

30 2. Introducing Uncertainty and Time

the gain from it). The expected cost is less than the expected gain (b y

$828,000). Since the company is not risk averse, it shou ld begin the

project. Note that this is not the only way to do the calculation. An

alternate approach w o uld be to aggregate the costs and beneﬁts of each

possible outcom e:

08 · 08 · 08 · (4 000 000 − 500 000 − 500 000 − 500 000)

+08 · 08 · 02(−500 000 − 500 000 − 500 000)

+08 · 02(−500 000 − 500 000) + 02(−500 000)

=$828 000

Either way, the expected net gain is $828 000. ¥

(e) Once the researc h has begun, should the ﬁrm quit at any poin t ev en if

it has had no failures? should it ever continue the researc h ev en if it has

had a failure?

Answ er: NO to both. Obviously, if one stage fails, then the project

cannot be com pleted successfully, so any mo re expenditures on it are a

waste. If no stage has failed a nd at lea st one h as succeeded, then th e

beneﬁt/cost comp arison of going forward with the project is even more

favorable than when the project began. ¥

After the

ﬁrm has successfully completed steps one and t wo, it discov-

ers an alternate production process that w ould cost $150 000 and would

lo wer production costs b y $1 000 000 with certainty. This process, how-

ever, is a substitute for the three-step cost-saving process; they cannot

be used sim ulta neously. Furthermor e, to have this process available, the

ﬁrm m ust spend the $150 000 before it know s if it will successfully

comp lete step three of the three-step research project.

(f) Draw the augmented decision tree that includes the possibility of pur-

suing this alternate production process.

Answ er:

2. Introducing Uncertainty and Time 31

(g) If the ﬁrm continues the three-step project, what is the cha nce it would

get any value from also developing the alternate production process?

Answ er: Th e alt erna te process would be used only if step three of the

current project failed, which has a 0.2 probab ility. ¥

(h) If dev eloping the alternate production process wer e costless and if the

ﬁrm con tinues the three-step project, w hat is the expected value that

it would get from ha ving the alternate production process available (at

the beginning of researc h step 3)? (This is known as the option value of

ha ving this process a vailable.)

Answ er: There is a 0.2 probabilit y that the alternate process would be

used and a $1,000,000 value if it is used, so the option value of having

thealternateprocessavailableis$200,000.¥

(i) Should the ﬁrm:

i. Pursue only the third step of the three-step project

ii. Pursue only alternate production process

iii. Pursue both the third step of the three-step project and the alter-

nate process

32 2. Introducing Uncertainty and Time

Answ er: Since the option value of the alternate process is greater

than the cost of having this option, the alternate process should

be developed if one con tinues with the three-step project. The net

value of developing this option is $200 000 − $150 000 = $50 000.

Of course, the alternate process wo uld also be developed if the

three-step project w ere unavailable, since it will be used with cer-

taint y and the net value of the altern ate process would then be

$850,00 0. The remainin g question is whether AK should drop the

three-step project rather than attempting the third step. Giv en

that the alternate process will be developed, the extra (or mar-

ginal ) value of successfully completing the three-step project wo u ld

be $3,000,000, because it w ou ld save $3,000,000 more than the al-

ternate process. The expected value of attemp ting the third step

is then 08 · $3 000 000 = $2 400 000. This is greater than the

$500,000 cost of the third step, so AK should p roceed with th e

three-step project as well as the alternate process, i.e., tak e strat-

egy (iii). ¥

(j) If the ﬁrm had kno wn of the alternate production process before it began

the three-step research project, what should it have done?

Answ er: We know that AK should p ursue the alternate process: It

was wo rth doing after successful completion of steps one and two (see

(i)) and wo uld ha ve g rea ter expected value if th e p roba bility of the

three-step project failing were high er . In fact, the option value of the

alternative process declines with each step of success in the three-step

project. At the beginning of step three A K would pay up to $200,000

for the alternate process. Con vin ce yo urself that it w o uld be willing to

pay up to $360,000 for the alternate process at the beginning of step

t wo and up to $488,000 for the alternate process at the beginning of

step one, assuming in each case that it cou ldn’t wait to develop the

alternate later. In fact, the option to wait un til the beginning of the

third period to develop the alternate process could itself be valuab le,

but it isn’t in this case, w h en the process costs $150,000. The other

2. Introducing Uncertainty and Time 33

question is whether AK should pursue the three-step project given that

it will ha ve the alternate process available with certain ty. As in (i), the

marg inal value of successfully completing the three-step project wou ld

be $3,000,000, because it w ould sav e $3,000,000 more than the alternate

process. The expected value of attempting the three-step project is then

0512 · $3 000 000 = $1 536 000. Th is is greater than the the expected

cost of pursuing the three-step project, which is 02 · 500 000 + 08 · 02 ·

1 000 000 + 08 · 08 · 1 500 000 = 1 220 000, so AK should proceed

with the three-step project as w ell as the alternate process. This is the

same calculation as in (c) and (d) except the beneﬁt of success is no w

$3,000,000 instead of $4,000,000. ¥

10. Surgery: A patient is very sick, and will die in 6 months if he goes un trea ted.

The only available treatment is risky surgery. The patien t is expected to live

for 12 months if the surgery is successful, but the probability that the surgery

fails and the patient dies immed ia tely is 0.3.

(a) Dra w a decision tree for this decision problem .

Answ er: Using () to denote the value of living  more months, the

follow ing is the decision tree:

(b) Let () be the patien t’s pa yoﬀ function, where  is the n umber of

mon ths till death. Assuming that (12) = 1 and (0) = 0,whatisthe

lowest payoﬀ thepatientcanhaveforliving3monthssothathaving

surgery is a best response?

Answ er: The expected value of the surgery giv en the pa yoﬀsaboveis

[(surgery)] = 07(12) + 03(0) = 07

34 2. Introducing Uncertainty and Time

which implies that if (3)  07 then the surgery should be performed. ¥

For the rest of the problem , assume that (3) = 08.

wheth er or not surgery will be successful. A positive test implies an

increased likelihood that the patient will survive the surgery as follows:

True- p ositive rate: Th e probab ility that the results of this test will

be positive if surgery is to be successful is 0.90.

False-positive rate: The probability that the results of this test will

be positive if the patient will not survive the operation is 0.10.

W ha t is the prob ab ility of a successful surgery if the test is positive ?

Answ er: The easiest way to think about this is to imagine that the

original 0.7 probability of success is true because for 70% of the sic k

populatio n, call these the “treatable” patients, the surgery is success-

ful, while for the other 30% (“untreatab le”) it is not, and previously

the patient did not know which population he belongs to. The test can

be though t of as detecting which population the patien t belongs to.

The abo ve description means th at if the patient is treatable then the

test will claim he is treata b le with proba bility 0.9, while if the patie nt

is un trea ta ble then the test will claim he is treatable with probability

0.1. Hence, 63% of the population are treatable and detected as such

(0.7×09), while 3% of the population are untre atable but are detected

as treatable (0.3×01). Hence, of the population of people for whom the

test is positive, the probability of successful surgery is

63+3

=0955 ¥

(d) Assuming that the patient has the test done, at no cost, and the result

is positiv e, should surgery be performed?

Answ er: The value from not having surgery is (3) = 08, and a positive

test updates the probability of success to 0955 with the expected payoﬀ

being 0955 × 1 so the patient should ha ve surgery done. ¥

2. Introducing Uncertainty and Time 35

(e) It turns out that the test may ha v e some fatal complications, i.e., the

patient ma y die durin g the test. D raw a d ecision tree for this revised

problem.

Answ er: Given the data above, we kno w that without taking the test

thepatientwillnothavesurgerybecausetheexpectedvalueofsurgery

is 0.7 while the value of living 3 months is 0.8. Also, we showed abo ve

that after a positive test the patient will choose to have surgery, and it

is easy to sho w that after a negative test he won’t (the probability of

a successful outcome is

7+27

=0206) Hence, the decision tree can be

collapsed as follow s 9the decision to have surg ery ha ve been collapsed

to the relevant payoﬀs):

(f) If the proba bility of death during the test is 0.005, should the patient

opt to have the test prior to deciding on the operation?

Answ er: From the decision tree in part (e), the expected value con di-

tional on surviving the test is equal to

07(09 × 1+01 × 08) + 03(01 × 0+09 × 08) = 0902

which implies that if the test succeeds with probabilit y 0.995 then the

expected pa yoﬀ from taking the test is

0995 × 0902 + 0005 × 0=0897

36 2. Introducing Uncertainty and Time

which implies that the test should be tak en because 0897  08. ¥

11. To Run or not to Run: You’re a sprinter, and in practice toda y y ou fell

and h u rt y o ur leg. A n x-ray suggests that it’s broken with probabilit y 0.2.

Your problem is whether you should participate in next week’s tournamen t.

If you run, you think you ’ll win w ith probability 0.1. If you r leg is b roken

and y ou run, then it will be further damaged and y our payoﬀs are as follo w s:

+100 if you win the race and y ou r leg isn’t br oken;

+50 if y o u win and y ou r leg is broken;

0 if you lose and your leg isn’t broken;

−50 if you lose and your leg is broken;

−10 if you don’t run and if y our leg is brok en;

0 if you don’t run and your leg isn’t brok en.

(a) Dra w the decision tree for this problem.

Answ er:

(b) What is your best c h oice of action and its expected pa yoﬀ?

Answ er: The expected pay oﬀ from not running is

[(not run)] = 08 × 0+02(−10) = −2

and the expected pa yoﬀ from running is

[(run)] = 08×(01×100+09×0)+02×(01×50+09×(−50)) = 0

2. Introducing Uncertainty and Time 37

so the best c h oice is to run and have an expected payoﬀ of 0. ¥

You can gather some more inform a tion by having more tests, and y ou

can gather more information about whether y ou ’ll win the race b y talk-

ingtoyourcoach.

Answ er: If you knew you r leg is broken then running yields an expected

pa y oﬀ of 01×50+09×(−50) = −40 while not running yields a pay oﬀ

of −10,soyouwouldnotrunandget−10.Ifyouknewyourlegisnot

broken then the expected payoﬀ from runnin g is 01×100+09×0=10,

while the pay oﬀ from not running is 0, and hence you would run and

get 10. Before getting the information you know your leg is broken

with proba b ility 0.2, so before getting the perfect inform ation , your

expected pa yoﬀ from being able to then act on the perfect inform a tion

is 02(−10) + 08 × 10 = 6. Recall from (b) that the expected payoﬀ of

not ha vin g perfect information is 0, so the value of being able to obtain

the perfect inform ation is 6 − 0=6. ¥

(d) What is the value of perfect information about whether you’ll win the

tournam ent?

Answ er: In this case we know that you will run if y ou know y o u will

win and y ou will not if you know you will lose. Hence, with probability

01 yo u will learn that you’ll win and your expected payoﬀ (depending

on the state of yo ur leg) is 02 × 100 + 08

× 50 = 60Similarly, with

probab ility 0.9 you learn that you’ll lose in wh ich case you r expected

pa y oﬀ is 02 × (−10)+ 08 × 0=−2. Before getting the information y ou

know you will win with probab ility 0.1, so before getting the perfect

information, y our expected pa yoﬀ frombeingabletothenactonthe

perfect information is 01 × 60+ 09(−2) = 42.Recallfrom(b)thatthe

expected payoﬀ of not ha v in g perfec t informa tio n is 0,sothevalueof

beingabletoobtaintheperfectinformationis42 − 0=42. ¥

38 2. Introducing Uncertainty and Time

(e) As stated above, the probabilit y that your leg is broken and the proba-

bilit y that y ou will win the tournamen t are independent. Can y ou use a

decision tree in the case that the probability that you will win the race

depends on whether y our leg is broken?

Answ er: Yes. All you need to do is have diﬀerent proba bilities of win-

ning that depend on whether or not y our leg is brok en. ¥

12. Mo re O il: Ch evron, the No. 2 US oil company, is facing a tough decision.

The new oil project dubbed “Tahiti” is sch eduled to produce its ﬁrst comm er-

cial oil in mid-2008, ye t it is still unclear how productiv e it will be. “Tahiti

is one of Chevron’s ﬁv e big projects,” told Peter Robertson, vice ch airm an

of the compan y’s board to the Wall Str eet Journal.

Still, it was unclear

wheth er the project will result in the blockb u ster success Chevron is hop ing

for. As of June 2007, $4-billion has been in vested in the high-tec h deep sea

platform, w hic h suﬃces to perform early w ell tests. Aside from oﬀering in-

formation on the type of reservoir, the tests will produce enough oil to just

co ver the incremental costs of the testing (beyond the $4 billion investm ent).

Follo win g the test we lls, Chevron predicts one of three possible scenarios.

The optim istic one is that Tahiti sits on one giant, easily accessible oil reser-

voir, in whic h case the compa ny expects to extract 200,000 barrels a da y

after expending another $5 billion in platform setup costs, with a cost of

extraction at about $10 a barrel. This will continue for 10 years, after whic h

the ﬁeld will have no more econom ically recoverable oil. Chevron believes

this scenario has a 1 in 6 c h an ce of occurring. A less rosy scen ario, that is

twice as lik ely as th e optim istic one, is that Chev ro n wo uld ha ve to drill

t wo more w e lls at an additional cost of $0.5 billion each (above and beyo nd

the $5 billion set-up costs), and in w hich case production will be around

100,000 barrels a day with a cost of extraction at about $30 a barrel, and

the ﬁeld will still be depleted after 10 years. The worst case scenario in volves

“C h e v ron’s Tahiti Facility Bets Bi g on G u lf O il Boo m.” Jun 2 7, 20 0 7 . pg. B 5 C.

http://proquest.umi.com/pqdweb?did=1295308671& sid=1&Fmt=3&clientId=1566&RQT=309&V Name= PQD

2. Introducing Uncertainty and Time 39

the oil tucked awa y in n um ero us poc kets, requiring expensive water injection

tec h n ique s which w o uld include up-front costs of another $4 billion (abo ve

and bey on d the $5 billion set-up costs), extraction costs of $50 a barre l, and

production is estimated to be at about 60,000 barr els a day, for 10 y ea rs.

Bill Varnado, Tahiti’s project manag er, was quoted giving this least desir-

able outcome odds of 50-50.

The curren t price of oil is $70 a barrel. For simplicity, assum e that the price

of oil and all costs will remain constant (adjusted for inﬂation) and that

Chevr on’s faces a 0% cost of capital (also adjusted for inﬂation).

(a) If the test-wells w o uld not produce information about whic h one of three

possible scenarios will result, should Ch evron invest the set-up costs of

$5 billion to be prepared to produce at wh atever scenario is realized?

Answ er: We start b y noticing that the $2 billion that were in vested are

a sunk cost and hence irrelevant. Also, since the cost of capital is just

about the same as the projected increase in oil prices, w e do not need to

discount future oil reven ues to get the net present value (NPV) sine the

two eﬀects (price increase and tim e discounting) will cancel each other

out. If the com p any in vests the $2 .5 billion dollar s, then they will be

prepared to a ct upon whatev e r scenario arises (great with probabilit y

, ok with pro bability

 or bad w ith probabilit y

). N otice from the

table belo w that in eac h scenario the added costs of extraction that

Chevron need s to invest (once it becomes clear whic h scenario it is) is

worthwhile (e.g., even in the bad scenario, the proﬁts are $2.19 billion,

which covers the added drilling costs of $2 billion in this case.) Hence,

Chevron wou ld proceed to drill in eac h of the three scenarios, and the

expected proﬁts inclu din g the init ia l $2.5 billion investm e nt would be,

 =

×($21)+

×($73−$05)+

×($219−$2)−$25 =$3 511 666 667

(b) If the test-w ells do produce accura te info rm ation about which of three

possible scenarios is true, what is the added value of performing these

40 2. Introducing Uncertainty and Time

tests?

Answ er: Now, if the test drilling will revea l the scenario ahead of time,

then in the event of the bad scena rio the revenues w ou ld not cover the

total in vestment of $4.5 billion ($2.5 billion initially, a nd another $2

billion for the bad scenario.) In the great and ok scenarios, ho wever,

the revenues cover all the costs. Hence, with the inform atio n Chevron

would not p roceed with the investm ents at all when the bad scenario

happens (probability

), and proceed only when the scenario is great or

ok, yielding an expected proﬁtof

 =

×($21−25)+

×($73−$25−$05)+

×0=$4 666 666 667

Hence, the added value of perfor m ing the tests is,



info

=$4 666 666 667 − $3 511 666 667 = $1 155 000 000

13. To day, Tomorrow or the Day after: Aplayerhas$100 toda y that need

to be consum ed o ver the next three periods,  =1 2 3. The utility over

consuming $



in period  is given by the utility funct ion ()=ln(),and

at period  =1, the player valu es his net presen t value from all consum p tion

as (

)+(

)+

(

),where =09

2. Introducing Uncertainty and Time 41

(a) How w ill the player plan to spend the $100 over the three periods of

consumption?

Answ er: The player will max imize

max





ln(

)+ ln(

)+

ln(100 − 

− 

)

which yield s the followin g two ﬁrst-order equations:



−



100 − 

− 

=0





−



100 − 

− 

=0

From these two equation s conclude that









or 

= 

. We can then then substitute 

with 

in the ﬁrst equa-

tion above to obtain,



−



100 − 

− 

100 − 

− 

− 



and the solution is



100

 + 

and in turn



100

 + 

,and



100

 + 

. ¥

(b) Imagine that the pla yer know s that in period  =2he will receiv e an

additional gift of $20 How will he ch oose to allocate his original $100

initially, and how will he spend the extra $20?

42 2. Introducing Uncertainty and Time

Answ er: After spending 

≤ 100, the player has 100 − 

+20in the

beginnin g of the second period. We can now solv e this backward and

assume that the pla yer has 100 − 

+20in the beginning of period 2

and has to choose between 

and 

so that he solv e,

max



ln(

)+ ln(120 − 

− 

)

with the ﬁrst order condition



−



120 − 

− 

which yields,



120 − 

1+

and 

(120 − 

)

1+



Now we can step back to the ﬁrst period and solve the optimal c h o ice

of 

given the w a y 

and 

will be chosen later. The pla y er solves,

max





ln(

)+ ln(

120 − 

1+

)+

ln(

(120 − 

)

1+

)

and the ﬁrst order condition is,



−

(1 + )

120 − 

1+

−



(1 + )

120 − 

1+



120

1+ + 

Notice, howev er, that as  drops, 

increases, and for a small enough

 this equation will call for 

 100. In particular, the value of  for

which 

=100canbesolvedasfollows,

100 =

120

1+ + 



or,  =

√

5 −

≈ 017. Ho wever, 

 100 is not possible, so the

solution is,



120

1++

if  ≥ 017

100 if 017

2. Introducing Uncertainty and Time 43

and from the calculations earlier,



120 − 

1+

and 

(120 − 

)

1+



Part II

Static Gam es of Com plete

Information

This is page 45

Printer: Opaque

P r elimina r ies

1. eBa y: Hund red s of millions of people bid on eBa y auctions to purchase goods

from all over th e world. Despite being done online, in spirit these a uction s

are similar to those conducted cen turies ago. Is an auction a game? W h y or

wh y not?

Answ er: An auction is indeed a game. A bidder’s payoﬀ depends on his bid

and on the bid of other bidders, and hence there are players, actions (whic h

are bids) and pa yoﬀs that depend on all the bids. Th e winner gets the item

andpaystheprice(whichoneBayisthesecondhighestbidplustheauction

increm ent), while the losers all pay nothing and get nothing. ¥

2. Pena lty Kicks: Imagine a kick er and a goalie who confron t each other in a

penalty kic k that will determine the outcom e of the gam e. The kicker can kick

theballleftorright,whilethegoaliecanchoosetojumpleftorright.Because

of the speed of the kick, the decision s need to be made simultaneously. If the

goalie jumps in the same direction as the kick, then the goalie wins and the

kic ker loses. If the goalie jumps in the opposite direction of the kick then the

kic ker wins and the goalie loses. M odel this as a normal form game and write

down the matrix that represents the game yo u modeled.

46 3. Preliminaries

Answ er: There are two pla yers, 1 (kic ker) and 2 (goalie). Each has two

actions, 



∈ { } to denote left or right. The kicker wins when they

c h oose opposite directions while the goalie wins if they ch oose the same

direction. Using 1 to denote a win and −1 to denote a loss, w e can w rite



( )=

( )=

( )=

( )=1and 

( )=

( )=



( )=

( )=−1. The matrix is therefore,

Play er 1

Player 2







−1 1 1 −1

1 −1 −1 1

3. Meeting Up: Tw o old friends plan to meet at a conference in San Francisco,

and agreed to m eet by the to wer. When arriving in tow n , each realizes that

there a re two natural choices: Sutro To wer or C oit Tower. Not having cell

phones, eac h m ust c hoose independently which tower to go to. Eac h pla y er

prefers meeting up to not meeting up, and neither cares w her e this would

happen. Model this as a normal form came, and write dow n the matrix form

of the game.

Answ er: There are t w o play ers, 1 and 2.Eachhastwoactions,



∈ { }

to denote Sutro or Coit. Both players are happ y if they choose the same

to wer and unhapp y if they don’t. Using 1 to denote happ y and 0 to denote

unhapp y, we can write 



( )=



( )=1and 



( )=



( )=0

for  ∈ {1 2}. The matrix is therefor e,

Player 1

Player 2







1 1 −1 −1

−1 −1 1 1

3. Preliminaries 47

4. Hunting: Two hunters, players 1 and 2 can eac h ch oose to h u nt a stag,

which provides a rather large and tasty meal, or h unt a hare, also tast y, but

much less ﬁlling . Hun t in g stags is c hallen gin g and requires m ut ua l coopera-

tion. If either hunts a stag alone, then the stag will get away, while h u nting

the stag together guarantees that the stag is caught. Hun ting hares is an

individualistic enterprise that is not done in pairs, and whoever c hooses to

hunt a hare will catch one. The payoﬀ from hunting a hare is 1, while the

pa y oﬀ to each from hu nting a stag together is 3. The pa yoﬀ from an unsuc-

cessful stag-h u nt is 0.Representthisgameasamatrix.

Answ er: This is the fam ous “stag hunt” game. Using  for stag and  for

hare, the matrix is,

Player 1

Player 2







3 3 0 1

1 0 1 1

5. Matc hing P ennies: Players 1 and 2 both put a penny on a table sim ul-

taneously. If the t wo pennies come up the same side (heads or tails) then

play er 1 gets both pennies, otherwise pla yer 2 gets both pennies. Represent

this game as a matrix.

Answer: Letting  denote a c hoice of heads and  a c ho ice of tails, and

letting winning giv e a pa y oﬀ of 1 while losing gives −1, the m atrix is ther e-

fore,

Play er 1

Player 2







1 −1 −1 1

−1 1 1 −1

6. Price Com petition: Imagine a market with dem an d ()=100−.There

are t wo ﬁrms, 1 and 2, and each ﬁrm  has to sim ultaneously choose it’s price

48 3. Preliminaries





.If







,thenﬁrm  gets all of the m ark et while no one demands the

good of ﬁrm . If the prices are the same then both ﬁrm s equally split the

mark et demand. Imagine that there are no costs to produce any quantit y

of the good. (These are two large dairy farms, and the product is man ure.)

Write do w n the norm al form of this game.

Answ er: The players are  = {1 2} and the strategy sets are 



=[0 ∞]

for  ∈ {1 2} and ﬁrms choose prices 



∈ 



.Tocalculatepayoﬀs, we need

to kno w what the qua ntities will be for eac h ﬁrm giv en prices (



).Given

the assump tion on ties, the quantities are given by,





(







⎧

⎪

⎨

⎪

⎩

100 − 



if 







0 if 







100−



if 



= 



whichinturnmeansthatthepayoﬀ function is given by quantit y times price

(there are no costs):





(







⎧

⎪

⎨

⎪

⎩

(100 − 



)



if 







0 if 







100−







if 



= 



7. Pu blic Good Co ntribution: Three pla y ers live in a town and eac h can

c h oose to contribu te to fund a street lamp. T h e value of having the street

lam p is 3 for eac h player and the value of not having one is 0.TheMayor

asks eac h pla yer to either contrib ute 1 or nothing. If at least two pla yers

con trib ute then the lam p will be erected. If one or less people con tribu te

then the lamp will not be erected, in which case an y person wh o contribu ted

will not get their money back. Write dow n the normal form of this game.

Answ er: The set of players is  = {1 2 3} and eac h h as an strategy set





= {0 1} where 0 is not to con tribute and 1 is to contribu te. The payoﬀs

3. Preliminaries 49

of player  fro m a proﬁle of strategies (



) is given by,





(



⎧

⎪

⎨

⎪

⎩

0 if 



=0and 



=0for some  6= 

3 if 



=0and 



=1for both  6= 

−1 if 



=1and 



=0for both  6= 

2 if 



=1and 



=1for some  6= 

50 3. Preliminaries

This is page 51

Printer: Opaque

Rat io n ality a n d C o mm o n Knowled g e

1. Prove Proposition ??:IfthegameΓ = h{



}



=1

 {



}



=1

i has a strictly

dominan t strategy equilibrium 



,then



is the unique dominant strateg y

equilibrium .

Answ er: Assume not. That is, there is some other strategy proﬁle 

∗

6= 



that is also a strictly dominant strategy equilibrium . But this implies that

for ev ery , 

∗









, which contradicts that 



is a strictly dominant strategy

equilibrium . ¥

2. Weak dominance. We call the strategy proﬁle 



∈  is a weakly domi-

nant strategy equilibriu m if 





∈ 



is a wea kly dominant strategy for all

 ∈ .Thatisif



(





−

) ≥ 



(





−

) for all 



∈ 



and for all 

−

∈ 

−

(a) Provide an example of a game in whic h there is no weakly dominan t

strategy equilibrium.

Answ er:

Player 1

Play er 2







1 −1 −1 1

−1 1 1 −1

52 4. Rationality and Common Knowledge

(b) Pro vide an exam ple of a game in whic h there is more than one we akly

dom inant strategy equilibrium .

Answ er: In the follow ing gam e each pla yer is indiﬀerent between his

strategies and so each one is weakly dominated by the other. This means

that an y outc om e is a weakly dom ina nt strategy equilibrium.

Player 1

Play er 2







1 1 1 1

3. Discrete ﬁrst-price auction: An item is up for auction. Player 1 values

the item at 3 while player 2 values the item at 5 Eac h player can bid either

0 1 or 2.Ifplayer bids more than player  then  win’s the good and pays

his bid, while the loser does not pay. If both pla y ers bid the same amoun t

then a coin is tossed to determine who the winner is, who gets the good and

pa ys his bid while the loser pays nothing.

(a) Write dow n the game in matrix form.

Answ er: We need to determine what the pa y oﬀs are if the bidders

tie. The one who wins the coin toss bids his bid and the loser gets

and pays nothing. Hence, w e can just calculate the expected payoﬀ as a

50:50 lottery betw een getting nothing and winning. For example, if both

pla yers bid 2 then pla yer 1 gets 3 −2=1unit of pa y oﬀ with probability

and pla yer 2 gets 5 − 2=3units of payoﬀ with prob a bility

,sothe

4. Rationality and Common Kno wledge 53

pair of pa y oﬀsis(



)

Player 1

Player 2

012



0 4 0 3

1 2 0 1 2 0 3

2 1 0 1 0



(b) Does an y pla yer have a strictly dominated strategy?

Answ er: Yes-forplayer2bidding0 is strictly dominated b y bidding

2. ¥

Answ er: After removing the strategy 0 of pla yer 2, pla yer 1’s strategy

of 0 is dominated by 2, so we can rem ove that too. But then, in the

remain ing 2 × 2 game where both pla yers can choose 1 or 2, bidding

1 is strictly dominated by bidding 2 for play e r 2, and after this roun d,

bidding 1 is strictly domina ted by bid ding 2 forplayer1.Hence,the

unique strategy that survives IESD S is (2 2) yielding expected payoﬀs

of (



). ¥

4. eBa y’s recommendation: It is hard to imagine that an yon e is not familia r

with eBay

, the m ost popular auction website by far. The wa y a typical

eBa y auction w orks is that a good is placed for sale, and each bidder places

a“proxybid”,whicheBaykeepsinmemory.Ifyouenteraproxybidthat

is lo wer than the current highest bid, then y our bid is ignored. If, however,

it is higher, then the curren t bid increases up to one incremen t (say, 1 cent)

abo ve the second highest pro xy bid. For example, im agine that three people

placed bids on a used laptop of $55, $98 and $112. The current price will be

at $98.01, and if the auction ended the player who bid $112 would win at a

price of $9 8.0 1. If you were to plac e a bid of $103.45 then the who bid $11 2

w ould still win, but at a price of $103.46, while if y our bid was $123.12 then

54 4. Rationality and Common Knowledge

y ou w ould win at a price of $112.01.

Now consider eBay’s historical recomme ndation that y ou think hard about

y our value of the good, and that y ou enter your true value as y our bid, no

more, no less. Assume that the value of the good for each poten tial bidder is

independent of ho w m u c h other bidders value it.

(a) Argue that bidding more than y our v aluation is w eakly dominated b y

actually bidding y our valuation.

Answ er: If y o u put in a bid 



= 







where 



is yo ur valu a tion , then

only the three follow ing cases can happen: () All other bid s are belo w 



In this case bidding 



= 



will yield the exact same outcome: y o u’ll win

atthesameprice.() Some bid is above 



. In this case bidding 



= 



will yield the exact same outco m e: you’ll lose to a higher bid. ()No

bids are abo ve 



and some bid 

∗



is in between 



and 



.Inthiscase

bidding 



willcauseyoutowininandpay

∗







which means that

y our pa y oﬀ is negative, while if you w ou ld ha ve bid 



= 



then you

w ould lose and get nothing. Hence, in cases () and () bidding 



would

do as w ell as bidding 



, and in case () it wo uld do strictly better,

implying that bidding more than you r valuation is w eakly dominated

b y actually bidding y our valuation. ¥

(b) Argue that bidding less t han y o ur valuation is weakly dominated b y

actually bidding y our valuation.

Answ er: If y o u put in a bid 



= 







where 



is yo ur valu a tion , then

only the three follow ing cases can happen: () Some other bid are above 



In this case biddin g 



= 



will yield the exact sam e outcome: you’ll

lose to a higher bid. () All othe r bids are below 



. In this case bidding





= 



will yield the exact same outcome: you’ll win at the sam e price.

()Nobidsareabove



and some bid 

∗



is in between 



and 



.In

this case bidding 



will cause yo u to lose and get nothing, while if you

would have bid 



= 



then you would win and get a positiv e payoﬀ of





−

∗



. Hence, in cases () and () bidding 



w ould do as well as bidding





, and in case () it would do strictly better, im plying that biddin g

4. Rationality and Common Kno wledge 55

less than yo ur valua tion is w ea kly dominate d by actually bidding your

valuation. ¥

y o u follow it?

Answ er: The r ecom mendation is indeed supported by an analysis of

rational beha vior.

5. In the follo w ing normal-form game, whic h strategy proﬁles survive iterated

elimination of strictly dom inated strategies?

Player 1

Player 2



 6 8 2 6 8 2

 8 2 4 4 9 5

 8 10 4 6 6 7

Answ er: First,  is dominated b y  forplayer1.Intheremaininggame,

is dominated by  for play e r 2. No more strategies are strictly dominated, and

hence ( ) ( ) ( ) and ( ) all survive IESDS. (Note: after the

last stage abo ve,  is weakly dominated by  forplayer1,afterwhich is

dominated b y  for player 1, so that ( ) would be the only strategy proﬁle

that w o uld survive iterated elim ination of wea kly domin ated strategies. ¥

6. Roommates: Two roommates need to each c hoose to clean their apartmen t,

and each can ch oose an amount of time 



≥ 0 to clean. If their c h oices are 



and 



,thenplayer’s payoﬀ is given by (10 −



)



−



.(Thispayoﬀ function

impliesthatthemoreoneroommatecleans,thelessvaluableiscleaningfor

the other roomm ate.)

Th o s e familiar wit h eB ay know abo u t sniping , which i s bid d in g in the last m inut e. It st ill is a we a k ly do min a te d

strategy to bid your valuation at that time, and waiting for the last minute may b e a “best response” if you b elieve

other p eople may respond to an early bid. M ore on this is discussed in chapter 13.

56 4. Rationality and Common Knowledge

(a) What is the best response correspondence of each play e r ?

Answ er: Play er  maximizes (10 −



)



−



given a belief about 



,and

the ﬁrst-order optimality condition is 10 − 



− 2



=0implying that

the best response is 



10−



 ¥

(b) Whic h c hoices survive one round of IESDS?

Answ er: Themostplayer would choose is 



=5,whichisaBRto





=0. Hence, any 



 5 is do min ated by 



=5.

Hence, 



∈ [0 5] are

the choices that survive one round of IESDS.

Answ er: The analysis follow s the same ideas that were used for the

Courno t duopoly in section 4.2.2. In the second round of elimination,

because 

≤ 5 the best response 



10−



implies tha t ﬁrm 1 will

choose 

≥ 25, and a symm etric argument applies to ﬁrm 2. Hence,

the second round of elimination implies that the surviving strategy sets

are 



∈ [25 5] for  ∈ {1 2}. If this process w e re to con verge to an

interval, and not to a single point, then b y the symmetry betw een both

pla yers, th e resulting interval for each ﬁrm w ould be [

min



max

] that

simultaneou sly satisfy two equations with two unkno w ns: 

min

10−

max

and 

max

10−

min

. Ho wever, the only solution to these t wo equations is



min

= 

max

 Hence, the unique pair of c hoices that survive IESDS

forthisgameare

= 

. ¥

7. Campaigning: Tw o candidates, 1 and 2, are running for oﬃce. They each

ha v e one of three choices in running their campaign: focus on the positive

aspects of one’s o w n platform, call this a positive cam p a ign (or  ), focus on

thepositiveaspectsofone’sownplatformwhileattackingone’sopponent’s

This can b e shown directly: The payoﬀ from cho o sing 



=5when the opp onent is choosing 



is (5



(10 − 



)5 − 25 = 25 − 5



. T he payoﬀ fro m cho o sin g 



=5+ where 0 when th e opp onent is choosing 



(5 + 



)=(10−



)(5+)− (5+)

=25−5−

−







, and b ecause 0 it follows tha t (5+ 



) (5



)



4. Rationality and Common Kno wledge 57

campaign, call this a balanced campaign (or ), and ﬁnally, focus only on at-

tac king one’s opponent, call this a negative campaign (or  ). All a candidate

cares about is the probabilit y of win n ing , so assume that if a candid ate ex-

pects to win with probab ility  ∈ [0 1], then his payoﬀ is . The probabilit y

that a candidate wins depends on his c hoice of campaign and his opponen t’s

choice. The proba bilities of winning are given as follo w s:

• — If both choose the same campaign, each wins with probabilit y 0.5.

— If candidate  uses a positive campa ign while  6=  uses a balanced

one, then  loses for sure.

— If candidate  uses a positiv e cam paig n wh ile  6=  uses a negative

one, then  wins with probability 0.3.

— If candidate  uses a negativ e campa ign while  6=  uses a balanced

one, then  wins with probability 0.6.

(a) Model this story as a normal form game. (It suﬃces to be speciﬁcabout

the pa yoﬀ function of one pla yer, and explainin g how the other player’s

pa y oﬀ function is diﬀeren t and why.)

Answ er: There are t w o pla y ers  ∈ {1 2}, each has three strategies





= {  } an d the p ayoﬀsare



( )=



( )=



( )=

05; 

( )=

( )=1; 

( )=

( )=0; 

( )=



( )=03; 

( )=

( )=07; 

( )=

()=06;

and 

()=

( )=04. ¥

(b) Write the game in matrix form.

Answ er:

Play er 1

Player 2



 05 05 0 1 03 07

 1 0 05 05 04 06

 07 03 06 04 05 05

58 4. Rationality and Common Knowledge

gies? Will this procedure lead to a clear prediction?

Answ er: No tice that for each pla yer  strictly dom inates  .Inthe

remain ing 2 × 2 gam e without the strategies  ,  strictly dominates 

for eac h player. Hence, the unique clear prediction is that both candi-

dateswillengageinnegativecampaigns.¥

8. Consider the -Beauty contest presented in section 4.3.5.

(a) Show that if pla yer  believes that ev eryon e else is ch oosing 20 then 19

is not the only best response for an y number of pla yers .

Answ er: If everyone else is c h oosing 20 and if player  chooses 19 then

of the avera ge will be somew here belo w 15, and 19 is closer to that

n u mber, and therefore is a best response. Bu t the same argument holds

for any choice of player  that is bet ween 15 and and 20 regardless of

the number of players. (In fact, y ou should be able to con vinc e yourse lf

that this will be true for any choice of  between 10 and 20.) ¥

(b) Sho w that the set of best response strategies to everyone else c hoosing

the n umber 20 depends on the n umber of play ers .

Answ er: Imagine that  =2. If one player  is choosing 20, then an y

number 



between 0 and 19 will beat 20. This follow s because the target

number (

oftheaverage)isequalto

20+







, the distance

between 20 and the target nu mber is

−





(this will always be positive

because the target n umber is less than 20) while the distance between





and the target n umber is





−

. The latter will be smaller than

the former if and only if





−



−





,or−20 



 20.Given

the constraints on the c ho ices, 



∈ {0 1 19}. Now imag ine that

 =5. The target number is equal to

80+



=12+





,thedistance

bet ween 20 and the target number is 8−





whilethedistancebetween





and the target nu mber is





− 12

. The latter will be smaller than

4. Rationality and Common Kno wledge 59

the former if and only if

 − 12

 8 −

,or





 20. Hence,





= {6 7  19}. You should be able to convince yourself that as

 →∞,ifeveryonebut ch ooses 20 then ’s best response will converge

to 



= {10 11  19}. ¥

9. Consider the -Beauty contest presented in section 4.3.5. Show that if the

number of players 2 then the c hoices {0 1} foreachplayerareboth

Ratio na liz ab le, while if  =2then only the c hoice of {0} by eac h player is

Rationalizable.

Answ er: We start with  =2.Ifplayer2 c hooses 0 then pla yer 1’s best

response is clearly 0. Now imag in e that player 2 is ch oosing 1.Ifplayer1

chooses 

=1then they tie and he wins with probability 05, while if he

chooses 

=0then the target n umber is

and he win s for sure. Hen ce, 0 is

a best reply to 1 and only the choice of 0 b y both players is Ratio na lizable.

Now assume that 2. If all player’s but  choose 0, then ’s best response

is 0, and hen ce ch oosing 0 is Rationa lizable. Now assume that everyon e but

 chooses 1.Ifplayer1chooses

=1then he ties. If he c hooses 

=0then

thetargetnumberis



+1

≥

because  ≥ 2 (it is equal to

when  =2

and greater when 2). Hence, for  ≥ 2 the set of Rationalizable choices is

{0 1}. The analysis in the text sho ws that no other c hoice is Rationalizable

when  =

. ¥

10. P opsicle stands: There are ﬁve lifeguard towers lined along a beach , where

the left-most tower is number 1 and the righ t most tower is number 5. Two

vendors, pla yers 1 and 2, each have a popsicle stand that can be located next

to one of ﬁv e to wers. There are 25 people located next to each tower, and

eac h person will purchase a popsicle from the stand that is closest to him or

her. That is, if pla yer 1 locates his stand at tower 2 and p layer 2 at tower

3, then 50 people (at towers 1 and 2) will purc hase from player 1, while 75

(from towers 3,4 and 5) will pu rchase from vendor 2. Eac h purc ha se yields a

proﬁtof$1.

60 4. Rationality and Common Knowledge

(a) Specify the strategy set of each player. Are there any strictly dominated

strategies?

Answ er: The strategy sets for each pla yer are 



= {1 25} where

each cho ice represen ts a tower. To see whether there are an y strictly

dom inated strategies it is useful to construct the matrix representation

of this game. Assume that if a group of people are indiﬀerent between

the two places (equidistant) then they will split between the two v en dors

(e.g., if the v end o rs are at the same tower then their payoﬀs will be 62.5

each, while if they are located at towers 1 and 3 then they split the

people from tower 2 and their payoﬀs are 37.5 and 87.5 respectively.)

Otherw ise they get the people closest to them, so payoﬀsare:

Player 1

Play er 2

12345

1 625 625 25 100 375 875 50 75 625 625

2 100 25 625 625 50 75 625 625 75 50

3 875 375 75 50 625 625 75 50 875 375

4 75 50 625 625 50 75 625 625 100 25

5 625 625 50 75 375 875 25 100 62 5 625

Notice that the choices of 1 and 5 are strictly dominated b y any other

c hoice for both players 1 and 2. ¥

(b) Find the set of strategies that survive Rat ionaliza b ility.

Answ er: Because the strategies 1 and 5 are strictly domin ated then

they cannot be a best response to an y belief (P r oposition 4.3). In the

reduced game in whic h these strategies are remov ed , both strategies 2

and 4 are dominated by 3, and therefore cannot be a best response in

this second stage. Hence, only the choice {3} is rationalizable. ¥

This is page 61

Printer: Opaque

Pinning Down Beliefs: Nash Equilibrium

1. Prove Proposition ??.

Answ er: (1) Assum e that 

∗

is a strict dominant strategy equilibrium . This

implies that for an y pla yer , 

∗



is a best response to any c hoice of his oppo-

nents including 

∗

−

, whic h in turn implies that 

∗

is a Na sh equilibrium.

(2)Assumethat

∗

istheuniquesurvivorofIESDS.Byconstructionofthe

IESDS procedure, there is no round in which 

∗



is strictly domina ted against

the surviving strategies of ’s opponents, an in particular, against 

∗

−

,imply-

ing that 

∗



is a best response to 

∗

−

, which in turn imp lies that 

∗

is a Nash

equilibrium .

(3) Assume that 

∗

is the unique Rationalizable strategy proﬁle. By construc-

tion of the Rationa lizability procedur e, any strategy of player  that surviv es

a round of rationalizability can be a best response to some strategy of ’s op-

ponen ts that surviv es that round. Hence, by deﬁnition, 

∗



is a best response

to 

∗

−

, which in turn implies that 

∗

is a Na sh equ ilibr ium . ¥

2. A strategy 



∈  is a w eakly dom inant strategy equilibrium if 





∈





is a weakly dominan t strategy for all  ∈ .Thatisif



(







−

) ≥





(





−

) for all 



∈ 



and for all 

−

∈ 

−

. Pro vide an example of a gam e

62 5. Pinning Down Beliefs: Nash Equilibrium

for which there is a w eakly dominan t strategy equilibrium, as w ell as another

Nash equilibriu m .

Answ er: Con sid er the follo wing game:

Player 1

Player 2







1 1 1 1

1 1 2 2

In this game, ( ) is a we ak ly domin ant strategy equilibrium (and of

cour se, a Na sh equilib riu m), yet ( ) is a Nash equilibrium that is not

a w e akly dom in ant strategy equilibrium. ¥

3. Consider a 2 pla yer game with  pure strategies for each player that can be

represen ted by a  ×  m atrix. Assum e that for each pla yer no two pa yo ﬀs

in the matrix are the sam e.

(a) Show that if  =2and the game has a unique pure strategy Nash

equilib riu m then this is the unique strategy proﬁle that survives IESDS.

Answ er: Consider a general 2 × 2 game as follows,

Player 1

Play er 2



2



2



1



1

































and assume without loss of generalit y that (

1



2

) is the unique pure

strategy Nash equilibrium.

Two statements are true: ﬁrst, because

(

1



2

) is a Nash equilibrium and no two pa yoﬀs are the same for

each player then 







and 







.Second,because(

1



2

) is

not a Nash equilibrium then 







and 







cannot hold to-

gether (otherwise it w ould hav e been another Nash equilibrium ). These

The term “w ithou t loss of generality” me ans that we are choosing one particular strategy pro ﬁle bu t th e re is

nothing sp ecial ab out it an d we could have chosen any one of the others using the sam e argument.

5. Pinning Down Beliefs: Nash Equilibrium 63

t wo statemen ts imply that either () 







and 







} in which

case 

1

is dominated by 

1

,or() 







and 







in which

case 

2

is strictly dom inate d by 

2

. This implies that either 

1

or 

2

(or both) will be elim ina ted in the ﬁrst round of IESDS, and from the

fact that 







and 







it follo w s that if only one of the strate-

gies wa s removed in the ﬁrst round of IESDS then the remaining one

will be remo ved in the second and ﬁnalround,leaving(

1



2

) as the

unique strategy that survives IESDS. ¥

(b) Show that if  =3and the game has a unique pure strategy equilibrium

then it ma y not be the only strategy proﬁle that survives IESDS.

Answ er: Cons ider this following game :

Play er 1

Play er 2



 7 6 3 0 6 5

 1 3 4 4 0 2

 8 7 2 1 5 8

Notice that for both players none of the strategies are strictly dominated

implying that IESDS does not restrict any strategy proﬁle survives

IESD S. Ho wever, th is game has a unique N ash equilibrium: ( ).

4. Splitting Pizza: You and a friend are in an Italian restaurant, and the ow ner

oﬀers both of you an 8-slice pizza for free under the follo w ing condition. Eac h

of y ou m ust simultaneously announce how man y slices you w ould lik e; that

is, each p layer  ∈ {1 2} name s his desired am ount of pizza, 0 ≤ 



≤ 8.

If 

+ 

≤ 8 then the play ers get their demands (and the owner eats an y

lefto ver slices). If 

+ 

 8, then the play ers get nothing. Assume that

y ou eac h care only about how m uc h pizza you individually consume, and the

more the better.

(a) Write out or graph each player’s best-response correspondence.

64 5. Pinning Down Beliefs: Nash Equilibrium

Answ er: Restrict attention to integer demands (more on contin uous

demands is belo w). If pla yer  demands 



∈ {0 1  7} then ’s best

response is to demand the complemen t to 8 slices. If  asks for more

then both get nothing wh ile if  asks for less then he is leaving some

slices unclaim ed . If instead player  demands 



=8then pla yer  gets

nothing regardless of his request so any demand is a best response. In

summ ary,





(



(

8 − 



if 



∈ {0 1  7}

{0 1  8} if 





Note: if the pla yers ca n ask for amounts that ar e not restricted to

integers then the same logic applies and the best response is





(



(

8 − 



if 



∈ [0 8)

[0 8] if 





(b) What outcome s can be supported as pur e-strateg y Nash equilibria?

Answ er: It is easy to see from the best response correspondence that

any pair of deman d s that add up to 8 will be a Nash equilib rium, i.e.,

(0 8) (1 7)(8 0). Ho wev er, there is another Nash equilibrium: (8,8)

in w h ich both players get n othin g. It is a Nash equilibriu m because

given that each player is asking for 8 slices, the other player gets noth-

ing regardless of his request, hence he is indiﬀerent bet ween a ll of h is

requests including 8.

Note: The pair 



=8and 



=  where  ∈ {1 27} is not a Nash

equilibrium because ev en thou gh player  is pla y ing a best response to





,player is not playing a best response to 



because b y demanding

8 play er  received noth ing, but if he instead dem an ded 8 − 0 then

he would get those amoun t of slices and get something. ¥

5. Pu blic Good Contribution : Th ree players live in a town and each can

c h oose to contribu te to fund a street lamp. T h e value of having the street

5. Pinning Down Beliefs: Nash Equilibrium 65

lam p is 3 for eac h player and the value of not having one is 0.TheMayor

asks eac h pla yer to either contrib ute 1 or nothing. If at least two pla yers

con trib ute then the lam p will be erected. If one or less people con tribu te

then the lamp will not be erected, in which case an y person wh o contribu ted

will not get their mo ney back.

(a) Write out or graph each player’s best-response correspondence.

Answ er: Consider pla y er  with beliefs about the choice s of pla yers

 and . If neither  nor  contribute then play er  does not w ant to

contribute because the lamp would not be erected and he would lose

his con tribution. Similarly, if both  and  con tribute then pla yer  does

notwanttocontributebecausethelampwouldbeerectedwithouthis

con tribution so he can “free ride” on their contributions. The remaining

cases is where only one of the players  and  contribute, in which case b y

contributin g 1 player  receive s 3, while by not contributing he receiv es

0, and hen ce con tributing is a best response. In sum m ary,





(







(

0 if 



= 



1 if 



6= 





(b) What outcome s can be supported as pur e-strateg y Nash equilibria?

Answ er: The best response corresponden ce described in (a) above im-

plies that there are two kinds of Nash equilib ria: one kind (which is

unique) is where no pla yer contributes, and the other kind has t wo of

the three play ers con trib uting and the third free riding . Hence, either

the lamp being erected with two players co ntributing or the lamp not

being erected with no pla yer contribu ting can be supported as Nash

equilib r ia . ¥

6. Hawk-Dov e: The follo wing game has been widely used in evolutionary biol-

ogy to understand how “ﬁghting” and “display” strategies b y animals could

coexist in a population. For a typical Ha wk-Do ve game there are resources to

66 5. Pinning Down Beliefs: Nash Equilibrium

be gained (i.e. food, mates, territories, etc.) denoted as .Eachoftwoplayers

can c hooses to be aggressive, called “Ha wk ” (), or can be compromising,

called “Dove” (). If both players choose  then they sp lit the resources,

butloosesomepayoﬀ from inju ries, den o ted as . Assum e that 



.If

both choose  then they split the resources, but engage in som e display of

po wer that a display cost ,with



. Fina lly, if player  chooses  w h ile

 ch ooses ,then gets all the resources while  leav es with no beneﬁts and

no costs.

(a) Describe this gam e in a matrix

Answ er:

Player 1

Play er 2







− 



−   0

 0



− 



− 

(b) Assume that  =10,  =6and  =4. What outcomes can be supported

as pure-strategy Nash equilibria?

Answ er: The game is:

Play er 1

Play er 2



 −1 −1 10 0

 0 10 1 1

and the t wo strategy proﬁles that can be supported as pure strategy

Nash equilibria are ( ) and ( ), leading to outcom es (10 0) and

(0 10) respectively. ¥

In the evolutio n a r y bi o lo gy litera tu re , th e a n a ly sis pe rfo rmed i s o f a very diﬀerent nature. Instead of considering

the Nash equilibriu m an a ly s is o f a sta tic gam e , the a nalysis is a d ynamic a n a ly sis whe re succ es sful stra te g ie s

“replicat e ” in a larg e p opu la tio n . This a n a ly s is is p a rt o f a met h od o lo g y c a ll ed “evolu t io n a r y g a me th e or y.” For

more on this see Gintis (2 000).

5. Pinning Down Beliefs: Nash Equilibrium 67

7. The  player Tragedy of the Commons: Suppose there are  pla y ers in

the Tragedy of the Commons example in section 5.2.2.

(a) Find the N ash equilibrium of this g am e. H ow does  aﬀect the Nash

outcome?

Answ er: The analysis in section 2 concluded that in the -player game

the best response of pla yer  is given b y





(

−

 −

6=







First, let’s consider a symm etric Nash equilibrium wh ere eac h player

c hooses the same level of consum ption 

∗

. Because the best response

must hold for each  and they all c h oose the same level 

∗

then in the

Nash equilibrium the best response reduces to,



∗

 − ( − 1)

∗



or,



∗



 +1



The wa y in whic h  aﬀects the outcome is that ﬁrst, as there are more

ﬁrm s, eac h will consum e less clean air. Second, as there are more ﬁrms,

the sum of clean air consumed b y the ﬁrms is



−1

, whic h increases with

.

It is more subtle to show that there cannot be other Nash equilibria. To

show this we will sho w that conditional on whatever is c ho sen by all but

t wo players, the two players must ch oose th e same amou nt in a Nash

equilibrium . Assume that there is another asymmetric Nash equilibrium

in which t wo players,  and , choose two diﬀeren t equilibrium levels



∗



6= 

∗



.Let =

6=



∗



be the sum of all the other equilibrium

c ho ices of the players who are not  or . Because we assumed that this

is a Nash equ ilibrium , the best response function of both  and  must

hold simultaneously, that is,



∗



 −

 − 

∗



 (5.1)

68 5. Pinning Down Beliefs: Nash Equilibrium

and



∗



 −

 − 

∗



 (5.2)

If we substitute (5.2) into (5.1) w e obtain,



∗



 −

 −

−−

∗





which implies that 

∗



−

.Ifwesubstitutethisbackinto(5.2)we

obtain,



∗



 −

 −

−

 −



= 

∗



which con tra dicts the assumption w e started with, that 

∗



6= 

∗



. Hence,

the unique Nash equilibrium has all the players c hoosing the same level



∗



+1

. ¥

(b) Find the socially optimal outco m e with  players. How does  aﬀect

this outcome?

Answ er: The socially optimal outcome is found m y maximizing,

max

(







)



=1

ln(



)+ ln( −



=1





) 

The  ﬁrst order conditions for this problem are,





−



 −



=1





=0for  =1 2   .

Just as for the analysis of the Nash equilibrium in part (a), the solution

here is also symmetr ic. Therefore the optimal solution, 



,canbefound

using the followin g equa tion :





−



 − 



=0,

or, 





2

, and the socially optimal total consum ptio n of clean air w ill

be equal to



regardless of the number of play ers. This implies that the

socially optimal solution is for the players to equally divide up half of

the clean air. ¥

5. Pinning Down Beliefs: Nash Equilibrium 69

cient outcome as  approaches inﬁnit y?

Answ er: The Nash equilibrium outcome alw ays has the ﬁrms consume

too m uch clean air as compared to the total



amount that social opti-

mality requires. Furthermore, as  approac hes inﬁnity the Nash levels of

consumption approach the total amoun t of clean air  and the payoﬀs

of the players approaches −∞. ¥

8. The  ﬁrm Cournot M odel: Suppose there are  ﬁrms in the Cournot

oligopoly model. Let 



denote the quantity produced by ﬁrm ,andlet

 = 



+···+



denote the aggregate production. Let  () denote the mark et

clearing price (when demand equals ) and assume that inv erse demand

function is given b y  ()= −  (where ). Assume that ﬁrms ha ve

no ﬁxed cost, and the cost of producing quantity 



is 



(all ﬁrmshavethe

same mar gin al cost, and assume that ).

(a) Model this as a Norm al form game

Answ er: The pla y ers are  = {1 2  },eachplayerchooses



∈ 



where the strategy sets are 



=[0 ∞) for all  ∈ ,andthepayoﬀsof

each pla yer are given b y,





(





−

⎧

⎪

⎨

⎪

⎩

( −



=1





)



− 





=1







−





=1





≥ 

(b) What is the Nash (Cournot) Equilibrium of the game where ﬁrms c hoose

their quantities simultaneously?

Answ er: Let’s begin by assuming that there is a symm etric “inter ior

solution” where eac h ﬁrm ch ooses the same positive quantit y as a Nash

equilibr iu m , and then w e will show that this is the only possible Nash

70 5. Pinning Down Beliefs: Nash Equilibrium

equilib riu m. Because each ﬁrm maximizes





(





−

)=( −



=1





)



− 



the ﬁrst order condition is

 −

6=





− 2



−  =0

which yields the best response of play er  to be





(

−

 −

6=





− 

Imposing symm etry in equilibrium implies that all  best response con-

ditions will hold with the same values 

∗



= 

∗

for all  ∈ ,andcanbe

solved using the best response function as follows,



∗

 − ( − 1)

∗

− 

which yields



∗

 − 

 +1



It is more subtle to show that there cannot be other Nash equilibria. To

show this we will sho w that conditional on whatever is c ho sen by all but

t wo players, the two players must ch oose th e same amou nt in a Nash

equilibrium . Assume that there is another asymmetric Nash equilibrium

in which two players,  and , choose two diﬀerent equilibrium quantities



∗



6= 

∗



.Let =

6=



∗



be the sum of all the other equilibrium

quantity choices of the players who are not  or . Because we assumed

that this is a Nash equilibrium, the best response function of both  and

 m u st hold simultaneo usly, that is,



∗



 −

 − 

∗



− 

 (5.3)

and



∗



 −

 − 

∗



− 

 (5.4)

5. Pinning Down Beliefs: Nash Equilibrium 71

If we substitute (5.4) into (5.3) w e obtain,



∗



 −

 −

−−

∗



−

− 



which implies that 

∗



−−

. If we substitute this back into (5.4) w e

obtain,



∗



 −

 −

−−

− 

 −

 − 

= 

∗



which contradicts the assumption we started with, that 

∗



6= 

∗



. Hence,

the unique Nash equilibrium has all the players c hoosing the same level



∗

−

+1

. ¥

familiar?

Answ er: First consider the total quantity in the Nash equilibrium as

afunctionof,



∗

= 

∗

( − )

 +1

and the resulting limit pric e is

lim

→∞

 (

∗

) = lim

→∞

 −

( − )

 +1

= .

Thismeansthatasthenumberofﬁrms gro w, the Nash equilibrium

price will also fall and will approach the marginal costs of the ﬁrms as

the number o f ﬁrms gro w s to inﬁnit y. Th ose familiar with a stan d ard

econom ics class kno w that in perfect competition price will equal mar-

ginal costs, which is what happens here when  approaches inﬁnit y. ¥

9. Tragedy of the Ro ommates: You and your  −1 roommates each hav e 5

hours of free time you could spend cleaning your apartment. You all dislike

cleaning, but you a ll like having a clean room: each person’s pa yoﬀ is the

72 5. Pinning Down Beliefs: Nash Equilibrium

total hours spent (b y everyone) cleaning, min us a nu mber  times the hours

spent (individu a lly) cleanin g. That is,





(







)=− · 





=1





Assum e every o ne c h ooses simultaneously how much time to spend cleaning.

(a) Find the Nash equilibrium if 1.

Answ er: The pay oﬀ function is linear in one’s o wn time spen t 



and

in the tim e spent b y the other roommates 



, an d we can rew rite the

pa y oﬀ function as





(





−

)=



− 



6=







Consid erin g this pa yo ﬀ function, if 1 then every additional amou nt 

of time that  spend s cleaning giv es him an extra payoﬀ of (1−)0 so

that each player  would choose to spend all the 5 hours cleaning. Note

that using a ﬁrst-order condition w o uld not work here because taking

thederivativeof



(





−

) with respect to 



will just yield 1 −  =0

which is not true for 1. This implies that there is a “corner” solution

in the range 



∈ [0 5], in this case the Nash equilibrium is at the corner



∗



=5for all  =1 2  . ¥

(b) Find the Nash equilibrium if 1.

Answ er: Similarly to (a) above, every ad ditional am ount  of time

that  spends cleaning giv es him an extra pa yoﬀ of (1 − )0,so

that eac h player  would choose to spend no time cleaning and the Nash

equilib r iu m is 

∗



=0for all  =1 2  . ¥

can y ou ﬁnd an outcome where everyone is better oﬀ than at the Nash

equilib riu m outcome?

Answ er: Follo wing the analysis in part (b), the unique Nash equilib-

rium is where ev eryone c hooses to spend no time cleaning and ev eryone’s

5. Pinning Down Beliefs: Nash Equilibrium 73

pa y oﬀ is equal to zero. Consider th e ca se w h ere everyone is someh ow

forced to choose 



=1.Eachplayer’spayoﬀ will be





(





−

)=



− 



6=





=1− 2 × 1+4× 1=3 0 

so that all the players will be better oﬀ if they all chose 



=1.Infact,

eac h amou nt of time 0 that pla yer  c h ooses to clean cause him

a person al loss of  − 2 = , but increases the pa yoﬀ of eac h of the

other players by . Hence, if w e can get eac h pla yer to increase his time

cleaning by ,thisyieldsanincreaseofvalueforeachplayerthatequals

his o w n loss, but the former is multiplied by the number of players.

Hence, the best sym metr ic outcome is when each player c hooses 



=5.

10. Synergies: Two division manag ers can invest time and eﬀort in creating a

better working relationship. Eac h in vests 



≥ 0, and if both invest more then

both are better oﬀ, but it is costly for each manager to in vest. In particular,

the payoﬀ function for play e r  from eﬀort lev els (







) is 



(







)=( +





)



− 



(a) What is the best response correspondence of each pla yer?

Answ er: If player  believe s that pla yer  chooses 



then ’s ﬁrst order

optim ality conditio n for maxim izin g his payoﬀ is,

 + 



− 2



=0

yielding the best response function,





(



 + 



for all 



≥ 0

74 5. Pinning Down Beliefs: Nash Equilibrium

(b) In what way are the best response correspondences diﬀerent from those

in the Courn ot game? Why?

Answ er: Here the best response function of pla yer  is increasing in

the cho ice of player  wherea s in the Cournot model it is decreasing in

thechoiceofplayer. This is because in this game the choices of the

t wo players are strateg ic complements while in the Cournot game they

are strategic substitutes. ¥

Answ er: We solve two equation s with t wo unk now ns,



 + 

and 

 + 

which yield the solution 

= 

= . It is easy to see that it is uniqu e

because it is the only poin t at wh ich these t wo best response functions

cross. ¥

11. Wasteful Shipping Costs. Consider t wo coun tries,  and  each with a

monopolist that owns the only coal mine in the coun try, and it produces coal.

Let ﬁrm 1 be the one located in country ,andﬁrm 2 the one in country

.Let





∈ {1 2} and  ∈ { } denote the quantit y that ﬁrm  sells in

coun try . Consequ ently, let 



= 





+ 





be the total quantit y produced by

ﬁrm  ∈ {1 2},andlet



= 



+ 



be the total quan tity sold in cou ntry

 ∈ { }. The demand for coal in coun tries  and  is given respectively

by ,





=90−



∈ { }

and the costs of production for eac h ﬁrm is giv e n by,





(



)=10



∈ {1 2}

(a) Assume that the coun tries do not ha ve a trade agreement and, in fact,

imports in both countries are prohibited. This implies that 



= 



is set as a political constraint. Wh at quantities 



and 



will both

5. Pinning Down Beliefs: Nash Equilibrium 75

ﬁrm s produce?

Answ er: Eac h ﬁrm is a monopolist in its ow n coun try. Let and maxi-

mizes,

max







≥0

(90 − 





)





− 10





where either  =1and  =  or  =2and  =  (so that 







=0is set by assumption.) The ﬁrst order maximization condition is

90 − 2





− 10 = 0, which yields 



= 



=40 The pay oﬀ for each ﬁrm

is 1 600. ¥

Now assume that the two coun tries sign a free-trade agreem ent that

allows foreign ﬁrms to sell in their coun tr ies without any tariﬀs. There

are, ho wever shipping costs. If ﬁrm  sells quantity 





in the foreign

country (i.e., ﬁrm 1 selling in  or ﬁrm 2 selling in ) then shipping

costs are equal to 10





. Assume further that each ﬁrm chooses a pair of

quantities 











simultaneo u sly,  ∈ {1 2} so that a proﬁle of actions

consists of four quantity c hoices.

(b) Model this as a normal form game and ﬁn d a Nash equilibriu m of the

game y ou described. Is it unique?

Answ er: This game has tw o play ers,  ∈ {1 2}, eac h ch oosing a strategy

that consists of two non -n egative quan tities, (











) ∈ R

,andthe

pa y oﬀ ofthetwoplayersaregivenby,



(















)=(90− 



− 



)



+(90− 



− 



)



− 10(



+ 



) − 10







(















)=(90− 



− 



)



+(90− 



− 



)



− 10(



+ 



) − 10





where the ﬁrst term is the ﬁrm’s reve nue in marke t , the second is the

revenue in market , the third is the total production cost and the last

is the shippin g cost. Given beliefs (







) about what ﬁrm 2 chooses

to produce, ﬁrm 1’s optimization requires two partial derivative s with

76 5. Pinning Down Beliefs: Nash Equilibrium

respect to 



and 



as follo ws,



(















)





=90− 



− 2



− 10 = 0



(















)





=90− 



− 2



− 20 = 0

whichinturnleadtothetwopartsofﬁrm 1’s best response function,





80 − 



 (5.5)





70 − 



 (5.6)

It is easy to see that the objectiv e of ﬁrm 2 is symmetric to that of ﬁrm

1 and hence w e can directly write do wn ﬁrm 2’s best responses as,





70 − 



 (5.7)





80 − 



 (5.8)

The Nash equilibrium is solv ed b y ﬁnding a proﬁle of strategies (















)

for whic h (5.5), (5.6), (5.7) and (5.8) all hold simultaneou sly. From (5.5)

and (5.7) w e obtain 



=30and , 



=20. Sim ila rly, from (5.6) and

(5.8)weobtain



=20and , 



=30.Thepayoﬀ of each ﬁrm s would

be equal to 1 300.

Now assume that before t he game yo u described in (b) is p layed, the

research department of ﬁrm 1 disco v ered that shipping coal with the

current ships causes the release of pollutants. If the ﬁrm would disclose

this report to the World-Trade-O rgan ization (WT O) then the WTO

w ould prohibit the use of the current ships. Instead, a new shipping

Because the payoﬀ function has no interactions b etween the m arkets (i.e., it is separable in the two m arkets

so tha t there a re no interactions throu gh the cost function) then 



depend s only on 



and 



depends only

on 



(and vice versa for ﬁrm 2). If costs were not linear then this would not b e the case and the solution would

involve solving four equations wit four unknowns simultaneously.

5. Pinning Down Beliefs: Nash Equilibrium 77

tec hnology would be oﬀered that w ould increase shipping costs to 40





(instead of 10





as abo ve).

y our answ er with an equilibrium analysis.

Answ er: To answer this w e need to solv e the Nash equilibrium with the

more expensive shipping tec hnology and compare the proﬁts to that of

the curren t cheaper technology. We know that a monopolist (or compet-

itive ﬁrm ) would never prefer a m ore expensiv e tec hnology to a cheaper

one, but here there may be interesting strategic eﬀects: the more ex-

pensive shipping technology will dampen competition. The new pa y o ﬀ

functions are



(















)=(90− 



− 



)



+(90− 



− 



)



− 10(



+ 



) − 40







(















)=(90− 



− 



)



+(90− 



− 



)



− 10(



+ 



) − 40





and follo w ing the same argumen ts in part (b) above, the four equations

that will deﬁne the best responses of both ﬁrms are,





80 − 



 (5.9)





40 − 



 (5.10)

and,





40 − 



 (5.11)





80 − 



 (5.12)

From (5.9) and (5.11) we obtain 



=40and , 



=0. Similarly, from

(5.10) and (5.12) w e obtain 



=0and , 



=40. The pay oﬀ of each

ﬁrm s would be equal to 1 600, as w e calculated in part (a) above. Hence,

the ﬁrm wo uld lik e to disclose the informa tion and let the WT O impose

abanthatwouldeﬀectively kill cross-border competition. ¥

78 5. Pinning Down Beliefs: Nash Equilibrium

12. Asym m e tric Be rtrand : Consider the Bertrand game with 

(

)=

and



(

)=2

,demandequalto =100− ,andwhereﬁrm s m ust c hoose

prices in increments of one cen t. We have seen in section ?? that one possible

Nash equilibriu m is (

∗



∗

)=(199 200).

(a) Sho w that there are other Nash equilibria for this game.

Answ er: Another Nash equilib rium is (



)=(150 151)In this

equilib r iu m ﬁrm 1 fulﬁlls market demand at a price of 1.50 and has no

incentive to cha ng e the price in eith er direction. Firm 2 is indiﬀeren t

between the curr ent price and any higher price, and strictly prefers it

to lower prices. ¥

(b) How man y Nash equilibria does this game ha ve?

Answ er: There are 100 Nash equilibria of this gam e starting with

(



)=(100 101) and going all the w a y up with one-cent increases

to (

∗



∗

)=(199 200). The same logic explains why eac h of these is

a Nash equilib riu m. ¥

13. Com parativ e E conom ics: Two high tech ﬁrms (1 and 2) are considering

a joint venture. Each ﬁrm  can in vest in a no vel techno logy, and can c hoose

a level of inv estm ent 



∈ [0 5] at a cost of 



(







(think of 



as how

many hours to train employees, or how much capital to buy for R&D labs).

The rev enue of each ﬁrm depends both on its investm ent, and of the other

ﬁrm ’s investmen t . In particular, if ﬁrm  and  choose 



and 



respectively,

then the gross revenue to ﬁrm  is

(







⎧

⎪

⎨

⎪

⎩

0 if 



 1

2 if 



≥ 1and 



 2





· 



if 



≥ 1and 



≥ 2

(a) Write do wn mathematically, and draw the proﬁtfunction(gross rev-

enue min u s costs) of ﬁrm  as a function of 



for three cases: () 



 2,

() 



=2,and() 



5. Pinning Down Beliefs: Nash Equilibrium 79

Answ er: For the case where 



 2 the pa yoﬀ (proﬁt) function of ﬁrm

 is,





(







(

0 −





if 



 1

2 −





if 



≥ 1



for the case where 



=2the payoﬀ function of ﬁrm  is,





(







(

0 −





if 



 1

2



−





if 



≥ 1



and for the case where 



=4the payoﬀ function of ﬁrm  is,





(







(

0 −





if 



 1

4



−





if 



≥ 1



The three proﬁt functions are depicted in the following ﬁgure:

1 2 3 4 5

-4

-2

All three share the same proﬁts in the range 



∈ [0 1) which is the red

line. The blac k line depicts the rest of the pa yoﬀ function for the case

of 



 2, the green line depicts the rest of the payoﬀ function for the

case of 



=2, and the blue line depicts the rest of the payoﬀ function

for the case of 



=4. ¥

(b) What is the best response function of ﬁrm  ?

Answ er: It is easy to see (and calculate) that when 



 2 then ﬁrm

’s best response is to c h oose 



=1,andwhen



 4 then ﬁrm ’s best

80 5. Pinning Down Beliefs: Nash Equilibrium

response is to choose 



=5(a “corner” solution.) When 



=2then

ﬁrm ’s best response solves

max





∈[05]

2



−







and the ﬁrst order optimality condition is 2−





=0which yield s 



=4.

More generally, as 



gro ws abov e 2 the best response of ﬁrm  will grow

above 4 until it hits the corner solution of 



=5. In the range in which

player ’s best response in between 4 and 5 he maximizes his pa y oﬀ

function which is,

max





∈[05]









−







and his best response is deriv ed form the ﬁrst order condition 



−





=0,

which yields,





(



)=2



It is easy to see that for any 



∈ [2 25] the best response of ﬁrm 

is within the range [4 5] and for any 



 25 thebestresponseof is

“stuc k” at the corner solutio n 



5. Hen ce, we can write down the

general best response functio n of ﬁrm  as,





(



⎧

⎪

⎨

⎪

⎩

1 if 



 2

2



if 



∈ [2 25]

5 if 



 25



(c)Itturnsoutthattherearetwoidentical pairs of suc h ﬁrm s (that is,

the techn ology above describes the situation for both pairs). One pair

in R ussia where coordination is hard to achiev e and business people

are v ery cautious, and the other pair in G erman y where coordination

is common and business people expect their partners to go the extra

mile. You learn that the R u ssia n ﬁrms are earning signiﬁca ntly less

proﬁts than the German ﬁrms, despite the fact that their tec h nolog ies

are identical. Can you use Na sh equilibrium analysis to shed light on

5. Pinning Down Beliefs: Nash Equilibrium 81

this dilemm a? If so, be precise and use your previo us analy sis to do so.

Answ er: The best response function described in part (b) leads to two

Nash equilibria: in the ﬁrst (

∗





∗



)=(1 1) and (

∗∗





∗∗



)=(5 5).In

the ﬁrst Nash equilibriu m the proﬁts of eac h ﬁrm are 

∗



=175, wh ile

in the second Nash equilibrium 

∗∗



=1875.Thisisanexamplewhere

“self fulﬁlling expectations” can lead to two Nash equilib ria, one with

high pay oﬀsandonewithlowpayoﬀs. This is an e xample of a gam e

with strateg ic com plements (see page 93) wh ere the co m plem e ntarity

cause m ultiple equilibria. ¥

14. Negativ e Ad Campaigns: Each one of two political parties can choose

to buy time on commercial radio sho w s to broadcast negativ e ad campaign s

against their rival. These c hoices are made sim ultaneously. Due to gov ern-

ment regulation it is forbidden to buy more than 2 hours of negativ e campaign

time so that eac h part y cannot choose an amount of negative campaigning

abo v e 2 hours. Giv en a pair of c hoices (



),thepayoﬀ of party  is given

by the follow ing function : 



(



)=



− 2



+ 







− (



)



(a) What is the normal form representation of this gam e?

Answ er: Tw o play ers  = {1 2}, for eac h pla yer the strategy space is





=[0 2] and the pa yoﬀ of play er  is given by 



(



)=



− 2











− (



)

. ¥

(b) What is the best response function for each part y?

Answ er: Each player maximizes 



(



) resulting in the ﬁrst order

optim ality condition 1+



− 2



=0resulting in the best response

function,





(



1+





82 5. Pinning Down Beliefs: Nash Equilibrium

Answ er: Solving the two best response functions simultaneou sly,



1+

and 

1+

yields the Nash equilibrium 

= 

=1, and this is the unique solution

to these equation s imp lyin g tha t this is the un iqu e equ ilibr ium. ¥

(d) If the parties could sign a binding agreement on how much to campaign ,

what levels would they c hoose?

Answ er: Both parties would be better oﬀ if they can c hoose not to

spend mon ey on negative cam paigns. Th e payoﬀs for each player from

the N ash equilibrium solved in part (c) are 



(1 1) = −1 while of they

agreed not to spend anything they eac h w ould obtain zero. This is a

varian t of the Prisoners’ Dilemma. ¥

15. Hotelling’s Con tin uous Model: Consider H ot ellin g’s m odel where the

citizens are a continuum of v oters on the interval  =[− ], with uniform

distribution ().

(a) What is the best response of candidate  if candidate  is choosing





 0.

Answ er: Pla y er ’s best response will depend on the position of 



relative to the ch oice

. For example, if 



= − ∈ [− 0) then any

choice 



∈ (



) will guaran tee pla yer  victory. This follow s because

player  will get mor e than  −  ofthevotefromtheinterval(



)

on his “right” (it is more because 



), and he will split the inner

interval (− 



) with play er  so that his total share of the vote is





−  +





− (−)

2 + 



− 



while the total share of pla y er  is the interval (− −) on his “left”

plus splitting the inner inte rval (− 



),whichis,





= − − (−)+





− (−)

2 + 



− 



5. Pinning Down Beliefs: Nash Equilibrium 83

which imp lies that 







and  will win the vote. A symmetric argu-

ment works for an y choice 



=  ∈ (0].Theremainingcaseis



=0.

In this case if 



6= 



then play er  gets less tan half the vote while

if 



= 



then pla yer  gets half the v o te, making 



= 



the best

response to 



=0. We conclude that the best response correspondence

is,





(



⎧

⎪

⎨

⎪

⎩

(



 −



) if 



 0

0 if 



(−







) if 



 0



(b) Sho w that the unique Nash equilibrium is 

= 

=0.

Answ er: This follo w s imm ed iately from the best response correspon-

dence in part (a) abo ve: only at the pair (







)=(0 0) are both pla yers

pla y ing a best response to each other. ¥

equilibrium is where is candidate c hooses the policy associated with the

median voter.

Answ er: This again follo ws from the analysis in part (a) above just

that instead of 0 being the poin t at which half the v ote is obtained, it

is the median vot er 

∗

for which half the v ote is at or abo v e 

∗

and half

thevoteisatorbelow

∗

16. Ho tellin g ’s Price Com petition: Imagine a con tinuum of potential buy ers,

located on the line segment [0 1], with uniform distribution. (Hence, the

If th e d is tr ib u tion  () is c ontinu ous the the re will be s o me 

∗

such that  (

∗

and that will be the

median voter. If there are “jumps” in the distribution  () then the median voter can b e som e 

∗

for wh ich

 (

∗

) 

. For instance , if half th e p o p u la t io n is distr ib u te d Un ifo rmly on [−1 1] and the oth er half are all

locat e d at the point 

∗

then

of the p op ulation are s trictly below 

∗

of the p opulation are strictly ab ove



∗

,and

of the population is exactly at 

∗

.Inthiscase

 ()=



1+

if − 1 ≤ 

3+

≤  ≤ 1



so that  (

∗

,but

∗

is still the median vote r.

84 5. Pinning Down Beliefs: Nash Equilibrium

“mass” or quan tit y of buyers in the interval [ ] is equal to  − .) Imagine

two ﬁrm s, pla yers 1 and 2 whoarelocatedateachendoftheinterval(player

1 at the 0 point and player 2 at the 1 point.) Each player  can c hoose its price





, and each customer goes to the v en dor who oﬀers them the highest value.

Ho wev er, price alone does not determine the value, but distance is importan t

as w ell. In particular, each buyer who buys the product from pla yer  has

anetvalueof − 



− 



where 



isthedistancebetweenthebuyerand

vendor , and represen ts the transportation costs of buying from v e ndor .

Th us, buy er  ∈ [0 1] buys from 1 and not 2 if  −

−

−

−

,andif

buying is better than getting zero. (He re 

=  and 

=1−. The buying

c h oice would be reversed if the inequality is rever sed.) Finally, assume that

thecostofproductioniszero.

(a) Assume that  is v ery large so that all the customers will be serv ed by at

least one ﬁrm, and that some customer 

∗

∈ [0 1] is ind i ﬀeren t between

the t wo ﬁrm s. What is the best response function of each player?

Answ er: Because customer 

∗

’s distance from ﬁrm 1 is 

∗

and his

distance from ﬁrm 2 is 1 − 

∗

, his indiﬀerence implies that

 − 

− 

∗

=  − 

− (1 − 

∗

)

which gives the equation for 

∗



∗

1+

− 



It follows that under the assumptions above, given prices 

and 

,the

demands for ﬁrms1and2aregivenby



(



)=

∗

1+

− 





(



)=1− 

∗

1+

− 



Firm 1’s maxim ization problem is

max



1+

− 



5. Pinning Down Beliefs: Nash Equilibrium 85

which yields the ﬁrst order condition

1+

− 2

=0

implying the best response function



A symm etric analysis yields the best response of ﬁrm 2,



(b) Assume that  =1. What is the Nash equilibrium? Is it unique?

Answ er: If we u se the best response functions calculated in part (a)

above then we obtain a unique Nash equilibrium 

= 

=1,andthis

implies that 

∗

so tha t each ﬁrm gets half the market. However,

when  =1th en the utilit y of customer 

∗

is −

−

=1−1−

−

, implying that he would prefer not to buy, and by contin uit y, an

in terval of customers around 

∗

w ould also prefer not to buy. his violated

the assum p tions we used to calculate the best response functions.

So,

the analysis in part (a) is invalid when  =1. It is therefore useful to

start with the monopoly case when  =1and see how each ﬁrm would

have priced if the other is absen t. Firm 1 maxim izes

max



(1 − 

)

wh ich yie lds the so lu t i on 

so th at everyone in the in terval  ∈

[0

] wished to buy from ﬁrm 1 and no other customer w ould buy. By

symm etry, if ﬁrm 2 w ere a monopoly then the solution w o uld be 

so that everyone in the interval  ∈ [

 1] would buy from ﬁrm 2 and no

other custom er would buy. But this implies that if both ﬁrm s set their

We need  ≥ 15 for c u sto mer 

∗

to be just indiﬀerent between buying and not b uying when 

= 

=1.

All the other customers will strictly prefer buying.

86 5. Pinning Down Beliefs: Nash Equilibrium

monopoly prices 

= 

then eac h wou ld maxim ize proﬁts ignoring

the other ﬁrm,andhencethisisthe(trivially)uniqueNashequilibrium.





,so

that a buy er buys from 1 if a nd only if  − 

−



− 

−



Write the best response function of each player and solv e for the Nash

Equ ilibrium .

Answ er: Lik e in part (a), assum e that cu stom er 

∗

’s distance from

ﬁrm 1 is 

∗

and his distance from ﬁrm 2 is 1 − 

∗

, and he is indiﬀeren t

between buying from either, so his indiﬀerence implies that

 − 

−



∗

=  − 

−

(1 − 

∗

)

which gives the equation for 

∗



∗

+ 

− 



It follows that under the assumptions above, given prices 

and 

,the

demands for ﬁrms1and2aregivenby



(



)=

∗

+ 

− 





(



)=1− 

∗

+ 

− 



Firm 1’s maxim ization problem is

max



+ 

− 



which yields the ﬁrst order condition

+ 

− 2

=0

implying the best response function



5. Pinning Down Beliefs: Nash Equilibrium 87

A symm etric analysis yields the best response of ﬁrm 2,



The Nash equilibriu m is a pair of price s for which these two best re-

sponse functions hold sim ultaneously, which yields 

= 

,and



∗

. To verify th at this is a Nash equilibrium notice th at for cus-

tomer 

∗

, the utility form buying from ﬁrm 1 is −

−

=1−

−

implying that he is indeed indiﬀerent between buying or not, whic h in

turn implies that eve ry other customer prefer buying ov er not buying.

(d) Follow ing your analysis in (c) above, imagine that transportation costs

are 



,with ∈ [0

]. Wha t h appens to the N ash equilibrium as

 → 0? What is the intuition for this result?

Answ er: Usin g the assumed indiﬀerent customer 

∗

, his indiﬀerence

imp lies that

 − 

− 

∗

=  − 

− (1 − 

∗

)

 − 

−  =  − 

− (1 − )

which gives the equation for 

∗



∗

2

(

− 

) 

It follows that under the assumptions above, given prices 

and 

,the

demands for ﬁrms1and2aregivenby



(



)=

∗

2

(

− 

) 



(



)=1− 

∗

2

(

− 

) 

Firm 1’s maxim ization problem is

max



2

(

− 

)



88 5. Pinning Down Beliefs: Nash Equilibrium

which yields the ﬁrst order condition



2

−





=0

implying the best response function







A symm etric analysis yields the best response of ﬁrm 2,











The Nash equilibriu m is a pair of price s for which these two best re-

sponse fu nctions ho ld simultaneously, whic h y ields 

= 

= ,and



∗

. From the analysis in (c) abo ve we know that for any  ∈ [0

)

custom er 

∗

will strictly prefer to buy o ver not buying and so will every

other customer. We see that as  decreases, so do the equilibrium prices,

so that at the limit of  =0the prices will be zero. The intu ition is that

the transportation costs  cause ﬁrms 1 and 2 to be diﬀeren tiated, and

this “softens” the Bertrand competition bet ween the t wo ﬁrm s. When

the transportation costs are higher this implies that competition is less

ﬁerce and prices are higher, and the opposite holds for lo wer transporta-

tion costs.¥

17. To v ote or not to vote: Two candidates,  and , are running for ma y oral

election in a tow n with  residen ts. A total of 0 residen ts support

candidate  while the remainder  =  − support candidate .Thevalue

for each resident for having their candida te win is 4, for having him tie is 2,

and for having him lose is 0. Going to vote costs each resident 1

(a) Let  =2and  =1. Write down this game as a matrix and solv e for

the Nash equilib r ium.

Answ er: Thegameisbetweentheresidentsasthecandidatesseemnot

5. Pinning Down Beliefs: Nash Equilibrium 89

to pla y a role and the question is whether to v ote or not to v ote. Letting

 denote “y es” v ote and  denote “no” vote, the matrix representation

of this t wo player game is

Player 1

Play er 2



 1 1 3 0

 0 3 2 2

If both vote or both don’t v ote then there is a tie and the only diﬀerence

is the cost of v otin g. If only one v otes then his candidate wins and he

exerts the v o ting costs, while the other gains and expends nothing.

Voting is a dominant strategy so (  ) is the unique Nash equilib riu m.

(b) Let 2 be an even number and let  =  =



. Find all the N ash

equilib r ia .

Answ er: Observe that ev e ryo ne v o ting is a Nash equilibriu m . Like in

part (a) there will be a tie and every pla yer’s pay oﬀ is 1, while if he

c h ose not to vote then his c andid ate will lose and his payoﬀ will be

0, hence it is a Nash equilibrium. We no w show that no other proﬁle

of strategies is a Nash equilib rium in three steps. Let 

and 

denote

the number of member of eac h group that plan on voting. ()Assume

that a n identical number of voters from each side v otes so that there

is a tie but some v oter s are not vo ting, that is, 

= 





.Inthis

case any one of the vo ters who is no t voting w ou ld prefer to devia te,

expend a voting cost of 1 and increase his pa yoﬀ from2to3becausehe

would tip the ele ction . Hen ce , this cannot be a N ash equilibrium . ()

Now assume that the number of supporters of candidate  is is at least

2morethanthatofcandidate,thatis,

≥ 

+2.(Asymmetric

argum ent will apply to the case of 

≥ 

+2.) In this case any one of

the  supporters who plans to v ote knows that his v ote is redundant,

and hence he w ould prefer not to v ote and sa ve the v oting costs. Hence,

this cannot be a Nash equilibrium . ()Nowassumethatthenumber

90 5. Pinning Down Beliefs: Nash Equilibrium

of supporters of candidate  is is exactly 1 more than that of candidate

,thatis,

= 

+1. (A symm etric argument will apply to the case

of 

= 

+1.) In this case an y one of the  supporters who does not

plan to vote know s that his v ote can turn a loss into a tie, and hence

he wo uld prefer to v ote and c han ge the election giving him a pa yoﬀ of 1

instead of 0. H ence , this too cannot be a Nash equilibrium. This covers

all the possible scenarios and shows that every one v oting is the unique

Nash equilibrium . ¥

answer to (a)and(b)change?

Answ er: The tw o play er game is now

Play er 1

Play er 2



 −1 −1 1 0

 0 1 2 2

and the dominated strategy is voting, implying that the unique N ash

equilibr iu m is for the players not to v ote , ( ). A similar argument

to part (b) abov e sho ws that all pla yers not v oting is the unique Nash

equilib r iu m . ¥

18. P olitical Cam paigning: Two candidates are competing in a political race.

Each candidate  can spend 



≥ 0 on adds that reac h out to voters, whic h

in turn increases the probability that candidate  wins the race. Giv en a pair

of spending c h oices (



), the probability that candidate  wins is given







+

. If neither spends any resources then eac h wins with probability

Eac h candidate values winning at a pa y oﬀ of 0,andthecostofspending





is just 



(a) Giv e n t wo spend lev els (



), write the expected pa yoﬀ of a candidate



5. Pinning Down Beliefs: Nash Equilibrium 91

Answ er: Play er ’s payoﬀ function is





(











+ 

− 





(b) What is the function that represents eac h pla yer’s best response func-

tion?

Answ er: Pla y er 1 maximizes his pay oﬀ 

(



) shownin(a)above

and the ﬁrst order optimality condition is,

(

+ 

) − 



(

+ 

)

− 1=0

and if w e use 

(

) to denote pla yer 1’s best response function then

it explicitly solves the follo w ing equalit y that is derived from the ﬁrst-

order condition,

[

(

)]

+2

(

)

+(

)

− 

=0

Becau se this is a quadratic equation we can no t write an explicit best

response function (or correspondence). However, if we can graph 

(

)

asshowninthefollowingﬁgure (the values correspond for the case of

 =1).

0.2 0.4 0.6 0.8 1.0 1.2

-0.2

-0.1

0.0

0.1

0.2

0.3

0.4

Similarly w e can derive the symm et ric func tion for player 2. ¥

92 5. Pinning Down Beliefs: Nash Equilibrium

Answ er: The best response functions are symm etr ic mirror images and

have a symmetric solution where 

= 

in the unique Nash equilibr iu m .

We can therefore use an y one of the two best response functio ns and

replace both variables with a single variable ,



+2

+ 

−  =0

or,

 =



so that the uniqu e Nash equ ilibr iu m has 

∗

= 

∗



¥

(d) What happens to the Na sh equilib rium spending levels if  increases?

Answ er: It is easy to see from part (c) tha t higher values of  cause

the pla yers to spend more in equilibriu m . As the stak es of the prize rise,

it is mo re valuable to ﬁgh t over it. ¥

(e) What happens to the N ash equilibr iu m levels if pla yer 1 still values

winning at , but play er 2 values winning at  where 1?

Answ er: N ow the two best response functions are not symm etric. The

best response function of player 1 rema ins as above, but that of pla yer

2willnowhave instead of ,

(

)

+2



+(

)

− 

=0 ((BR1 ))

and

(

)

+2



+(

)

− 

=0 ((BR2 ))

Subtracting (BR2) from (BR1) w e obtain,



= 

which implies that the solution will no longer be sym m e tric and, more-

over, 



 which is in tuitive because no w pla y er 2 cares more about

the prize. Using 

= 

we substitute for 

in (BR1) to obtain,

(

)

+2(

)

+ 

(

)

− 

5. Pinning Down Beliefs: Nash Equilibrium 93

which results in,





1+2 + 





1+2 + 





where both inequalities follow from the fact that 1.From

= 

abovewehave







1+2 + 









+2

+ 



where the ine quality fo llows from 1. ¥

94 5. Pinning Down Beliefs: Nash Equilibrium

This is page 95

Printer: Opaque

M ixed Strategies

1. Use the best response correspondences in the B attle of the Sexes game to

ﬁnd all the Nash equilibria. (Follow the approac h used for the example in

section 6.2.3.)

Answ er: TheBattleoftheSexesgameisdescribedbythefollowingmatrix,

player 1

Player 2



 2 1 0 0

 0 0 1 2

Let  denote the probabilit y that player 1 pla ys  and let  be the probability

that player 2 pla ys . The expected pa yoﬀ of player 1 from playing  is



( )=2 and of pla ying  is 

( )=1− . It is easy to see that  is

better than  if and only if 21 − ,or

. Hence, the best response

correspondence of play er 1 is:



()=

⎧

⎪

⎨

⎪

⎩

 =1 if 

 ∈ [0 1] if  =

 =0 if 



96 6. Mixed Strategies

The best response of p layer 2 is derived a nalogou sly: 

( ) 

(  ) if

and only if 2(1 − ),or,

, implyin g that ,



()=

⎧

⎪

⎨

⎪

⎩

 =1 if 

 ∈ [0 1] if  =

 =0 if 



It is now easy to see that there are three Nash equilibria: ( ) ∈ {(1 1) (



) (0 0)}.

2. Let 



be a mixed strategy of pla yer  that puts positive weight on one strictly

dom inated pure strategy. Show that there exists a mixed strategy 



that puts

no w eight on any dom inated pure strategy and that dom inates 



Answ er: Let player  hav e  pure strategies 



= {

1



2

  



} and let 



be a pure strategy which is strictly dominated by 



,thatis,



(





−

) 





(





−

) for an y strategy proﬁle of ’s opponents 

−

.Let



=(

1



2

  



)

be a mixed strategy that puts some positiv e w eight 



 0 on 



and let 



be iden tical to 



except that it puts weight 0 on 



and div erts that w eight

over to 



.Thatis,



=0and 



= 



+ 



,and



= 



for all  6= 

and  6= 

. It follo ws that for all 

−





(





−



=1









(





−

) 



=1









(





−

)=



(





−

)

because 



(





−

) 



(





−

) and the w a y in which 



was constructed.

Hence, 



is strictly dom ina ted by 



. ¥

3. Consider the game used in section ?? as follows:

Play er 1

Player 2



 5 1 1 4 1 0

 3 2 0 0 3 5

 4 3 4 4 0 3

6. Mixed Strategies 97

(a) Find a strategy diﬀerent from (

()

()

()) = (0



) that

strictly dominates the pure strategy  forplayer2.Arguethatyoucan

ﬁnd an inﬁn ite n u mber of suc h strategies.

Answ er: The expected pa yoﬀ of any player in a m a trix game is con-

tinuous in the proba bilities of his mix ed strategy (because it is a lin-

ear fun ction of the probab ility weights), and hence if we “tweak” the

strategy (

()

()

()) = (0



) just a little bit then the pay-

oﬀs will be the same for an y c h oice of pla yer 1. For exam ple, tak e



=(

()

()

()) = (0



). The expected pay o ﬀ of pla yer

2from

against an y one of the three strategies of pla yer 1 are,



( 

)=04 × 4+06 × 0=16  1=

( )



( 

)=04 × 0+06 × 5=3 2=

( )



( 

)=04 × 4+06 × 3=34  3=

( )

which shows that 

strictly dom inates . It is therefore follows by the

continuit y of the expected payoﬀ function that an y one of the inﬁnitely

many mixed strategies that puts weights close to 0.5 on  and the

remain ing pro ba bility on  will dom inate .

(b) Find a strategy diﬀeren t from (

()

()

()) = (0



) that

strictly dominates the pure strategy  forplayer1inthegameremain-

ing after one stage of elimination. Argue that you can ﬁnd an inﬁnite

number of such strategies.

Answ er: This is an identical procedure as for part (a).

4. Monitoring: An emplo yee (pla yer 1) w h o w orks for a boss (play er 2) can

either w ork ( )orshirk(), while his boss can either monitor the employ e e

() or ignore him (). L ike most employee-boss rela tionship s, if th e em-

ployee is w orking then the boss prefers not to monitor, but if the boss is not

A m ore elegant way 0f writing this would b e to choose a m ixed strategy 

=(0

+ 

− ) and show that

for sm all enough values of  it follows t h a t 

strictly dominates , a nd it fo llows th a t th e re ar e inﬁnitely many

such values o f .

98 6. Mixed Strategies

monitorin g then the employee prefers to shirk. The game is represented in

the follo w ing matrix :

player 1

Play er 2



 1 1 1 2

 0 2 2 1

(a) Dra w the best response functions of each player.

Answ er: Let  be the probability that pla yer 1 ch ooses  and  the

probability that player 2 c hooses . It follows that 

( ) 

( )

if and only if 1  2(1 −),or

,and

( ) 

( ) if and only

if  +2(1− )  2 +(1− ),or

. It follo ws that for pla y er 1,



()=

⎧

⎪

⎨

⎪

⎩

 =0 if 

 ∈ [0 1] if  =

 =1 if 

and for player 2,



()=

⎧

⎪

⎨

⎪

⎩

 =1 if 

 ∈ [0 1] if  =

 =0 if 



Notice that these are iden tical to the best response functions for the

matchingpenniesgame(seeFigure6.3).¥

(b) Find the Nash equilibrium of this game. What kind of game does this

game remind yo u of?

Answ er: From the two best response correspondences the unique Nash

equilib r iu m is ( )=(



) and the game’s strategic forces are identical

to those in the Match ing Pennies game. ¥

5. Cops and Robbers: Player 1 is a police oﬃcer who must decide whether to

patrol the streets or to hang out at the coﬀee shop. His pa yoﬀ from hang ing

out at the coﬀee shop is 10, while his pay oﬀ from p atrolling the streets

6. Mixed Strategies 99

depends on whether he catches a robber, who is pla yer 2. If the robber prowls

the streets then the police oﬃcer will catch him and obtain a payoﬀ of 20.

If the robber stays in his hidea way then the oﬃcer’s payoﬀ is 0. The robber

m u st choose between sta ying hidden or pro wling the street. If he stays hidden

then his payoﬀ is 0, while if he wa lk s the street his pay o ﬀ is (−10) if the oﬃcer

is patrolling the streets, and it is 10 if the oﬃcer is at the coﬀee shop.

(a) Write dow n the matrix form of this gam e.

Answ er: Let  denote patrol and  coﬀee shop for player 1, and  is

the robber’s choice of pro w ling while  is remaining hidden. Th e game

is therefore

player 1

Play er 2



 20 −10 0 0

 10 10 10 0

(b) Dra w the best response functions of each player.

Answ er: Let  be th e probability th at pla yer 1 chooses  and  the

probability that player 2 ch ooses . It follow s that 

( ) 

( )

if and only if 2010,or

,and

( ) 

( ) if and only if

−10 +10(1− )  0,or

.Itfollowsthatforplayer1,



()=

⎧

⎪

⎨

⎪

⎩

 =0 if 

 ∈ [0 1] if  =

 =1 if 

and for player 2,



()=

⎧

⎪

⎨

⎪

⎩

 =1 if 

 ∈ [0 1] if  =

 =0 if 



Notice that these are iden tical to the best response functions for the

matchingpenniesgame(seeFigure6.3).¥

100 6. Mixed Strategies

game remind yo u of?

Answ er: From the t wo best response correspondences the unique Nash

equilib r iu m is ( )=(



) and the game’s strategic forces are identical

to those in the Match ing Pennies game. ¥

6. Declining Industry: Consider tw o competing ﬁrms in a declining indus-

try that cannot support both ﬁrms proﬁta bly. Each ﬁrm has three possible

choicesasitmustdecidewhetherornottoexittheindustryimmediately,at

the end of this quarter, or at the end of the next quarter. If a ﬁrm c h ooses

to exit then its payoﬀ is 0 from that poin t on ward. Ev ery quarter that both

ﬁrm s operate yields each a loss equal to −1, and eac h quarter that a ﬁrm

operates alone yields a pa yoﬀ of 2 For example, if ﬁrm 1 plans to exit at the

end of this quarter while ﬁrm 2 plans to exit at the end of the next quarter

then the payoﬀsare(−1 1) because both ﬁrms lose −1 in the ﬁrst quarter

and ﬁrm 2 gains 2 in the second. The pay oﬀ for each ﬁrm is the sum of its

quarterly payoﬀs.

(a) Write dow n this game in matrix form.

Answ er: Let  denote immediate exit,  denote exit this quarter , and

 denote exit next quarter.

Play er 1

Player 2

 

 0 0 0 2 0 4

 2 0 −1 −1 −1 1

 4 0 1 −1 −2 −2

(b) Are there any strictly dominated strategies? Are there an y w eakly dom-

inated strategies?

Answ er: There are no strictly dominated strategies but there is a

w eakly dominated one:  . To see this note that choosing both  and

6. Mixed Strategies 101

 with probability

eac h yields the same expected payoﬀ as c hoos-

ing  against  or  , and a higher expected pay oﬀ against  and

hence 



=(



()



( )



()) = (

 0

) weakly dominates  .The

reason there is no strictly domina ted strategy is that, starting with 



increasing the weight on  causes the mixed strategy to be w or se than

 against , while increasing the w eig ht on  causes the mixed strat-

egy to be w orse than  against  , implying it is impossible to ﬁnd a

mixed strategy that strictly dominates  . ¥

Answ er: Because  is we ak ly dom ina ted , it is suspect of never being

a best response. A quic k observation should conv ince you that this is

indeed the case: it is never a best response to an y of the pure strategies,

and hence cannot be part of a pure strategy Nash equilibrium . Rem oving

 fro m consid eration results in the reduced game:

Play er 1

Player 2



 0 0 0 4

 4 0 −2 −2

for whic h there are two pure strategy Nash equilibria, ( ) and ( ).

(d) Find the unique mixed strategy Nash equilibr ium (hint: you can use

y o ur answer to (b) to mak e things easier.)

Answ er: We start by ignoring  and using the reduced game in part

part of a Nash equilibrium . We need to ﬁn d a pair of mixed strategies,

(

()

()) and (

()

()) that mak e both players indiﬀerent

between  and  . For pla yer 1 the indiﬀerence equatio n is,

0=4

() − 2(1 − 

())

which results in 

()=

,andforplayer2theindiﬀerence equation

is symmetric, resulting in 

()=

. Hence, the mixed strategy Nash

102 6. Mixed Strategies

equilib riu m o f the original gam e is (



()



( )



()) = (

 0

Notice that at this Nash equilibrium, each player is not only indiﬀerent

between  and , but choosing  giv es the sam e expected pa yoﬀ of

zero. However, c hoosing  with positiv e probabilit y cannot be part of

a mixed strategy Nash equilibrium. To pro ve this let player 2 play the

mixed strategy 

=(

()

( )

()) = (

2



2

 1 −

2

−

2

)

The strategy  forplayer1isatleastasgoodas if and only if,

0 ≤ 2

2

− 

2

− (1 − 

2

− 

2

)

or, 

2

≥

. The strategy  for player 1 is at least as good as  if and

only if,

4

2

− 

2

− 2(1 − 

2

− 

2

) ≤ 2

2

− 

2

− (1 − 

2

− 

2

)

or, 

2

≤ 1−3

2

.Butif

2

≥

(when  is as good as )then

2

≤

1 − 3

2

reduces to 

2

≤ 0, whic h can only hold when 

2

and



2

=0(which is the Nash equ ilib rium w e found abo ve). A symm etric

argument holds to conclude that (



()



( )



()) = (

 0

) is the

uniqu e mixed strategy Nash equilibrium. ¥

7. Grad School Competition: Twostudentssignupforanhonorsthesis

with a Professor. Eac h can in vest time in their o w n project: either no time,

oneweek,ortwoweeks(thesearetheonlythreeoptions).Thecostoftimeis

0 fornotime,andeachweekcosts1 unit of pa yoﬀ. The more time a student

puts in the better their work will be, so that if one student puts in more time

than the other there will be a clear “leader”. If they put in the same amount

of time then their thesis projects will ha ve the same quality. The professor,

however, will give out only one “A” grade. If there is a clear leader then he

will get the A, while if they are equally good then the professor will toss a

fair coin to decide who gets the A grade. The other studen t gets a “B”. Since

both wish to con tinue to gra dua te sch ool, a gr ade of A is worth 3 wh ile a

grade of B is wo rth zero.

(a) Write dow n this game in matrix form.

6. Mixed Strategies 103

Answ er: Let  denote no time,  denote one w eek, and  denote t wo

weeks. The matrix game is,

Player 1

Player 2

 

 15 15 0 2 0 1

 2 0 05 05 −1 1

 1 0 1 −1 −05 −05

The pa y oﬀs are deriv ed b y the fact that a tie is an equal c hance of

getting 3 so each player gets 1.5 in expectation. ¥

(b) Are there any strictly dominated strategies? Are there an y w eakly dom-

inated strategies?

Answ er: Eachoneofthethreestrategiescanbeastrictbestresponse:

 is a best response to  ,  is a best response to ,and is a best

response to . Hence, no strategy is strictly or weakly dominated . ¥

Answ er: Let 



=(







 1 −



−



) denote a mixed strategy for

player . Because the game is symm etric it suﬃces to solve the indiﬀer-

ence condition s for one pla yer. For player  to be indiﬀeren t bet ween 

and ,

15



=2



+05



− (1 − 



− 



)

and for him to be indiﬀerent bet ween  and  ,

15



= 



+ 



− 05(1 − 



− 



)

Solving these t wo equations with t wo unknowns yields 



= 



implying that the unique mixed strategy Nash equilibrium has the

pla yers mixing between all three pure strategies with equal probabilit y.

8. Market en try: There are 3 ﬁrm s that are considering enter ing a new market.

The pa y oﬀ for each ﬁrm that enters is

150



where  is the n umber of ﬁrms

that en ter . The cost of entering is 62.

104 6. Mixed Strategies

(a) Find all the pure strategy Nash equilibria.

Answ er: The costs of entry are 62 so the beneﬁts of entry m u st be at

leastthatforaﬁrm to choose to enter. Clearly, if a ﬁrm believes the

other two are not entering then it wan ts to en ter, and if it believ es that

the other ﬁrm s are entering then it would sta y out (it w ould only get

50). If a ﬁrm believes that only one other ﬁrm is entering then it prefers

to enter and get 75. Hence, there are three pure strategy Nash equilibria

in which two of the three ﬁrms en ter and one stays out. ¥

(b) Find the symm etric mixed strategy equilibrium where all three players

enter with the same probability.

Answ er: Let  be the probability that a ﬁrm enters. In order to be

willing to mix th e ex pected payoﬀ of en tering must be equal to zero.

If a ﬁrm en ters then with pro bab ility 

it will face two other en trants

and receive 



=50− 62 = −12 with prob ab ility (1 − )

it will face

no other en tran ts and receive 



=150− 62 = 88 and with probab ility

2(1 − ) it will face one other en trant and receive 



=75− 62 = 13

Hen ce, to be willing to mix the expected pa yo ﬀ must be zero,

+1−

(1 − )

88 + 2(1 − )13 − 

12 = 0

which results in the qu ad ratic equation 25

− 75 +44=0,andthe

relevan t solution (between 0 and 1) is  =

. ¥

9. Discrete all pay auction: In section 6.1.4 we introduced a v ersion of an all

pa y auction that work ed as follo ws: Eac h bidder submits a bid. The highest

bidder gets the good, but all bidders pay there bids.Consideranauctionin

which player 1 values the item at 3 while player 2 values the item at 5 Each

player ca n bid either 0 1 or 2. T he twist is that each pla yer pays his bid

regardless of wheth er he wins the good. If pla yer  bids more than pla yer 

then  win’s the good and both pa y. If both players bid the sam e am ount

then a coin is tossed to determine who gets the good but again, both pay.

6. Mixed Strategies 105

(a) Write down the game in ma trix form. Which strategies survive IESDS ?

Answ er: Let  denote zero,  denote one, and  denote tw o. The

matrix game is,

Player 1

Player 2

 

 15 25 0 4 0 3

 2 0 05 15 −1 3

 1 0 1 −1 −05 05

The pa y oﬀs are deriv ed b y the fact that a tie is an equal c hance of

winnin g so pla yer 1 gets 1.5 and pla yer 2 gets 2.5 in expectation. It is

easy to see that for play er 2, pla ying  is dominated b y pla ying  ,so

it is eliminated in the ﬁrst stage of IESD S . In the second stage  is

dominated b y  for player 1 and we are left with the following reduced

game that survives IESDS,

Player 1

Play er 2



 0 4 0 3

 1 −1 −05 05

(b) Find the Nash equilibria of this game.

Answ er: From the reduced game it is easy to see that there is no pure

strategy Nash equilibrium . Let 

=(

1



1

) and 

=(

2



2

)

denote the mixed strategies for the players in the reduced game. For

player 1 to be indiﬀeren t bet ween  and  ,

0=

2

− 05(1 − 

2

)

which yields 

2

.Forplayer2tobeindiﬀerent between  and  ,

4

1

− (1 − 

1

)=3

1

+05(1 − 

1

)

106 6. Mixed Strategies

which yields 

1

=06. Thus, the unique mixed strategy Nash equilib-

rium has the pla yers mixing 



) and 



) in the reduced

game, or 

 0

) and 

=(0



) in the original game. ¥

10. Contin uous all pay auction: Consid er an all-pa y auction for a good w orth

1toeachofthetwobidders.Eachbiddercanchoosetooﬀer a bid from the

unit interval so that 



=[0 1]. Players only care about the expected value

they will end up with at the end of the game (i.e., if a player bids 0.4 and

expects to win with probab ility 0.7 then his payoﬀ is 07 × 1 − 04).

(a) Model this auction as a norm al-form game.

Answ er: There are t wo players,  = {1 2}, eac h has a strategy set





=[0 1], and assuming that the players are equa lly lik ely to get the

good in case of a tie, the payoﬀ to play er  is given by





(







⎧

⎪

⎨

⎪

⎩

1 − 



if 







− 



if 



= 



−



if 







(b) Sho w that this game has no pure strategy Nash Equilibrium.

Answ er: First, it cannot be the case that 



= 



 1 because then each

player w o uld beneﬁt from raising his bid b y a tiny amount  in order to

win the auction and receive a higher payoﬀ 1 − −





−



. Second,

it cannot be the case that 



= 



=1because each pla y er would prefer

to bid nothing and receiv e 0  −

. Last, it cannot be the case that









≥ 0 because then pla y er  would prefer to lower his bid by 

while still beating player  and paying less money. Hence, there cannot

be a pure strategy Na sh equilib rium. ¥

player is rando m izing over a ﬁnite number of bids.

Answ er: Assume that a Nash eq uilibriu m involves player 1 mixin g

6. Mixed Strategies 107

between a ﬁnite n um ber of bids, {



12

 

1

} where 

≥ 0 is the

lowest bid, 

1

≤ 1 is the highest, 

1



1(+1)

andeachbid

1

bein g played with som e positive proba b ility 

1

. Similarly assume that

player 2 is mixing between a ﬁnite number of bids, {



22

 

2

}

andeachbid

2

is being pla yed with som e positiv e probability 

2

.()

First observe that it cannot be true that 

1



2

(or the reverse by

symm etry). If it w er e the case then player 2 will win for sure when he

bids 

2

and pay his bid, while if he reduces his bid b y some  such that



1



2

−  then he will still win for sure and pay less, cont radictin g

that pla y ing 

2

was part of a Nash equilibrium . () Second observe

that when 

1

= 

2

then the expected pay oﬀ of pla y er 2 from bidding



2



=Pr{

1



2

}(1 − 

2

)+Pr{

1

= 

2

}(

− 

2

)

=(1− 

1

)(1 − 

2

)+

1

(

− 

2

)

=1− 

2

−





≥ 0

where the last inequalit y follow s from the fact that 

2

 0 (he would

not play it w ith positive probability if the expected pa yoﬀ were nega-

tive.) Let 

2

= 

2

+  where  =





. If instead of bidding 

2

player

2bids

2

then he wins for sure and his utility is



=1− 

2

=1− 

2

−





 1 − 

2

−





contradicting that playin g 

2

was part of a Nash equilibriu m . ¥

(d) Consider mixed strategies of the follo wing form: Eac h play er  chooses

and interval, [



 



] with 0 ≤ 



 



≤ 1 together with a cum u lative

distribution 



() over the interval [



 



] (Alternatively y ou can think

of each player choosing 



() over the interv al [0 1], with two values 



and 



suc h that 



(



)=0and 



(



)=1.)

i. Show that if two such strategies are a mixed strategy Nash equilib-

rium then it m ust be that 

= 

and 

= 



108 6. Mixed Strategies

Answ er: Assum e not. There are two cases: () 

6= 

:With-

out loss assume that 



. This means that there are values

of 

∈ (



) for wh ich 

 0 but for which pla yer 1 loses

with probab ility 1. This implies that the expected payoﬀ from this

bidisnegative,andplayer wouldbebetteroﬀ bidding 0 instead.

Hence, 

= 

must hold. () 

6= 

: Witho ut loss assum e that



 

. This means that there are values of 

∈ (

 

) for whi ch





 1 but for w hich player 2 wins with prob ability 1. But

then play er 2 could replace 

with 

= 

− with  small enough

such that





 1, he will win with proba bility 1 and pay

less than he would pa y with 

. Hence, 

= 

must hold. ¥

ii. Show that if two such strategies are a mixed strategy Nash equilib-

rium then it m ust be that 

= 

=0

Answ er: Assume not so that 

= 

=   0. This means that

when player  bids 

then he loses with probability 1, and get

an expected pa yoﬀ of −

 0. But instead o f biddin g  pla y er 

can bid 0 and receive 0 w hich is better than −

, imp lying that



= 

=   0 cannot be an equilibrium. ¥

iii. Using the abo ve, argue that if t wo such strategies are a mixed strat-

egy Nash equilibrium then both pla yers m u st be getting an expected

pa y oﬀ of zero.

Answ er: As proposition 6.1 states, if a player is randomizing be-

t ween two alternatives then he m u st be indiﬀerent betw een them.

Because both players are including 0 in the support of their mixed

strategy, their pa yoﬀ from 0 is zero, and hence their expected pa yoﬀ

from any ch oice in equilibrium m u st be zero. ¥

iv. Show that if two such strategies are a mixed strategy Nash equilib-

rium then it m ust be that



= 

=1

Answ er: Assume not so that



= 

= 1.From()abovethe

expected payoﬀ from any bid in [0

] is equal to zero. If one of the

6. Mixed Strategies 109

play ers deviates from this strategy and choose to bid  + 1 then

he will win with probab ility 1 and receiv e a payoﬀ of 1−(

+)  0,

con tradicting that



= 

= 1 is an equilibrium. ¥

v. Sho w that 



() being uniform over [0 1] is a symm etric Nash equi-

librium of this game.

Answ er: Ima g ine tha t player 2 is playing accor ding to the pro-

posed strategy 

() uniform ov er [0 1].Ifplayer1 bids some value



∈ [0 1] then his expected pa yoﬀ is

Pr{



}(1−

)+Pr{



}(−

)=

(1−

)+(1−

)(−

)=0

imply ing that player 1 is willing to bid an y value in the [0 1] in terval,

and in particular, c hoosing a bid according to 

() uniform o ver

[0 1]. Hence, this is a symm etric Nash equilibrium. ¥

11. Bribes: Two play ers ﬁnd themselves in a legal battle over a paten t. The

patent is worth 20 foreachplayer,sothewinnerwouldreceive20 and the

loser 0.Giventhenormsofthecountrytheyarein,itiscommontobribe

the judge of a case. Each pla yer can oﬀer a bribe secretly, and the one whose

bribe is the largest is awarded the patent. If both c hoose not to bribe, or

if the bribes are the same amount, then eac h has an equal c han ce of being

a warded the patent. If a pla yer does bribe, then bribes can be either a value

of 9 or of 20. Any other number is considered to be very unluc ky and the

judge w ould surely rule against a part y who oﬀers a diﬀerent n umber.

(a) Find the uniq ue pure-strategy N ash equilibr ium of this game.

Answ er: The gam e is captured in the follow ing two player matrix,

where  represents no paym ent,  represents a bribe of 9 and  a

bribe of 20. For exam ple, if both choose 9 then they have an equal

110 6. Mixed Strategies

c hance of getting 20, so the expected payoﬀ is

× 20 − 9=1,

Player 1

Player 2

 

 10 10 0 11 0 0

 11 0 1 1 −9 0

 0 0 0 −9 −10 −10

It is easy to see th at  is strictly dominated by .Intheremaining

game,  is strictly dom in ate d by  , and hence ( ) is the unique

Nash equilibrium . ¥

(b) If the norm were diﬀerent so that a bribe of 15 w ere also acceptable, is

there a pure strategy Nash equilibrium?

Answ er: Now the game is as follo w s (where  denotes a bribe of 15),

Player 1

Player 2

  

 10 10 0 11 0 5 0 0

 11 0 1 1 −9 5 −9 0

 5 0 5 −9 −5 −5 −15 0

 0 0 0 −9 0 −15 −10 −10

Using the best responses of eac h player it is easy to see that there is no

pure strategy Nash equilibrium . ¥

possible bribes of 9 15 and 20

Answ er: Note ﬁrst that  is w ea kly domina te d by ,soconsiderthe

game without  ,

Play er 1

Player 2

 

 10 10 0 11 0 5

 11 0 1 1 −9 5

 5 0 5 −9 −5 −5

6. Mixed Strategies 111

Let 



=(











) denote a mixed strategy for p layer  where





=1− 



− 



. The game is symmetric so for player 1 to be

indiﬀer ent bet ween  and  it must be that,

10

2

=11

2

+ 

2

− 9(1 − 

2

− 

2

)

which implies that 

2

−

2

.Forplayer1 to be indiﬀerent bet ween

 and  it must be that,

10

2

=5

2

+5

2

− 5(1 − 

2

− 

2

)

which im plies tha t 

2

. Hence, the unique (mixed strategy) Nash

equilib riu m has each player play 





). ¥

12. The Tax Man: A citizen (play er 1) m ust c hoose whether or not to ﬁle taxes

honestly or whether to cheat. The tax man (pla y er 2) decides how m uc h eﬀort

to in vest in auditing and can c hoose  ∈ [0 1], and the cost to the tax man of

investing at a lev el  is ()=100

. If the citizen is honest then he receives

the benc hmark pa yoﬀ of 0, and the tax man pays the auditing costs without

an y beneﬁt from the audit, yielding him a pa yoﬀ of (−100

). If the citizen

c h eats then his pa yoﬀ depends on whether he is caught. If he is caught then

his payoﬀ is (−100) and the tax man’s payoﬀ is 100 − 100

.Ifheisnot

caugh t then his payoﬀ is 50 whilethetaxman’spayoﬀ is (−100

) If the

citizen cheats and the tax man chooses to audit at lev el  then the citizen is

caught with probability  and is not cau ght with proba bility (1 − ).

(a) If the tax man believ es that the citizen is cheating for sure, w ha t is his

best response level of ?

Answ er: The tax man maximizes (100 −100

)+(1−)(0−100

100 − 100

.Theﬁrst-order optimality condition is 100 − 200 =0,

yielding  =

. ¥

(b) If the tax man believes that the citizen is ho nest for sure, what is h is

best response level of ?

112 6. Mixed Strategies

Answ er: The tax man maximizes −100

which is maxim ized at  =0.

what is his best response level of  as a function of ?

Answ er: The tax man maximizes (−100

)+(1−)(100 −100

100(1 −) −100

.Theﬁrst-ord er optimality condition is 100(1 −)−

200 =0, yielding the best response functio n 

∗

()=

1−

. ¥

(d) Is there a pu re strategy Nash equilibrium o f th is game? W hy or why

not?

Answ er: T here is no pure strategy N ash equilibrium. To see this, con-

sider the best response of player 1 who believ e s that pla yer 2 c hooses

some lev el  ∈ [0 1].Hispayoﬀ from being honest is 0 wh ile his payoﬀ

from c h ea ting is (−100) + (1 −)50 = 50 − 150. Hence, he prefers to

be honest if and only if 0 ≥ 50 − 150 or  ≥

. Letting 

∗

() denote

the best response correspondence of player 1 as the probability that he

is honest, we hav e that



∗

()=

⎧

⎪

⎨

⎪

⎩

1 if 

[0 1] if  =

0 if 

and it is easy to see that there are no values of  and  for wh ich both

pla yers are play ing mutual best responses. ¥

(e) Is there a mixe d strate gy Nash equilibrium of this gam e? Why or wh y

not?

Answ er: From (d) above w e kno w that pla yer 1 is willing to mix if

and only if  =

, which must therefore hold true in a mixed strategy

Nash equilibrium. For play er 2 to be willing to pla y  =

we use his

best response from part (c),

1−

, which yields,  =

. Hence, the

uniqu e mixed strategy Nash equilibriu m has player 1 being honest with

probab ility

and pla yer 2 c hoosing  =

. ¥

Part III

Dynamic Games of Complete

Information

113

114

This is page 115

Printer: Opaque

P r elimina r ies

1. Strategies: Im a gin e an extensive form game in which pla yer  has  infor-

matio n sets.

(a) If the player has an identical nu mber of  possible action s in each

informationset,howmanypurestrategiesdoeshehave?

Answ er: The play er has 



pure strategies.

(b) If the play er has 



actions in information set  ∈ {1 2  },how

many pure strategies does the player ha ve?

Answ er: The play er has 

× 

×···× 



pure strategies. ¥

2. Strate gies and equilibrium : Consider a two player game in wh ich player

1canchoose or . The game ends if he chooses  w h ile it continu es to

player 2 if he chooses .Player2canthenchoose or ,withhegame

ending after  and con tinuing again with player 1 after . Player 1 then can

choose  or  , and the game ends after each of these choices.

(a) Model this as an extensive form game t ree. Is it a game of perfect or

imperfect informa tio n?

116 7. Preliminaries

Answ er:

This gam e is a game of perfect informatio n . ¥

(b) Ho w man y terminal nodes does the game ha v e? Ho w man y information

sets?

Answ er: The game has 4 terminal nodes (after choices A, C, E and F)

and 3 information sets (one for each player. ¥

Answ er: Player 1 has 4 pure strategies and player 2 has 2. ¥

(d) Imagine that the pa yoﬀsfollowingchoice by player 1 are (2 0),fol-

lowing  by player 2 are (3 1), follo w ing  by player 1 are (0 0) and

follow in g  by player 1 are (1 2). Wha t are the Nash equilibria of this

game? Does one strik e y ou as more “appealing” than the other? If so,

explain why.

Answ er: We can write down the matrix form of this game as follows

( denotes a strategy for player 1 where  ∈ { } is what he does

in his ﬁrstinformationsetand ∈ {} in his second one),

Play er 1

Player 2



 2 0 2 0

 2 0 2 0

 3 1 0 0

 3 1 1 2

7. Preliminaries 117

It’s easy to see that there are three pure strategy Nash equilibria:

( ) ( ) and ( ). The equilibria ( ) ( ) are

Pareto dom inated by the equilibrium ( ), and hence it would be

temp ting to argue that ( ) is the mo re “appealing” equilibrium .

AswewillseeinChapter8itisactually( ) that has properties

that are more appealing (sequential rationality). ¥

3. Tick-tack-toe: The extensiv e form represen tation of a game can be cumber-

some even for very simple games. Consider the game of Tick-tack-toe where

2 pla yers mark “”or“”ina3 × 3 matrix. Play er 1 mov es ﬁrst, then

play er 2, and so on. If a player gets three of his kind in a row , column, or

one of the diagon als then he wins , and otherwise it is a tie. For this question

assum e that even after a winner is declared, the players mu st co mple t e ly ﬁll

the matrix before the game ends.

(a) Is this a game of perfect or imperfect information? Why?

Answ er: This is a game of perfect information because each pla yer

knows exactly what transpired before he m oves, and hence every infor-

mation set con tains one node. ¥

(b) Ho w many inform a tion sets d oes pla yer 2 have after player 1’s ﬁrst

move?

Answ er: Pla yer 2 has 9 inform ation sets, one for eac h of the moves of

player 1. ¥

ﬁrst move?

Answ er: Play er 2 has 8 possible mo ves in his ﬁrst turn, and this is

true for each one of the 9 possible moves that pla yer 1 has in hid ﬁrst

turn. Hence, player 1 has 9×8=72inform ation sets in his second move

(after pla yer 2’s ﬁrst mo ve).

118 7. Preliminaries

(d) Ho w many information sets does each pla yer ha ve in total? (Hin t: For

this and the next part y ou may want to use a program like Excel.)

Answ er: Continuing the logic of part (c), after player 1’s second mo ve,

player 2 has 9 × 8 × 7=504information sets, then pla yer 1 has 9 × 8 ×

7 × 6=3 024 informationsets,andsoon(15 120; 60 480; 181 440 and

362 880). We add the alternating n umbers to get ho w many information

set each player has, and we ha ve to remember to add the root which

belongs to player 1. Hence, pla yer 1 has 426,457 information sets while

player 2 has 197,073. ¥

(e) Ho w many terminal nodes does the game have?

Answ er: The number of terminal nodes is equal to the number of

information sets in pla yer 1’s last turn because at that pint he just has

one mo v e, to complete the tick-tack-toe matrix, whic h is 362 880. ¥

4. Cen tipedes: Imagine a two pla y er game that proceeds as follows. A pot of

money is created with $6 in it initially. Play e r 1 moves ﬁrst, then pla yer 2,

then pla yer 1 again and ﬁnally pla yer 2 again. At eac h player’s turn to mo ve,

he has two possible actions: grab ()orshare(). If he grabs, he gets

thecurrentpotofmoney,theotherplayergets

of the pot and the game

ends. If he shares then the size of the current pot is multiplied b y

and the

next player gets to mov e . At th e last stage in which player 2 mo ves, if he

chooses share then the pot is still mu ltiplied by

,player2gets

of the pot

and player 1 gets

of the pot.

(a) Model this as an extensive form game t ree. Is it a game of perfect or

imperfect informa tio n?

Answ er:

7. Preliminaries 119

This is a game of perfect informat ion . Note that we draw the game from

left to right (which is the common con vention for “cen tipede games” of

this sort.) We use capital letters for player 1 and lo wer case for pla yer

2. ¥

(b) Ho w man y terminal nodes does the game ha v e? Ho w man y information

sets?

Answ er: Thegamehasﬁve terminal nodes and four information sets.

Answ er: Each play er has four pure strategies (2 actions in eac h of his

2 information sets). ¥

(d) Find the Nash equilibr ia of this gam e. How many outcom es can be

supported in equilibrium?

Answ er: Using the convention of  to denote a strategy of player

where he c h ooses  in his ﬁrst information set and  in his second, we

can dra w the following matrix representation of this game,

Player 1

Play er 2

   

 4 2 4 2 4 2 4 2

 4 2 4 2 4 2 4 2

 3 6 3 6 9 45 9 45

 3 6 3 6 675 135 2025 10125

We see that only one outcome can be supported as a Nash equilibrium:

player 1 grabs imm ediately and the players’ payoﬀsare(4 2). ¥

(e) Now imag ine that at the last stage in whic h play er 2 moves, if he chooses

tosharethenthepotisequallysplitamongtheplayers.Doesyour

answer to part (d) above chan ge?

Answ er: The answ er does change because the pay oﬀs from the pair of

120 7. Preliminaries

strategies ( ) c hanges from (2025 10125) to (151875 151875) in

which case pla yer 2’s best response to  will be , and player 1’s best

response to  remains ,sothat() is another Nash equilibrium

in which they split 30375 equally (the previous Nash equilibria are still

equilib ria ). ¥

5. Veto Power: T wo pla yers must choose between three alternatives,   and

. Pla yer 1’s preferen ces are given by  Â

 Â

 while player 2’s preferences

are giv en by  Â

 Â

. The rules are that player 1 moves ﬁrst and can

veto one of the three alternativ e s. Then, play e r two chooses whic h of th e

rema in ing two alternatives will be chose n.

(a) Model this as an extensiv e form game tree (choose pay o ﬀsthatrepre-

sen t the preferences).

Answ er: Assume that the pay oﬀ of the best option is 3, the second best

2 and the w orst is 1. Player1’s actions are which alternativ e to remo ve

and player 2’s whic h of the remain ing two to choose.

(b) Ho w ma ny pure strategies does each pla yer ha ve?

Answ er: Player 1 has three pure strategies while play er 2 has eigh t (2

actions in each of three information sets.) ¥

7. Preliminaries 121

Answ er: Let  be a strategy for pla y er 2 where  is w h at he does

follow ing the remo val of ,  fo r  and  for  so that we can use the

follow ing m a t r ix,

Player 1

Player 2

       

 2 2 2 2 2 2 2 2 1 3 1 3 1 3 1 3

 3 1 3 1 1 3 1 3 3 1 3 1 1 3 1 3

 3 1 2 2 3 1 2 2 3 1 2 2 3 1 2 2

and we see that  is the only outcome that an be supported as an

equilib rium via two Nash equilibria , ( ) and ( ) ¥

6. Entering an Industry: A ﬁrm (pla yer 1) is considering entering an estab-

lished industry with one incumbent ﬁrm (player 2). Player 1 must c hoose

whether to enter or to not enter the industry. If play er 1 enters the industry

then play er 2 can either accomm odate the entry, or ﬁght the entry with a

price w a r. Pla yer 1’s most pr eferred outcom e is entering with pla yer 2 not

ﬁghting, and his least preferred outcome is entering with pla yer 2 ﬁghting.

Player 2’s most preferred outcome is pla yer 1 not entering, and his least

preferred outcome is player 1 enter ing with play er 2 ﬁghting.

(a) Model this as an extensiv e form game tree (choose pay o ﬀsthatrepre-

sen t the preferences).

Answ er:

122 7. Preliminaries

(b) Ho w ma ny pure strategies does each pla yer ha ve?

Answ er: Each player has two pure strategies. ¥

Answ er: Th ere are two Nash equilibria which can be seen in the matrix,

Play er 1

Player 2



 0 2 0 2

 1 1 −1 −1

Both ( ) and ( ) are Nash equilibria of this game. ¥

7. Roommates Voting: Three roommates need to vote on whether they will

adopt a new rule and clean their room once a week, or stick to the current

once a mon th rule. Each votes “y es” for the new rule or “no” for the curren t

rule. Imagine that players 1 and 2 prefer the new rule while player 3 prefers

the old rule.

(a) Imagine that the pla yers require a unanimou s vote to adopt the new

rule. Player 1 vot es ﬁrst, then player 2, and then player 3, e ach one

observing the previous v otes. Dra w this as an extensive form game and

ﬁnd the Nash equ ilibr ia.

Answ er: The game is,

7. Preliminaries 123

and there are man y Nash equilibria. Player 1 has two pure strategies: 

and .Player2has4:{} (where the left entry corre-

sponds to his left information set) and player 3 has 16 (again, with the

natural left to righ t in terpretation): {



    }.Because

a unanimous v ote is needed, the only strategy proﬁles t hat are not a

Nash equilibrium are those for whic h players 1 or 2 can ch ang e a “no”

v o te to a “ye s” vote, or player 3 can chan ge a “yes” v ote to a “no”

vote. These proﬁles are () (  ) where     ∈ {}

from whic h player 1 can proﬁtably deviate from ;()(  )

from which player 2 can pr o ﬁtably devia te fro m  to ;and()

(  ) from which player 3 can proﬁtably deviate from 

to . Th u s, the Nash equilibria are proﬁles of strategies that belong

to one of two classes: () player 3 votes  and players 1 and 2 vote

anything (a total of 64 strategy proﬁles which include 8 of player 3, 4

of play er 2 and 2 of play er 1); ()player3 votes , player 2 votes

 and player 1 votes  (a total of 16 strategy proﬁles whic h include 8

of player 3, 2 of player 2 and 1 of player 1). All the outcomes ha ve the

curren t rule surviving. ¥

(b) Imagine now that the pla yers require a majority vote to adopt the new

rule (at least two “y es” votes). Again, player 1 votes ﬁrst, then pla yer

2, and then pla yer 3, each one observing the previous v otes. Dra w this

as an extensive form gam e and ﬁnd the Nash equilibria.

Answ er: The game’s pa y oﬀ no w c hange as follows,

124 7. Preliminaries

Now only a majorit y vote is needed, but still, the only strategy proﬁles

that are not a Nash equilibrium are those for which players 1 or 2 can

c hange a “no” v ote to a “yes” vote, or pla yer 3 can c hange a “yes” v o te

to a “no” vote. Th ese proﬁles are () (  ) or (   )

where      ∈ {} fromwhichplayer1canproﬁtably devi-

ate from ;()() or (  ) from wh ich player 2

can proﬁtab ly deviate from  to  or from  to  ;and()

( )or(  ) from whic h pla yer 3 can proﬁtably de-

viate from  to  or from   to .Thus,theNash

equilib ria are proﬁles of strategies that belong to one of two classes: ()

player 3 votes , player 1 votes  and 2 v otes  (a total of 32

strategy proﬁles whic h include 16 of play er 3, 2 of pla yer 2 and 1 of

pla yer 1). These all ha ve players 1 and 2 v o ting  and support the new

rule; ()player3 votes ,player2votes and pla yer 1 votes 

(a total of 8 strategy proﬁ

les whic h include 4 of play er 3, 2 of pla yer 2

and 1 of pla yer 1). These have pla yers 1 and 3 or 1 and 2 (or all three)

vote  and support the curren t rule surviving, and neither player 1 nor

2 can change the outcome by deviating unilaterally. ¥

v o tes in a hat, so that the votes of earlier mo vers are not observed by

the votes of later mo vers, and at the end the v otes are counted. Draw

this a s an extensive form gam e and ﬁnd the Nash equilib ria. In w ha t

7. Preliminaries 125

wayisthisdiﬀerent from the result in (b) above?

Answ er: This gam e is on e of imperfect info rm ation where eac h player

has one information set,

Lik e part (b), an y strategy in whic h both players 1 and 2 are v ot-

ing  oroneinwhichthereareatleast2novotesthatcannotbe

c h ang ed to only one by pla yers 1 and 2 will be an equilibrium, but

the strategy sets are small because pla yers 2 and 3 cannot condition

their play on the history of what the “previous” players did . Hence,

for each pla yer the strategy set is 



= { } and the Nash equilib ria

are (



) ∈ {(  ) (  ) (  )}.Likeinpart(b)both

outcom es can be supported by a N ash equilibrium, just th at no w the

strategy combination s that support it are fewer. ¥

8. Brothers: Consider the followin g game that proceeds in t wo steps: In the

ﬁrst stage one brother (pla yer 2) has t wo $10 bills and can c hoose one of t wo

options: he can give his you nger brother (player 1) $20, or give him one of

the $10 bills (giving nothing is inconceivable giv en the wa y they were raised.)

This money will be used to buy snack s at the show they will see, and each

one dollar of snack yields one unit of pa yoﬀ for a player who uses it. The

126 7. Preliminaries

show they will see is determin ed by the follow ing “battle of the sexes” gam e:

Player 1

Player 2



 16,12 0,0

 0,0 12,16

(a) Presen t the entire gam e in extensive form (a game tree).

Answ er: Let the choices of play er 1 ﬁrst be  fo r spliting the $2 0 and

 for giving it all away. T he en tire game will hav e the payoﬀsfromthe

c hoice of ho w to split the money added to the pa y oﬀs from the Battle

of the Sexes part of the game as follows,

Because the latter is simultaneous, it does not mater which pla y er mo ves

after pla yer 1 as long as the last pla yer cannot distingu ish between the

choiceoftheplayerwhomovesjustbeforehim.¥

(b) Write the (pure) strategy sets for both pla yers.

Answ er: Both players can conditio n their choice in the Battle of the

Sexes gam e on th e initial split/give c ho ice of player 1. For player 2,



= {} where 

=  means that player 2 chooses

 ∈ { } afterplayer1chose while pla yer 2 chooses  ∈ {  }

after player 1 ch ose . For player 1, however, even though he chooses

ﬁrst bet ween  or , he must specify his action for eac h information

set even if he knows it will not happen (e.g., wha t he will do follo w in g

7. Preliminaries 127

 even w hen he plans to play ). Hence, he has 8 pure strategies,



= {  } where 

 means that pla y er 1 ﬁrst c hooses  ∈ { } andthenchooses

 ∈ { } if he pla yed  and  ∈ { } if he played . ¥

Answ er: This will be a 8 × 4 matrix as follows,

Play er 1

Player 2

    

 26 22 26 22 10 10 10 10

 26 22 26 22 10 10 10 10

 10 10 10 10 22 26 22 26

 10 10 10 10 22 26 22 26

 16 32 0 20 16 32 0 20

 0 20 12 36 0 20 12 36

  16 32 0 20 16 32 0 20

  0 20 12 36 0 20 12 36

(d) Find the Nash equilibria of the entir e game (pure and mixed strategies).

Answ er: First note that for play er 1, mixing equally bet ween  and

 will strictly dominate the four strategies     and

  . Hence, we can consider the reduced 4 × 4 game,

Play er 1

Player 2

   

 26 22 26 22 10 10 10 10

 26 22 26 22 10 10 10 10

 10 10 10 10 22 26 22 26

 10 10 10 10 22 26 22 26

The simple ov erline-underline method shows that w e ha ve eigh t pure

strategy Nash equilibria, four yielding the pay oﬀs (26 22) and the other

128 7. Preliminaries

four yielding (22 26). Because of each players indiﬀerence between the

w a ys in which the pa y oﬀs are reac hed, there are inﬁnitely man y mixed

strategies that y ield the same payoﬀs. For exam ple, an y proﬁle where

pla yer 1 mixes bet ween  and  and where player 2 mixes be-

tween  and  w ill be a Nash equilibrium that yields (26 22).Sim-

ilarly, any proﬁle wher e play er 1 mixes bet ween  and  and

where player 2 mixes between  and  will be a Nash equilibrium

that yields (22 26). There is, however, one more class of m ixed strategy

Nash equilibria that are similar to the one found in section 6.2.3. To see

this, focus on an e ven simpler game where we eliminate the duplicate

pa y oﬀsasfollows,

Play er 1

Player 2

  

 26 22 10 10

 10 10 22 26

which p reserve th e nature of the gam e. For player 1 to be indiﬀerent

between  and  it must be that player 2 chooses  with

probab ility  such that

26 +10(1− )=10 +22(1− )

which yields  =

. Similarly, for player 2 to be indiﬀeren t between 

and  it m u st be that player 1 chooses  with probability  such

that

22 + 10(1 − )=10 +26(1− )

which yields  =

. Hence, we found a mixed strategy Nash equilibrium

that results in each player getting an expected pa yoﬀ of 26×

+10×

 Notice, however, tha t player 1 is always indiﬀeren t bet ween 

and ,aswellasbetween and  so there are inﬁnitely

many w ays to ac hieve this kind of mixed strateg y, and similarly for

player 2 because of his indiﬀerence between  and  as w ell as 

and . ¥

7. Preliminaries 129

9. The Dean’s Dilemm a: A student stole the DVD from the studen t lounge.

The dean of studen ts (player 1) suspects the student (player 2) and engages

in evidence collection. Ho wever, evidence collection is a random process, and

concret e evidence will be available to the dean only with probability

.The

student knows the evidence generating process, but does not kno w whether

the dean received eviden ce or not. The game proceeds as follow s: The dean

realizes if he has evidence or not, and then can choose his action, whether to

Accuse the studen t (), or Bounce the case () and forget it. Once accused,

the studen t has two options: he can either Confess ()orDeny().

Payoﬀs are realized as follow s: If the dean bounces the case then both play ers

get 0. If the dean accuses the student, and the student confesses, the dean

gains 2 and the studen t loses 2. If the dean accuses the student and the

student denies, then pa y oﬀs depend on the evidence: If the dean has no

evidence then he loses face which is losing 4, while the studen t gains glory

which gives him a payoﬀ 4. If, ho wev er, the dean has evidence then he is

triumphant and gains 4, while the studen t is put on probation and loses 4.

(a) Dra w the game-tree that represents the extensiv e form of this game.

Answ er: Letting the Dean be pla yer 1 and the studen t play er 2,

(b) Write down the matrix that represen ts the normal form of the extensive

form you did in (a) abo ve.

130 7. Preliminaries

Answ er: Because player 1 can condition whether or not to accuse

on whether or not there is evidence, he has four pure strategies. Let

 ∈ {   } be the strategy of player 1 wh ere  follow s

“evidence” and  follow s “no evidence.” Player 2 does not kno w whether

there is evidence and can only respond by confessing or not:

Play er 1

Player 2



 2 −2 0 0

 1 −1 2 −2

 1 −1 −2 2

 0 0 0 0

Answ er: It is easy to see that  is strictly domina ted b y  and

 is strictly domin ated b y . The reduced gam e is therefore,

Play er 1

Player 2



 2 −2 0 0

 1 −1 2 −2

Let play er 1 c hoose  with probability  and player 2 c hoose  with

probab ility . For pla yer 2 to be indiﬀerentitmustbethat

(−2) + (1 − )(−1) = (0) + (1 − )(−2)

and the solution is  =

. Similarly, for player 1 to be indiﬀerentitmust

be that

(2) + (1 − )(0) = (1) + (1 − )(2)

and the solution is  =

. Hence, ( )=(



) istheuniquemixed

strategy Nash equilibrium of this game. As you will see in c hap ter 15,

this is a dyna m ic gam e of incomplet e information. ¥

7. Preliminaries 131

10. P erfect and Imperfect Recall: Consider the game dep icted in Figure ??

Exercise

(a) What are the pure strategies sets for eac h play er?

Answ er: T

(b) Show that for an y behavioral strategy for player 1, there is a mixed

strategy that leads to the same distribution ov er the terminal nodes

regardless of the strategy c hosen by pla yer 2.

Answ er: T

strategy that leads to the same distribution ov er the terminal nodes

regardless of the strategy c hosen by pla yer 1.

Answ er: T

(d) No w imagin e that the game does not have perfect recall so that player

2’s two bottom information sets are no w one large information set. Can

you ﬁnd an example sho wing that the claim in (a) abo v e is no longer

true?

Answ er: T

132 7. Preliminaries

This is page 133

Printer: Opaque

C r e d ib ility a n d Se q u e ntia l R a t ionality

1. Find the mixed strategy subgame perfect equilibrium of the Sequen tial Battle

of the Sexes game depicted in Figure ??

Answ er: The subg ame starting with play e r 1 c h oosing bet ween  and  is

given in the follo w ing ma trix:

Player 1

Player 2



 2 1 0 0

 0 0 1 2

Let player 1 choose  with probab ility  and pla yer 2 c hoose  with proba-

bility .Forplayer2tobeindiﬀerent it must be that

(1) + (1 − )(0) = (0) + (1 − )(2)

and the solution is  =

. Similarly, for pla yer 1 to be ind iﬀerentitmustbe

that

(2) + (1 − )(0) = (0) + (1 − )(1)

and the solution is  =

. Hen ce, ( )=(



) is the unique mixed strategy

Nash equilibrium of this subgame with expected payoﬀsof(



)=(



Working backward, pla yer 1 w ould prefer to c hoose  over  . ¥

134 8. Credibility and Sequential Rationality

2. M utu ally Assure d De struction (revisite d): Consider the game in sec-

tion ??.

(a) Find the mixed strategy equilibrium of the w ar stage game and argue

that it is unique.

Answ er: The war-game in the text has a weakly dominated Nash equi-

librium ( ) and hence does not ha ve an equilibrium in which any

player is mixing. This exercise should have replaced the war-stage game

with the follow ing gam e:

The subgame we called the w ar -stag e gam e is given in the follow ing ma-

trix:

Play er 1

Play er 2



 −5 −5 −120 −80

 −80 −120 −100 −100

Let player 1 c hoose  with probabilit y  and player 2 choose  with

probab ility .Forplayer2tobeindiﬀerentitmustbethat

(−5) +

(1 −

)(−120) = −

1820

(−5) + (1 − )(−120) = (−80) + (1 − )(−100)

and the solution is  =

.Bysymmetry,forplayer1tobeindiﬀeren t

it must be that  =

. Hence, ( )=(



) is the unique mixed

strategy Nash equilibrium of this subgame with expected pa yoﬀsof

(



)=(−9578 −9579). ¥

8. Credibility and Sequential Rationality 135

(b) What is the unique subgame perfect equilibrium that includes the mixed

strategy you found abo ve?

Answ er: Working backward, play er 2 w ould prefer to ch oose  over 

andplayer1wouldprefer over .

3. Bro the rs (revisited ): Find all the subgame prefect equilibria in the “broth-

ers” exercise (exercise 7.8) from the previous c h apter .

Answ er: In part (d) o f exercise 7.8 w e found all the N ash equilibria as

follo w s: For pla yer 2, 

= {} where 

=  means that

play er 2 chooses  ∈ {  } after play er 1 chose  while player 2 c h ooses

 ∈ { } after pla yer 1 chose .Forplayer1,



= {    } where 

= 

meansthatplayer1ﬁrst c h ooses  ∈ { } and then chooses  ∈ {  } if

he played  and  ∈ { } if he played .Weﬁrst noted that for player

1, mixing equa lly bet ween  and  will strictly dominate the four

strategies     and   . Hence, w e can consider the reduced

4 × 4 game,

Player 1

Play er 2

    

 26 22 26 22 10 10 10 10

 26 22 26 22 10 10 10 10

 10 10 10 10 22 26 22 26

 10 10 10 10 22 26 22 26

Thesimpleoverline-underlinemethodshowsthatwehaveeightpurestrategy

Nash equilibria, four yielding the payoﬀs (26 22) and the other four yielding

(22 26).

Now w e know that any subgam e perfect equ ilibrium must be a N ash equi-

librium , so w e can consider the set of Nash equilibria and see which surviv es

backward induction. Because the second stage of the game has play ers 1 and

2 move simu ltaneo usly, the only restriction of subgam e perfection is that in

eac h of the simultaneous move games, the players are playing a Nash equi-

librium. This implies that the pairs of strategies () and ()

136 8. Credibility and Sequential Rationality

are not subgame perfect, which is also true for () and ().

Hence , of the eigh t pure strategy Na sh equilibria only four are subgame per-

fect: ( ),(),() and ().

Because of each players indiﬀerence bet ween the w ays in w h ich the p ayoﬀs

are reached, there are inﬁnitely many mixed strategies that yield the same

pa y oﬀs. For example, any proﬁle where pla yer 1 mixes between  and

 and wher e player 2 m ixes bet ween  and  w ill be a N ash equilib-

rium that yields (26 22). Sim ilarly, any proﬁle wh ere player 1 mi xes between

 and  and where play er 2 mixes between  and  will be a

Nas h equilibriu m that yields (22 26). But most of these will not be subgame

perfect because in the subgame following ,whichisoﬀ the equilibrium path,

the pla yers are not playing a best response. There is, however, one more class

of mixed strategy Nash equilibria that are similar to the one found in section

6.2.3. To see this, focus on an ev e n simpler game where we eliminate the

duplicate pay oﬀsasfollows,

Play er 1

Play er 2

  

 26 22 10 10

 10 10 22 26

which preserv e the nature of the game. For player 1 to be indiﬀerent between

 and  it must be that pla yer 2 chooses  with probability  such

that

26 +10(1− )=10 +22(1− )

wh ich yie lds  =

. Similarly, for player 2 to be indiﬀerent between  and

 it must be that player 1 ch ooses  with prob ability  suc h that

22 +10(1− )=10 + 26(1 − )

which yields  =

. Hence, we found a mixed strategy Nash equilibrium that

results in eac h pla yer getting an expected pa yoﬀ of 26 ×

+10×

=16



Notice, ho wev er, that player 1 is always indiﬀerent between  and ,

as w ell as between  and  so there are inﬁnitely man y wa ys to

8. Credibility and Sequential Rationality 137

ac h ieve this kind of mixed strategy, and similarly for player 2 because of

his indiﬀerence between  and  as well as  and .Themixed

strategy subgame perfect equilibria will be those for which in each subgame

the players are play ing a Na sh equilibr ium , and hence there will be only 6

such pairs: wh ere they mix after  and pla y one of three Nash equ ilibria in

the subgam e after , and similarly where they mix after  and play one of

the three Nash equilibria in the subga m e after . In all of these player 1’s

backward induction choice is the play . ¥

4. The Industry Leader: Three oligopolists operate in a mark et with in v erse

demand giv en by  ()=− ,where = 

+

,and



is the quantity

produced b y ﬁrm .Eachﬁrm has a constan t marginal cost of production, ,

and no ﬁxed cost. The ﬁrm s c h oose their quantities dynamically as follow s:

(1) Firm 1, wh o is the industry leader, chooses 

≥ 0 ; (2) Firms 2 and 3

observ e 

and then sim ultaneously c hoose 

and 

respectively.

(a) Ho w m an y proper subgames does this dynamic game ha ve? Explain

Brieﬂy.

Answ er: There are inﬁnitely many proper subgame s because ev ery

quantit y choice of pa yer 1 results in a proper subgam e. ¥

(b) Is it a game of perfect or imperfect informa tion ? Explain Brieﬂy.

Answ er: Th is is a game of imperfe ct inform ation because play e rs 2 and

3 make their choice without observing eac h other’s cho ice ﬁrst. ¥

unique.

Answ er: ﬁrst w e solve for the Nash equilib rium o f the simultan eous

move stage in which pla yers 2 an d 3 make their choices as a function

of the choice made ﬁrst b y player 1 . Given a c h oice of 

and a belief

about 

 player 2 maxim ize s

max



( − (

+ 

) − )

138 8. Credibility and Sequential Rationality

which leads to the ﬁrst order condition

 − 

− 

−  − 2

yielding the best response function



 − 

− 

− 



and symm etrically, the best response function of pla yer 3 is



 − 

− 

− 

Hence, follo wing an y choice of 

b y pla yer 1, the unique Nash equilib-

rium in the resulting subga m e is the solution to the two best response

function s, which yields



∗

(

)=

∗

(

 −  − 



Moving bac k to pla yer 1’s decision node, he will ch oose 

kno wing that



and 

be be cho sen using the best response fu nction above, a nd

hence play er 1 maximizes,

max



( − (

 −  − 

) − )

which leads to the ﬁrst order equation

( −  − 2

)=0

resulting in a un ique solution 

−

. H en ce, the unique subgame

perfect equilibriu m dictates that 

∗

−

,and

∗

(

)=

∗

(

−−

(d) Find a Na sh equilibrium that is not a sub ga m e perfect equilibriu m .

Answ er: There are inﬁnitely man y Nash equilibria of the form “if pla yer

1plays

then players 2 and 3 play 

∗

(

)=

∗

(

−−

, and oth-

erwise they play 

= 

= .” In any such Nash equilibrium, players

8. Credibility and Sequential Rationality 139

2 and 3 are playing a Nash e quilibriu m on the equ ilib riu m path (fol-

lowing 

) w hile they are ﬂooding the market and casing the price to

be zero oﬀ the equilibriu m path. One exam ple wo uld be 

=0.Inthis

case, following 

=0the remaining two play ers pla y the duopoly Nash

equilib riu m , and player 1 gets zero proﬁts. If pla yer 1 were to choose

any positive quantity, his belief is that players 2 and 3 w ill ﬂood the

market and he will earn −

 0, so he w ould prefer to c h oose 

given those beliefs. Of course, the threats of pla yers 2 and 3 are not

sequentia lly rational, which is the reason tha t this Nash equilibr iu m is

not a subgame perfect equilibrium. ¥

5. Technology Adoption: During the adoption of a new technology a CE O

(player 1) can design a new task for a division manager. The new task can

either be a high level ()orlowlevel(). The manager simultaneously

chooses to invest in good training ()orbadtraining(). The payoﬀsfrom

this in teractio n is giv en by the following m atr ix:

pla yer 1

Play er 2







5 4 −5 2

2 −2 0 0

(a) Presen t the gam e in extensive form (a gam e tree) and solve for all the

Nash Equilibria and sub game perfect equilib ria .

Answ er:

140 8. Credibility and Sequential Rationality

It is easy to see in the m atrix above that both ( ) and ( ) are

pure strategy Nash equilibria. To ﬁnd the mixed strategy Nash equilib-

rium, let player 1 choose  with probability  and pla yer 2 choose 

with probability .Forplayer2tobeindiﬀerentitmustbethat

(4) + (1 − )(−2) = (2) + (1 − )(0)

and the solution is  =

. Similarly, for player 1 to be indiﬀerentitmust

be that

(5) + (1 − )(−5) = (2) + (1 − )(0)

and the solution is  =

. Hence, ( )=(



) istheuniquemixed

strategy Nash equilibrium of this game with expected payoﬀsof(



(

 1). ¥

(b) No w assume that before th e game is played the CEO can choose not

to adopt this new tec h nolo gy, in which case the pa yoﬀsare(1 1),orto

adoptitandthenthegameaboveisplayed.Presenttheentir e game in

extensiveform.Howmanypropersubgamesdoesithave?

Answ er:

The gam e has two proposer subgames. The ﬁrstisthewholegameand

the second starts at player 1’s secon d inform a tion set. ¥

game described in (b) abo ve.

8. Credibility and Sequential Rationality 141

Answ er: The game can be represen ted by the following ma trix,

Play er 1

Play er 2



 5 4 −5 2

 2 −2 0 0

 1 1 1 1

 1 1 1 1

and it is easy to see that there are three pure strategy Nash equilib-

ria: ( ) () and (). From part (a) above w e know that

player 1 choosing  followed by the mix ed strategy Nash eq uilib riu m

( )=(



) will be a (subgam e perfect) Nas h e quilibr ium with ex-

pected payoﬀsof(



)=(

 1). Also, there are inﬁnitely man y mixed

strategy Nash equilibria in which pla yer 1 is mixing bet ween  and

 in any arbitrary way and pla yer 2 c hooses .Finally,thereisan-

other inﬁnite set of mixed strategy equilibria in which player 1 mixes

between ,  and , and pla yer 2 mixes bet ween  and .Tosee

this, ignore  for the moment (as it yields the same pa yoﬀsas),

and let play er 1 ch oose  with probabilit y  and  with probability

(1 −), and let player 2 c h oose  with probability .Forplayer2tobe

indiﬀerentitmustbethat

(4) + (1 − )(−2) = (1) + (1 − )(1)

and the solution is  =

. Similarly, for player 1 to be indiﬀerentitmust

be that

(5) + (1 − )(−5) = (1) + (1 − )(1)

and the solution is  =

. Hence, ( )=(



) is a mixed strateg y

Nash equilibrium of this game with expected payoﬀsof(



)=(1 1).

Of course, because of the iden tity bet ween the  and  any pair of

the following strategies will also be a mixed strategy Nash equilibriu m :

player 1 c hooses  with p rob ability

,chooses and  with

probab ilities that a dd up to the remaining

, and pla y er 2 chooses 

142 8. Credibility and Sequential Rationality

with probability

Turning to subgame perfect equilibria, of the three Nash equilibria only

t wo are subgame perfect: ( ) and (). In the subgame starting

with player 1 c hoosing bet ween  and ,part(a)aboveshowedthatthe

unique (non-degener ate) mixed strategy Nash equilibriu m was ( )=

(



) with expected payoﬀsof(



)=(

 1). If this will be pla y ed by

the pla yers after pla yer 1 c hooses , then pla yer 1’s best reply at the

root is indeed to ch oose  because

 1. Hence, as suggested earlier,

player 1 choosing  followed by the mix ed strategy Nash eq uilib riu m

( )=(



) is also a subgame perfect N ash equilib rium. ¥

6. Inv e stment in the Future: Consider tw o ﬁrms that pla y a Cournot com-

petition game w ith demand  =100− , and costs for each ﬁrm giv en b y





(



)=10



.Imaginethatbeforethetwoﬁrms play the Cournot game, ﬁrm

1 can invest in cost reduction. If it invests, the costs of ﬁrm 1 will drop to



(

)=5

. The cost of investment is 0. Firm 2 does not hav e this

investment opportunity.

(a) Find the value 

∗

forwhichtheuniquesubgameperfectequilibrium

involves ﬁrm 1 investing.

Answ er: If ﬁrm 1 does not invest then they are expected to pla y the

Courn ot Nash equilibrium wh ere both ﬁrms ha ve costs of 10



.Each

ﬁrm solves,

max





(100 − (



+ 



) − 10)



which leads to the ﬁrst order condition

90 − 



− 2



yielding the best response function





(



90 − 





and the unique Courn ot Nash equ ilibr ium is 

= 

=30with proﬁts



= 

=900.Ifﬁrm 1 does invest then for ﬁrm 1 the problem becomes

max



(100 − (

+ 

) − 5)

8. Credibility and Sequential Rationality 143

which leads to the best response function



(

95 − 



For ﬁrm 2 the best response function remains the same as solved earlier

with costs 10

, so the unique Cournot Nash equilibriu m is now solved

using both equations,



95 −

90−



which yield s 

100

, 

,andproﬁts are 

=1 111

while



= 802

. H ence, the increase in pro ﬁts from the equilibrium with

investmentforplayer1are

∗

=1 111

−900 = 211

,whichisthemost

that player 1 would be willing to pay for the in vestment anticipating that

they will play the Courn ot Nash equilibrium after any choice of player

1 regar ding in vestment. If 

∗

then the unique subgame perfect

equilib riu m is that ﬁrst, player 1 in vests, then they players choose 

100

, 

, and if pla yer 1 did not invest the payers choose 

= 

=30.

(Notethatif

∗

then the unique subgame perfect equilibrium is

that ﬁrst, player 1 does not invest, then they players choose 

= 

30, and if player 1 did invest the pa yers choose 

100

, 

.) ¥

(b) Assume that 

∗

. Find a Nash equilibrium of the game that is not

subgame perfect.

Answ er: We construct a Nash equilibrium in which player 1 will invest

despite 

∗

. Player 2’s strategy will be, play 

if pla yer 1

inv e sts, and 

=100if he does not invest. With this belief, if play er 1

does not inv est then he expects the price to be 0, and his best response

is 

=0leadingtoproﬁts 

=0. If he in vests then his best response

to 

is 

100

, which togeth er ar e a N a sh eq uilibr ium in the

Cournot game after in v estmen t. For an y 1 111

this will lead to

positive proﬁts, and hence, for 

∗

 1 111

the strategy of pla yer 2

described abo ve, together with pla yer 1 choosing to invest, play 

100

if he invests and 

=0if he does not is a Nash equilibrium . It is not

144 8. Credibility and Sequential Rationality

subgam e perfect because in the subgame follo w ing no in vestment, the

pla yers are not play ing a Nash equilibrium. ¥

7. Debt and Repayment: A project co sting $100 yields a gross return of

$110. A lender (player 1) is approac hed by a debtor (player 2) requesting a

standard loan con tra ct to complete the project. If the lender chooses not to

oﬀer a loan, then both parties earn nothing. If the lender c hooses to oﬀer

a loan of $100, the debtor can realize the projects gains, and is obliged by

con tract to repay $105. For simplicity, assume that money is continuous, and

that the debtor can choose to return an y amount of m on ey  ≤ 110.Also,

ignorethetimevalueofmoney.Assumeﬁrst that no legal system is in place

that can cause the lender to repay, so that default on the loan (less than full

repa ym ent) carries no repercussions for the deb tor.

(a) Model this as an extensive form game tree as best as you can and ﬁnd

a subgame perfect equilibrium of this game. Is it unique?

Answ er: Pla yer 1 has t wo cho ices ﬁrst, lend ()ordon’tlend().

After  both play ers get zero, while after  player2choosesavalue

 ∈ [0 110] to repay. The game can be described as follo w s:

There is a unique subgame perfect equilibrium . If player 2 is oﬀered the

loan then he suﬀers no penalty from repaying, and his best response is

to choose  =0. Anticipat in g this behavior player 1 should c hoose .

8. Credibility and Sequential Rationality 145

(b) No w assume that there is a legal system in place that allow s the lender

to voluntarily choose whether to sue or not to sue wh en the debtor de-

faultsandrepaysanamount105. Furtherm ore, assume that it is

costless to use the legal system (it is supplied b y the state), and if the

lender sues a debtor that defaulted, the lender will get the $105 repaid

in full. After paying the lender, the borrow er will pay a ﬁne of $5 to the

court above and beyond the repayment. Model this as an extensiv e form

game tree as best as you can and ﬁnd a subgame perfect equilibrium of

this game. Is it unique?

Answ er: Th e game no w distinguished bet ween two conditions:  ≥ 105

in which case it is like the game in part (a) above, and 105 in which

case player 1 has a new decision node where he can ch oose to sue ()

or not sue ().

Starting at the last decision node of player 1, because it is relevan t only

when 105, it follow s that −100  5 implying that  dominates .

Anticipatin g this, pla yer 2’s best response in the repa y m ent phase is to

choose  = 105 This is the lowest paym ent that does not trigger a suit.

At the root of the tree player 1 anticipa tes a pa yoﬀ of 105−100 = 5  0

and hence prefers to c hoose . The resulting outcome yields the pa yoﬀs

146 8. Credibility and Sequential Rationality

(5 5) This backward ind uctio n arg u ment shows tha t this is the unique

subgame perfect equilibriu m . ¥

not subgame perfect equilibria?

Answ er: For player 1, ch oosing  followed by  is a domina nt strategy

becauseitguarantieshimapayoﬀ of at least 5 (exactly 5 when 105

and  −100 when  ≥ 105.) Given this strategy, pla yer 2’s best reply is

to c h oose  =105. Hence, the only Nash equilibrium is also the subgam e

perfect Nash equilib rium. ¥

(d) No w assume that using the legal system is costly: if the lender sues,

he pays la wyers a legal fee of $105 (this is the lawy ers price whic h is

unrelated to the con tract above). The rest proceeds the sam e as before

(if the lender sues a debtor that defaulted , the lender will get repaid in

full; after pa yin g the lender, the borrower will pay a ﬁne of $5 above

and bey o nd the repa ym ent.) Model this as an extensiv e form game tree

as best as y ou can and ﬁnd a subgame perfect equilibrium of this game.

Is it unique?

Answ er: Thegameisnow,

Because  − 100 ≥−100 it follows that tha t  is w eakly dominated by

8. Credibility and Sequential Rationality 147

. A nticipatin g this, player 2’s best response in the repaym ent phase

is to c hoose  =0 At the root of the tree player 1 an ticipates a payoﬀ

of −100  0 and hence prefers to ch oose , and the outcom e results in

pa y oﬀs (0,0). This backward induction argumen t show s that this is the

unique subgame perfect equilibrium. ¥

(e) Are there Nash equilibria in the game described in (d) abo ve that are

not subgame perfect equilibria?

Answ er: There are inﬁnitely many . Any choice by player 2 of  ≤ 100,

for which play ing  is a best res ponse, will be a Nash equilibrium in

which pla yer 2 is not pla yin g a best response. ¥

(f) Now assume that a law change is proposed: upon default, if a debtor is

sued he has to ﬁrst repa y the lender $105, and then pay the legal fees

of $105 abo ve and beyond repa yment of the loan, a nd no extra ﬁne is

imposed. Should the lender be willing to pay for this la w c h ange ? If so,

ho w m uch?

Answ er: Thegameisnowasfollows:

Thebackwardinductionargumentfollowsthesamelogicasinpart(b)

resulting in the outco m e (5 5). This yields player 1 an extra pay o ﬀsof

5 relative to the solution in part (d), implying that he should be willing

to pay up to 5 in order to ha ve the law implemented. ¥

148 8. Credibility and Sequential Rationality

(g) If you were the “social planner”, w ould you implemen ted the suggested

law?

Answ er: Yes because it results in a Pareto superior outcome of (5 5)

instead of (0 0). ¥

8. En try Deterrence 1: NSG is considering en try into the local phone market

in the Ba y Area. The incumbent S&P, p redicts that a price w ar will result

if NSG enters. If NSG stays out, S&P earns monopoly proﬁts valued at $10

million (net present value, or NPV of proﬁts), while NSG earns zero. If NSG

enters, it must incur irrev ersible en tr y costs of $2 million. If there is a price

war, each ﬁrm earns $1 million (NPV) . S&P alw ays has the option of accom-

modating en t ry (i.e., not starting a price war). In such a case, both ﬁrm s

earn $4 million (NPV ). Suppose that the timing is such that NSG ﬁrst has

to choose whether or not to enter the market. Then S&P decides whether

to “accommodate entry” or “engage in a price w ar.” What is the subgame

perfect equilibrium outc om e to this sequential game? (Set up a gam e tree.)

Answ er: Letting NSG be play er 1 and S&P be player 2,

Backward induction implies that player 2 will Accom m odate, and player 1

will therefore enter. Hence, the unique subgam e perfect equilibrium is (En-

ter,Accom m odate). ¥

9. En try Deterrence 2: Consider the C ourn ot duopoly game with dem and

 =100− (

+ 

), and variab le costs 



(



)=0for  ∈ {1 2}.Thetwistis

8. Credibility and Sequential Rationality 149

that there is now a ﬁxed cost of production 0 that is the sam e for both

ﬁrm s.

(a) Assume ﬁrstthatbothﬁrms choose their quantities simultaneou sly.

Model this as a normal form game.

Answ er: This is a standard Cournot game with two play ers:  =

{1 2}



= R

(the non-negativ e real line) and w e need to add the

ﬁxed costs to the pa yoﬀ function , 



(



)=(100− 

− 

)



−  for

 ∈ {1 2}.

(b) Write dow n the ﬁrm’s best response function for  = 1000 and solv e for

pure strategy Nash equilibrium . Is it unique?

Answ er: Because the ﬁxed costs do not aﬀect the ﬁrst order conditions,

from section 5.2.3 we know that the two best response functions ignor ing

the ﬁxed costs are,





(



100 − 



With ﬁxed costs, howev er, each ﬁrm will produce only if it has positiv e

proﬁts. For example, using ﬁrm 1’s best response function , its pro ﬁts

conditional on pla ying a best response are



(

)

) = (100 − (

100 − 

+ 

))

100 − 

− 

=2500+



− 50

− 

=1500+



− 50



where the last inequality follo w s from  =1000. Now w e can compute

the value of 

for w hich playin g a best response by ﬁrm 1 w ill yield

zero proﬁts, which in turn w ill imply that for higher levels of 

ﬁrm

1 will incur a loss even when it plays a best response conditional on

producing. We have,

1500 +



− 50

≥ 0

150 8. Credibility and Sequential Rationality

which holds when 

≤ 100 − 20

√

10 ≈ 3675. A symmetric argumen t

will hold for ﬁr m 2, which yields the best response function with a ﬁxed

cost of  =1000to be,





(



(

100−



if 



≤ 100 − 20

√

0 if 



 100 − 20

√

Using the ﬁrst portion of the best response function to try and solve for

a Nash equilibriu m , we obtain that 

= 

=33

 3675.Thus,when

 =1000, 

= 

=33

is the unique Nash equilib rium of this game.

moves ﬁrst and chooses 

, and then after observing 

ﬁrm 2 c hooses



. Also assume that if ﬁrm 2 cannot make strictly positive proﬁts then

it will not produce at all. Model this as an extensive form game tree as

best as you can, and ﬁnd a subgame perfect equilibrium of this game

for  =25.Isitunique?

Answ er: similar to the analysis in section 8.3.2 we kno w that, ignorin g

ﬁxed costs, ﬁrm 2 will choose 

(

100−

as derived above. With

 =25it will not produce for so m e valu es of 

close to 100 (Similar

to the ana lysis in p ar t (b), 

must satisfy 2500 +



− 50

− 0

with  =25. Th i s will be sat is ﬁed when 

≤ 90.) Given ﬁrm 2’s best

response, ﬁrm 1 maxim izes

max



(100 − 

−

100 − 

)

− 25 

which yields the ﬁrst order condition 50 − 

=0or 

∗

=50. Because



∗

 90 we know that ﬁrm 2 will indeed follow 

(

100−

=25,

proﬁts for ﬁrm 1 are 

=25× 50 − 25 = 1 225,andforﬁrm 2 are



=25× 25 − 25 = 600. By construction, this is the unique subgame

perfect equilib riu m. ¥

(d) Ho w does your answ er in (c) c hange for  =725?

Answ er: Now ﬁrm 2 will follo w 

(

100−

as long as 1775 +



−

8. Credibility and Sequential Rationality 151

50

≥ 0, which holds for 

≤ 100 −10

√

29 ≈ 4615.Aswesawinpart

(c), if ﬁrm 1 ant icipates ﬁrm 2 to produce according to 

(

100−

then ﬁrm 1 produces 

∗

=50. It turns out that if ﬁrm 1 anticip ates ﬁrm

2 to stay out then it will also produce 

∗

=50which is the monopolists

optim al choice for this market w ith only ﬁxed costs. Ho wev er, since

50  4615 this c h oice will indeed cause ﬁrm2tostayout,andthe

unique subgame perfect equilibrium is no w 

∗

=50and



(

(

100−

if 

≤ 100 − 10

√

0 if 

 100 − 10

√

resulting in 

∗

=0. ¥

10. Playing it safe: Consider the following dynamic game: Play er 1 can choose

toplayitsafe(denotethischoiceby), in whic h case both he and play er 2 get

a payoﬀ of 3 eac h , or he can risk playing a game with pla yer 2 (den ote this

choice by ). If he chooses , then they play the following simultane ou s

mo ve game:

player 1

Play er 2







8 0 0 2

6 6 2 2

(a) Dra w a game tree that represen ts this gam e. Ho w many proper sub-

gamesdoesithave?

Answ er:

152 8. Credibility and Sequential Rationality

The game has t wo proper subgames: the whole game and the subgame

starting at the node where 1 ch ooses between  and . ¥

(b) Are there other gam e trees that would work? Explain brieﬂy.

Answ er: Yes - it is possible to have player 2 move after 1’s initial move,

and then have player 1 w ith an inform a tion set as follows:

game.

8. Credibility and Sequential Rationality 153

Answ er: The game can be represen ted by the following ma trix,

Player 1

Play er 2



 8 0 0 2

 6 6 2 2

 3 3 3 3

 3 3 3 3

(d) Find all the Nash and subgame perfect equilibr ia of the dynam ic gam e.

Answ er: It is easy to see that there are t wo pure strategy Nash equi-

libria: ( ) and (). It follows immediately that there are inﬁ-

nitely m any mixed strategy Nash equilibria in whic h play e r 1 is mixing

between  and  in an y arbitrary w ay and player 2 chooses .Itis

also easy to see that follo w ing a c ho ice of , there is no pure strategy

Nash equilibriu m in the resultin g subga m e. To ﬁnd the mixed strategy

Nash equilibriu m in that subga m e, let pla yer 1 choose  with proba-

bility  and  with probabilit y (1 −), and let player 2 ch oose  with

probab ility . For pla yer 2 to be indiﬀerentitmustbethat

(0) + (1 − )(6) = (2) + (1 − )(2)

and the solution is  =

. Similarly, for player 1 to be indiﬀerentitmust

be that

(8) + (1 − )(0) = (6) + (1 − )(2)

and the solution is  =

. Hence, ( )=(



) is a mixed strateg y

Nash equilibriu m of the subgame after play er 1 chooses , yield in g

expected payoﬀsof(



)=(4 2). In an y subgame perfect equilibrium

the players will have to pla y this mixe d strateg y equilib rium follo w in g

,andbecause4  3 player 1 will prefer  over . Hen ce, ch oosing 

follo wed by the m ixed strategy com puted abo ve is the unique subgame

perfect equilib riu m. ¥

154 8. Credibility and Sequential Rationality

11. RA Sele ction with a Tw ist:Twostaﬀ managers in the ΠBΦ sororit y, the

house manager (player 1) an d kitch en manager (player 2), a re supposed to

select a residen t assistan t (RA) from a pool of three candidates: {  }.

Play er 1 prefers  to  and  to . Play e r 2 prefers  to  and  to .The

process that is imposed on them is as follo w s: First, the house manager vetoes

one of the candidates, and announces the v eto to the central oﬃce for staﬀ

selection, and to the kitchen m a nager. Next, the kitchen manager vetoes

one of the remaining two candidates and announ ces it to the cen tr al oﬃce.

Finally, the director of the central oﬃce assigns the remaining candida te to

be an RA at ΠBΦ.

(a) Model this as an exten sive form ga m e (usin g a gam e tree) wh ere a

player’s most preferred candidate gives a payoﬀ of 2, the second gives a

pa y oﬀ of 1, and the last giv es 0.

Answ er: Since ﬁrst player 1 eﬀectively removes a candidate, each of

the three choices (to v eto) of player 1 are followed b y two possible veto

c hoices of play er 2:

(b) Find the subgame perfect equilibria of this gam e. Is it unique?

Answ er: If player 1 vetoes  or  then player 2 will veto , and if play er 1

vetoes  then player 2 will v eto . By bac kward induction, an ticipating

player 2’s behavior play er 1 will v eto candidate . This is the unique

8. Credibility and Sequential Rationality 155

subgam e perfect equilibrium by bac kward induction resulting in pa yoﬀs

of (2 1). ¥

Answ er: Yes. Play er 2 can threaten to veto candidate  whenever player

1 vetoes either  or , and v eto candidate  when player 1 v etoes .Player

1’s best reply to this strategy is to veto either  or . The players will

both be playin g best responses on the equilibrium path but play er 2

is not pla ying a best response follo wing the c hoice of pla yer 1 to veto

candidate . Hence, this is a Nash equilibrium that is not subgam e

perfect.

(d) No w assume that before the t wo players play the gam e, player 2 can

send an alienating E-mail to one of the candidates, which would result

in that candidate withdrawing her application. Would player 2 ch oose

to do this, and if so, with wh ich candidate ?

Answ er: Player 2 would lik e to send the email to candidate .That

way, only candidates  and  will be in the pool and both players will

veto ,resultinginthepayoﬀs (1 2) which are better for player 2 than

theuniquesubgameperfectequilibriumpayoﬀs derived in part (a). ¥

12. Agenda Setting: An agenda-setting game is described as follow s. The “issue

space” (set of possible policies) is an int erval  =[0 5].AnAgendaSetter

(player 1) proposes an alternative  ∈  against the status quo  =4.

Afterplayer1proposes, the Legislator (player 2) observ es the proposal

and selects between the proposal  and the status quo . Player 1’s most

preferred policy is 1 and for any ﬁnal policy  ∈  his pa yoﬀ is given by



()=10− | − 1|

where | −1| denotes the absolute v alue of ( −1). Player 2’s most preferred

policy is 3 and for any ﬁnal policy  ∈  her pay oﬀ is given by



()=10− | − 3|

156 8. Credibility and Sequential Rationality

That is, each player prefers policies that are closer to their m ost preferred

policy.

(a) Write the game down as a normal form game. Is this a game of perfect

or imperfect information?

Answ er: There are two players,  ∈ {1 2} with strategy sets 

=  =

[0 5] and 

= { } where  denotes accepting the proposal  ∈ 

and  means rejecting it and adopting the status quo  =4. The pay oﬀs

are giv en by



(



(

10 − |

− 1| if 

= 

7 if 

= 

and



(



(

10 − |

− 3| if 

= 

9 if 

= 

(b) Find a sub game perfect equilibrium of this gam e. Is it uniqu e?

Answ er: Pla yer 2 can guarantee himself a payoﬀ of 9 by choosing ,

implying that his best response is to c h oose  if and only if 10−|

−3| ≥

9, which will ho ld fo r any 

∈ [2 4]. Player 1 would like to have an

alternative adopted that is closest to 1, which im plies that his best

response to pla yer 2’s sequentially rational strategy is to ch oose 

=2.

This is the unique subgame perfect eq uilibriu m which results in the

pa y oﬀsof(



)=(9 9). ¥

y e s, explain. If not, sho w all the Nash equilibria of this game.

Answ er: One Nash equilib rium is where pla yer 2 adopts the strategy

“I will reject anything except 

=3”Ifplayer1chooses

=3then

his pay oﬀ is 8, while an y other choice of 

is expected to yield player

1apayoﬀ of 7. H en ce, player 1s best response to pla yer 2’s proposed

strategy is indeed to c h oose 

=3and the payoﬀsfromthisNash

8. Credibility and Sequential Rationality 157

equilib riu m are (



)=(8 10). Since pla yer 2 can guaran tee himself

apayoﬀ of 9,thereareinﬁnitely many Nash equilibria that are not

subgam e perfect and that follow a similar logic: player 2 adopts the

strategy “I will reject an ything except 

= ”forsomevalue ∈ (2 4).

Play er 1 wou ld strictly prefer the adoption of  over 4, and hence would

indeed propose , and player 2 would accept the pro posal. For  =4

both players are indiﬀerent so it w o uld also be supported as a Nash

equilib r iu m . ¥

13. Ju n k Ma il Advertisin g: Suppose there is a single good that is owned b y a

single seller who values it at 0 (he can consume the good and get a pay oﬀ

of ). There is a single buyer who has a small transportation cost 0 to

get to an d back from the seller’s store, and he values the good at + .

The buyer ﬁrst decides whether to make the comm ute or stay at home, not

buy the good and receive a pa yoﬀ of 0. If the buy ers commutes to the store,

the seller can then mak e the buyer a Take-It-Or-Lea ve-It price oﬀer  ≥ 0.

The buy er can then accept the oﬀer, pay  andgetthegood,orhecanwalk

out and not buy the good. Assume that   and , are common kno w ledg e.

(a) As best as you can, dra w the extensive form of this gam e. What is the

best response of the buy er at the node where he decides whether to

accept or reject the seller’s oﬀer?

Answ er: Letthebuyerbeplayer1.Denoteby goingtothestoreand

 sta y ing home, and by  accepting or  rejecting the seller’s oﬀer.

The game can be described as follo w s:T he best response of player 1 after

the oﬀer  is to c h oose  if and only if  −  −  ≥− or  ≤

. ¥

(b) Find the subgam e perfect equilibrium of the game and show that it is

unique. Is it P areto Optimal?

Answ er: Given the buyer’s best response at the accept/reject node,

backward induction implies that the seller’s unique best response is to

oﬀer  =  because + . An ticipa ting this, the buy er knows

that if he chooses  then his payoﬀ will be −, so his unique best

158 8. Credibility and Sequential Rationality

FIGURE 8.1.

response is to choose  and the pa y oﬀs from the unique subgame perfect

equilib riu m are (



)=(0). This outcome is not P areto optimal

because if the t wo play ers trade at an y price  such that  − 

then they w o uld both be better oﬀ. ¥

compared to the subgame perfect equilibrium y ou found in (b) abo ve?

Answ er: There are inﬁnitely many suc h Nash equilibria. Fix some 

∗

∈

(  − ) and let player 1’s strategy after the proposal stage be “I w ill

accept an y oﬀer  ≤ 

∗

and reject anything else.” Giv en this strategy,

and giv en that 

∗

, player 2’s best response is to oﬀer 

∗

and pla yer

1 will therefor e wish to choose  because 

∗

−  Th e resulting

pa y oﬀs will be (



)=( − 

∗

−  

∗

)  (0). ¥

(d) No w assume that before the game is played, the seller can, at a small

cost ( −  − ) send the buyer a postcard that commits the seller

to a certain price at whic h the buyer can buy the good (e.g., “bring this

coupon and get the good at a price ”). Would the seller choose to do

so? Justify y ou r answer with an equilibrium analysis.

Answ er: Th e seller wou ld inde ed ben e ﬁt from sending such a postcard.

8. Credibility and Sequential Rationality 159

To see this, let  =  −− −0 and imagine that the seller sends he

postcard with a price 

∗

=  −  −



. The buy er wh o receiv es this card

kno w s that he can go to the store at a cost of  and pay 

∗

for the good

which w ou ld lea ve the buyer with a payoﬀ of 

=  −  − 

∗



 0,

and hence w o uld prefer to go shoppin g. The seller would receive a pa yoﬀ

of 

= 

∗

−  =  +



, which is better than no trade. This w ould

work for any price 

∗

=  −  −  for any  ∈ (0 1). ¥

14. Hy perbolic Discounting: Consider the three period example of a player

with hyperbolic discou nting described in section 8.3.4 with ln() utility in

eac h of the three periods and with discoun t factors 0 1 and 0 1.

(a) Solv e the optimal choice of player 2, the second period self, as a function

of his budget 

,  and .

Answ er: Player 2’s optimization problem is given b y

max





(

 − 

)=ln(

)+ ln( − 

)

for which the ﬁrst order condition is







−





− 

=0

which in turn imp lies that player 2’s best response function is,



(



 +1

which leav es 

= 

− 

(



+1

for consum ption in the third

period. ¥

(b) Solve the optimal c hoice of player 1, the ﬁrst period self, as a function

of ,  and .

Answ er: Pla yer 1 decides how much to allocate between his o wn con-

sump tion and that of pla yer 2 taking into account that 

(



+1

hence player 1 solv es the follo w ing proble m ,

max





(



 − 

 +1



( − 

)

 +1

)=ln(

)+ ln(

 − 

 +1

)+

ln(

( − 

)

 +1

)

160 8. Credibility and Sequential Rationality

for which the ﬁrst order condition is,







−



 − 

−



 − 

=0

which in turn imp lies that player 1’s best response function is,



()=



 + 

15. Time Inconsistency: Consider the three period example of a player with

h y perbolic discoun ting described in section 8.3.4 with ln() utility in eac h of

the three periods, with initial bud get  and with discount factors  =1and

 =

(a) Solve the optimal plan of action of a “naiv e” pla yer 1 who does not tak e

into account ho w his future self, pla y er 2, will alter the plan. What is

player 1’s optimal plan 

∗

, 

∗

and 

∗

as a function of ?

Answ er: Anaiveplayer1willsolve,

max





( − 

− 



)=ln( − 

− 

)+ ln(

)+

ln(

)

=ln( − 

− 

ln(

)

when  =

and  =1.Thetwoﬁst order conditions are,





= −

 − 

− 

2

=0

and,





= −

 − 

− 

2

=0

Solving these two equatio ns yields the solution



= 





and using 

=  − 

− 

gives,







8. Credibility and Sequential Rationality 161

(b) Let 

betheamountleftfromthesolutiontopart(a)aboveafter

player 1 consumes his planned choice of 

∗

.Given

, what is the opti-

mal plan of player 2? In what way does it diﬀer from the optimal plan

set out by player 1?

Answ er: This w as solv ed at the bottom of page 168 in the textbook.

Afterplayer1leave

for player 2, his optimization problem is giv en

max





(



− 

)=ln(

ln( − 

)

for which the ﬁrst order condition is







−

2(

− 

)

=0

which in turn imp lies that player 2’s best response function is,



(

2

which lea v es 

= 

− 

(



for consumption in the third

period. From part (a) w e kno w that 



so player 2 will c h oose





and 



. This is in contrast to what player 1 planned which

was 

= 



so pla yer 2 is o verconsuming relativ e to what player

1wouldhavewanted.¥

16. The Value of Commitmen t: Consider the three period example of a player

with hyperbolic discou nting described in section 8.3.4 with ln() utility in

eac h of the three periods and with discount facto rs  =1an d  =

.We

solved the optimal consump tio n plan of a sophisticat ed player 1.

(a) Imagine that an external entit y can enforce any plan of action that

player 1 chooses in  =1and will prevent player 2 from modifying it.

W ha t is the plan that player 1 wou ld c h oose to enforce?

Answ er: Player 1 wants to maximize,

max





( − 

− 



)=ln( − 

− 

)+ ln(

)+

ln(

)

=ln( − 

− 

ln(

)

162 8. Credibility and Sequential Rationality

when  =

and  =1.Thetwoﬁst order conditions are,





= −

 − 

− 

2

=0

and,





= −

 − 

− 

2

=0

Solving these two equatio ns yields the solution



= 





and using 

=  − 

− 

gives,







Th us, pla y er 1 would c hoose to enforce 



and 

= 



. ¥

(b) Assume that  =90. Up to ho w m uch of his initial budget  will player

1 be willing to pay the external entity in order to enforce the plan you

found in part (a)?

Answ er: If the external en tit y does not enforce the plan, then from the

analysis on pages 168-169 we know that pla yer 2 will choose 



=30

and 



=15, and player 1 will choose 



=45. The discounted

value of the stream of pa yoﬀs for pla yer 1 from this outcome is therefore,

ln(45) +

ln(30) +

ln(15) ≈ 686 

If, however, pla yer 1 can have the plan in part (a) abov e enforced then

his discoun ted value of the stream of payoﬀsis

ln(45) +

ln(225) +

ln(225) ≈ 692 

We can therefore solv e for the amount  of budget  =90that pla yer

1 w ou ld be willing to give up whic h is found b y the follo w ing equality,

ln(45 − )+

ln(225) +

ln(225) = 686 

whic h yields  ≈ 263 Hence, pla yer 1 will be willing to giv e up to 263

of his initial budget  =90in order to enforce the plan 

= 



=225. ¥

This is page 163

Printer: Opaque

Mu lti-S t a ge G a mes

1. Consider the following sim u ltan eou s mo ve game that is played tw ice (the

play er s observe the ﬁrst period outcome prior to the second period pla y):

pla yer 1

Play er 2









10,10 2,12 0,13

12,2 5,5 0,0

13,0 0,0 1,1

(a) Find all the pure strategy subgam e perfect equilibria with no discount-

ing ( =1). Be precise in deﬁn ing history conting ent strategies for both

pla yers.

Answ er: The simultaneous mo ve gam e has t wo pure strategy Nash

equilib r ia : ( )and( ), whic h implies that one of these has to be

pla yed in the second stage of the game. We kno w that an y unconditional

play of these Nash equilibria in each stage is a subgame perfect equilib-

rium of the m ultistag e game im plyin g four pure strategy Nash equilib ria

(e.g., pla yer 1 pla y s  follo wed by  regardless of what pla yer 2 chose

and player 2 plays  fo llow ed b y  regardless of what pla yer 1 did.)

164 9. Multi-Stage Games

We now construct other equilibria that are history conting ent in which

the pla yers will pla y the “reward” ( ) in the second period if they

followed t h e ﬁrst period proposed strategies giving each a p ayoﬀ of 5,

while they will play the “punishment” () if one of the players de-

viated from the proposed strategy and both will receiv e a pa yoﬀ of 1.

Note that the loss from not follow ing the ﬁrst stage proposed strategies

will be 5 − 1=4in the second period, and because  =1th en 4 is

also the discounted loss. It is therefore possible to support any pa y oﬀ

in the ﬁrst stage for whic h the best deviation is no greater then 4 with

 =1because the discounted loss from the second stage “punishm ent”

would be greater than the ﬁrst period gain. The only pair of pay o ﬀs

from which there is a greater gain than 4 is from (0 0) because one

of the players can deviate to (5 5). Hence, pick any pure strategy pair

( ) that is not ( ) or (). The follow ing is a subgame perfect

equilib r iu m: pla yer 1 plays  in the ﬁrst stage followed by  if ( )

was followed and  if it was not. Sim ilarly, player 2 plays  in the ﬁrst

stage follo wed b y  if ( ) was followed and  if it w as not. For  =1

this is a subg a me perfect equilibriu m. ¥

(b) For each of the equilibria y o u found above, ﬁnd the smallest discoun t

factor that supports it.

Answ er: The four subgame perfect equilibria that are just an uncon-

ditional sequence of one-stage Nash equilibria are equilibria for any dis-

count factor. The others, however, must guarantee that the discoun ted

loss from punishmen t is greater than the ﬁrst period gain for the player

whohasthemosttobeneﬁt from the deviation. For the ﬁrst stage

outcomes ( ) ∈ {() () ( }, the pla y er who gains most

can gain 3, and hence the discoun t factor must satisfy the inequal-

it y 3 − 4 ≤ 0 or  ≥

for these outcomes to be pla yed in the ﬁrst

stage of the subgame perfect equilibrium. For the other two possibili-

ties, ( ) ∈ {( ) ()} theplayerwhogainsmostcangainonly1,

andhencethediscountfactormustsatisfytheinequality1 − 4 ≤ 0 or

9. Multi-Stage Games 165

FIGURE 9.1. The Cen tipede Game

 ≥

for these outcomes to be played in the ﬁrst stage of the subgame

perfect equilib riu m. ¥

2. Cen tipedes revisited: Two players are playing two consecutive games.

First, they play the cen tipede game described in Figur e 9.1. After the cen-

tipede game they play the follo wing coordination game:

Player 1

Player 2



 1 1 0 0

 0 0 3 3

(a) What are the Nash equilibria of each stage game?

Answ er: The ﬁrst stage game has a unique Nash equilibrium outcome

in which player 1 plays  in the ﬁrst stage and payoﬀsare (1 1).This

can be supported in more than one M ash equilibrium (fo r ex am p le,

player 1 plays  alw ays and player 2 does as w ell, which is the subgame

perfect equilibrium, or player 1 player  always and player 2 plays 

ﬁrst and  later — there are more.) The second stage game have three

Nash equilibria. The t wo pure are ( ) and ( ) and the mixed one

has player 1 (respectively 2) play  (respectiv ely ) with probability

(b) Ho w man y pure strategies does each pla yer ha ve in the m ultistage game?

Answ er: The players have four pure strategies in the ﬁrst stage game

166 9. Multi-Stage Games

(t wo information sets with t wo actions in each). The second stage strate-

gies can be conditional on the outcomes of the ﬁrst stage, of which there

are4.(Wearedeﬁning an outcome is the pa y oﬀsoftheﬁrst stage and

not the strategies that players c h ose to obtain the payoﬀs. Unlike a ma-

trix game, these will be diﬀerent here because, as we sa w in part a.

above, there are diﬀerentcombinationsofpurestrategiesthatcanlead

to the same outcome.) Hence, there are 2

=16pure strategies for each

pla yer in the second stage, which can follo w each of the 4 ﬁrst stage

pure strategies, giving every pla yer a total of 64 pure strategies. ¥

counting ( =0). Be precise in deﬁning history contingent strategies for

both pla yers.

Answ er: In the second stage the pla yers m u st play either ( ) or

( ) for any history. With extreme discoun ting we cannot support

play in the ﬁrst stage that is not a Nash equilibrium because there is no

second stage “punishment” that can deter ﬁrst stage deviations. Hence,

in the ﬁrst stage the pla yers must play a subgame perfect equilibrium

of the ﬁrst stage game whic h is  alw a ys for pla yer 1 and  always for

pla yer 2. Hence, there are t wo possible outcomes that can be supported

b y a subgame prefect equilibrium, (1 1) followed b y (1 1) or by (3 3).

Ho w ever, there are 2

=16pure strategy subgam e perfect equilibria

because for each of the 4 outcomes of the ﬁrst stage th e players m u st

specify which of the 2 equ ilib r ia ( ) or () will be pla y ed in the

second stage. ¥

(d) No w let  =1. Fin d a subgam e perfect equilibrium for th e two-stag e

game in w hich the players receiv e the payoﬀs(2 2)intheﬁrst stage-

game.

Answ er: To get (2 2) in the ﬁrst stage pla yer 2 m ust overcome the

temp tation to choose  at his ﬁrstmoveandget3 instead. Hence, w e

can use the follo w ing conditional strategies in the second stage: player 1

(respectiv ely 2) pla ys  (respectively )iftheoutcomeoftheﬁrst stage

9. Multi-Stage Games 167

was (2 2) while they play  and  otherwise. In the ﬁrst stage player 1

will pla y  followed by  a nd player 2 will play  followed by .Player

1 has no reason to deviate in the ﬁrst stage, and neither does player 2

because the gain of 1 from deviating in the ﬁrst stage is less than the

loss of 2 in the second stage. ¥

(e) What is the lo west value of  for which the subgame perfect equilibriu m

y ou found in (d) survives?

Answ er: The pain from deviation will de ter pla yer 2 if and o nly if

1 − 2 ≤ 0 or  ≥

. ¥

(f) For  greater than the value y ou found in (e) abov e, are there other

outcomes of the ﬁrst stage cen tipede game that can be supported as

part of a subgame perfect equilibrium?

Answ er: Yes — the exact same idea can be used to support any of the

other outcome s because the player who is tempted to deviate will gain

1intheﬁrst period. ¥

3. Campaigning Adds: Two political candidates are sc hedu led to campaign

in two states, in one in period  =1andintheotherin =2. In each state

they can either choose a positiv e campa ign that promotes their ow n agenda

( for pla yer 1,  forplayer2)oranegativeonethatattackstheiropponent

( for player 1,  forplayer2).Residentsoftheﬁrst period state do not

mind negative campaigns, which are generally eﬀective, and payoﬀsinthis

state are given by the follow ing matrix:

Play er 1

Play er 2



 2 2 0 5

 5 0 3 3

168 9. Multi-Stage Games

In the second period state, residen ts dislike negative campaign s despite their

eﬀectiv en ess and the payoﬀs are given b y the follo w ing matrix:

Play er 1

Play er 2



 6 6 1 0

 0 1 2 2

(a) What are the Nash equilibria of e ach stage game? Find all the pure

strategy subgame perfect equilibria with extrem e discounting ( =0).

Be precise in deﬁnin g history con ting ent strategies for both players.

Answ er: The ﬁrst stage game has a unique dom inant strategy N ash

equilib r iu m () while the second stage game has two pure strategy

equilib r ia , ( ) and () and a mixed strategy equilibrium in which

each player c hooses the positive campa ign with probab ility

. In the sec-

ond stage the pla yers must play either ( ) or ( ) for any history in

a pure strategy subgam e perfect equilibrium. With extreme discounting

w e cannot support play in the ﬁrst stage that is not a Nash equilib-

rium because there is no second stage “punishm ent” that can deter ﬁrst

stage deviations. Hen ce, in the ﬁrst stage the pla yers must play ( ).

Hence, ( ) follo wed by either ( ) or ( ) will be the only out-

comes that can be supported as subg am e perfect equilibria. Howev er,

there are 2

=16pure strategy subgame perfect equilibria because for

each of the 4 outcomes of the ﬁrst stage the players m ust specify wh ich

of the 2 equilibria ( ) or ( ) will be played in the second stage. ¥

(b) No w let  =1. Fin d a subgam e perfect equilibrium for th e two-stag e

game in which the play ers c hoose ( )intheﬁrst stage-game.

Answ er: We can use the conditional second stage strategies in whic h

pla yer 1 (respectiv ely 2) plays  (respectively )ifthechoiceinthe

ﬁrst stage was ( ) while they play  and  other w ise. In the ﬁrst

stage neith er player w ants to deviate from ( ) because the gain of

switching actions is 3 (from 2 to 5) while the loss from the punishmen t

9. Multi-Stage Games 169

in the second stage is 4 (and it is not discoun ted so it’s value remains

4). ¥

y ou found in (b) survives?

Answ er: The discounted punishment m ust be at least as high as the

gain from deviation, so the inequalit y is 3 − 4 ≤ 0,andthesolutionis

 ≥

 ¥

(d) Can you ﬁnd an subgame perfect equilibrium of this game whe re the

players pla y something other than ( ) or ( ) in the ﬁrst stage?

Answ er: The same logic as that for parts b. and c. follows to support

the pairs of actions ( ) and () in the ﬁrst stage. In eac h of these

proﬁles one pla yer will gain 3 by devia ting to his preferred choice, and

the loss in the second stage with properly deﬁned contingen t strategies

is 4, so if  ≥

the punishmen t will suﬃce to support the desired ﬁrst

stage behavior. ¥

4. On lin e Gam in g: Consider a two-stage game between t wo ﬁrm s that produce

online games. In the ﬁrst stage, they play a Cournot competition game (eac h

chooses a quantity 



) with demand function  =100−, and zero marginal

production costs (



(



)=0for  =1 2) In the second stage, after observin g

the pa ir (



) and after proﬁts have been distributed , th e p layers play a

simultaneou s move “access” gam e where they can either keep their game

platforms closed, or eac h can open it’s platform to allo w players on the other

platform to play online with pla yers on their o wn platform ( for pla yer

1,  for pla yer 2), or choose to keep their platforms non-compatible ( for

player 1,  for player 2), in wh ich case each platform ’s pla yers can on ly pla y

with others on their platform. If they choose ( ) then second stage pa yoﬀs

are (0 0). If only one ﬁrm chooses to open its platform, it bears a cost of

(−10) with no beneﬁt since the other ﬁrm d id not allow to open access.

Finally, if both ﬁrms choose ( ) then eac h ﬁrm gets many more ey eballs

170 9. Multi-Stage Games

for advertising, and pay oﬀsforeachﬁrm are 2 500.Bothplayersusethe

same discount factor  to discount future payoﬀs.

(a) Find the unique Nash equilibrium in the ﬁrststageCournotGameand

all of the pure strategy Nash equilibria of the second stage access game.

Find all the pure strategy subgam e perfect equilibria with extreme dis-

counting ( =0). Be precise in deﬁning history contingent strategies for

both pla yers.

Answ er: The maximization problem in the Cournot game is

max









(







)=(100−



− 



)



and the ﬁrst order condition is 100 − 



− 2



=0resulting in the best

response function 



100−



, which in turn implies that the unique

Nash equilibrium is 

= 

=33

. The second stage game is giv en by

the follo wing matr ix:

Player 1

Player 2



 250 250 −10 0

 0 −10 0 0

and it is easy to see that both ( ) and () are Nash equilibria.

Oneofthesetwowillhavetobeplayedinthesecondstageofthegame

in any subg am e perfect equilibrium. When  =0the only ﬁrst stage

play that is possible in eq uilibr ium is the unique Courno t equilibrium.

Therefor e, only t wo outcom es can result from a subgam e perfect equi-

librium: choose 

= 

=33

in the ﬁrst stage and choose either ( )

or ( ) in the second stage.

There are inﬁnitely many strategy proﬁles that will supp ort this outcome and by a subgame p erfect equilib-

rium. For exam p le, h ave th e players each cho ose 



=33

in the ﬁrst stage followed by the following contingent

strategy for the second stage: if both 

and 

are b elow 

∗

then choose “open” ( and ) w h ile if eith e r 

or 

are ab ove 

∗

then choose “not-o p en” ( and ). Notic e th a t th is is a n e q u ilib r iu m for a ny val ue o f 

∗

.If

∗

 33

the n t he sec o n d sta g e e q u ilibrium will be ( ) while if 

∗

≥ 33

the n it will be ( ).

9. Multi-Stage Games 171

(b) No w let  =1. Fin d a subgam e perfect equilibrium for th e two-stag e

game in which the play ers c hoose the monopoly (total proﬁtmaximiz-

ing) quantities and split them equally (a symm e tr ic equilib rium).

Answ er: Monopoly proﬁts in the ﬁrst stage are giv en b y maximiz-

ing (100 − ) wh ich yields  =50, and an equal split means that



= 

=25with each ﬁrm making (100 − 50)25 = 1250 in the ﬁrst

stage. Ho wever, eac h ﬁrm  is tem pted to deviate given that 



=25.

Using the best response deriv ed in part a. we know that the best

deviation is 



100−



100−25

=375 and the deviator’s proﬁts

would be (100 − 625)375 = 14063. Hence, the gain from deviating is

14063 − 1250 = 1563. Given the t wo equilibria in the second stage

game we can prevent the pla yers from deviating by introducing the fol-

lo w ing cont ingent strategies: each pla yer will play “open” if both play ed





=33

in the ﬁrst stage and th ey will play “n ot open” if any other

c ho ices were made. With  =1the losses from the punishment out weigh

the gains from deviation and hence it is a subgame perfect equilibrium.

y ou found in (b) survives?

Answ er: Itmustbethecasethatthelossfromdeviationisatleastas

painful as the gain, that is, 1563 − 250 ≤ 0,or, ≥ 06252 ¥

(d) No w let  =04. Can you support a subgame perfect equilibrium for the

t wo-stage game in which the play ers choose the monopoly quantities and

split them equally? If not, w hat are the h ighest proﬁts that th e ﬁrm s

can make in a symm etric equilibrium?

Answ er: From the analysis in part c. we know that for 06252

w e cannot support the split of monopoly proﬁts as a subgame perfect

equilibr ium. Finding the highest proﬁts that can be supported is a bit

tric ky. The easy part is starting with the discounted punishment value

when  =04,whichis04×250 = 100. Next w e need to ﬁnd a symme tric

pair (



)=(

∗



∗

) for whic h the extra proﬁts from deviating to the

172 9. Multi-Stage Games

best response to 

∗

given that the other ﬁrm sticks to 

∗

is exactly equal

to 100. First, the proﬁts from sticking to 

∗

will be 

∗

=(100− 2

∗

)

∗

Next, the best response to 

∗

is 

100−

∗

and the proﬁts from this

deviation are 

=(100−

∗

−

100−

∗

)

100−

∗

,andtherefore

∗

must solve



− 

∗

=100,or

(100 − 

∗

−

100 − 

∗

)

100 − 

∗

− (100 − 2

∗

)

∗

=100

which yields the solution 

∗

=26

. Hence, for  =04 the best symmet-

ric equilibrium has both players earning (100 − 2(26

))26

=1244

the ﬁrst period followed by 250 in the second. ¥

5. Campaign Spending: Two political candidates are destined to play the

follo w ing t wo stage game. Assume throughout that there is no discounting

( =1). First, they compete in the primaries of their part y. Eac h candidate

 can spend 



≥ 0 reso urces on adds that reach out to voters, whic h in

turn increases the probability that candidate  wins the race. Giv en a pair

of spending choices (



), the probabilit y that candidate  wins is given by







+

. If neither spends any resources then eac h wins with probability

.Each

candidate values winning at a pa y oﬀ of 16  0, and the cost of spending 



equal to 



. After eac h player observes the resources spen t by the other, and

a winner in the primaries is selected, they can choose how to in teract. Each

can choose to be pleasant ( forplayer1and forplayer2)ornasty( and

 respectively). A t this stage, both pla yers prefer that they be nice to each

other rather than nast y, but if a pla yer is nast y then the other prefers to be

nasty too. The pa y oﬀs from this stage are given by the matrix where 0:

Play er 1

Play er 2



   −1 0

 0 −1 0 0

(a) Find the unique Nash equilibrium of the ﬁrst stage game and the t wo

pure strategy Nash equilibria of the second stage game.

9. Multi-Stage Games 173

Answ er: In the ﬁrst stage player  maximizes





(















+ 



16 − 



and the ﬁrst order condition is

16



(



+ 



)

− 1=0

which is of course symmetric for both pla yers and rep resents the best

response correspondence. Solving the tw o F OCs simultaneously yields



= 

=4as the unique N ash equilibrium of the ﬁrst stage game and

each candidate wins with probab ility

. The two pure strategy Nash

equilibria of the second stage game are ( ) and (). ¥

(b) What are the P areto optim al outcomes of eac h stage game?

Answ er: In the ﬁrst stage th e symmetric Pareto optimal outcome is

for both to choose 

= 

=0. This way they win with probability

each without w asting an y resources.

It is easy to see that the Pa reto

optim al outcome of the second stage is ( ). ¥

outcomes as a subgam e perfect equilibrium?

Answ er: Because there is no discounting ( =1)thenthevalueof

thethreatofcontingentpunishmentinthesecondstageis for each

player because the conditional strategies will be “we pla y ( ) if we

did the righ t thing in stage 1 and otherwise we pla y ( ).” If we wish

to support the Pa reto optimal action of 

= 

=0in the ﬁrst stage

w e need to see what the deviation payoﬀ is. A pla yer  who deviates to

an inﬁnitesimal value 



 0 will win for sure and get 16 −



instead of

Strictly sp eaking, there is no other Pareto optimal outcome because of the continuous action spaces. If player

 chooses 



=0then if player  chooses 



= 0 then this is b etter for player  than choosing 



=0, bu t it is

not Pareto optimal b ecause if instead player  chooses 





then he is better oﬀ without making player 1 worse

oﬀ. T his is a technicality in the sense that the “Pareto frontier” includes the p oint (



)=(0 0) but any other

pair of feas ib le s tr a te g ie s is Pa ret o do m i n a te d for a t le as t o n e player.

174 9. Multi-Stage Games

getting

×16−0=8, so that the gains from deviating are inﬁnites im ally

close to 8. Hence, if  =8then no player will wish to deviate from the

proposed path of play. ¥

(d) Assume that  =1. What is the “best” symmetric subgame perfect

equilibrium that the players can support?

Answ er: Themostseverethreatisthattheplayerslose =1so the

gainfromdeviatingcannotbemorethan1.Wearethereforelooking

for a symmetr ic ch oice in the ﬁst stage, 

= 

∗

such that if some

player  deviates to the best response to 



= 

∗

thenhisgainintheﬁrst

period is an expected payoﬀ of 1. First note that if both play e rs choose



∗

then eac h gets a payoﬀ in the ﬁrst stage of 8 − 

∗

because they win

with equal probabilit y. Now consider the ﬁrst order condition derived in

part a. abov e. From it we can derive the best response function of each

player to be 



(



)=4

√





− 



. This implies that the best response to





= 

∗

is 



(

∗

)=4

√



∗

− 

∗

, and if this is what player  deviates to

then his expected payoﬀ in the ﬁrst stage gam e is





(



(

∗

)

∗

√



∗

− 

∗

√



∗

− 

∗

+ 

∗

16 − (4

√



∗

− 

∗

)

=16− 8

√



∗

+ 

∗

The best symmetric equilibrium will be ach ieved when the gains from

deviating are exactly equal to 1, or

16 − 8

√



∗

+ 

∗

− (8 − 

∗

)=1,

which results in 

∗

− 2

√

2 ≈ 16716. ¥

(e) What happens to the best symmetric subgame perfect equilibrium that

the players can support as  c han ges? In what way is this related to

theroleplayedbyadiscountfactor?

Answ er: If  increases then w e can have a harsher punishment, and

this can allo w us to deter more attractive deviations that will happen

when we try to implemen t a smaller v alue of 

∗

in th e ﬁrst stage. As 

9. Multi-Stage Games 175

increases to wards 8 w e can get closer and closer to the Pareto optimal

outcome of 

∗

=0. This carries the same intuition as a higher discount

factor, whic h mak es the punishment more sev ere for deviations in the

ﬁrst stage. ¥

6. Augmented Competition: Consider two ﬁrmsplayingatwostagegame

with discoun t factor .Intheﬁrst stage they pla y a Cournot quan tity setting

game where eac h ﬁrm has costs 



(



)=10



for  ∈ {1 2} and the demand

is given by ()=100−  where  = 

+ 

. In the second stage, after

the results of the Cournot game are observ ed , the ﬁrms play the follow in g

standard setting game:

Player 1

Player 2



 100 100 0 0

 0 0 300 300

(a) Find the unique Nash equilibrium of the ﬁrst stage game and the t wo

pure strategy Nash equilibria of the second stage game.

Answ er: In the ﬁrst stage game each player  maximizes (100 − 



−





)



− 10



which yields the best response function 



90−



and the

unique Nash equilibrium is 



= 



=30. The two pure strategy Nash

equilibria in the second stage are ( ) and ( ). ¥

(b) As far as the t wo ﬁrm s are considered, what are the symm etric P areto

optimal outcomes of eac h stage game?

Answ er: In the ﬁrst stage game it is splitting the monopoly proﬁts and

in the second stage it is ( ) because 300  100. M onopoly proﬁts in

the ﬁrst stage are earned when w e maximize (100 − 



− 



)(



+ 



) −

10(



+ 



) which is obtained when 



+ 



=45. Hence, the symmetric

Pareto optimal outcome is 

= 

=225 in the ﬁrst stage, whic h yields

apayoﬀ of 10125 for each pla yer, and ( ) in the second, whic h yields

apayoﬀ of 300 for eac h pla yer. ¥

176 9. Multi-Stage Games

a subgame perfect equilibrium?

Answ er: In the P areto optimal outcome of the ﬁrst stage each ﬁrm

earns a proﬁtof is tempted to deviate giv en that 



=225.Using

the best response derived in part a. we kn ow that the best deviation

is 



90−



90−225

=3375 and the deviator’s proﬁts w ould be

(100−225−3375)3375−10(3375) = 11390625. Hence, the gain from

deviating is 11390625 −10125=1265625. Given the t wo equilibria in

the second stage game w e can try and prev ent the players from deviating

by using contingent strategies: each player will pla y  (or )ifboth

played 



=225 in th e ﬁrst stage and they w ill pla y  (or )ifany

other choices w ere made. This will cause the deviating pla y er a loss of

200 in the second stage, and for this to deter the best deviation in the

ﬁrst stage it must be th at 1265625 − 200 ≤ 0,andthesolutionis

 ≥ 063281. ¥

(d) Assume that  =05. W hat is the “best” sy m m etric subgam e perfect

equilibrium that the players can support?

Answ er: Finding the highest proﬁts (“best”) that can be supported as a

subgam e perfect equilibrium is a bit tricky. The easy part is starting with

the discounted punishment value when  =05,whichis05×200 = 100.

Next we need to ﬁnd a symm etric pair (



)=(

∗



∗

) for w hic h

the extra proﬁts from deviating to the best response to 

∗

given that

the other ﬁrm sticks to 

∗

is exactly equal to 100. First, the p roﬁts

from stick ing to 

∗

will be 

∗

=(100− 2

∗

)

∗

− 10

∗

. Next, the best

response to 

∗

is 

90−

∗

and the proﬁts from this deviation are



=(100− 

∗

−

90−

∗

)

90−

∗

− 10(

90−

∗

), and therefor e 

∗

must solve



− 

∗

=100,or

(100−

∗

−

90 − 

∗

)

90 − 

∗

−10

90 − 

∗

−[(100−2

∗

)

∗

−10

∗

]=100

9. Multi-Stage Games 177

and the solution is 

∗

=23

.Hence,for =05 the best symm etric

equilibr iu m has each pla yer earning (100−2(23

))23

−10(23

) = 1011

in the ﬁrst stage follo wed by 300 in the second. ¥

(e) What happens to the best symmetric subgame perfect equilibrium that

the players can support as  drops towards zero?

Answ er: As  d rop s towards zero the ability to punish becom es less

eﬀective and the best subgam e perfect equilibrium quan tities in the ﬁrst

stage will grow un til they reach the Nash (Co urn ot) equilibrium of the

ﬁrst stage, 

= 

=30. ¥

178 9. Multi-Stage Games

This is page 179

Printer: Opaque

Repeated Games

1. Medicare Drug Policy: In early 2005 there w a s a discussion of a proposed

policy of the US federal administratio n that supported the use of so called

“discount cards” that pharm aceutical ﬁrm s can oﬀer senior citizens for the

purchase of medication s. These cards will have a subscription fee, and they

will in return oﬀer discounts if prescription drugs are bought through the

issuing companies. The federal administration argued that any of the large

pharmaceutical companies can enter this market for discoun t cards, which in

turn will promote competition. To ensure this the governmen t has a website

with posted prices and posted discounts that go with eac h card. Som e con-

sume r adv ocates suggest that the compan ies will just hike up prices and oﬀer

a discount over this higher prices, resulting in less welfa re for consumers. The

administra tion argued that this does not make too m uch sense because there

is entry and competition. Can y ou argue, usin g some formal id eas on tacit

collusion, that the way things are set up it is in fact possible, and maybe even

easier, for the ﬁrmstosqueezemoreproﬁts at the expense of consumers?

Answ er: By having a central place in whic h prices are posted the govern-

ment mak es it easy for companies to monitor eac h other’s prices, and this in

turnmakesiteasiertosustaintacitcollusionbecauseitcompanieswhodevi-

180 10. Repeated Games

ate from the tacit agreement will be easily detected b y the other companies.

2. Grim Trigger: Consider the inﬁn itely repeated game with discoun t factor

1 of the following variant of the Prisoner ’s dilemma:

player 1

Play er 2









6 6 −1 7 −2 8

7 −1 4 4 −1 5

8 −2 5 −1 0 0

(a) For which values of the discoun t factor  can the play ers support the

pair of actions ( )playedineveryperiod?

Answ er: The grim trigger strategy is to revert to pla ying ( ) forever

yielding a discoun ted sum of payoﬀs(andanaveragepayoﬀ) equal to 0.

The discounted sum of pa yoﬀsfromstickingtothepair( ) forever is

1−

. A player who deviates gets 5 instead of 4 in the period of deviation,

but then gets 0 thereafter. Hence a deviatio n will not be proﬁtable if

1−

≥ 5,or ≥

 ¥

(b) For whic h values of the discoun t factor  can the pla yers support the pair

of actions () played in ev ery period? Why is y our answer diﬀeren t

from part (a) above?

Answ er: The discoun ted sum of payoﬀsfromstickingtothepair()

fore ver is

1−

. A player who deviates gets 8 instead of 6 in the period

of deviation, but then gets 0 thereafter using grim trigger. Hence a

deviation will not be proﬁtable if

1−

≥ 8,or ≥

 ¥

10. Repeated Games 181

3. Not so Grim Trigger:Considertheinﬁnitely repeated Prisoner’s Dilemma

with discount factor 1 described b y the follow ing m atrix:

Player 1

Player 2







4 4 −1 5

5 −1 1 1

Instead of using “grim trigger” strategies to support a pair of actions (



)

other than ( ) as a subgame perfect equilibriu m , assum e that the player

wish to c hoose a less draconian punishm ent called a “length  punishment”

strategy. Nam ely, if there is a deviation from (



) then the play ers will pla y

( )for periods,andthenresumeplaying(



). Let 



be the critical

discount factor so that if 



then the adequately deﬁned strategies will

implem ent the desired path of play with length  punishment as the threat.

(a) Let  =1. What is the critical value 

to support the pair of actions

() played in every period?

Answ er: T he proposed one period punishment mean s that instead of

getting 4 for the period after deviation, the players will get 1, and after-

wards will resort to getting 4 forev er. Hen ce, the punishment is of size

3 and the discounted value is 3. The gain from deviating in one period

is getting 5 instead of 4 so this will be deterred if 1 ≤ 3 or  ≥

(b) Let  =2. Wha t is the critical value 



to support the pair of actions

() played in every period?

Answ er: The proposed two period punishmen t means that instead of

getting 4 for the two periods after deviation, the players will get 1,

To see this using the w hole stream of payoﬀs, sticking to ( ) yields

1−

while dev iating w ith th e th reat

of a one p eriod punishment will yield 5+1+

1−

andthisisnotproﬁtable if

1−

≥ 5+1+

1−

,which

can b e rewritten as 4+4+

1−

≥ 5+1+

1−

 whichinturnreducesto3 ≥ 1.

Helpful hint: You should encounter an equation of the form 

− ( +1) +1=0 fo r which it is e a sy to s ee

that  =1is a roo t. In this case, you kn ow that th e equ ation can be w ritten in the form ( − 1)(

+  − 1) = 0

and solve for the other relevant ro ot of the cubic equation.

182 10. Repeated Games

and afterwards will resort to getting 4 forev er. H ence, the discoun ted

punishment is ( +

)3. The gain from deviating in one period is getting

5insteadof4sothiswillbedeterredif1 ≤ (+

)3,or ≥

√

7−

≈

026376.

diﬀer and what is the intuition for this?

Answ er: Thepunishmentinpartb. last for two periods which is more

severe than the one period pun ishm ent in part a. This means that it

can be supported with a lower discoun t factor because the in tensity of

the punishment is increasing either in the length or when we ha ve less

discounting. ¥

4. Trust oﬀ-th e-e q u ilib riu m-path : Recall the trust game depicted in Figure

10.1. We argued that for  ≥

the follow ing pair of strategies is a subgame

perfect equilibrium . For pla yer 1: “in period 1 I will trust player 2, and as as

long as there w ere no deviation s from the pair () in any period, then I

will continue to trust him. Once such a deviation occurs then I will not trust

him forever after.” For player 2: “in period 1 I will cooperate, and as as long

as there w ere no deviations from the pair () in any period, then I w ill

con tinue to do so. O n ce such a d eviation occurs then I w ill deviate forever

after.” Show that if instead pla yer 2 uses the strategy “as long as pla yer 1

trusts me I will cooperate” then the path () pla yed forever is a Nash

equilibrium for  ≥

but is not a subgam e perfect equilibriu m for any value

of .

Answ er: It is easy to see th at this is a Nash equilibrium: th e equilibrium

path is followed because neither player beneﬁts from deviating as they both

believe that a deviation will call for the continuation of grim trigger. To

see that it is not subgame perfect consider the subg am e that follow s after

To see this using the whole stream of payoﬀs, sticking to ( ) yields

1−

wh ile d e viatin g with t h e t h re a t o f

a two p erio d punishm ent w ill yield 5+1+

1+

1−

and this is not proﬁtable if

1−

≥ 5+1+

1+

1−

This can either b e solved as a cubic inequality or can b e rewritten as 4+4+

4+

1−

≥ 5+1+

1+

1−



whichinturnreducesto( + 

)3 ≥ 1.

10. Repeated Games 183

a deviation of pla yer 2 from  to . The strategy of pla yer 1 is to not

trust forever which will revert the payoﬀ to 0 in ev ery period, but pla yer 2’s

strategy is to cooperate as long as player 1 tru sts. So, after a deviation by

player 2, if player 1 believes that pla yer 2 will inde ed cooperate then player

1 should con tinue to trust. ¥

5. Negativ e Ad Cam p aigns (revisited): Reca ll the exer cise from cha pter

?? in whic h eac h one of two political parties can choose to b uy time on

commercial radio shows to broadcast negative ad campaigns against their

rival.Thesechoicesaremadesimultaneously.Duetogovernmentregulation

it is forbidden to buy more than 2 hours of negativ e campaign time so that

eac h party cannot choose an amount of negative campaigning above 2 hours.

Given a pair of choices (



),thepayoﬀ of part y  is given b y the follo w in g

function: 



(



)=



− 2



+ 







− (



)



(a) Find the uniqu e pure strategy Nash equilibrium of the one shot game.

Answ er: Eac h player maximizes 



(



)=



− 2



+ 







− (



)

resulting in the ﬁrst order optimality condition 1+



−2



=0, yieldin g

the best response function,





(



1+





Solving the two best response functions sim u ltaneously,



1+

and 

1+

yields the Nash equilibrium 

= 

=1, and this is the unique solution

to these equations implying that this is the unique equilibrium. Each

player obtains a payoﬀ of −1. ¥

(b) If the parties could sign a binding agreement on how much to campaign ,

what levels would they c hoose?

Answ er: They would choose 

= 

=0and each would obtain a

pa y oﬀ of 0. ¥

184 10. Repeated Games

demonstrates the choices and pa yoﬀs per period. For which discount fac-

tors  ∈ (0 1) can the levels you found in part (b) above be supported

as a subg ame perfect equilibrium of the inﬁnitely repeated game?

Answ er: Consider the grim trigger strategy where the pla yers will re-

v e rt to playing the one-shot Nash forever after a deviation. The temp -

tation to deviate from 0 is the v alue a play er gains when he chooses the

best response to 0, which is 



, whic h yields the one shot pay oﬀ of

Hence, the deviation will not be proﬁtable if

−

1−

≤ 0,or ∈ [

 1).

(d) Despite the parties ab ility to coordinate as you have demon strated in

y o ur answer to (c) abo ve, the government is concerned about the parties

ability to place up to 2 hours a day of negative cam p aign in g, and it is

consid erin g lim itin g negative campa ign ing to

hour a day so that now





∈ [0

] Is this a good policy to further limit negative campaign s?

Justify your answer with the relevant calculation s. Wha t is the intu ition

foryourconclusion?

Answ er: If this were just a one shot game then the government’s regu-

lation wo uld be beneﬁcial. Instead of c h oosing 

= 

=1they w ould

choose 

= 

and receive −

each in stea d of −1. Howev er, for

the repeated game this regulation makes the grim trigger threat less

severe, and cooperation on spending nothing can only be ac h ieved if

− 

05

1−

≤ 0,whichholdsfor ∈ [

 1). Hence, for  ∈ [



) the play-

ers will no longer be able to achieve the P ar eto optimal outcome using

repeated game cooperation, making this regulation a bad idea. ¥

6. Regulating Medications: Consider a ﬁrm(player1)thatproducesaunique

kind of drug that is used by a consumer (pla yer 2). This drug is regulated b y

the go vernmen t so that the price of the drug is  =6.Thispriceisﬁxed, but

the quality of the drug depends on the man ufacturing procedure. The “good”

() manufacturing procedure costs 4 to the ﬁrm , and yields a value of 7 to

10. Repeated Games 185

FIGURE 10.1.

theconsumer.The“bad”() man ufactur ing procedure costs 0 to the ﬁrm,

and yields a valu e of 4 to the consumer. The consumer can ch oose whether

to buy or not at the price , and this decision mu st be made before the ac-

tual manufa cturing procedure is revea led. However, after consumptio n, the

true qualit y is revealed to the consumer. The ch oice of m anufacturing pro-

cedure,andthecostofproduction,ismadebeforetheﬁrm kno w s whether

theconsumerwillbuyornot.

(a) Dra w the game tree and the matrix of this game , and ﬁnd all the Nash

equilib ria of this game.

Answ er: Let play er 1 be the ﬁrm who can c hoose  (good) or  (bad),

and player 2 is th e consumer who can choose  (purc hase) or  (not

purc hase). If, for example, the pla yers choose (  ) then the ﬁrm gets

6−4=2and the consumer gets 7−6=1. In a similar way the complete

matrix of this one shot game can be represen ted as follo w s:

Play er 1

Play er 2



 2 1 −4 0

 6 −2 0 0

The extensiv e form game tree is,¥

(b) No w assum e that the game described abo ve is repeated t w ice. (The con-

sumer learns the qualit y of the p roduct in each period only if he con-

186 10. Repeated Games

sumes.) Assume that each pla yer tries to maxim ize the (non-discoun ted )

sum of his stage pa yoﬀs. Find all the subg ame-perfect equilibr ia of this

game.

Answ er: Itiseasytoseethatplayer1hasadominantstrategyinthe

stage game: c hoose , and pla yer 2’s best response is to choose .This

unique N ash equ ilibriu m m ust be pla yed in the seco nd sta ge, and by

backward induction must also be pla yed in the ﬁrst stag e. hence, it is

theuniquesubgameperfectequilibrium.

that eac h pla y er tries to maximize the discounted sum of his or her

stage payoﬀs, where the discoun t rate is  ∈ (0 1). What is the range

of discount factors for whic h the good m anufa cturing procedure will be

used as part of a subgame perfect equilibrium?

Answ er: Consider the grim trigger strategies: pla yer 1 c hooses  and

contin ues to choose  as long as he chose  in the past and as long

as play er 2 purc hased . O therwise he chooses  forever after. P layer 2

c h ooses  and contin ues to c hoose  as long as he chose  and play er 1

chose . Otherwise he pla ys  forever after. Player 2 has no incentive

to deviate at any stage, but player 1 can gain 4 from switc h ing to  in

any period (get 6 instead of 2). He will no t have an incentive to deviate

if 4 ≤

1−

, whic h holds for  ∈ [

 1) ¥

(d) Consume r adv ocates are pushing for a lo wer price o f the drug, say 5.

The ﬁrm wants to approach the Federal trade Commission and argue

that if the regulated price is decreased to 5 then this may have dire

consequences for both consumers and the ﬁrm . Can you make a formal

argumen t using the parameters abo v e to support the ﬁrm? What about

the consumers?

Answ er: If the price of the drug is lowered to 5 then p layer 1 has a

stronger relative temptation to deviate from the grim trigger strategies

described in par t c. above. His gain from deviation is still 4, but the

gain from continuing to ch oose  is only 1 per period and not 2. Hence,

10. Repeated Games 187

hewillnothaveanincentivetodeviateif4 ≤

1−

,whichholdsfor

 ∈ [

 1). Hence, if the ﬁrm can argue that  ∈ [



) then increasing

the price from 4 to 5 w ill ca use the good equilibrium to colla ps e and

no trade will occur. The argument in favor of raising the price can be

made if  ∈ [

 1) because then the consumers beneﬁtattheexpenseof

the ﬁrm but there is enough surplus to support the good outcome. ¥

7. Diluted Happiness: Consider a relationship betw een a bartender and a

customer. The bartender serves bourbon to the customer, and c hooses  ∈

[0 1] whic h is the proportion of bourbon in the drink serv ed, while 1 − 

is the p roportion of water. T h e cost of su pply ing such a drink (standa rd 4

once glass) is  where 0. The C u stom er , w ithou t kn owin g , decides

on wheth er or not to bu y the drink at the m ar ket price .Ifhebuysthe

drink, his pay oﬀ is  − and the bartender’s payoﬀ is  −. Assum e that

,andallpayoﬀs are common knowledge. If the customer does not buy

thedrink,hegets0,andthebartendergets−(). because the customer

hassomeexperience,oncethedrinkisboughtandhetastesit,helearnsthe

value of , but this is only after he pays for the drink.

(a) Find all the Nash equilib r ia of this game.

Answ er: The customer has to buy the drink without knowing its con-

tent, implying that the bartender has a dominant strategy which is to

choose  =0once the customer p ays for the drink. Bu t anticipating

that, the customer would not buy the drink. Hence, the unique N ash

equilibrium is for the custom er not to buy and the bartender to c h oose

 =0if he does buy. ¥

(b) No w assume that the custom er is visiting town for 10 days, and this “bar

game” will be pla yed for eac h of the 10 evenings that the customer is in

to w n . A ssum e that eac h player tries to maxim ize the (non-discounted)

sum of his stage payoﬀs. Find all subgame -perfect equilibria of this

game.

Answ er: The game just unravels: in the last period they m ust play

188 10. Repeated Games

theuniqueNashinparta. above. But then they will do the same in

the pen ultimate period, and so in un til the beginning of the game. The

unique subgame perfect equilibrium is therefore for the customer not to

buy in an y of the 10 periods and for the bartender to choose  =0in a

period where the customer buys. ¥

game as repeated inﬁnitely many times. Assume that each player tries

to maxim ize the discounted sum of his or her stage payoﬀs, where dis-

countrateis ∈ (0 1). What is the range of prices  (expressed in the

param eters of the problem) for which there exists a subgame-perfect

equilibrium in which everyday the bartender chooses  =1and the

customer buys at the price ?

Answ er: For a transaction to occur both hav e to get a non-negativ e

pa y oﬀ, implying ﬁrst that  ∈ [ ]. We will consider a subgame perfect

equilibrium with grim trigger strategies that reverts to no-pur chase if

anyone ev er deviates. Notice that the customer has no incentive to ever

deviate if  ≤  because he gains nothing or loses some positive value

from not buying. The bartender does beneﬁt in the one shot game from

deviating to  =0and obtaining  instead of  −. Given some value of

, the bartender will not deviate if  ≤

−

1−

,or ≥





(which is of course

greater than ). Hence, if





≤  then for any price  ∈ [





] there exists

a subgam e perfect equilibrium in which every day the bartender chooses

 =1and the customer buys at the price .If,however,





then no

suc h price exists. ¥

(d) For which values of  (expressed in the parameters of the problem) can

suc h a price range that you found in (5) above exist?

Answ er: Th e condition for such a subgame perfect equilibrium is that





≤  wh ich implies that  must satisfy  ≥





. ¥

10. Repeated Games 189

8. Tacit C o llu sion : There are t w o ﬁrmsthathavezeromarginalcostandno

ﬁxed cost that produce some good, each producing 



≥ 0∈ {1 2}.The

demand for this good is giv en by  = 200 − ,where = 

+ 

(a) First consider the case of Cournot competition, where each ﬁrm chooses





, and that this game is inﬁnitely repeated with a discount factor 1.

Solve for the static stage-game Cournot-N a sh equilib riu m .

Answ er: Each ﬁrm solves max





(200 − 



− 



)



so the ﬁrst order

condition is 200 −2



−



=0and the best response is 



200−



.The

uniqu e N as h equilibrium is therefore 

= 

=66

. The proﬁts of each

ﬁrm would be 4 444

. ¥

(b) For which values of  can y ou su pport the ﬁrm s equally splitting

monopoly proﬁts in eac h period as a subgam e perfect equilibriu m that

uses “trigger stra tegies”? (i.e., after one deviates from the proposed

split, they resort to the static Cournot-Na sh equilibrium thereafter).

Note: be careful in deﬁning the strategies of the ﬁrm s.

Answ er: The monopoly proﬁts is obtained from maxim izin g (200−)

which occurs at  =100with combined proﬁts being 10 000 or 



=50

and proﬁts are 5 000 for each ﬁrm. If ﬁrm  is producing 50, however,

then the best deviation for ﬁrm  is giv en by the best response, 



200−50

=75,andﬁrm ’s proﬁts in the period when it deviates are

(200−75 −50)75 = 5 625. Consider trigger strategies of the form “start

by choosing 



=50andcontinuetochoosesoaslongasbothﬁrms

follow this path, yet if an y ﬁrm ever deviates form this path rev ert to





=66

forever after.” The devia tio n will not be wor thw h ile if

5625 +



1 − 

(4444

) ≤

5000

1 − 

which holds if  ∈ [

 1). ¥





≥ 0, where the lowest priced ﬁrm gets all the demand, and in case of a

190 10. Repeated Games

tie they split the market. Solv e for the static stage-game Bertrand-N ash

equilib r iu m .

Answ er: The static Bertrand -N ash equilibrium is for each form to

choose 



=0because they hav e zero mar ginal costs. Proﬁts will be

zero for each ﬁrm. ¥

(d) For which values of  can you support the ﬁrm s splitting m ono poly

proﬁts in each period as a subgam e perfect equilibrium that uses “trigger

strategies”? (i.e., after one deviates from the proposed split, they resort

to the static Ber tran d—N ash equilibrium thereafter). Note: be careful in

deﬁning the strategies of the ﬁrms!

Answ er: The monopoly proﬁts are o btained from choosing 



=100

with combined proﬁts being 10 000 and proﬁts are 5 000 for each ﬁrm if

they split production equally. If ﬁrm  is charges 100, however, then ﬁrm

 can deviate to some price 



=100−  for  inﬁnitesimally small and

ﬁrm ’s proﬁts in the period when it deviates will be inﬁnitesim ally close

to 10 000. Consider trigger strategies of th e form “start b y choosing





=100andcontinuetochoosesoaslongasbothﬁrms follow th is

path, yet if any ﬁrm ev er deviates form this path revert to 



forever after.” Th e deviat ion w ill not be w o rthwhile if

10000 +



1 − 

(0) ≤

5000

1 − 

which holds if  ∈ [

 1). ¥

(e) No w instead of using trigger strategies, try to support the ﬁrms equally

splitting monopoly proﬁts as a subgame perfect equilibrium where after

adeviation,ﬁrm s would resort to the static Bertrand competition for

only t w o periods. For which values of  will this w ork ? Why is this

diﬀerent than you r answer in (d) above?

Answ er: Because we are only punishing for two periods, the deviation

will not be worth w hile if

10000 + (0) + 

(0) + 

5000

1 − 

≤

5000

1 − 

10. Repeated Games 191

or 

+  −1 ≥ 0, which results in  ≥

√

5 −

≈ 0618.Thereasonwe

need a larger discount factor is that the punishment is less sev ere as it

lasts for only two periods and not inﬁnitely man y. ¥

9. Negative Externalities: Two ﬁrm s are located adjacent to one another

and each im poses an external cost on the other: the detergen t that Firm 1

uses in it’s laundry business makes the ﬁsh that ﬁrm 2 catches in the lake

tastefunny,andthesmokethatﬁrm 2 uses to smoke its caugh t ﬁsh m akes

the cloth es that ﬁrm 1 hand s o ut to dry smell funny. As a consequence,

eac h ﬁrms proﬁts are increasing it its o wn production and decreasing in the

production of its neigh boring ﬁrm. In particular, if 

and 

are the ﬁrms’

production levels then their per-period (stage gam e) proﬁts are given by



(



)=(30−

)

− 

and 

(



)=(30−

)

− 

(a) Dra w the ﬁrm s’ best response functions and ﬁn d the Na sh equilibriu m

of the stage game. How does this compa re to the P ar eto optimal stage-

game proﬁtlevels?

Answ er: Each ﬁrm maximizes 

(







)=(30−



)



−



and the ﬁrst

order condition is 30 −



−2



=0, resulting in the best response func-

tion 



30−



as dra wn in the following ﬁgure:

0 10 20 30

q_1

q_2

The uniq ue Na sh equ ilib rium is 

= 

=10giving eac h ﬁrm a proﬁt

of 100. To solve for the Pareto optimal outcome w e can maximize the

sum of proﬁts,

max





(



)=(30− 

)

− 

+(30− 

)

− 

192 10. Repeated Games

and the two ﬁrst order conditions are

(



)



=30− 

− 2

− 

(



)



=30− 

− 2

− 

and solving them together yields 

= 

and the proﬁts of each

ﬁrm are 112

. ¥

(b) For whic h levels of discount factors can the ﬁrms support the P areto

optim al lev el of quantities in an inﬁnitely repeated game?

Answ er: We consider grim trigger strategies of the form “I will c h oose





=75 and continue to do so as long as both c hose this value. If anyone

ever deviates I w ill revert to 



=10forev er.” The best deviation from





=75 giv en that 



=75 is to ch oose the best response to 75 which is

30−75

=1125,andtheproﬁtfromdeviatingis(30 −7

)11

−(11

)

2025

=126

. Thus, each player will not w ant to deviate if

126

+ 

100

1 − 

≤

112

1 − 

which holds for  ∈ [

 1). ¥

10. La w Merc hants (revisited): Consider the three person game described

in section ??. A subgame perfect equilibrium was constructed with a bond

equal to 2,andawagepaidbyeveryplayer



to player 3 equal to  =01,

anditwasshownthatitisindeedanequilibriumforanydiscountfactor

 ≥ 095. Show that a similar equilibrium, where players 



trust players 



who post bonds, play ers 



post bonds and cooperate, and pla yer 3 follow s

the contract in every period, for any discoun t factor 0 1.

Answ er: First notice that the bond need not be equal to 2 because player





only gains 1 from deviating. Hence, any bond of value 1+1 will

deter play er 



from c h oosing to defect instead of cooperate. Second , notice

that fo r any w age to the third party of 1 − 1 player 



still get a

10. Repeated Games 193

positive surplus 0 from engaging the services of the third part y. Hen ce,

for any value of  ∈ (0 1),postingabondof1+ and pa y ing the third part y

1 −  guaran tees that player 



will choose to emplo y the third party and

cooperates if trusted, and in turn, 



will c h oose to trust. We are left to see

whether the third party prefers to return the bond as promised or if he would

deviate and give up the future stream of all income. By deviating the third

party pockets the bon wo rth 1+, and giv es up the future series of w ag es

1 −  for all future periods. He nc e, he will not deviate if

1+ ≤



1 − 

(2 − )

which for  ∈ (0 1) holds for  ∈ (

1+

 1). Hence, for any 

there exists

a small enough 0 for whic h the inequality above holds. ¥

11. Trading Brand Names: Show that the strategies proposed in Section ??

constitute a subgame perfect equilibrium of the sequence of trust gam es.

Answ er: Conside r any pla yer 



, 1 Under the proposed strategies, if

trust was never abused and the name w as bought up till period  −1 then ()

b y buying the name and cooperating he is guaran teed a payoﬀ of 1, ()by

buying the name and defecting he receiv es 2 but cannot sell the name to the

next pla y er 2 and hence he gets 2 −

∗

 1,and() b y not buying the name

he gets 0. He nce, for any  the strategy of 



is a best response. Consider

player 

.Ifhe() by creating the name and cooperating he is guaranteed

apayoﬀ of 1+

∗

 2,() by not creating the name he gets 0. Hence, the

strategy of 

is a best r esponse. Last, it is easy to see tha t an y pla yer 1

can expect cooperation, and hence trusting is a best response conditional on

no one ever defecting and the name being created and transmitted. ¥

12. Folk Theorem (revisited): Consider the inﬁnitely repeated trust g am e

describedinFigure10.1.

(a) Draw the convex hull of average payoﬀs.

Answ er: ¥

194 10. Repeated Games

FIGURE 10.2.

(b) Are the average pa yoﬀs (

 

)=(−04 11) in the convex hull of av-

erage payoﬀs? Can they be supported by a pair of strategies that form

a subgame perfect equilibrium for a large enough discount factor ?

Answ er: The a v erage pa y oﬀs (



 

)=(−04 11) are in the con vex

h ull of average payoﬀs. It is easy to see that the point (−04 08) is

on the line that connects the poin t (−1 2) with (0 0),andthepoint

(−04 17) thelinethatconnectsthepoint(−1 2) with (1 1). It follo w s

that the point (−04 11) is in the interior of the convex hull of payoﬀs.

How ev er, these pa yoﬀs cannot be supported by a subgame perfect equi-

librium because pla yer 1 is expected to get an a verage pa yoﬀ of −04,

but he can guaran tee himself a pa yoﬀ of 0 b y c hoosing never to trust.

the t wo play ers that yields a v erage payoﬀsthatapproach(



 

(



) as  approac h es 1.

Answ er: Firstnotethatthepoint(



) thelinethatconnectsthepoint

(−1 2) with (1 1). Tha t is, it is a weighted a verage of the two points as

follow s:

(−1 2)+

(1 1) = (



). This suggests that the average payoﬀ

wearetryingtoachieveisa

weighted average between the pairs

of actions () and (). So, consider the the follo w ing strategies:

10. Repeated Games 195

Player 2 will play  twice and then  once, and repeat this pattern

(play  in  =1 2 4 5 7 and pla y  in  =3 6 9 ). P layer 1

will pla y  ever y period. If either pla yer deviates from these proposed

strategies then both players rev ert to playing ( ) forever after. The

pa y oﬀ for pla yer 1 is,



=(1− )(1 +  + 

(−1) + 

+ ···)

=(1− )(

1 − 



1 − 

−



1 − 

)

=(1− )

1+ − 

(1 − )(1 +  + 

)

1+ − 

1+ + 

and it follo ws that

lim

→1

1+ − 

1+ + 

Similarly,



=(1− )(1 +  + 

(2) + 

+ ···)

=(1− )(

1 − 



1 − 

2

1 − 

)

1+ +2

1+ + 

and it follo ws that

lim

→1

1+ +2

1+ + 

Hence, as  → 1 the average payoﬀs from this subgame perfect equilib-

rium conv erge to (



). ¥

196 10. Repeated Games

This is page 197

Printer: Opaque

Strategic Bargaining

1. Disagreement: Construct a pair of strategies for the ultimatum game ( =

1 bargaining gam e) that constitute a Nash equilibrium , which together sup-

port the outcome that there is no agreement reac hed b y the two players and

the payoﬀs are zero to each. Show that this disagreem ent outcome can be

supported b y a Nash equilibr ium regardless of the nu mber of bargain ing pe-

riods.

Answ er: Co nsider the following strategies: player 1 oﬀers nothing to player

2( =0) and pla yer 2 only accepts if he is oﬀered all of the surplus ( =1).

In this case both players are indiﬀerent (pla yer 1 is indiﬀerent between an y

oﬀer and play e r 2 is indiﬀerent bet ween accepting and rejecting), and both

receive zero. It is easy to see that repeating these strategies for any length of

the game will still constitute a N a sh equilibr ium. ¥

2. Hold Up: Co n side ring an ultim atu m game ( =1bargaining game) where

before pla yer 1 makes his oﬀer to play er 2, pla yer 2 can in vest in the size of the

pie. If player 2 c hooses a low level of in vestmen t () then the size of the pie

is small, equal to 



while if play e r 2 c h ooses a high level of investment ()

thenthesizeofthepieislarge,equalto



. The cost to pla yer 2 of choosing

198 11. Strategic Bargaining

 is 



, while the cost of ch oosing  is 



 Assume that 







 0,









 0 and 



− 







− 



(a) What is the unique subgame perfect equilibrium of this game? Is it

Pareto Optimal?

Answ er: Solvin g this ga m e bac kward, w e kno w that the ultimatum

game has a unique equilibrium in which player 1 will oﬀer nothing to

player 2 and pla yer 2 will accept the oﬀer. Working backwards, if player 2

ﬁrst chooses the lo w lev el of investment then his payoﬀ will be −



, while

he will be worse oﬀ if he chooses the high lev el of investm ent because

−



 −



. Hence, the unique subgam e perfect equilibrium has player

2 ﬁrst choose the lo w lev el of inv estmen t, then pla yer 1 oﬀering to keep

all the value 



to him self, and ﬁn ally player 2 accepting the oﬀer and

getting −



. ¥

(b) Can y ou ﬁnd a Nash equilibrium of the game that results in an outcome

that is better for both pla yers as compared to the unique subgame

perfect equilib riu m?

Answ er: Consider the following strategy for pla y er 2: ﬁrstchoosethe

high lev el of investment, and then accept any oﬀer that gives himself at

least 



−



− for  small. Given this strategy, pla yer 1’s best response

is to oﬀer to keep 



+  for him self and 



−



− for pla yer 2. Player

2’s payoﬀ is then 



−



− −



 −



for small enough ,andplayer

1’s pa yoﬀ is 



+ 



sotheplayersarebothbetteroﬀ. ¥

3. Ev en/Odd Symmetry: In section ?? we analyzed the alternating bargain-

ing game for a ﬁnite number of periods when  w as odd. Repeat the analysis

for  even.

Answ er: Consider the case with an even n umber of rounds ∞, imply in g

that player 2 has the last mo ver advan t ages. The following bac k ward induc-

tion argumen t applies:

-Inperiod , player 1 accepts any oﬀer, so player 2 oﬀers  =0and pay o ﬀs

11. Strategic Bargaining 199

are 

=0;

= 

 −1

-Inperiod −1 (odd period — pla yer 1 oﬀers), by bac kward induction pla yer

2 should accept an ything resulting in a pay oﬀ of 

≥ 

 −1

. If player 2 is of-

fered  in period  − 1 then 

= 

 −2

(1 − ); Th is imp lies that in period

 −1 player 2 will accept an y (1 −) ≥  and by bac kward induction player

1shouldoﬀer  =1− , whic h yields play er 1 a pa yoﬀ of 

=(1− )

 −2

and 

= 

 −1



-Inperiod −2 (ev en period), conditional on the analysis for  −1,player1’s

best response is to accept an y  that giv es him 

 −3

 ≥ (1 − )

 −2

 Pla yer

2’s best response to this is to oﬀer the smallest  that satisﬁes this inequa l-

it y, and solving it with equalit y yields play er 2’s best response:  =  − 



This oﬀer follo wed by 1’s acceptance yields 

= 

 −3

 = 

 −2

− 

 −1

and



= 

 −3

(1 − )=

 −3

− 

 −2

+ 

 −1

We can con tinue with this tedious exercise only to realize that a simple pat-

tern emerges. If w e consider the solution for an ev en period  −  ( being

ev en because  is assumed to be ev en) then the bac kward induction argume nt

leads to the sequentially rational oﬀer,



 −

=  − 

+ 

···−





while for a n odd period  −  ( being odd) then the bac kward induction

argument leads to the sequentia lly rationa l oﬀer ,



 −

=1−  + 

···−



We can use this Pattern to solve for the subgame perfect equilibr ium oﬀer in

the ﬁrst period, 

 whic h b y bac kward induction m ust be accepted by player

2, and it is equal to



=1−  + 

− 

+ 

···−

 −1

=(1+

+ 

+ ···+ 

 −2

) − ( + 

+ 

+ ···+ 

 −1

)

1 − 



1 − 

−

 − 

 +1

1 − 



1+



200 11. Strategic Bargaining

andthisinturnimpliesthat



∗

= 

1 − 



1+

 and 

∗

=(1− 

 + 



1+



4. Con stant Delay Cost: Consider a two player alternating bargaining gam e

where instead of the pie shrink ing by a discount factor 1, the pla yers

eac h pa y a cost 



 0,  ∈ {1 2} to advance from one period to another. So,

if player  receiv es a share of the pie that gives him a value of 



in period

 then his pa yoﬀ is 



= 



− ( − 1)



. If the game has  periods then a

sequence of rejections results in each player receiving 



= −( − 1)



(a) Assume that  =2. Find the subgame perfect equilibrium of the game

and sho w in whic h w ay it depends on the values of 

and 

Answ er: Inthelastperiodplayer2makestheoﬀer in an ultimatum

game and will oﬀer to k eep the whole pie: 

=0and 

=1 and

player 1 is will accep t (he’s indiﬀeren t). P ayoﬀswouldbe

= −

and 

=1− 

.Goingbackwardstoperiod1,player1hastooﬀer at

least 

=1− 

to player 2 for him to accept, so the unique subgame

perfect equ ilibriu m has pla yer 1 oﬀering 1 − 

to pla yer 2, and player

2 accepts a nticipating that he will oﬀer and get 

=1in the second

period follow ing rejection. Pa yoﬀsare

= 

and 

=1− 

.Payoﬀs

therefore do not depend on 

. ¥

(b) Are there Nash equilibria in the t wo period game that are not subgame

perfect?

Answ er: Yes. Just lik e in the game w e studied with a discount factor ,

any split can be supported by a Nash equilibrium . Con sider the follow in g

strategy by player 2: reject an ything but the whole pie in the ﬁrst period

and oﬀer to keep the whole pie in the second. Pla yer 1’s best response in

the ﬁrst period is to oﬀer exactly the whole pie to player 2 because that

way he is guara nteed 0, while if he believes that player 2 will follow the

11. Strategic Bargaining 201

proposed strategy and he oﬀers anyt hin g else then he will get 

= −

and sho w in whic h w ay it depends on the values of 

and 

Answ er: Inthelastperiodplayer1makestheoﬀer in an ultimatum

game and will oﬀer to k eep the whole pie: 

=1and 

=0,and

pla yer 2 is will accept (he’s indiﬀeren t). Payoﬀswouldbe

=1− 2

and 

= −2

.Goingbackwardstoperiod2,player2hastooﬀer at

least 

=1− 2

to player 1 for him to accept, so the pay oﬀsstarting

from the second period are 

=1−3

and 

=2

−

(player 1 gets

a piece of the pie equal to 1 −2

and because this is the second period

he incurs the cost 

from the ﬁrst period.) Finally, in period 1 play er 1

must oﬀer pla yer 2 at least 2

−

so he will oﬀer exactly that, pla yer

2 will accept the oﬀer , an d the payoﬀs will be 

=1− 2

+ 

and



=2

− 

. ¥

5. Asymmetric P atience 1: Conside r a 3-period sequen t ial (alternating) bar-

gainin g model where two pla yers ha ve to split a pie worth 1 (starting with

player1makingtheoﬀer). No w the pla yers ha ve diﬀerent discoun t factors,



and 

(a) Compute the outcom e of the unique subgam e perfect equilibrium .

Answ er: In the third period pla yer 1 will get the whole pie and hence

the pa yoﬀs will be 

= 

and 

=0. Moving back to the second period,

player 2 will oﬀer player 1 

and play er 1 will accept, so the pa yoﬀsare



= 

and 

= 

(1−

).Movingbacktotheﬁr s t period, player 1 will

oﬀer to keep  such that pla yer 2 will receiv e 

=(1− )=

(1 − 

)

implying that pla yer 1 gets 

=  =1−

(1 − 

)=1− 

+ 



. ¥

(b) Show that when 

= 

then player 1 has an advantage.

Answ er: In this case 

=1− 

+ 



=1−  + 

and 

=  − 

202 11. Strategic Bargaining

implying that



− 

=1−  + 

− ( − 

)=1− 2 +2

=(1−)

+ 

 0

implying that 



. ¥

and 

give play er 2 an advantage? Why?

Answ er: For pla yer 2 to get an advan ta ge it must be that 



which

implies using the answer in part a. above that 1 −

+ 





−



or 1  2

(1 −

). This condition means that 

has to be signiﬁcan tly

greater than 

, meaning that player 2 has to be signiﬁcantly more

patient for him to have an advantage. For example, if 

is very close

to 1, then 

has to be less than

for this condition to hold, and if





then player 2 will never ha ve an advantage. The patience has to

overcome the ﬁrst and la st mover advantag e that player 1 has in this

case. ¥

6. Asymmetric Patience 2: Consid er the analysis of the inﬁnite horizon bar-

gaining model in section 11.3 and a ssum e that the pla yers ha ve d iﬀeren t

discount factors 

and 

. Find the unique subgame perfect equilibrium us-

ing the same techniques, and sho w that as 

and 

becom e closer in values,

the solution you found converges to the solution derived in section 11.3 .

Answ er: Consider a subgame in which player 1 m a kes the oﬀer. P layer 2

will not accept an oﬀer that gives him less than 



,implyingthat



≤ 1 − 



 (11.1)

and play er 2 will accept an oﬀer that gives him at least than 



, implying

that



≥ 1 − 



 (11.2)

By sym m etry, when player 2 makes the oﬀer we obtain the symmetric in-

equalities,



≤ 1 − 



 (11.3)

11. Strategic Bargaining 203

and



≥ 1 − 



 (11.4)

Subtracting (11.2) from (11.1) yields



− 

≤ 

(

− 

)  (11.5)

and similarly, subtracting (11.4) from (11.3) yields



− 

≤ 

(

− 

)  (11.6)

But (11.5) and (11.6) together imply that



− 

≤ 

(

− 

) ≤ 



(

− 

) 

and because 



 1 it fo llows that 

= 

(= 

) and 

= 

(= 

Revisiting the inequalities above, (11.1) and (11.2) imp ly that



=1− 



and (11.3) and (11.4) imply that



=1− 



and from these last two equalities we obtain that in the unique subgame

perfect equilibrium , in the ﬁrst period pla yer 1 receives



∗

1 − 





and pla yer 2 receiv es 1 −

∗



(1−

)

1−



.Nowlet

= ,andlet

approac h 

The denominator approac hes 1−

=(1−)(1+) and we get that 

∗

1+

which is th e so lution w e obtained is section 11.3 for a symmetric discou nt

factor. ¥

7. Legislative Bargaining (revisited): Consider a ﬁnite  period v ersion of

the Baron and Ferejohn legislative bargaining game with an odd number 

of players and with a closed rule as described in section 11.4.1.

204 11. Strategic Bargaining

(a) Find the uniq u e sub ga m e perfect equilibriu m for  =1.Also,ﬁnd a

Nash equilibriu m that is not subga m e perfect.

Answ er: If  =1then follo w ing a failed v ot e (a majorit y reject s the

proposer’s proposal) all the players receive a pay o ﬀ of 0. Hence, lik e in

the Rubinstein game, the proposer will ask for all the surplus and a ma-

jorit y of pla yers will vote in fa vor. No other outcome can be supported

by a subgam e perfect equilibrium . There are many Nash equilibria. For

example, some player  asks for at least 

∗



∈ [0 1] of the surplus while

all other players will settle for nothing. Then any player  6=  will oﬀer

 the amo unt 

∗



, and nothing to the other players, and all the play ers

will v ote in favor of the proposal. ¥

(b) Find the unique subgame perfect equilibrium for  =2with a discoun t

factor 0 ≤ 1 Also, ﬁnd a N ash equilibrium tha t is no t su bga m e

perfect.

Answ er: If the proposal is not accepted in period 1 then period 2 will

ha v e the unique subgame perfect equilibrium described in part a. above.

This implies that in the ﬁrst period, ever y player has an expected surplus





because they will be the proposer with proba bility



and will get

thewholesurplusof1.Thismeansthattheplayerwhooﬀer s in the ﬁrst

period must oﬀer at least





−1

other pla yers to form a m ajorit y

and have the proposal accepted. Hence, the proposing player will keep

1−

−1





to himself in the unique subgam e perfect equilibrium. Just lik e

in par t a. above, we can support an arbitrary division of the surplus

in a Nash equ ilib riu m by having some players com mit to incredib le

strategies. ¥

fect equilibrium yo u fou nd in pa rt (b) abo ve to w ha t a ﬁrst period

proposer receiv es in the t wo-period t wo-person Rubinstein-Ståhl bar-

gaining gam e. Wh at intuitively accounts for the diﬀerence?

Answ er: In the two-period two-person Rubinstein-Ståhl bargainin g

game the proposing player 1 gets 1 −  because player 2 can get the

11. Strategic Bargaining 205

whole pie in the second period. Notice that the diﬀerence bet ween the

pa y oﬀ in the Baron-Ferejohn model and the Rubinstein-Ståhl model is,

1 −

 − 1





− (1 − )=

( +1)

2





As w e can see, the ﬁrstproposerhasalotmoresurplusintheBaron-

Ferejohn m odel. This is because the responder is not one player wh o

pla y s an ultimatum game in the secon d period, bu t a group of player

from whic h a majority needs to be selected. This lets the proposer pit

the responders against each other and capture more surplus. ¥

(d) Compare the subgame perfect equilibrium y ou found in part (b) above to

the solution of the inﬁnite horizon model in section ??. What intuitively

accoun ts for the similarity?

Answ er: The share receiv ed b y the ﬁrst proposer is the same as what

w e deriv ed in equation (11.8). The intuition is that the same forces are

at work: the larger the discount factor the more the proposer needs to

giveaway,andthemorepeoplethereare,themorehehastogiveaway.

Still, he gets to k e ep at least

because of the competitiv e nature of the

situation in whic h the responder s are put. ¥