Game Theory Lecture: Prisoner's Dilemma to Mixed Strategy Dominance

 76 min video

 2 min read

YouTube video ID: b7BAHSV1EBo

Source: YouTube video by MIT OpenCourseWareWatch original video

PDF

The lecture begins by moving from informal descriptions of strategic situations to the extensive‑form representation and finally to the strategic (normal) form. This progression compresses the decision tree into a payoff matrix that captures all relevant choices and outcomes. The instructor distinguishes two modes of analysis: a positive “will play” view that predicts how rational agents actually behave, and a normative “should play” view that recommends optimal strategies.

The Prisoner's Dilemma

The classic Prisoner’s Dilemma illustrates how rational behavior can be inefficient. Two prisoners each choose either to stay mum (M) or to fink (F). The payoff matrix is:

  • (M,M) = (2, 2)
  • (F,M) = (3, ‑1)
  • (M,F) = (‑1, 3)
  • (F,F) = (0, 0)

Because F yields a higher payoff regardless of the opponent’s action, F is a dominant strategy. Both players therefore select F, producing the (0, 0) outcome, which is strictly worse than the feasible (2, 2) outcome when both cooperate. As the instructor notes, “In games, rational behavior can lead to inefficiency.”

Formalizing Strategic Behavior

The lecture introduces notation for strategy profiles (S_i) and partial profiles (S_{-i}). A strategy (S_i) strictly dominates another (S_i') if it gives a higher payoff for every opponent profile (S_{-i}). Weak dominance requires at least one strict improvement and never a lower payoff. The instructor emphasizes that multiple weakly dominant strategies cannot coexist because the strict‑inequality condition would be violated.

Beliefs and Best Responses

A belief (\beta_{-i}) is a probability distribution (lottery) over opponent strategy profiles. Expected utility is linear in these beliefs, allowing the construction of a best response: a strategy that maximizes expected utility given a specific belief. The lecture shows that several strategies can be best responses to the same belief, and graphical analysis uses the “upper envelope” of linear expected‑utility functions to identify the best‑response region.

Mixed Strategies and the Fundamental Theorem

Randomization, or mixed strategies, becomes essential when no pure strategy is a best response to any belief. The instructor cites professional poker players as experts at randomizing with precisely the right probabilities, noting that “the people who are better at randomizing with exactly the right probabilities win a lot of money.”

A strategy is called reasonable if it is a best response to some belief; otherwise it is unreasonable and is strictly dominated by a mixed strategy. The fundamental theorem presented states:

For any finite strategic‑form game, a strategy is either a best response to some belief or it is strictly dominated by a mixed strategy (but never both).

This result links dominance logic to mixed‑strategy randomization: if a pure strategy never maximizes expected utility for any belief, a suitable mixture of other strategies will dominate it.

  Takeaways

  • The lecture shows how game analysis moves from informal descriptions through extensive form to the compact strategic form, distinguishing predictive “will play” from prescriptive “should play” perspectives.
  • In the Prisoner’s Dilemma, defecting (F) dominates cooperation (M), so rational players choose (F,F) even though (M,M) yields higher payoffs for both, illustrating rational inefficiency.
  • Strict dominance requires a strategy to yield higher payoffs against every opponent profile, while weak dominance allows equality except for at least one strict improvement, and multiple weakly dominant strategies cannot coexist.
  • A belief is a probability distribution over opponents’ strategies; a best response maximizes expected utility given that belief, and several strategies can be best responses to the same belief.
  • The fundamental theorem states that in any finite game a strategy is either a best response to some belief or is strictly dominated by a mixed strategy, never both, highlighting the essential role of randomization.

Frequently Asked Questions

Why does the Prisoner’s Dilemma produce an inefficient outcome despite rational behavior?

Rational players select the dominant strategy that maximizes their own payoff regardless of the other’s move. In the Prisoner’s Dilemma, defecting (F) dominates cooperation (M) for both, so each chooses F, resulting in the (0,0) outcome, which is worse than the mutually cooperative (2,2) payoff.

What does the fundamental theorem say about mixed strategies and best responses?

The theorem asserts that for any finite strategic‑form game a strategy is either a best response to some belief about opponents’ actions or it is strictly dominated by a mixed strategy, meaning randomization can replace any “unreasonable” pure strategy.

Who is MIT OpenCourseWare on YouTube?

MIT OpenCourseWare is a YouTube channel that publishes videos on a range of topics. Browse more summaries from this channel below.

Does this page include the full transcript of the video?

Yes, the full transcript for this video is available on this page. Click 'Show transcript' in the sidebar to read it.

Helpful resources related to this video

If you want to practice or explore the concepts discussed in the video, these commonly used tools may help.

Links may be affiliate links. We only include resources that are genuinely relevant to the topic.

PDF