Game Theory
Extensive Form Games
'The employer will fire the worker if the worker disagrees.' 'We'll impose sanctions if conditions aren't met.' Threats in negotiations are everywhere. But not all threats are equally convincing. The extensive form and backward induction give a mathematical answer: a threat only works if it's in the threatener's interest to carry it out. This insight transformed negotiation theory, diplomacy, and corporate strategy.
- **Salary negotiation:** who makes the first offer (anchoring effect), who has a better BATNA-all determined by the game tree structure and backward induction
- **Antitrust policy:** Entry Deterrence models why the threat of a price war sometimes deters competitors and sometimes doesn't (depends on credibility)
- **Poker AI (Libratus, Pluribus):** the Carnegie Mellon bot beat world-class players by solving the problem with information sets via Counterfactual Regret Minimization
Предварительные знания
The Game Tree
Consider a salary negotiation: the employer makes an offer first, then the candidate accepts or rejects. The normal form (matrix) doesn't show who moves first. The extensive form is a decision tree where each node is someone's turn to act, each edge is a choice, and the leaves are final payoffs.
Defined by: N (players), H (history nodes), Z (terminal nodes/leaves), χ(h) (available actions at node h), ρ(h) (whose turn it is), σ (information sets), u: Z → Rⁿ (payoffs at leaves).
The key advantage of the extensive form: it explicitly shows each player's **information**. In chess both players see the full board-perfect information. In poker the opponent's cards are hidden-imperfect information. The structure of information sets fundamentally changes what optimal strategies look like.
The Ultimatum Game and behavioral economics
In Ultimatum Game experiments (Güth et al., 1982), player 1 proposes a split of a sum; player 2 accepts or rejects (rejection gives both zero). Rationally, player 2 should accept any positive offer. In practice: offers below 20-30% are routinely rejected. People will pay to punish 'unfairness'-against the predictions of theory.
Normal and extensive forms always give the same equilibria
The normal form loses information about the order of moves-it can have more Nash Equilibria, including ones based on 'unrealistic' threats. SPE is only available in the extensive form.
Subgame perfect equilibrium (SPE) is a stronger concept than NE. It rules out threats that are not credible to carry out-and such threats are invisible in the matrix.
How does the extensive form differ from the normal form (matrix)?
Backward Induction
To find the optimal strategy in a game tree, reason from the end. At the final nodes the choice is clear: each player picks the action with the best payoff. Knowing this, the tree can be 'folded' backward-penultimate nodes are solved knowing what comes next. Continue to the root. This is backward induction.
1. Find all pre-terminal nodes (whose children are all leaves) 2. At each such node, the player picks the action with max payoff 3. Replace the node with the corresponding payoff 4. Repeat until the root Result: a strategy profile that is a subgame perfect equilibrium (SPE).
| Game example | Backward induction result | Intuition |
|---|---|---|
| Ultimatum Game | Player 1 offers minimum, player 2 accepts | Any offer > 0 beats rejection |
| Rubinstein bargaining | First proposer gets δ-share | Impatience weakens bargaining power |
| Finite PD | Both defect from round one | Backward unraveling of cooperation |
Backward induction requires that all decisions be rational-including at nodes that will never be reached. This 'perfectness' assumption rules out 'non-credible threats': promises to take actions that the promiser would not rationally carry out.
Backward induction always gives a unique solution
When intermediate nodes have tied payoffs, backward induction may produce multiple solutions. Uniqueness is only guaranteed with strict preferences.
If two actions at a node give the same payoff, both are rational, leading to multiple equilibria. Strict preferences are needed for uniqueness.
Backward induction solves a game tree starting from:
Subgame Perfect Equilibrium (SPE)
The normal form of the Ultimatum Game has many Nash Equilibria-for example, player 1 offers 10% and player 2 threatens to reject anything below 50%. But this NE is unrealistic: if player 1 actually offers 10%, player 2 is better off accepting (10% > 0%). The threat was 'empty'. Subgame perfect equilibrium (SPE) rules out such threats.
A subgame is a part of the tree starting from a single node and including all its descendants. A strategy profile is an SPE if it is a Nash Equilibrium in EVERY subgame. SPE ⊂ NE: every SPE is an NE, but not every NE is an SPE.
SPE was introduced by Reinhard Selten (Nobel 1994) as a refinement of NE for sequential games. It eliminates equilibria based on 'empty threats'-actions that a player would not rationally carry out when the time comes.
A convincing public threat always changes the opponent's behavior
A threat's effectiveness depends on its credibility-whether it's actually in the threatener's interest to carry it out. SPE formally rules out threats that are not rational to execute.
An incumbent can publicly threaten a price war, but if the entrant knows that war is costly for both sides, the threat is non-credible. SPE models exactly this reasoning.
SPE differs from ordinary NE in that:
Information Sets
In poker, a player bets without knowing the opponent's cards. In game-theoretic terms: the player cannot distinguish between several nodes in the tree-'did the opponent have an ace or not?' Such a group of indistinguishable nodes is called an information set. A strategy in a game with imperfect information is a plan of action for each information set.
Perfect information: every information set contains exactly one node-the player knows the history exactly (chess, Go). Imperfect information: some information sets contain multiple nodes (poker, negotiations). SPE applies strictly only to perfect information. For imperfect information: Perfect Bayesian Equilibrium (PBE).
Information sets explain why bluffing in poker is rational. If a player only bets with a strong hand, the opponent always folds when facing a bet. The optimal strategy: bluff at the right frequency to make the opponent indifferent between calling and folding. This is a classic mixed strategy equilibrium.
Games with hidden cards cannot be formally solved by game theory
Games with imperfect information are solved via Perfect Bayesian Equilibrium or NE in mixed strategies. This is exactly how modern poker bots work.
Libratus and Pluribus solve poker via approximation of NE using Counterfactual Regret Minimization-fully within the game-theoretic framework of information sets.
What is an information set in an extensive-form game?
Key ideas
- **Extensive form (game tree):** explicitly models the order of moves and each player's information
- **Backward induction:** solve from leaves to root-finds the optimal strategy in games with perfect information
- **SPE:** rules out 'unrealistic threats'-actions that are not profitable to carry out in the corresponding subgame
- **Information sets:** model imperfect information; bluffing in poker is a rational mixed strategy equilibrium
Related topics
The extensive form is the foundation for analyzing sequential negotiations and mechanisms:
- Nash Equilibrium — SPE is a refinement of NE for games with sequential moves
- Mechanism Design — Mechanisms often use sequential moves, and the revelation principle works via backward induction
- Auction Theory — Auctions are games with bids under incomplete information about opponents' valuations
Вопросы для размышления
- Rubinstein bargaining shows: the more impatient a player (high discount factor), the weaker their bargaining position. How does this apply to real salary negotiations or M&A deals?
- Empty threats in diplomacy are common. What are examples where a threat turned out to be non-credible? Why did that happen?
- Poker is a game with imperfect information. Why can a human professional beat a simple bot, yet lose to Libratus? What does the AI bot do fundamentally differently?