Prisoner’s Dilemma
The prisoner’s dilemma is a foundational concept in game theory that illustrates how individually rational decisions can produce a collectively worse outcome. It appears in economics, politics, environmental problems, and everyday social interactions whenever personal incentives conflict with the common good.
Classic setup and paradox
Two accomplices are arrested and interrogated separately. Each has two choices: cooperate with the other by remaining silent, or defect by testifying against the other. Typical penalties:
Explore More Resources
- Both remain silent (mutual cooperation): each gets 1 year.
- One defects, the other stays silent: defector goes free (0 years), silent accomplice gets 5 years.
- Both defect (mutual betrayal): each gets 3 years.
Each player’s dominant strategy is to defect: defecting yields a better personal outcome regardless of the other player’s choice. Yet when both follow that logic, they both receive 3 years, a worse combined result (6 years total) than mutual cooperation (2 years total). That tension between individual incentive and collective welfare is the dilemma.
Key concepts
- Dominant strategy: a move that yields a better payoff for a player no matter what the other does. In the classic dilemma, defecting is dominant.
- Nash equilibrium: a stable outcome where no player can improve their payoff by unilaterally changing strategy. Mutual defection is the Nash equilibrium here.
- Social optimum vs. individual rationality: the choice that maximizes group welfare (mutual cooperation) can be different from the individually rational choice (defection).
Real-world examples
- Tragedy of the commons: everyone benefits from exploiting a shared resource individually, which leads to overuse and depletion that harms the whole group.
- Cartels: members collectively benefit by restricting output to raise prices, but each has an incentive to cheat and increase production to gain extra profit.
- Everyday behaviors: littering, tax evasion, workplace free-riding, or arms races — situations where short-term self-interest harms collective outcomes.
Iterated prisoner’s dilemma and cooperation
When interactions repeat, strategies can condition behavior on past moves. The iterated prisoner’s dilemma allows punishment and reward over time, enabling cooperation to emerge. A widely studied strategy, “tit for tat,” begins by cooperating and then replicates the opponent’s previous move: cooperate if the opponent cooperated; retaliate if the opponent defected. Repetition, memory, and reciprocity can stabilize cooperative outcomes.
Explore More Resources
How to escape the dilemma
Mechanisms that shift incentives toward cooperation include:
- Repetition and reputation: repeated interactions create future costs for defection and incentives to maintain a cooperative reputation.
- Institutions and enforcement: laws, contracts, monitoring, and sanctions change payoffs so cooperation becomes individually attractive.
- Social norms and psychology: trust, reciprocity, group identity, and long-term orientation can bias decisions toward cooperation even when short-term incentives favor defection.
Practical implications
Understanding the prisoner’s dilemma helps explain behavior in markets, environmental policy, collective bargaining, and social problems. Designing rules, institutions, or repeated interactions that align individual incentives with collective goals is key to improving outcomes.
Explore More Resources
Bottom line
The prisoner’s dilemma shows why individually rational choices can produce suboptimal collective results and how repetition, enforcement, and social mechanisms can help overcome that gap to achieve better cooperative outcomes.