Optimism, Overconfidence, and Moral Hazard

I. Introduction

A vast literature in psychology and economics documents the prevalence of wrong beliefs.¹ In this paper, I seek to define, distinguish, and characterize two oft-conflated senses in which beliefs can be wrong: overconfidence and optimism. An agent is overconfident if she overestimates her ability to influence (the distribution of) a payoff-relevant outcome, which I shall call “output.” By contrast, she is optimistic if her expectation of the distribution of output is unrealistically high, in the sense of first-order stochastic dominance. In simple parametric models, these two concepts typically admit natural definitions in terms of the parameters.² In this paper, I provide general model-independent definitions of overconfidence and optimism in terms of choice behavior.

I explicate the content of my definitions by characterizing their meaning in the standard moral hazard model. This model is the natural benchmark because it represents the canonical formalism in economics for studying how an agent may influence the distribution of an observable outcome (output). The question of how overconfident and optimistic behavior manifest in the moral hazard model raises a more basic question: how does this model relate to choice behavior? I answer this question by characterizing what restrictions the moral hazard model imposes on choice data and delineating the extent to which its parameters may be recovered from such data.

The environment consists of a finite set S of possible output levels and a convex set $Π \subseteq R$ of possible monetary rewards. A contract $w : S \to Δ (Π)$ specifies the agent’s (possibly random) remuneration as a function of output. In the moral hazard model, the agent influences the distribution $P_{e} \in Δ (S)$ of output by choosing effort $e \in E$ at a cost $C (e) \geq 0$ . She chooses effort optimally, randomizing if desired (by selecting an effort distribution $μ \in Δ (E)$ ). The agent’s valuation of a contract $w : S \to Δ (Π)$ is therefore

$\sup_{μ \in Δ (E)} \int_{E} [- C (e) + \sum_{s \in S} u (w (s)) P_{e} (s)] μ (d e),$

where the utility function $u : Π \to R$ describes her risk attitude.³

I first ask what the moral hazard model’s empirical content is: what testable restrictions does this model impose on contract choice data? In other words, which preference relations ⪰ on the set of all contracts admit a moral hazard representation? I give the answer in terms of six axioms, which together exhaust the testable implications of the moral hazard model.

I then show that the moral hazard model’s parameters are to a significant extent identified off contract choice data. In particular, the agent’s utility function u is identified (up to positive affine transformations), and so is her minimum cost of producing any given output distribution $p \in Δ (S)$ : that is, the quantity

$\begin{matrix} (⋆) & c (p) ≔ \inf_{\begin{matrix} μ \in Δ (E) : \\ \int_{E} P_{e} μ (d e) = p \end{matrix}} \int_{E} C (e) μ (d e) for each p \in Δ (S) . \end{matrix}$

I next define relative confidence, motivated by the idea that a confident agent is one who believes that she can significantly influence output. Specifically, I call one preference ⪰ more confident than another preference ⪰′ if whenever ⪰′ chooses a contract w over a constant contract x (one under which pay does not vary with output), ⪰ also chooses w over x. I show that for moral hazard preferences, confidence shifts are equivalent to vertical shifts of the output distribution cost c defined in (⋆): greater confidence means precisely a pointwise lower c function (and an unchanged utility function u).

Finally, I define relative optimism, building on the idea that an optimistic agent is one who believes that output is likely to be high. In the simplest case, for two preferences ⪰ and ⪰′ that both have a risk-neutral expected utility attitude to risk, I call ⪰ more optimistic than ⪰′ if and only if whenever ⪰′ chooses a contract w over another contract w′ that is less steep, in the sense that $s \mapsto E (w (s)) - E (w^{'} (s))$ is increasing, ⪰ also chooses w over w′. The general definition is similar but adjusts for risk attitude. I show that in the moral hazard model, optimism shifts correspond to horizontal shifts of the output distribution cost c in (⋆): greater optimism means a shift of the c function toward the first-order stochastically higher distributions (and no shift of the utility function u).

My definitions of relative confidence and optimism are purely behavioral, making no reference to objective facts about the agent’s actual ability to influence output. If an objective description of the agent’s influence is given, then I may additionally define what it means for a preference to be (absolutely) overconfident or optimistic: namely, “more confident than objectively warranted” and “more optimistic than objectively warranted.”

The formal results in this paper are rooted in a simple but powerful insight: that the moral hazard model is closely related to the variational model of choice under uncertainty, which was introduced and axiomatized by Maccheroni, Marinacci, and Rustichini (2006). This close relationship allows me to leverage known results about the variational model to obtain my four main results about the moral hazard model (axiomatization, identification, characterization of “more confident than,” and characterization of “more optimistic than”). Specifically, I apply results from Maccheroni, Marinacci, and Rustichini (2006) and Dziewulski and Quah (2024).

A. Related Literature

The moral hazard model is the focus of a large literature, beginning with Mirrlees ([1975] 1999) and Holmström (1979).⁴ I contribute to the behavioral strand of this literature, which examines, among other things, how greater confidence or optimism on the part of the agent shapes optimal contracting.⁵ This literature takes the moral hazard model as given and studies its implications for optimal contracts; in this paper, I provide behavioral foundations for the model itself as well as for overconfidence and optimism.

My characterization of the testable implications of the moral hazard model is designed to respect the model’s two defining features: that effort is costly and that it is unobservable. This distinguishes my analysis from previous work on the behavioral content of the moral hazard model, which has either confined attention to the special case of costless effort (Drèze 1961, 1987) or else assumed that effort is observable (Karni 2006).

I also contribute to the theoretical literature on behavioral economics, which studies models of individual decision making that can accommodate various departures from standard economic behavior that have been documented in the lab (or sometimes in the field). One strand of this literature examines behavioral models from the axiomatic perspective of decision theory;⁶ another strand is concerned with overconfidence and optimism.⁷ This paper marries these two strands. Relative to other work in that general spirit,⁸ I give new and simple definitions that distinguish overconfidence from optimism and cash out their meaning in the moral hazard model.

Finally, at a high level, this paper provides axiomatic behavioral foundations for a standard economic model (namely, the moral hazard model). Other work in this spirit includes axiomatizations of Blackwell’s (1951, 1953) model of choice informed by a signal,⁹ of the canonical model of costly information acquisition,¹⁰ and of Kamenica and Gentzkow’s (2011) Bayesian persuasion model (Jakobsen 2021, 2025).

B. Road Map

The rest of this paper is arranged as follows. In section II, I describe the environment and the moral hazard model. I introduce a canonical parsimonious moral hazard model in section III. In section IV, I characterize the moral hazard model in terms of six empirically testable axioms. I establish the model’s identification properties in section V. Finally, in section VI, I propose behavioral definitions of “more confident than” and “more optimistic than” and characterize their meaning in the moral hazard model.

II. Setting

There is one agent and a nonempty convex set $Π \subseteq R$ of possible levels of monetary remuneration (prizes). The agent’s pay $π \in Π$ can be made contingent on the realization of a contractible signal. We follow convention by calling this signal “output.” The set S of possible output realizations is assumed to be nonempty and, for simplicity, finite.

Let Δ(Π) be the set of all finite-support probability distributions over monetary prizes Π; we call each $x \in Δ (Π)$ a random remuneration. A contract is a map $w : S \to Δ (Π)$ . The interpretation is that if realized output is $s \in S$ , then the agent is paid a random amount drawn from the distribution $w (s) \in Δ (Π)$ . We write W for the set of all contracts.

A. Definitions and Conventions

A contract $w \in W$ is constant if and only if $w (s) = w (s^{'})$ for all output levels $s, s^{'} \in S$ . As is standard, we identify each constant contract (an element of W) with the random remuneration at which it is constant (an element of Δ(Π)). Similarly, we identify each degenerate random remuneration (an element of Δ(Π)) with the prize at which it is degenerate (an element of Π).

Any (utility) function $u : Π \to R$ defined on monetary prizes Π extends naturally to a linear (expected utility) function $u : Δ (Π) \to R$ defined on random remunerations via

$u (x) ≔ \int_{Π} u (π) x (d π) for every x \in Δ (Π) .$

Throughout the paper, we hold fixed an arbitrary pair $π_{0} < π_{1}$ of reference prizes in Π, and we call a (utility) function $u : Π \to R$ such that $u (π_{0}) \neq u (π_{1})$ normalized if and only if ${u (π_{0}), u (π_{1})} = {0, 1}$ . This is merely an (arbitrary) choice of units in which to measure utility.

Let Δ(S) denote the set of all output distributions, that is, probability distributions on S. For any nonempty set A, call a function $ϕ : A \to [- \infty, \infty]$ grounded if and only if $\inf_{a \in A} ϕ (a) = 0$ . In the sequel, subsets of Euclidean spaces (such as the simplex Δ(S)) are always equipped with the Borel σ-algebra, and “increasing” always means “weakly increasing.”

B. The Standard Moral Hazard Model

In the moral hazard model (Mirrlees [1975] 1999; Holmström 1979), the agent chooses her effort from a nonempty, compact, and convex subset E of a Euclidean space, for example, $E = [0, \bar{e}]$ for some $\bar{e} \in (0, \infty)$ . Each effort level $e \in E$ incurs a cost $C (e) \geq 0$ and produces a distribution $P_{e} \in Δ (S)$ of output, where the cost function $C : E \to R_{+}$ is grounded and lower semicontinuous and the map $e \mapsto P_{e}$ is continuous. The distribution P_e need not be objective; all that matters is that the agent believes that output will realize according to the probability distribution P_e if she supplies effort e.

The agent’s payoff under a contract $w \in W$ given effort $e \in E$ is

$- C (e) + \sum_{s \in S} u (w (s)) P_{e} (s),$

where $u : Π \to R$ is a strictly increasing and normalized utility function whose curvature captures the agent’s risk attitude. The agent chooses effort optimally, randomizing if desired. Her payoff under a contract $w \in W$ is therefore

$\sup_{μ \in Δ (E)} \int_{E} [- C (e) + \sum_{s \in S} u (w (s)) P_{e} (s)] μ (d e),$

where Δ(E) denotes the set of all probability measures on E.

C. Data: Preferences over Contracts

The agent’s preference over contracts, ⪰, is a binary relation on W. As usual, we write $w ≻ w^{'}$ if and only if $w ⪰ w^{'} ⋡ w$ . We interpret the preference ⪰ as (potential) data on the agent’s choice behavior: in particular, her choices when presented with any given pair w, $w^{'} \in W$ of contracts. In other words, ⪰ is an empirical object.

A binary relation ⪰ on W is a moral hazard preference exactly if it may be represented by a moral hazard model: that is, if and only if there is a nonempty, compact, and convex (effort) set $E \subseteq R^{n}$ (where $n \in N$ ); a grounded and lower semicontinuous (cost) function $C : E \to R_{+}$ ; a continuous (belief) map $e \mapsto P_{e}$ carrying E into Δ(S); and a strictly increasing and normalized (utility) function $u : Π \to R$ , such that for any contracts $w, w^{'} \in W$ , $w ⪰ w^{'}$ holds if and only if

$\begin{aligned} \sup_{μ \in Δ (E)} \int_{E} [- C (e) + \sum_{s \in S} u (w (s)) P_{e} (s)] μ (d e) \\ \geq & \sup_{μ \in Δ (E)} \int_{E} [- C (e) + \sum_{s \in S} u (w^{'} (s)) P_{e} (s)] μ (d e) . \end{aligned}$

We assume that ⪰ is the only data that are available. In particular, following the moral hazard literature, we assume that the agent’s effort choices $e \in E$ are unobservable.

III. A Parsimonious Moral Hazard Model

In the moral hazard model, effort $e \in E$ is just an index: what the agent actually cares about is the output distribution $P_{e} \in Δ (S)$ and the cost C(e). We may therefore recast the moral hazard model as one in which the agent directly chooses the output distribution p at some cost c(p), as follows.

Lemma 1.

A relation ⪰ is a moral hazard preference if and only if there is a grounded, convex, and lower semicontinuous function $c : Δ (S) \to [0, \infty]$ and a strictly increasing and normalized function $u : Π \to R$ such that for any contracts $w, w^{'} \in W$ , $w ⪰ w^{'}$ holds if and only if

$\max_{p \in Δ (S)} [- c (p) + \sum_{s \in S} u (w (s)) p (s)] \geq \max_{p \in Δ (S)} [- c (p) + \sum_{s \in S} u (w^{'} (s)) p (s)] .$

We call any pair (c, u) that satisfies the properties in lemma 1 a parsimonious representation of the moral hazard preference ⪰ (“parsimonious” since it has just two parameters, c and u). In the remainder of the paper, we work with parsimonious moral hazard representations.

The main claim of lemma 1 is that the cost function c may be chosen to be convex. The fact that we may write “max” instead of “sup” follows from the lower semicontinuity of c. Note that although the representation allows the agent to choose any output distribution $p \in Δ (S)$ , constraints can still be captured by setting $c (p) = \infty$ for some distributions $p \in Δ (S)$ .

Proof. For the “only if” part, let ⪰ be a moral hazard preference, with effort set E, cost function C, belief map $e \mapsto P_{e}$ , and utility function u. For each output distribution $p \in Δ (S)$ , let c(p) be the least cost at which p may be produced:

$c (p) ≔ \inf_{\begin{matrix} μ \in Δ (E) : \\ \int_{E} P_{e} μ (d e) = p \end{matrix}} \int_{E} C (e) μ (d e),$

with the convention that $\inf \emptyset = \infty$ . The function c maps Δ(S) into [0, ∞], is convex and lower semicontinuous by construction,¹¹ and is grounded since C is. And clearly

$\begin{aligned} \sup_{μ \in Δ (E)} \int_{E} [- C (e) + \sum_{s \in S} u & (w (s)) P_{e} (s)] μ (d e) \\ = \sup_{p \in Δ (S)} \sup_{\begin{matrix} μ \in Δ (E) : \\ \int_{E} P_{e} μ (d e) = p \end{matrix}} [- \int_{E} C (e) μ (d e) + \sum_{s \in S} u (w (s)) p (s)] \\ = \sup_{p \in Δ (S)} [- c (p) + \sum_{s \in S} u (w (s)) p (s)] . \end{aligned}$

Finally, the supremum is attained since c is lower semicontinuous.

For the “if” part, let ⪰ admit parsimonious representation (c, u). We have

$\begin{aligned} \max_{p \in Δ (S)} [- c (p) + \sum_{s \in S} u (w (s)) p (s)] & = \sup_{μ \in Δ (Δ (S))} \int_{Δ (S)} [- c (p) + \sum_{s \in S} u (w (s)) p (s)] μ (d p) \end{aligned}$

since c is convex and $p \mapsto \sum_{s \in S} u (w (s)) p (s)$ is linear, so randomization is not strictly optimal. This is a moral hazard model, with effort $p \in Δ (S) = E$ , cost function $C = c$ , and belief map $P_{p} = p$ for every $p \in Δ (S)$ . QED

Remark 1.

This paper is concerned with the moral hazard model’s behavioral foundations rather than with its implications for optimal contracting. Still, it is worth pointing out that the parsimonious formulation of the moral hazard model is in some ways more tractable than the standard formulation described in section II.B, since the cost function c is so well behaved. Mirrlees and Zhou (2006) and Georgiadis, Ravid, and Szentes (2024) use this insight to obtain new results on implementability and optimal contracts.

IV. Empirical Content of the Moral Hazard Model

In this section, I give a characterization of the behavioral implications of the moral hazard model. I first introduce a number of axioms: testable properties of choice behavior that a given binary relation ⪰ on W may or may not satisfy. I then show that a binary relation ⪰ is a moral hazard preference if and only if it satisfies these axioms. In other words, these axioms exhaust the empirical implications of the moral hazard model.

We shall work with mixtures, defined standardly as follows. For any random remunerations $x, x^{'} \in Δ (Π)$ and any $α \in (0, 1)$ , we write $α x + (1 - α) x^{'}$ for the random remuneration under which the agent’s pay is drawn from x with probability α and drawn from x′ otherwise.¹² For any contracts $w, w^{'} \in W$ and any $α \in (0, 1)$ , the contract $α w + (1 - α) w^{'}$ is given by $[α w + (1 - α) w^{'}] (s) ≔ α w (s) + (1 - α) w^{'} (s)$ for each output level $s \in S$ .

A. Basic Axioms

Our first four axioms are standard. The first three express basic economic properties, and the fourth is a technical regularity condition.

Axiom 1 (Weak order).

⪰ is complete and transitive.

Axiom 2 (Monotonicity).

For any prizes π, $π^{'} \in Π$ , $π > π^{'}$ implies $π ≻ π^{'}$ .

Axiom 3 (Dominance).

If two contracts $w, w^{'} \in W$ satisfy $w (s) ⪰ w^{'} (s)$ for every output level $s \in S$ , then $w ⪰ w^{'}$ .

Axiom 4 (Continuity).

For any contracts w, w′, $w^{″} \in W$ , the sets { $α \in [0, 1] : α w + (1 - α) w^{'} ⪰ w^{″}$ } and { $α \in [0, 1] : w^{″} ⪰ α w + (1 - α) w^{'}$ } are closed.

It is easily verified (using lemma 1) that any moral hazard preference satisfies these four axioms.

The first of our two substantial axioms is as follows.

Quasiconvexity Axiom.

For any contracts $w, w^{'} \in W$ such that $w ⪰ w^{'} ⪰ w$ , it holds that $w ⪰ α w + (1 - α) w^{'}$ for all $α \in (0, 1)$ .

Quasiconvexity says precisely that the agent is averse to mixing contracts: if two contracts are equally good, then the contract obtained by randomly selecting one of them (using an α-biased coin toss) is weakly worse. In a parsimonious moral hazard representation, this axiom is satisfied because tailoring a single effort level $p^{″} \in Δ (Π)$ to the mixture of two contracts w and w′ is more difficult than tailoring effort separately to each of the two contracts.¹³ A version of this observation was first made by Drèze (1961).¹⁴

Mathematically, quasiconvexity is identical to Schmeidler’s (1989) definition of uncertainty seeking (the opposite of uncertainty aversion).

Example 1.

To see what quasiconvexity rules out, consider a moral hazard model in which effort is chosen (and costs are borne) by malevolent nature: let $c : Δ (S) \to [0, \infty]$ be grounded, convex, and lower semicontinuous; let $u : Π \to R$ be strictly increasing and normalized; and let ⪰ be the preference such that for any contracts $w, w^{'} \in W$ , $w ⪰ w^{'}$ holds if and only if

$\min_{p \in Δ (S)} [c (p) + \sum_{s \in S} u (w (s)) p (s)] \geq \min_{p \in Δ (S)} [c (p) + \sum_{s \in S} u (w^{'} (s)) p (s)] .$

Then ⪰ violates quasiconvexity, except if c is trivial.¹⁵ Preferences like ⪰, called variational, will appear again in section IV.C.

B. Independence

Every moral hazard preference has an expected utility attitude to risk: random remunerations $x \in Δ (Π)$ (i.e., constant contracts) are evaluated via the expectation $u (x) = \int_{Π} u (π) x (d π)$ . By the expected utility theorem (von Neumann and Morgenstern 1947), a relation ⪰ that satisfies axioms 1–4 has this objective expected utility property if it satisfies the following axiom.

vNM Independence Axiom.

For any random remunerations $x, x^{'} \in Δ (Π)$ and any $α \in (0, 1)$ ,

$x ⪰ x^{'} implies α x + (1 - α) y ⪰ α x^{'} + (1 - α) y for any y \in Δ (Π) .$

Moral hazard preferences do not generally have an expected utility attitude to (Knightian) uncertainty. Equivalently (given axioms 1–4), moral hazard preferences do not generally satisfy the stronger Anscombe–Aumann independence axiom.¹⁶ They do, however, satisfy the following independence axiom from Maccheroni, Marinacci, and Rustichini (2006), which is stronger than vNM independence but weaker than Anscombe–Aumann independence.

MMR Independence Axiom.

For any contracts $w, w^{'} \in W$ and any $α \in (0, 1)$ ,

$\begin{aligned} α w + (1 - α) y ⪰ α w^{'} + (1 - α) y for some y \in Δ (Π) \\ implies & α w + (1 - α) y^{'} ⪰ α w^{'} + (1 - α) y^{'} for any y^{'} \in Δ (Π) . \end{aligned}$

MMR independence requires that mixing one random remuneration (y) into two contracts (w and w′) is much the same as mixing in another (y′): what matters is the probability ( $1 - α$ ) with which the output-independent remuneration is awarded instead of whatever output-dependent remuneration the contracts (w and w′) specify. The reason why the probability α matters is that it affects the slope of contracts, that is, how they vary with output $s \in S$ ; by contrast, whether remuneration is (constant and equal to) y or y′ with probability $(1 - α)$ does not affect slope. In a parsimonious moral hazard representation, MMR independence is satisfied because replacing y with y′ does not affect the optimal choice of effort $p \in Δ (S)$ .

Example 2.

To understand what MMR independence rules out, consider a moral hazard model with income effects: let $u : Π \to R$ be strictly increasing and normalized, let $V : Δ (S) \times u (Δ (Π)) \to [- \infty, \infty)$ be quasiconcave with V(p,⋅) strictly increasing for each $p \in Δ (S)$ and $\sup_{p \in Δ (S)} V (p, t) = t$ for each $t \in u (Δ (Π))$ , and let ⪰ be the preference such that for any contracts $w, w^{'} \in W$ , $w ⪰ w^{'}$ holds if and only if

$\max_{p \in Δ (S)} V (p, \sum_{s \in S} u (w (s)) p (s)) \geq \max_{p \in Δ (S)} V (p, \sum_{s \in S} u (w^{'} (s)) p (s)) .$

Then ⪰ violates MMR independence, except if V has the quasilinear form $V (p, t) = - c (p) + t$ for some (necessarily convex) $c : Δ (S) \to [0, \infty]$ .¹⁷

C. Axiomatic Behavioral Characterization

The following proposition characterizes the behavioral content of the moral hazard model.

Proposition 1.

For a binary relation ⪰ on W, the following are equivalent:

1. ⪰ satisfies axioms 1–4, MMR independence, and quasiconvexity.

2. ⪰ is a moral hazard preference.

To prove proposition 1, we begin with an observation. The inverse of a binary relation ⪰ on W is the binary relation ⊒ on W such that $w^{'} ⊒ w$ if and only if $w ⪰ w^{'}$ for all contracts $w, w^{'} \in W$ . Given a binary relation ⊒ on W, a grounded, convex, and lower semicontinuous function $c : Δ (S) \to [0, \infty]$ , and a nonconstant function $v : Π \to R$ , we say that (c, v) is a variational representation of ⊒ exactly if for any contracts $w, w^{'} \in W$ , $w^{'} ⊒ w$ holds if and only if

$\min_{p \in Δ (S)} [c (p) + \sum_{s \in S} v (w^{'} (s)) p (s)] \geq \min_{p \in Δ (S)} [c (p) + \sum_{s \in S} v (w (s)) p (s)] .$

Observation 1.

For a binary relation ⪰ on W, a grounded, convex, and lower semicontinuous function $c : Δ (S) \to [0, \infty]$ , and a nonconstant function $u : Π \to R$ , the following are equivalent:

• (c, u) is a parsimonious moral hazard representation of ⪰.

• (c, −u) is a variational representation of the inverse of ⪰, and u is strictly increasing and normalized.

Proof of proposition 1. It is easily verified (using lemma 1) that point 2 implies point 1. For the converse, suppose that ⪰ satisfies point 1 and let ⊒ be its inverse. By inspection, ⊒ satisfies axioms 1, 3, and 4, MMR independence, nondegeneracy (there exist contracts ${w, w}^{'} \in W$ such that $w ≻ w^{'}$ ), and quasiconcavity (for any contracts $w, w^{'} \in W$ such that $w ⊒ w^{'} ⊒ w$ , it holds that $α w + (1 - α) w^{'} ⊒ w$ for all $α \in (0, 1)$ ). Thus, by theorem 3 in Maccheroni, Marinacci, and Rustichini (2006), ⊒ admits a variational representation (c, v). By axiom 2, −v is strictly increasing, so $u (\cdot) ≔ [v (π_{0}) - v (\cdot)] / [v (π_{0}) - v (π_{1})]$ is strictly increasing and normalized. Hence, by observation 1, (c, u) is a parsimonious moral hazard representation of ⪰, and thus ⪰ is a moral hazard preference by lemma 1. QED

V. Identification of the Moral Hazard Model

In this section, I show that the parsimonious moral hazard model is identified: both of its parameters, c and u, are recoverable from contract choice data.

Definition 1.

A binary relation ⪰ on W is unbounded if and only if there are random remunerations $x ≻ y$ in Δ(Π) such that for any $α \in (0, 1)$ , we may find random remunerations $z, z^{'} \in Δ (Π)$ that satisfy $y ≻ α z + (1 - α) x$ and $α z^{'} + (1 - α) y ≻ x$ .

It is easy to see that a moral hazard preference ⪰ is unbounded if and only if for any parsimonious representation (c, u) of ⪰, the function u is unbounded both above and below (equivalently, $u (Δ (Π)) = R$ ).

Proposition 2 (Identification).

Each unbounded moral hazard preference admits exactly one parsimonious representation.

Proof. By lemma 1, each moral hazard preference admits at least one parsimonious representation. By proposition 6 in Maccheroni, Marinacci, and Rustichini (2006) and observation 1, each unbounded moral hazard preference admits at most one parsimonious representation.¹⁸ QED

Proposition 2 asserts that observing the agent’s choices between pairs of contracts suffices to recover the parameters c and u. Concretely, u may be recovered in standard fashion from choice between random remunerations (i.e., constant contracts), whereupon c may be recovered as

$c (p) = \sup_{w \in W} (- u (x_{w}) + \sum_{s \in S} u (w (s)) p (s)) for each p \in Δ (S),$

where x_w denotes the unique random remuneration $x \in Δ (Π)$ that satisfies $x ⪰ w ⪰ x$ .

The standard four-parameter moral hazard model ( $E, C, e \mapsto P_{e}, u$ ) described in section II.B is only partially identified off contract choice data ⪰. In particular, it is not possible to disentangle the effort cost C from the effort-to-output map $e \mapsto P_{e}$ : the data ⪰ can reveal only the minimum cost c(p) of inducing any given output distribution $p \in Δ (S)$ . Richer data could help: for example, data on chosen effort $e \in E$ (see Karni 2006). Such data are rarely available in the field,¹⁹ but they can potentially be obtained in the lab.²⁰

VI. Confidence and Optimism

In this section, I propose behavioral definitions “more confident than” and “more optimistic than.” These definitions are couched entirely in terms of observed choice between contracts, making no reference to any objective facts about how the agent’s effort actually influences output. I show that in the parsimonious moral hazard model, increased confidence corresponds to vertical shifts (precisely, pointwise decreases) of the output distribution cost function c, while increased optimism corresponds to horizontal shifts of c in the direction of the first-order stochastically higher output distributions. I also discuss what data are required to distinguish empirically between “more confident than” and “more optimistic than.” I conclude by noting that my notions of relative confidence and optimism lead naturally to definitions of what it means for an agent to be (absolutely) overconfident or optimistic.

A. Confidence

A confident agent is one who believes she can significantly influence the distribution of output. In terms of contract choice behavior, greater confidence is thus expressed by a greater appetite for nonconstant contracts, under which pay depends on realized output. This motivates the following purely behavioral definition of relative confidence.

Definition 2.

Let ⪰ and ⪰′ be binary relations on W. ⪰ is more confident than ⪰′ if and only if whenever $w ⪰^{'} (≻^{'}) x$ for a contract $w \in W$ and a constant contract $x \in Δ (Π)$ , we also have $w ⪰ (≻) x$ .

Note that the definition of relative confidence makes no reference to any objective facts about the agent’s actual ability to influence the distribution of output. Greater confidence may therefore reflect an increased subjective confidence on the agent’s part about her ability to influence output, an actual improvement in her ability, or a combination of the two.

Mathematically, definition 2 is identical to the standard definition of “less uncertainty averse than” (Epstein 1999; Ghirardato and Marinacci 2002).

In the moral hazard model, greater confidence is equivalent to uniformly lower costs:

Proposition 3.

Let ⪰ and ⪰′ be moral hazard preferences, with parsimonious representations (c, u) and (c′, u′). Then ⪰ is more confident than ⪰′ if and only if $u = u^{'}$ and $c \leq c^{'}$ .

Equivalently, (c, u) is more confident than (c′, u′) exactly if $u = u^{'}$ and the cost level sets are nested:

${p \in Δ (S) : c (p) \leq k} \supseteq {p \in Δ (S) : c^{'} (p) \leq k} for every k \geq 0 .$

In other words, any output distribution that is (believed to be) attainable at cost ≤k under (c′, u′) is also (believed to be) attainable at cost ≤k under (c, u). The fact that $u = u^{'}$ is implied reflects a (conceptually desirable) separation of confidence from risk attitude.

Example 3.

Let output be binary, $S = {0, 1}$ , so that each distribution $p \in Δ (S)$ may be identified with a single number, $p \equiv \Pr (output = 1)$ . Let $c (p) ≔ α {(p - β)}^{2}$ for each $p \in [0, 1]$ , where $α \in R_{+}$ and $β \in [0, 1]$ are parameters. As α falls (holding β constant), c shifts vertically downward (see fig. 1). By proposition 3, the agent becomes more confident.

**Fig. 1.** As α falls, $p \mapsto α {(p - β)}^{2}$ shifts vertically downward.

Proof of proposition 3. The “if” part is trivial. For the “only if” part, let ⪰ and ⪰′ be moral hazard preferences with parsimonious representations (c, u) and (c′, u′), respectively, and let ⪰ be more confident than ⪰′. Write ⊒ and ⊒′ for the inverses of ⪰ and ⪰′, respectively. By observation 1, (c, −u) and (c′, −u′) are variational representations of ⊒ and of ⊒′, respectively. And by inspection, for any random remuneration $x \in Δ (Π)$ and any contract $w \in W$ , $x ⊒^{'} w$ implies $x ⊒ w$ . Thus, $u = u^{'}$ and $c \leq c^{'}$ by proposition 9 in Maccheroni, Marinacci, and Rustichini (2006). QED

B. Optimism

An optimistic agent is one who expects output to be high. To capture this, assume that output levels are ordered: $S = {s_{1}, s_{2}, \dots, s_{| S |}}$ , where $s_{1} < s_{2} < \dots < s_{| S |}$ .

In terms of contract choice behavior, greater optimism is manifested by a greater propensity to choose steeper contracts, which pay relatively more following high output realizations than following low output realizations.

The appropriate formalization of “steeper” requires correcting for risk attitude by using units of utility rather than of money. Toward a definition, recall that by the expected utility theorem (von Neumann and Morgenstern 1947), if a relation ⪰ satisfies axioms 1–4 and vNM independence, then there is exactly one strictly increasing and normalized function $u : Π \to R$ such that for any random remunerations $x, x^{'} \in Δ (Π)$ , $x ⪰ x^{'}$ holds if and only if $\int_{Π} u (π) x (d π) \geq \int_{Π} u (π) x^{'} (d π)$ . Given any such relation ⪰ and any two contracts w, $w^{'} \in W$ , we say that w is ⪰-steeper than w′ exactly if $s \mapsto u (w (s)) - u (w^{'} (s))$ is increasing. Equivalently, $(1 / 2) w (s) + (1 / 2) w^{'} (s^{'}) ⪰ (1 / 2) w (s^{'}) + (1 / 2) w^{'} (s)$ holds for all output levels $s \geq s^{'}$ in S. This latter formulation furnishes a general definition that is applicable for any relation ⪰:

Definition 3.

Fix a binary relation ⪰ on W and two contracts w, $w^{'} \in W$ . We say that w is ⪰-steeper than w′ if and only if $(1 / 2) w (s) + (1 / 2) w^{'} (s^{'}) ⪰ (1 / 2) w (s^{'}) + (1 / 2) w^{'} (s)$ holds for all output levels $s \geq s^{'}$ in S.

Definition 4.

Let ⪰ and ⪰′ be binary relations on W. ⪰ is more optimistic than ⪰′ if and only if whenever $w ⪰^{'} (≻^{'}) w^{'}$ for two contracts $w, w^{'} \in W$ such that w is ⪰-steeper than w′, we also have $w ⪰ (≻) w^{'}$ .

In other words, a more optimistic preference is one that is more disposed to prefer steeper contracts, where steepness is measured in utility units.

We shall characterize relative optimism in the moral hazard model in terms of upshiftedness of costs, defined as follows.

Definition 5.

Let c, $c^{'} : Δ (S) \to [0, \infty]$ be grounded, convex, and lower semicontinuous. We say that c is upshifted from c′ if and only if for any $p, p^{'} \in Δ (S)$ , there are $q, q^{'} \in Δ (S)$ such that p first-order stochastically dominates (FOSD) q′, q FOSD p′, $(1 / 2) p + (1 / 2) p^{'} = (1 / 2) q + (1 / 2) q^{'}$ , and $c (q) + c^{'} (q^{'}) \leq c (p) + c^{'} (p^{'})$ .

Despite its tricky definition, upshiftedness expresses a straightforward idea: that first-order stochastically higher output distributions are relatively cheaper under c than under c′. Indeed, c is upshifted from c′ exactly if under any contract $w \in W$ and for any strictly increasing and normalized $u : Π \to R$ , the optimal choice of effort $p \in Δ (S)$ in the parsimonious moral hazard model (c, u) is first-order stochastically higher than optimal effort $p^{'} \in Δ (S)$ in the parsimonious moral hazard model (c′, u) (see Dziewulski and Quah 2024, sec. 5.2).²¹

Example 3 (Continued).

Recall that $c (p) = α {(p - β)}^{2}$ for each $p \in [0, 1]$ and note that p FOSD p′ if and only if $p \geq p^{'}$ . As β rises (holding α constant), c shifts horizontally rightward (see fig. 2); formally, c upshifts.²²

There is an intuitive necessary (though not sufficient) condition for upshiftedness in terms of the cost level sets

$L_{k} ≔ {p \in Δ (S) : c (p) \leq k} and L_{k}^{'} ≔ {p \in Δ (S) : c^{'} (p) \leq k} .$

Observation 2.

Let $c, c^{'} : Δ (S) \to [0, \infty]$ be grounded, convex, and lower semicontinuous. If c is upshifted from c′, then for every $k \geq 0$ ,

$\begin{array}{l} for each p \in L_{k}, p FOSD p^{'} for some p^{'} \in L_{k}^{'}, and \\ for each p^{'} \in L_{k}^{'}, p FOSD p^{'} for some p \in L_{k} . \end{array}$

In other words, the set L_k of output distributions that are (believed to be) attainable at cost ≤ k under c is “FOSD higher” than the set $L_{k}^{'}$ of output distributions that are (believed to be) attainable at cost ≤k under c.²³

Proof. We prove the first half, omitting the analogous argument for the second half. Fix a $k \geq 0$ and a $p \in L_{k}$ . Since c′ is grounded and lower semicontinuous, we may choose a $p^{'} \in Δ (S)$ such that $c^{'} (p^{'}) = 0$ . By upshiftedness, there are $q, q^{'} \in Δ (S)$ such that p FOSD q′ and $c^{'} (q^{'}) \leq c (q) + c^{'} (q^{'}) \leq c (p) + c^{'} (p^{'}) \leq k$ , so $q^{'} \in L_{k}^{'}$ . QED

We now show that greater optimism is manifested in the moral hazard model by first-order stochastically higher distributions becoming relatively cheaper in the sense of upshiftedness.

Proposition 4.

Let ⪰ and ⪰′ be unbounded moral hazard preferences, with parsimonious representations (c, u) and (c′, u′). Then ⪰ is more optimistic than ⪰′ if and only if $u = u^{'}$ and c is upshifted from c′.

The proof, given in the appendix, turns on a result due to Dziewulski and Quah (2024).

C. Distinguishability of Confidence and Optimism

Relative confidence and relative optimism are logically independent: by example 3, a preference can be more confident than another without being more optimistic, and vice versa.²⁴ In other words, relative confidence is empirically distinguishable from relative optimism.

This conclusion rests in part on the richness of the data ⪰, which record the agent’s choices between all pairs $w, w^{'} \in W$ of contracts. On coarser data, distinguishability may fail. To explore this point, suppose that data are available only on choices between pairs of contracts belonging to $W^{⋆} \subseteq W$ , where W^⋆ contains the constant contracts (i.e., $W^{⋆} \supseteq Δ (Π)$ ). For binary relations ⪰ and ⪰′ on W, say that ⪰ is more confident on W^⋆ than ⪰′ if and only if whenever $w ⪰^{'} (≻^{'}) x$ for a contract $w \in W^{⋆}$ and a constant contract $x \in Δ (Π)$ , we also have $w ⪰ (≻) x$ ; and say that ⪰ is more optimistic on W^⋆ than ⪰′ if and only if whenever $w ⪰^{'} (≻^{'}) w^{'}$ for two contracts $w, w^{'} \in W^{⋆}$ such that w is ⪰-steeper than w′, we also have $w ⪰ (≻) w^{'}$ .

Whether “more confident on W^⋆ than” implies “more optimistic on W^⋆ than” or vice versa depends on $W^{⋆} \subseteq W$ . Consider, for example, the set W^↑ of all increasing contracts, meaning those $w \in W$ such that w(s) first-order stochastically dominates w(s′) whenever $s \geq s^{'}$ .

Observation 3.

For binary relations ⪰ and ⪰′ on W that satisfy axioms 1–4 and vNM independence, if ⪰ is more optimistic on W^↑ than ⪰′, then ⪰ is more confident on W^↑ than ⪰′.²⁵

Thus, data on increasing contracts alone are insufficient to distinguish relative confidence from relative optimism.

Proof. Fix a contract $w \in W^{⋆}$ and a constant contract $x \in Δ (Π)$ such that $w ⪰^{'} (≻^{'}) x$ ; we must show that if ⪰ is more optimistic on W^↑ than ⪰′, then $w ⪰ (≻) x$ . By the expected utility theorem (von Neumann and Morgenstern 1947), there is exactly one strictly increasing and normalized function $u : Π \to R$ such that for any random remunerations $x^{'}, x^{″} \in Δ (Π)$ , $x^{'} ⪰ x^{″}$ holds if and only if $\int_{Π} u (π) x^{'} (d π) \geq \int_{Π} u (π) x^{″} (d π)$ . Since u is strictly increasing and w is increasing, w is ⪰-steeper than x. Thus, if ⪰ is more optimistic on W^↑ than ⪰′, then $w ⪰ (≻) x$ . QED

D. Absolute Overconfidence and Optimism

A remaining question is what it means for an agent to be overconfident or optimistic in an absolute sense (rather than relative to some other agent). I conclude by offering one possible answer to this question, in the spirit of Epstein (1999) and Ghirardato and Marinacci (2002).

Whereas our definitions of “more confident than” and “more optimistic than” were purely behavioral, making no reference to objective facts about the agent’s ability to influence output, absolute overconfidence and optimism are concerned precisely with discrepancies between such objective facts and the agent’s subjective assessment of them. We must therefore fix an objective benchmark against which the agent’s behavior may be compared. Although it is natural to fix as our benchmark the first three parameters (E^⋆, C^⋆, $e \mapsto P_{e}^{⋆}$ ) of a standard four-parameter moral hazard model, what will actually matter is only the objective output distribution cost $c^{⋆} : Δ (S) \to [0, \infty]$ .²⁶

Definition 6.

Let $c^{⋆} : Δ (S) \to [0, \infty]$ be the objective output distribution cost, and let ⪰ be a binary relation on W that satisfies axioms 1–4 and vNM independence. Write $u : Π \to R$ for the (unique) normalized utility function of ⪰, and let ⪰^⋆ be the moral hazard preference whose parsimonious representation is (c^⋆, u). We say that ⪰ is overconfident (optimistic) if and only if it is more confident (more optimistic) than ⪰^⋆.

In other words, an overconfident (optimistic) preference is one that is more confident (more optimistic) than is objectively warranted.

Corollary 1.

Let $c^{⋆} : Δ (S) \to [0, \infty]$ be the objective output distribution cost, and let ⪰ be an unbounded moral hazard preference, with parsimonious representation (c, u). Then ⪰ is overconfident if and only if $c \leq c^{⋆}$ , and ⪰ is optimistic if and only if c is upshifted from c^⋆.

Analogous definitions and characterizations may be given for (absolute) underconfidence and pessimism.

Appendix. Proof of Proposition 4

The proof relies on an observation and a lemma. The lemma is inspired by and proved using a result due to Dziewulski and Quah (2024).

Observation 4.

If $u : Δ (Π) \to R$ is onto, then for every function $f : S \to R$ , there is a contract $w \in W$ such that $f (s) = u (w (s))$ for every $s \in S$ .

Lemma 2.

Let $c, c^{'} : Δ (S) \to [0, \infty]$ be grounded, convex, and lower semicontinuous. The following are equivalent:

a. c is upshifted from c′.

b. For any $f, f^{'} : S \to R$ such that $f - f^{'}$ is increasing,

$\max_{p \in Δ (S)} [- c^{'} (p) + \sum_{s \in S} f (s) p (s)] \geq (>) \max_{p \in Δ (S)} [- c^{'} (p) + \sum_{s \in S} f^{'} (s) p (s)]$

implies

$\max_{p \in Δ (S)} [- c (p) + \sum_{s \in S} f (s) p (s)] \geq (>) \max_{p \in Δ (S)} [- c (p) + \sum_{s \in S} f^{'} (s) p (s)] .$

A1. Proof of Proposition 4

Suppose that ⪰ is more optimistic than ⪰′. Then $u = u^{'}$ since for any constant contracts $x, x^{'} \in Δ (Π)$ , x is steeper than x′ and x′ is steeper than x. By observation 4 (applicable since ⪰ and ⪰′ are unbounded), property b holds. Hence, by lemma 2, c is upshifted from c′.

Suppose that $u = u^{'}$ and that c is upshifted from c′. Then property b holds by lemma 2. Since ⪰ and ⪰′ are unbounded, it follows by observation 4 that ⪰ is more optimistic than ⪰′. QED

A2. Proof of Lemma 2

Property b is equivalent to the following:

c. For any $f, f^{'} : S \to R$ such that $f - f^{'}$ is increasing,

$\begin{aligned} \min_{p \in Δ (S)} [c^{'} (p) + \sum_{s \in S} f (s) p (s)] \geq (>) \min_{p \in Δ (S)} [c^{'} (p) + \sum_{s \in S} f^{'} (s) p (s)] \\ \Rightarrow & \min_{p \in Δ (S)} [c (p) + \sum_{s \in S} f (s) p (s)] \geq (>) \min_{p \in Δ (S)} [c (p) + \sum_{s \in S} f^{'} (s) p (s)] . \end{aligned}$

We claim that property c is equivalent to the following:

d. For any $f, f^{'} : S \to R$ such that $f - f^{'}$ is increasing,

$\begin{aligned} \min_{p \in Δ (S)} [c (p) + \sum_{s \in S} f (s) p (s)] - \min_{p \in Δ (S)} [c (p) + \sum_{s \in S} f^{'} (s) p (s)] \\ \geq & \min_{p \in Δ (S)} [c^{'} (p) + \sum_{s \in S} f (s) p (s)] - \min_{p \in Δ (S)} [c^{'} (p) + \sum_{s \in S} f^{'} (s) p (s)] . \end{aligned}$

It is clear that property d implies property c. Conversely, if property d fails for some f and f′ such that $f - f^{'}$ is increasing, then property c fails for $f - k$ and f′, where

$k ≔ \min_{p \in Δ (S)} [c^{'} (p) + \sum_{s \in S} f (s) p (s)] - \min_{p \in Δ (S)} [c^{'} (p) + \sum_{s \in S} f^{'} (s) p (s)] .$

Finally, property d holds if and only if c is upshifted from c′ by proposition 9 in Dziewulski and Quah (2024).²⁷ QED

Notes

This paper grew out of conversations with Jorge Padilla and Joe Perkins, whom I thank for their insights. I am grateful for comments from three anonymous referees, Johannes Abeler, Dağhan Carlos Akkar, Nemanja Antić, Ala Avoyan, Jean Baccelli, Aislinn Bohren, Roberto Corrao, Gregorio Curello, Eddie Dekel, Matteo Escudé, Bruno Furtado, George Georgiadis, Duarte Gonçalves, Tai-Wei Hu, Alex Jakobsen, Ian Jewitt, Jan Knoepfle, Meg Meyer, Salvatore Piccolo, Mauricio Ribeiro, Evgenii Safonov, Todd Sarver, Lorenzo Stanca, Quitzé Valenzuela-Stookey, and Mu Zhang as well as audiences at Compass Lexecon and the Second Southeast Theory Festival. This research was supported by Compass Lexecon. An earlier version of this paper bore the title “Optimism and Overconfidence.” This paper was edited by Eduardo Azevedo.

¹ For example, see the survey by DellaVigna (2009, sec. 3.1).

² For example, see Spinnewijn (2015).

³ For a random remuneration $x \in Δ (Π)$ , u(x) denotes the expected utility $\int_{Π} u (π) x (d π)$ .

⁴ For an overview, see Holmström (2017) and Georgiadis (2024).

⁵ For example, see de la Rosa (2011), Gervais, Heaton, and Odean (2011), and Spinnewijn (2015). See Kőszegi (2014, sec. 3.4) for a survey.

⁶ Examples include disappointment aversion (Gül 1991), temptation (Gül and Pesendorfer 2001; Dekel, Lipman, and Rustichini 2009), status quo bias (Masatlioglu and Ok 2005; Ortoleva 2010), framing effects (Salant and Rubinstein 2006, 2008; Ahn and Ergin 2010), heuristics (Manzini and Mariotti 2007, 2012), regret aversion (Sarver 2008), costly contemplation (Ergin and Sarver 2010), and reference dependence (Ok, Ortoleva, and Riella 2015).

⁷ Topics include the impact of overconfidence or optimism on credit markets (Manove and Padilla 1999), competitive screening (Sandroni and Squintani 2007), monopolistic screening (Eliaz and Spiegler 2008; Grubb 2009), moral hazard (see n. 5), political behavior (Ortoleva and Snowberg 2015), and learning (Heidhues, Kőszegi, and Strack 2018). See also Benoît and Dubra (2011).

⁸ For example, see Wakker (1990), Epstein and Kopylov (2007), Chateauneuf, Eichberger, and Grant (2007), Moore and Healy (2008), Dean and Ortoleva (2017), and Dillenberger, Postlewaite, and Rozen (2017).

⁹ See Azrieli and Lehrer (2008), Dillenberger et al. (2014), and Lu (2016).

¹⁰ See Van Zandt (1996), de Oliveira et al. (2017), and Ellis (2018).

¹¹ Convexity is easily verified. For lower semicontinuity, let Δ(Δ(S)) be the set of all probabilities on Δ(S), equipped with the topology of weak convergence. For each $p \in Δ (S)$ , $c (p) = \inf_{ν \in B (p)} \int_{Δ (S)} \tilde{c} (q) ν (d q)$ , where $B (p) ≔ {ν \in Δ (Δ (S)) : \int_{Δ (S)} q ν (d q) = p}$ and $\tilde{c} (q) ≔ \inf_{e \in E : P_{e} = q} C (e)$ for each $q \in Δ (S)$ , and $\inf \emptyset = \infty$ by convention. The map $\tilde{c} : Δ (S) \to [0, \infty]$ is lower semicontinuous by lemma 17.30 in Aliprantis and Border (2006), since C is lower semicontinuous and (by the continuity of $e \mapsto P_{e}$ ) the correspondence $q \mapsto {e \in E : P_{e} = q}$ is upper hemicontinuous and compact valued. Since the map $ν \mapsto \int_{Δ (S)} q ν (d q)$ is continuous (Phelps 2001, proposition 1.1), the correspondence $B : Δ (S) ⇉ Δ (Δ (S))$ is upper hemicontinuous and compact valued. Thus, c is lower semicontinuous by lemma 17.30 in Aliprantis and Border (2006). This argument is adapted from Lipnowski and Ravid (2020, app. sec. A.4.1).

¹² Explicitly, $[α x + (1 - α) x^{'}] (A) ≔ α x (A) + (1 - α) x^{'} (A)$ for each finite $A \subseteq Π$ .

¹³ Explicitly, writing $φ_{w} (p) ≔ - c (p) + \sum_{s \in S} u (w (s)) p (s)$ for any $w \in W$ and $p \in Δ (S)$ , we have $φ_{α w + (1 - α) w^{'}} (p) = α φ_{w} (p) + (1 - α) φ_{w^{'}} (p) \leq α \max_{Δ (S)} φ_{w} + (1 - α) \max_{Δ (S)} φ_{w^{'}}$ for every $p \in Δ (S)$ , so $\max_{Δ (S)} φ_{α w + (1 - α) w^{'}} \leq α \max_{Δ (S)} φ_{w} + (1 - α) \max_{Δ (S)} φ_{w^{'}}$ .

¹⁴ His postulate P2.1 is quasiconvexity. See also Drèze (1987, chap. 2) and Machina (1984).

¹⁵ Reasoning similar to that in n. 13 shows that ⪰ satisfies the reverse inequality (quasiconcavity) and that the inequality is strict for some $w, w^{'} \in W$ except if c is trivial.

¹⁶ The axiom: for any contracts $w, w^{'} \in W$ and any $α \in (0, 1)$ , $w ⪰ w^{'}$ implies $α w + (1 - α) w^{″} ⪰ α w^{'} + (1 - α) w^{″}$ for any contract $w^{″} \in W$ . Equivalence holds by the Anscombe and Aumann (1963) expected utility theorem.

¹⁷ Preferences like ⪰, except with V replaced by $(p, t) \mapsto - V (p, - t)$ and “max” replaced by “min,” were studied by Cerreia-Vioglio et al. (2011).

¹⁸ This argument, and thus proposition 2, remains valid if unboundedness is weakened to require only that u(Δ(Π)) be unbounded either above or below. The full force of unboundedness will be needed in sec. VI.B, however.

¹⁹ See Georgiadis (2024, sec. 3) for some notable exceptions.

²⁰ Two common methods are incentivized elicitation and measuring performance on a cognitive task (see, e.g., Charness, Gneezy, and Henderson 2018). A third method is measuring the total time spent on a task (e.g., Avoyan, Ribeiro, and Schotter 2025).

²¹ This statement assumes that optimal effort choices are unique. The general statement may be found in Dziewulski and Quah (2024, sec. 5.2).

²² Fix $α \in R_{+}$ and $β > β^{'}$ in [0,1]. For any $p, p^{'} \in [0, 1]$ , it is easily verified that $q ≔ \max {p, p^{'}}$ and $q^{'} ≔ \min {p, p^{'}}$ satisfy the properties in definition 5.

²³ This way of comparing sets is known as the weak set order (Topkis 1998, sec. 2.4).

²⁴ Explicitly, there exist binary relations⪰ and ⪰′ on W such that ⪰ is more confident but not more optimistic than ⪰′, and there exist relations ⪰ and ⪰′ on W such that ⪰ is more optimistic but not more confident than ⪰′; furthermore, these can be chosen to satisfy axioms 1–4, MMR independence, and quasiconvexity.

²⁵ I thank an anonymous referee for this observation.

²⁶ The relationship is $c^{⋆} (p) = \inf_{μ \in Δ (E^{⋆})} {\int_{E^{⋆}} C^{⋆} (e) μ (d e) : \int_{E^{⋆}} P_{e}^{⋆} μ (d e) = p}$ for every $p \in Δ (S)$ , where $\inf \emptyset = \infty$ by convention. This was shown in the proof of lemma 1.

²⁷ The statement of this proposition assumes that c and c′ are finite valued, but that assumption is not required for the proof. (The statement also assumes continuity, but that property is equivalent to lower semicontinuity given the other assumptions.)

Ahn, D. S., and H. Ergin. 2010. “Framing Contingencies.” Econometrica 78 (2): 655–95.
Aliprantis, C. D., and K. C. Border. 2006. Infinite Dimensional Analysis: A Hitchhiker’s Guide. 3rd ed. Berlin: Springer.
Anscombe, F. J., and R. J. Aumann. 1963. “A Definition of Subjective Probability.” Ann. Math. Statis. 34 (1): 199–205.
Avoyan, A., M. Ribeiro, and A. Schotter. 2025. “Choosing between and Allocating Time across Contracts: An Experimental Study.” Working paper.
Azrieli, Y., and E. Lehrer. 2008. “The Value of a Stochastic Information Structure.” Games and Econ. Behavior 63 (2): 679–93.
Benoît, J.-P., and J. Dubra. 2011. “Apparent Overconfidence.” Econometrica 79 (5): 1591–625.
Blackwell, D. 1951. “Comparison of Experiments.” In Berkeley Symposium on Mathematical Statistics and Probability, vol. 2, edited by J. Neyman, 93–102. Berkeley: Univ. California Press.
———. 1953. “Equivalent Comparisons of Experiments.” Ann. Math. and Statis. 24 (2): 265–72.
Cerreia-Vioglio, S., F. Maccheroni, M. Marinacci, and L. Montrucchio. 2011. “Uncertainty Averse Preferences.” J. Econ. Theory 146 (4): 1275–330.
Charness, G., U. Gneezy, and A. Henderson. 2018. “Experimental Methods: Measuring Effort in Economics Experiments.” J. Econ. Behavior and Org. 149:74–87.
Chateauneuf, A., J. Eichberger, and S. Grant. 2007. “Choice under Uncertainty with the Best and Worst in Mind: Neo-Additive Capacities.” J. Econ. Theory 137:538–67.
Dean, M., and P. Ortoleva. 2017. “Allais, Ellsberg, and Preferences for Hedging.” Theoretical Econ. 12 (1): 377–424.
Dekel, E., B. L. Lipman, and A. Rustichini. 2009. “Temptation-Driven Preferences.” Rev. Econ. Studies 76 (3): 937–71.
de la Rosa, L. E. 2011. “Overconfidence and Moral Hazard.” Games and Econ. Behavior 73 (2): 429–51.
DellaVigna, S. 2009. “Psychology and Economics: Evidence from the Field.” J. Econ. Literature 47 (2): 315–72.
de Oliveira, H., T. Denti, M. Mihm, and K. Ozbek. 2017. “Rationally Inattentive Preferences and Hidden Information Costs.” Theoretical Econ. 12 (2): 621–54.
Dillenberger, D., J. S. Lleras, P. Sadowski, and N. Takeoka. 2014. “A Theory of Subjective Learning.” J. Econ. Theory 153:287–312.
Dillenberger, D., A. Postlewaite, and K. Rozen. 2017. “Optimism and Pessimism with Expected Utility.” J. European Econ. Assoc. 15 (5): 1158–75.
Drèze, J. H. 1961. “Fondements Logiques de la Probabilité Subjective et de l’Utilité. In La Décision: Paris, 25–30 Mai 1960, Colloques Internat. CNRS, 73–87. Paris: CNRS.
———. 1987. Essays on Economic Decisions under Uncertainty. Cambridge: Cambridge Univ. Press.
Dziewulski, P., and J. K.-H. Quah. 2024. “Comparative Statics with Linear Objectives: Normality, Complementarity, and Ranking Multi-Prior Beliefs.” Econometrica 92 (1): 167–200.
Eliaz, K., and R. Spiegler. 2008. “Consumer Optimism and Price Discrimination.” Theoretical Econ. 3 (4): 459–97.
Ellis, A. 2018. “Foundations for Optimal Inattention.” J. Econ. Theory 173:56–94.
Epstein, L. G. 1999. “A Definition of Uncertainty Aversion.” Rev. Econ. Studies 6 (3): 579–608.
Epstein, L. G., and I. Kopylov. 2007. “Cold Feet.” Theoretical Econ. 2 (3): 231–59.
Ergin, H., and T. Sarver. 2010. “A Unique Costly Contemplation Representation.” Econometrica 78 (4): 1285–339.
Georgiadis, G. 2024. “Contracting with Moral Hazard: A Review of Theory & Empirics.” In Elgar Encyclopedia on the Economics of Competition, Regulation and Antitrust, edited by Michael D. Noel, 24–37. Cheltenham: Elgar.
Georgiadis, G., D. Ravid, and B. Szentes. 2024. “Flexible Moral Hazard Problems.” Econometrica 92 (2): 387–409.
Gervais, S., J. B. Heaton, and T. Odean. 2011. “Overconfidence, Compensation Contracts, and Capital Budgeting.” J. Finance 66 (5): 1735–77.
Ghirardato, P., and M. Marinacci. 2002. “Ambiguity Made Precise: A Comparative Foundation.” J. Econ. Theory 102 (2): 251–89.
Grubb, M. D. 2009. “Selling to Overconfident Consumers.” A.E.R. 99 (5): 1770–807.
Gül, F. 1991. “A Theory of Disappointment Aversion.” Econometrica 59 (3): 667–86.
Gül, F., and W. Pesendorfer. 2001. “Temptation and Self-Control.” Econometrica 69 (6): 1403–35.
Heidhues, P., B. Kőszegi, and P. Strack. 2018. “Unrealistic Expectations and Misguided Learning.” Econometrica 86 (4): 1159–214.
Holmström, B. 1979. “Moral Hazard and Observability.” Bell J. Econ. 10 (1): 74–91.
———. 2017. “Pay for Performance and Beyond.” A.E.R. 107 (7): 1753–77.
Jakobsen, A. M. 2021. “An Axiomatic Model of Persuasion.” Econometrica 89 (5): 2081–116.
———. 2025. “Temptation and Persuasion.” Working paper.
Kamenica, E., and M. Gentzkow. 2011. “Bayesian Persuasion.” A.E.R. 101 (6): 2590–615.
Karni, E. 2006. “Subjective Expected Utility Theory without States of the World.” J. Math. Econ. 42 (3): 325–42.
Kőszegi, B. 2014. “Behavioral Contract Theory.” J. Econ. Literature 52 (4): 1075–118.
Lipnowski, E., and D. Ravid. 2020. “Cheap Talk with Transparent Motives.” Econometrica 88 (4): 1631–60.
Lu, J. 2016. “Random Choice and Private Information.” Econometrica 84 (6): 1983–2027.
Maccheroni, F., M. Marinacci, and A. Rustichini. 2006. “Ambiguity Aversion, Robustness, and the Variational Representation of Preferences.” Econometrica 74 (6): 1447–98.
Machina, M. J. 1984. “Temporal Risk and the Nature of Induced Preferences.” J. Econ. Theory 33 (2): 199–231.
Manove, M., and A. J. Padilla. 1999. “Banking (Conservatively) with Optimists.” RAND J. Econ. 30 (2): 324–50.
Manzini, P., and M. Mariotti. 2007. “Sequentially Rationalizable Choice.” A.E.R. 97 (5): 1824–39.
———. 2012. “Categorize Then Choose: Boundedly Rational Choice and Welfare.” J. European Econ. Assoc. 10 (5): 1141–65.
Masatlioglu, Y., and E. A. Ok. 2005. “Rational Choice with Status Quo Bias.” J. Econ. Theory 121 (1): 1–29.
Mirrlees, J. A. (1975) 1999. “The Theory of Moral Hazard and Unobservable Behaviour: Part I.” Rev. Econ. Studies 66 (1): 3–21. Completed October 1975, published 1999.
Mirrlees, J. A., and Y. Zhou. 2006. “Principal-Agent Problems Revisited, Part I: Static Models.” Working paper.
Moore, D. A., and P. J. Healy. 2008. “The Trouble with Overconfidence.” Psychological Rev. 115 (2): 502–17.
Ok, E. A., P. Ortoleva, and G. Riella. 2015. “Revealed (P)Reference Theory.” A.E.R. 105 (1): 299–321.
Ortoleva, P. 2010. “Status Quo Bias, Multiple Priors and Uncertainty Aversion.” Games and Econ. Behavior 69 (2): 411–24.
Ortoleva, P., and E. Snowberg. 2015. “Overconfidence in Political Behavior.” A.E.R. 105 (2): 504–35.
Phelps, R. R. 2001. Lectures on Choquet’s Theorem. 2nd ed. Berlin: Springer.
Salant, Y., and A. Rubinstein. 2006. “A Model of Choice from Lists.” Theoretical Econ. 1 (1): 3–17.
———. 2008. “(A, f): Choice with Frames.” Rev. Econ. Studies 75 (4): 1287–96.
Sandroni, A., and F. Squintani. 2007. “Overconfidence, Insurance, and Paternalism.” A.E.R. 97 (5): 1994–2004.
Sarver, T. 2008. “Anticipating Regret: Why Fewer Options May Be Better.” Econometrica 76 (2): 263–305.
Schmeidler, D. 1989. “Subjective Probability and Expected Utility without Additivity.” Econometrica 57 (3): 571–87.
Spinnewijn, J. 2015. “Unemployed but Optimistic: Optimal Insurance Design with Biased Beliefs.” J. European Econ. Assoc. 13 (1): 130–67.
Topkis, D. M. 1998. Supermodularity and Complementarity. Princeton, NJ: Princeton Univ. Press.
Van Zandt, T. 1996. “Hidden Information Acquisition and Static Choice.” Theory and Decision 40 (3): 235–47.
von Neumann, J., and O. Morgenstern. 1947. Theory of Games and Economic Behavior. 2nd ed. Princeton, NJ: Princeton Univ. Press.
Wakker, P. P. 1990. “Characterizing Optimism and Pessimism Directly through Comonotonicity.” J. Econ. Theory 52 (2): 453–63.

Source link

What's Hot

Optimism, Overconfidence, and Moral Hazard

Silver Price Forecast: Bears retain near-term control as RSI slips below 50

USDT Dominance Rejection Could Fuel The Next Bitcoin Rally

Optimism, Overconfidence, and Moral Hazard

Biomass Demand, Global Conflict and the New Economics of Wood

To 2026 Graduates Facing a Bleak Job Market, 1991 Grads Have Some Advice

Sides continue to feud over massive data center proposed for Genesee County – WGRZ

The bond market is flashing a warning sign for the global economy

AI’s Macroeconomic Challenges and Promises

S&P Upgrades 7 Banks’ Ratings On Improved Macroeconomic Outlook

Optimism, Overconfidence, and Moral Hazard

Silver Price Forecast: Bears retain near-term control as RSI slips below 50

USDT Dominance Rejection Could Fuel The Next Bitcoin Rally

8 AI day trading bots for stocks in 2026: trade faster with intraday automation

The state of the housing market? It ‘sucks,’ Rocket CEO says

Long Path Partners Completes Take-Private Acquisition of Idox plc – PA Media

Teaching Economics – Econlib

Monthly Featured

Where to watch Bayern Munich vs. PSG live stream, TV channel, start time for Champions League semifinal second leg

Best aggressive hybrid mutual funds to invest in May 2026

The Chinese Bank Targeting Infrastructure Projects Across Central Asia

Latest Posts

Optimism, Overconfidence, and Moral Hazard

Silver Price Forecast: Bears retain near-term control as RSI slips below 50

USDT Dominance Rejection Could Fuel The Next Bitcoin Rally

SUBSCRIBE TO OUR NEWSLETTER

What's Hot

Optimism, Overconfidence, and Moral Hazard

I. Introduction

A. Related Literature

B. Road Map

II. Setting

A. Definitions and Conventions

B. The Standard Moral Hazard Model

C. Data: Preferences over Contracts

III. A Parsimonious Moral Hazard Model

IV. Empirical Content of the Moral Hazard Model

A. Basic Axioms

B. Independence

C. Axiomatic Behavioral Characterization

V. Identification of the Moral Hazard Model

VI. Confidence and Optimism

A. Confidence

B. Optimism

C. Distinguishability of Confidence and Optimism

D. Absolute Overconfidence and Optimism

Appendix. Proof of Proposition 4

A1. Proof of Proposition 4

A2. Proof of Lemma 2

Notes

Related Posts