I. Introduction
A vast literature in psychology and economics documents the prevalence of wrong beliefs.1 In this paper, I seek to define, distinguish, and characterize two oft-conflated senses in which beliefs can be wrong: overconfidence and optimism. An agent is overconfident if she overestimates her ability to influence (the distribution of) a payoff-relevant outcome, which I shall call “output.” By contrast, she is optimistic if her expectation of the distribution of output is unrealistically high, in the sense of first-order stochastic dominance. In simple parametric models, these two concepts typically admit natural definitions in terms of the parameters.2 In this paper, I provide general model-independent definitions of overconfidence and optimism in terms of choice behavior.
I explicate the content of my definitions by characterizing their meaning in the standard moral hazard model. This model is the natural benchmark because it represents the canonical formalism in economics for studying how an agent may influence the distribution of an observable outcome (output). The question of how overconfident and optimistic behavior manifest in the moral hazard model raises a more basic question: how does this model relate to choice behavior? I answer this question by characterizing what restrictions the moral hazard model imposes on choice data and delineating the extent to which its parameters may be recovered from such data.
The environment consists of a finite set S of possible output levels and a convex set of possible monetary rewards. A contract specifies the agent’s (possibly random) remuneration as a function of output. In the moral hazard model, the agent influences the distribution of output by choosing effort at a cost . She chooses effort optimally, randomizing if desired (by selecting an effort distribution ). The agent’s valuation of a contract is therefore
where the utility function describes her risk attitude.3
I first ask what the moral hazard model’s empirical content is: what testable restrictions does this model impose on contract choice data? In other words, which preference relations ⪰ on the set of all contracts admit a moral hazard representation? I give the answer in terms of six axioms, which together exhaust the testable implications of the moral hazard model.
I then show that the moral hazard model’s parameters are to a significant extent identified off contract choice data. In particular, the agent’s utility function u is identified (up to positive affine transformations), and so is her minimum cost of producing any given output distribution : that is, the quantity
I next define relative confidence, motivated by the idea that a confident agent is one who believes that she can significantly influence output. Specifically, I call one preference ⪰ more confident than another preference ⪰′ if whenever ⪰′ chooses a contract w over a constant contract x (one under which pay does not vary with output), ⪰ also chooses w over x. I show that for moral hazard preferences, confidence shifts are equivalent to vertical shifts of the output distribution cost c defined in (⋆): greater confidence means precisely a pointwise lower c function (and an unchanged utility function u).
Finally, I define relative optimism, building on the idea that an optimistic agent is one who believes that output is likely to be high. In the simplest case, for two preferences ⪰ and ⪰′ that both have a risk-neutral expected utility attitude to risk, I call ⪰ more optimistic than ⪰′ if and only if whenever ⪰′ chooses a contract w over another contract w′ that is less steep, in the sense that is increasing, ⪰ also chooses w over w′. The general definition is similar but adjusts for risk attitude. I show that in the moral hazard model, optimism shifts correspond to horizontal shifts of the output distribution cost c in (⋆): greater optimism means a shift of the c function toward the first-order stochastically higher distributions (and no shift of the utility function u).
My definitions of relative confidence and optimism are purely behavioral, making no reference to objective facts about the agent’s actual ability to influence output. If an objective description of the agent’s influence is given, then I may additionally define what it means for a preference to be (absolutely) overconfident or optimistic: namely, “more confident than objectively warranted” and “more optimistic than objectively warranted.”
The formal results in this paper are rooted in a simple but powerful insight: that the moral hazard model is closely related to the variational model of choice under uncertainty, which was introduced and axiomatized by Maccheroni, Marinacci, and Rustichini (2006). This close relationship allows me to leverage known results about the variational model to obtain my four main results about the moral hazard model (axiomatization, identification, characterization of “more confident than,” and characterization of “more optimistic than”). Specifically, I apply results from Maccheroni, Marinacci, and Rustichini (2006) and Dziewulski and Quah (2024).
A. Related Literature
The moral hazard model is the focus of a large literature, beginning with Mirrlees ([1975] 1999) and Holmström (1979).4 I contribute to the behavioral strand of this literature, which examines, among other things, how greater confidence or optimism on the part of the agent shapes optimal contracting.5 This literature takes the moral hazard model as given and studies its implications for optimal contracts; in this paper, I provide behavioral foundations for the model itself as well as for overconfidence and optimism.
My characterization of the testable implications of the moral hazard model is designed to respect the model’s two defining features: that effort is costly and that it is unobservable. This distinguishes my analysis from previous work on the behavioral content of the moral hazard model, which has either confined attention to the special case of costless effort (Drèze 1961, 1987) or else assumed that effort is observable (Karni 2006).
I also contribute to the theoretical literature on behavioral economics, which studies models of individual decision making that can accommodate various departures from standard economic behavior that have been documented in the lab (or sometimes in the field). One strand of this literature examines behavioral models from the axiomatic perspective of decision theory;6 another strand is concerned with overconfidence and optimism.7 This paper marries these two strands. Relative to other work in that general spirit,8 I give new and simple definitions that distinguish overconfidence from optimism and cash out their meaning in the moral hazard model.
Finally, at a high level, this paper provides axiomatic behavioral foundations for a standard economic model (namely, the moral hazard model). Other work in this spirit includes axiomatizations of Blackwell’s (1951, 1953) model of choice informed by a signal,9 of the canonical model of costly information acquisition,10 and of Kamenica and Gentzkow’s (2011) Bayesian persuasion model (Jakobsen 2021, 2025).
B. Road Map
The rest of this paper is arranged as follows. In section II, I describe the environment and the moral hazard model. I introduce a canonical parsimonious moral hazard model in section III. In section IV, I characterize the moral hazard model in terms of six empirically testable axioms. I establish the model’s identification properties in section V. Finally, in section VI, I propose behavioral definitions of “more confident than” and “more optimistic than” and characterize their meaning in the moral hazard model.
II. Setting
There is one agent and a nonempty convex set of possible levels of monetary remuneration (prizes). The agent’s pay can be made contingent on the realization of a contractible signal. We follow convention by calling this signal “output.” The set S of possible output realizations is assumed to be nonempty and, for simplicity, finite.
Let Δ(Π) be the set of all finite-support probability distributions over monetary prizes Π; we call each a random remuneration. A contract is a map . The interpretation is that if realized output is , then the agent is paid a random amount drawn from the distribution . We write W for the set of all contracts.
A. Definitions and Conventions
A contract is constant if and only if for all output levels . As is standard, we identify each constant contract (an element of W) with the random remuneration at which it is constant (an element of Δ(Π)). Similarly, we identify each degenerate random remuneration (an element of Δ(Π)) with the prize at which it is degenerate (an element of Π).
Any (utility) function defined on monetary prizes Π extends naturally to a linear (expected utility) function defined on random remunerations via
Throughout the paper, we hold fixed an arbitrary pair of reference prizes in Π, and we call a (utility) function such that normalized if and only if . This is merely an (arbitrary) choice of units in which to measure utility.
Let Δ(S) denote the set of all output distributions, that is, probability distributions on S. For any nonempty set A, call a function grounded if and only if . In the sequel, subsets of Euclidean spaces (such as the simplex Δ(S)) are always equipped with the Borel σ-algebra, and “increasing” always means “weakly increasing.”
B. The Standard Moral Hazard Model
In the moral hazard model (Mirrlees [1975] 1999; Holmström 1979), the agent chooses her effort from a nonempty, compact, and convex subset E of a Euclidean space, for example, for some . Each effort level incurs a cost and produces a distribution of output, where the cost function is grounded and lower semicontinuous and the map is continuous. The distribution Pe need not be objective; all that matters is that the agent believes that output will realize according to the probability distribution Pe if she supplies effort e.
The agent’s payoff under a contract given effort is
where is a strictly increasing and normalized utility function whose curvature captures the agent’s risk attitude. The agent chooses effort optimally, randomizing if desired. Her payoff under a contract is therefore
where Δ(E) denotes the set of all probability measures on E.
C. Data: Preferences over Contracts
The agent’s preference over contracts, ⪰, is a binary relation on W. As usual, we write if and only if . We interpret the preference ⪰ as (potential) data on the agent’s choice behavior: in particular, her choices when presented with any given pair w, of contracts. In other words, ⪰ is an empirical object.
A binary relation ⪰ on W is a moral hazard preference exactly if it may be represented by a moral hazard model: that is, if and only if there is a nonempty, compact, and convex (effort) set (where ); a grounded and lower semicontinuous (cost) function ; a continuous (belief) map carrying E into Δ(S); and a strictly increasing and normalized (utility) function , such that for any contracts , holds if and only if
We assume that ⪰ is the only data that are available. In particular, following the moral hazard literature, we assume that the agent’s effort choices are unobservable.
III. A Parsimonious Moral Hazard Model
In the moral hazard model, effort is just an index: what the agent actually cares about is the output distribution and the cost C(e). We may therefore recast the moral hazard model as one in which the agent directly chooses the output distribution p at some cost c(p), as follows.
Lemma 1.
A relation ⪰ is a moral hazard preference if and only if there is a grounded, convex, and lower semicontinuous function and a strictly increasing and normalized function such that for any contracts , holds if and only if
We call any pair (c, u) that satisfies the properties in lemma 1 a parsimonious representation of the moral hazard preference ⪰ (“parsimonious” since it has just two parameters, c and u). In the remainder of the paper, we work with parsimonious moral hazard representations.
The main claim of lemma 1 is that the cost function c may be chosen to be convex. The fact that we may write “max” instead of “sup” follows from the lower semicontinuity of c. Note that although the representation allows the agent to choose any output distribution , constraints can still be captured by setting for some distributions .
Proof. For the “only if” part, let ⪰ be a moral hazard preference, with effort set E, cost function C, belief map , and utility function u. For each output distribution , let c(p) be the least cost at which p may be produced:
with the convention that . The function c maps Δ(S) into [0, ∞], is convex and lower semicontinuous by construction,11 and is grounded since C is. And clearly
Finally, the supremum is attained since c is lower semicontinuous.
For the “if” part, let ⪰ admit parsimonious representation (c, u). We have
since c is convex and is linear, so randomization is not strictly optimal. This is a moral hazard model, with effort , cost function , and belief map for every . QED
Remark 1.
This paper is concerned with the moral hazard model’s behavioral foundations rather than with its implications for optimal contracting. Still, it is worth pointing out that the parsimonious formulation of the moral hazard model is in some ways more tractable than the standard formulation described in section II.B, since the cost function c is so well behaved. Mirrlees and Zhou (2006) and Georgiadis, Ravid, and Szentes (2024) use this insight to obtain new results on implementability and optimal contracts.
IV. Empirical Content of the Moral Hazard Model
In this section, I give a characterization of the behavioral implications of the moral hazard model. I first introduce a number of axioms: testable properties of choice behavior that a given binary relation ⪰ on W may or may not satisfy. I then show that a binary relation ⪰ is a moral hazard preference if and only if it satisfies these axioms. In other words, these axioms exhaust the empirical implications of the moral hazard model.
We shall work with mixtures, defined standardly as follows. For any random remunerations and any , we write for the random remuneration under which the agent’s pay is drawn from x with probability α and drawn from x′ otherwise.12 For any contracts and any , the contract is given by for each output level .
A. Basic Axioms
Our first four axioms are standard. The first three express basic economic properties, and the fourth is a technical regularity condition.
Axiom 1 (Weak order).
⪰ is complete and transitive.
Axiom 2 (Monotonicity).
For any prizes π, , implies .
Axiom 3 (Dominance).
If two contracts satisfy for every output level , then .
Axiom 4 (Continuity).
For any contracts w, w′, , the sets {} and {} are closed.
It is easily verified (using lemma 1) that any moral hazard preference satisfies these four axioms.
The first of our two substantial axioms is as follows.
Quasiconvexity Axiom.
For any contracts such that , it holds that for all .
Quasiconvexity says precisely that the agent is averse to mixing contracts: if two contracts are equally good, then the contract obtained by randomly selecting one of them (using an α-biased coin toss) is weakly worse. In a parsimonious moral hazard representation, this axiom is satisfied because tailoring a single effort level to the mixture of two contracts w and w′ is more difficult than tailoring effort separately to each of the two contracts.13 A version of this observation was first made by Drèze (1961).14
Mathematically, quasiconvexity is identical to Schmeidler’s (1989) definition of uncertainty seeking (the opposite of uncertainty aversion).
Example 1.
To see what quasiconvexity rules out, consider a moral hazard model in which effort is chosen (and costs are borne) by malevolent nature: let be grounded, convex, and lower semicontinuous; let be strictly increasing and normalized; and let ⪰ be the preference such that for any contracts , holds if and only if
Then ⪰ violates quasiconvexity, except if c is trivial.15 Preferences like ⪰, called variational, will appear again in section IV.C.
B. Independence
Every moral hazard preference has an expected utility attitude to risk: random remunerations (i.e., constant contracts) are evaluated via the expectation . By the expected utility theorem (von Neumann and Morgenstern 1947), a relation ⪰ that satisfies axioms 1–4 has this objective expected utility property if it satisfies the following axiom.
vNM Independence Axiom.
For any random remunerations and any ,
Moral hazard preferences do not generally have an expected utility attitude to (Knightian) uncertainty. Equivalently (given axioms 1–4), moral hazard preferences do not generally satisfy the stronger Anscombe–Aumann independence axiom.16 They do, however, satisfy the following independence axiom from Maccheroni, Marinacci, and Rustichini (2006), which is stronger than vNM independence but weaker than Anscombe–Aumann independence.
MMR Independence Axiom.
For any contracts and any ,
MMR independence requires that mixing one random remuneration (y) into two contracts (w and w′) is much the same as mixing in another (y′): what matters is the probability () with which the output-independent remuneration is awarded instead of whatever output-dependent remuneration the contracts (w and w′) specify. The reason why the probability α matters is that it affects the slope of contracts, that is, how they vary with output ; by contrast, whether remuneration is (constant and equal to) y or y′ with probability does not affect slope. In a parsimonious moral hazard representation, MMR independence is satisfied because replacing y with y′ does not affect the optimal choice of effort .
Example 2.
To understand what MMR independence rules out, consider a moral hazard model with income effects: let be strictly increasing and normalized, let be quasiconcave with V(p,⋅) strictly increasing for each and for each , and let ⪰ be the preference such that for any contracts , holds if and only if
Then ⪰ violates MMR independence, except if V has the quasilinear form for some (necessarily convex) .17
C. Axiomatic Behavioral Characterization
The following proposition characterizes the behavioral content of the moral hazard model.
Proposition 1.
For a binary relation ⪰ on W, the following are equivalent:
1. ⪰ satisfies axioms 1–4, MMR independence, and quasiconvexity.
2. ⪰ is a moral hazard preference.
To prove proposition 1, we begin with an observation. The inverse of a binary relation ⪰ on W is the binary relation ⊒ on W such that if and only if for all contracts . Given a binary relation ⊒ on W, a grounded, convex, and lower semicontinuous function , and a nonconstant function , we say that (c, v) is a variational representation of ⊒ exactly if for any contracts , holds if and only if
Observation 1.
For a binary relation ⪰ on W, a grounded, convex, and lower semicontinuous function , and a nonconstant function , the following are equivalent:
• (c, u) is a parsimonious moral hazard representation of ⪰.
• (c, −u) is a variational representation of the inverse of ⪰, and u is strictly increasing and normalized.
Proof of proposition 1. It is easily verified (using lemma 1) that point 2 implies point 1. For the converse, suppose that ⪰ satisfies point 1 and let ⊒ be its inverse. By inspection, ⊒ satisfies axioms 1, 3, and 4, MMR independence, nondegeneracy (there exist contracts such that ), and quasiconcavity (for any contracts such that , it holds that for all ). Thus, by theorem 3 in Maccheroni, Marinacci, and Rustichini (2006), ⊒ admits a variational representation (c, v). By axiom 2, −v is strictly increasing, so is strictly increasing and normalized. Hence, by observation 1, (c, u) is a parsimonious moral hazard representation of ⪰, and thus ⪰ is a moral hazard preference by lemma 1. QED
V. Identification of the Moral Hazard Model
In this section, I show that the parsimonious moral hazard model is identified: both of its parameters, c and u, are recoverable from contract choice data.
Definition 1.
A binary relation ⪰ on W is unbounded if and only if there are random remunerations in Δ(Π) such that for any , we may find random remunerations that satisfy and .
It is easy to see that a moral hazard preference ⪰ is unbounded if and only if for any parsimonious representation (c, u) of ⪰, the function u is unbounded both above and below (equivalently, ).
Proposition 2 (Identification).
Each unbounded moral hazard preference admits exactly one parsimonious representation.
Proof. By lemma 1, each moral hazard preference admits at least one parsimonious representation. By proposition 6 in Maccheroni, Marinacci, and Rustichini (2006) and observation 1, each unbounded moral hazard preference admits at most one parsimonious representation.18 QED
Proposition 2 asserts that observing the agent’s choices between pairs of contracts suffices to recover the parameters c and u. Concretely, u may be recovered in standard fashion from choice between random remunerations (i.e., constant contracts), whereupon c may be recovered as
where xw denotes the unique random remuneration that satisfies .
The standard four-parameter moral hazard model () described in section II.B is only partially identified off contract choice data ⪰. In particular, it is not possible to disentangle the effort cost C from the effort-to-output map : the data ⪰ can reveal only the minimum cost c(p) of inducing any given output distribution . Richer data could help: for example, data on chosen effort (see Karni 2006). Such data are rarely available in the field,19 but they can potentially be obtained in the lab.20
VI. Confidence and Optimism
In this section, I propose behavioral definitions “more confident than” and “more optimistic than.” These definitions are couched entirely in terms of observed choice between contracts, making no reference to any objective facts about how the agent’s effort actually influences output. I show that in the parsimonious moral hazard model, increased confidence corresponds to vertical shifts (precisely, pointwise decreases) of the output distribution cost function c, while increased optimism corresponds to horizontal shifts of c in the direction of the first-order stochastically higher output distributions. I also discuss what data are required to distinguish empirically between “more confident than” and “more optimistic than.” I conclude by noting that my notions of relative confidence and optimism lead naturally to definitions of what it means for an agent to be (absolutely) overconfident or optimistic.
A. Confidence
A confident agent is one who believes she can significantly influence the distribution of output. In terms of contract choice behavior, greater confidence is thus expressed by a greater appetite for nonconstant contracts, under which pay depends on realized output. This motivates the following purely behavioral definition of relative confidence.
Definition 2.
Let ⪰ and ⪰′ be binary relations on W. ⪰ is more confident than ⪰′ if and only if whenever for a contract and a constant contract , we also have .
Note that the definition of relative confidence makes no reference to any objective facts about the agent’s actual ability to influence the distribution of output. Greater confidence may therefore reflect an increased subjective confidence on the agent’s part about her ability to influence output, an actual improvement in her ability, or a combination of the two.
Mathematically, definition 2 is identical to the standard definition of “less uncertainty averse than” (Epstein 1999; Ghirardato and Marinacci 2002).
In the moral hazard model, greater confidence is equivalent to uniformly lower costs:
Proposition 3.
Let ⪰ and ⪰′ be moral hazard preferences, with parsimonious representations (c, u) and (c′, u′). Then ⪰ is more confident than ⪰′ if and only if and .
Equivalently, (c, u) is more confident than (c′, u′) exactly if and the cost level sets are nested:
In other words, any output distribution that is (believed to be) attainable at cost ≤k under (c′, u′) is also (believed to be) attainable at cost ≤k under (c, u). The fact that is implied reflects a (conceptually desirable) separation of confidence from risk attitude.
Example 3.
Let output be binary, , so that each distribution may be identified with a single number, . Let for each , where and are parameters. As α falls (holding β constant), c shifts vertically downward (see fig. 1). By proposition 3, the agent becomes more confident.

Fig. 1. As α falls, shifts vertically downward.
Proof of proposition 3. The “if” part is trivial. For the “only if” part, let ⪰ and ⪰′ be moral hazard preferences with parsimonious representations (c, u) and (c′, u′), respectively, and let ⪰ be more confident than ⪰′. Write ⊒ and ⊒′ for the inverses of ⪰ and ⪰′, respectively. By observation 1, (c, −u) and (c′, −u′) are variational representations of ⊒ and of ⊒′, respectively. And by inspection, for any random remuneration and any contract , implies . Thus, and by proposition 9 in Maccheroni, Marinacci, and Rustichini (2006). QED
B. Optimism
An optimistic agent is one who expects output to be high. To capture this, assume that output levels are ordered: , where .
In terms of contract choice behavior, greater optimism is manifested by a greater propensity to choose steeper contracts, which pay relatively more following high output realizations than following low output realizations.
The appropriate formalization of “steeper” requires correcting for risk attitude by using units of utility rather than of money. Toward a definition, recall that by the expected utility theorem (von Neumann and Morgenstern 1947), if a relation ⪰ satisfies axioms 1–4 and vNM independence, then there is exactly one strictly increasing and normalized function such that for any random remunerations , holds if and only if . Given any such relation ⪰ and any two contracts w, , we say that w is ⪰-steeper than w′ exactly if is increasing. Equivalently, holds for all output levels in S. This latter formulation furnishes a general definition that is applicable for any relation ⪰:
Definition 3.
Fix a binary relation ⪰ on W and two contracts w, . We say that w is ⪰-steeper than w′ if and only if holds for all output levels in S.
Definition 4.
Let ⪰ and ⪰′ be binary relations on W. ⪰ is more optimistic than ⪰′ if and only if whenever for two contracts such that w is ⪰-steeper than w′, we also have .
In other words, a more optimistic preference is one that is more disposed to prefer steeper contracts, where steepness is measured in utility units.
We shall characterize relative optimism in the moral hazard model in terms of upshiftedness of costs, defined as follows.
Definition 5.
Let c, be grounded, convex, and lower semicontinuous. We say that c is upshifted from c′ if and only if for any , there are such that p first-order stochastically dominates (FOSD) q′, q FOSD p′, , and .
Despite its tricky definition, upshiftedness expresses a straightforward idea: that first-order stochastically higher output distributions are relatively cheaper under c than under c′. Indeed, c is upshifted from c′ exactly if under any contract and for any strictly increasing and normalized , the optimal choice of effort in the parsimonious moral hazard model (c, u) is first-order stochastically higher than optimal effort in the parsimonious moral hazard model (c′, u) (see Dziewulski and Quah 2024, sec. 5.2).21
Example 3 (Continued).
Recall that for each and note that p FOSD p′ if and only if . As β rises (holding α constant), c shifts horizontally rightward (see fig. 2); formally, c upshifts.22

Fig. 2. As β rises, shifts horizontally rightward.
There is an intuitive necessary (though not sufficient) condition for upshiftedness in terms of the cost level sets
Observation 2.
Let be grounded, convex, and lower semicontinuous. If c is upshifted from c′, then for every ,
In other words, the set Lk of output distributions that are (believed to be) attainable at cost ≤ k under c is “FOSD higher” than the set of output distributions that are (believed to be) attainable at cost ≤k under c.23
Proof. We prove the first half, omitting the analogous argument for the second half. Fix a and a . Since c′ is grounded and lower semicontinuous, we may choose a such that . By upshiftedness, there are such that p FOSD q′ and , so . QED
We now show that greater optimism is manifested in the moral hazard model by first-order stochastically higher distributions becoming relatively cheaper in the sense of upshiftedness.
Proposition 4.
Let ⪰ and ⪰′ be unbounded moral hazard preferences, with parsimonious representations (c, u) and (c′, u′). Then ⪰ is more optimistic than ⪰′ if and only if and c is upshifted from c′.
The proof, given in the appendix, turns on a result due to Dziewulski and Quah (2024).
C. Distinguishability of Confidence and Optimism
Relative confidence and relative optimism are logically independent: by example 3, a preference can be more confident than another without being more optimistic, and vice versa.24 In other words, relative confidence is empirically distinguishable from relative optimism.
This conclusion rests in part on the richness of the data ⪰, which record the agent’s choices between all pairs of contracts. On coarser data, distinguishability may fail. To explore this point, suppose that data are available only on choices between pairs of contracts belonging to , where W⋆ contains the constant contracts (i.e., ). For binary relations ⪰ and ⪰′ on W, say that ⪰ is more confident on W⋆ than ⪰′ if and only if whenever for a contract and a constant contract , we also have ; and say that ⪰ is more optimistic on W⋆ than ⪰′ if and only if whenever for two contracts such that w is ⪰-steeper than w′, we also have .
Whether “more confident on W⋆ than” implies “more optimistic on W⋆ than” or vice versa depends on . Consider, for example, the set W↑ of all increasing contracts, meaning those such that w(s) first-order stochastically dominates w(s′) whenever .
Observation 3.
For binary relations ⪰ and ⪰′ on W that satisfy axioms 1–4 and vNM independence, if ⪰ is more optimistic on W↑ than ⪰′, then ⪰ is more confident on W↑ than ⪰′.25
Thus, data on increasing contracts alone are insufficient to distinguish relative confidence from relative optimism.
Proof. Fix a contract and a constant contract such that ; we must show that if ⪰ is more optimistic on W↑ than ⪰′, then . By the expected utility theorem (von Neumann and Morgenstern 1947), there is exactly one strictly increasing and normalized function such that for any random remunerations , holds if and only if . Since u is strictly increasing and w is increasing, w is ⪰-steeper than x. Thus, if ⪰ is more optimistic on W↑ than ⪰′, then . QED
D. Absolute Overconfidence and Optimism
A remaining question is what it means for an agent to be overconfident or optimistic in an absolute sense (rather than relative to some other agent). I conclude by offering one possible answer to this question, in the spirit of Epstein (1999) and Ghirardato and Marinacci (2002).
Whereas our definitions of “more confident than” and “more optimistic than” were purely behavioral, making no reference to objective facts about the agent’s ability to influence output, absolute overconfidence and optimism are concerned precisely with discrepancies between such objective facts and the agent’s subjective assessment of them. We must therefore fix an objective benchmark against which the agent’s behavior may be compared. Although it is natural to fix as our benchmark the first three parameters (E⋆, C⋆, ) of a standard four-parameter moral hazard model, what will actually matter is only the objective output distribution cost .26
Definition 6.
Let be the objective output distribution cost, and let ⪰ be a binary relation on W that satisfies axioms 1–4 and vNM independence. Write for the (unique) normalized utility function of ⪰, and let ⪰⋆ be the moral hazard preference whose parsimonious representation is (c⋆, u). We say that ⪰ is overconfident (optimistic) if and only if it is more confident (more optimistic) than ⪰⋆.
In other words, an overconfident (optimistic) preference is one that is more confident (more optimistic) than is objectively warranted.
Corollary 1.
Let be the objective output distribution cost, and let ⪰ be an unbounded moral hazard preference, with parsimonious representation (c, u). Then ⪰ is overconfident if and only if , and ⪰ is optimistic if and only if c is upshifted from c⋆.
Analogous definitions and characterizations may be given for (absolute) underconfidence and pessimism.
Appendix. Proof of Proposition 4
The proof relies on an observation and a lemma. The lemma is inspired by and proved using a result due to Dziewulski and Quah (2024).
Observation 4.
If is onto, then for every function , there is a contract such that for every .
Lemma 2.
Let be grounded, convex, and lower semicontinuous. The following are equivalent:
a. c is upshifted from c′.
b. For any such that is increasing,
implies
A1. Proof of Proposition 4
Suppose that ⪰ is more optimistic than ⪰′. Then since for any constant contracts , x is steeper than x′ and x′ is steeper than x. By observation 4 (applicable since ⪰ and ⪰′ are unbounded), property b holds. Hence, by lemma 2, c is upshifted from c′.
Suppose that and that c is upshifted from c′. Then property b holds by lemma 2. Since ⪰ and ⪰′ are unbounded, it follows by observation 4 that ⪰ is more optimistic than ⪰′. QED
A2. Proof of Lemma 2
Property b is equivalent to the following:
c. For any such that is increasing,
We claim that property c is equivalent to the following:
d. For any such that is increasing,
It is clear that property d implies property c. Conversely, if property d fails for some f and f′ such that is increasing, then property c fails for and f′, where
Finally, property d holds if and only if c is upshifted from c′ by proposition 9 in Dziewulski and Quah (2024).27 QED
Notes
This paper grew out of conversations with Jorge Padilla and Joe Perkins, whom I thank for their insights. I am grateful for comments from three anonymous referees, Johannes Abeler, Dağhan Carlos Akkar, Nemanja Antić, Ala Avoyan, Jean Baccelli, Aislinn Bohren, Roberto Corrao, Gregorio Curello, Eddie Dekel, Matteo Escudé, Bruno Furtado, George Georgiadis, Duarte Gonçalves, Tai-Wei Hu, Alex Jakobsen, Ian Jewitt, Jan Knoepfle, Meg Meyer, Salvatore Piccolo, Mauricio Ribeiro, Evgenii Safonov, Todd Sarver, Lorenzo Stanca, Quitzé Valenzuela-Stookey, and Mu Zhang as well as audiences at Compass Lexecon and the Second Southeast Theory Festival. This research was supported by Compass Lexecon. An earlier version of this paper bore the title “Optimism and Overconfidence.” This paper was edited by Eduardo Azevedo.
1 For example, see the survey by DellaVigna (2009, sec. 3.1).
2 For example, see Spinnewijn (2015).
3 For a random remuneration , u(x) denotes the expected utility .
4 For an overview, see Holmström (2017) and Georgiadis (2024).
5 For example, see de la Rosa (2011), Gervais, Heaton, and Odean (2011), and Spinnewijn (2015). See Kőszegi (2014, sec. 3.4) for a survey.
6 Examples include disappointment aversion (Gül 1991), temptation (Gül and Pesendorfer 2001; Dekel, Lipman, and Rustichini 2009), status quo bias (Masatlioglu and Ok 2005; Ortoleva 2010), framing effects (Salant and Rubinstein 2006, 2008; Ahn and Ergin 2010), heuristics (Manzini and Mariotti 2007, 2012), regret aversion (Sarver 2008), costly contemplation (Ergin and Sarver 2010), and reference dependence (Ok, Ortoleva, and Riella 2015).
7 Topics include the impact of overconfidence or optimism on credit markets (Manove and Padilla 1999), competitive screening (Sandroni and Squintani 2007), monopolistic screening (Eliaz and Spiegler 2008; Grubb 2009), moral hazard (see n. 5), political behavior (Ortoleva and Snowberg 2015), and learning (Heidhues, Kőszegi, and Strack 2018). See also Benoît and Dubra (2011).
8 For example, see Wakker (1990), Epstein and Kopylov (2007), Chateauneuf, Eichberger, and Grant (2007), Moore and Healy (2008), Dean and Ortoleva (2017), and Dillenberger, Postlewaite, and Rozen (2017).
9 See Azrieli and Lehrer (2008), Dillenberger et al. (2014), and Lu (2016).
10 See Van Zandt (1996), de Oliveira et al. (2017), and Ellis (2018).
11 Convexity is easily verified. For lower semicontinuity, let Δ(Δ(S)) be the set of all probabilities on Δ(S), equipped with the topology of weak convergence. For each , , where and for each , and by convention. The map is lower semicontinuous by lemma 17.30 in Aliprantis and Border (2006), since C is lower semicontinuous and (by the continuity of ) the correspondence is upper hemicontinuous and compact valued. Since the map is continuous (Phelps 2001, proposition 1.1), the correspondence is upper hemicontinuous and compact valued. Thus, c is lower semicontinuous by lemma 17.30 in Aliprantis and Border (2006). This argument is adapted from Lipnowski and Ravid (2020, app. sec. A.4.1).
12 Explicitly, for each finite .
13 Explicitly, writing for any and , we have for every , so .
14 His postulate P2.1 is quasiconvexity. See also Drèze (1987, chap. 2) and Machina (1984).
15 Reasoning similar to that in n. 13 shows that ⪰ satisfies the reverse inequality (quasiconcavity) and that the inequality is strict for some except if c is trivial.
16 The axiom: for any contracts and any , implies for any contract . Equivalence holds by the Anscombe and Aumann (1963) expected utility theorem.
17 Preferences like ⪰, except with V replaced by and “max” replaced by “min,” were studied by Cerreia-Vioglio et al. (2011).
18 This argument, and thus proposition 2, remains valid if unboundedness is weakened to require only that u(Δ(Π)) be unbounded either above or below. The full force of unboundedness will be needed in sec. VI.B, however.
19 See Georgiadis (2024, sec. 3) for some notable exceptions.
20 Two common methods are incentivized elicitation and measuring performance on a cognitive task (see, e.g., Charness, Gneezy, and Henderson 2018). A third method is measuring the total time spent on a task (e.g., Avoyan, Ribeiro, and Schotter 2025).
21 This statement assumes that optimal effort choices are unique. The general statement may be found in Dziewulski and Quah (2024, sec. 5.2).
22 Fix and in [0,1]. For any , it is easily verified that and satisfy the properties in definition 5.
23 This way of comparing sets is known as the weak set order (Topkis 1998, sec. 2.4).
24 Explicitly, there exist binary relations⪰ and ⪰′ on W such that ⪰ is more confident but not more optimistic than ⪰′, and there exist relations ⪰ and ⪰′ on W such that ⪰ is more optimistic but not more confident than ⪰′; furthermore, these can be chosen to satisfy axioms 1–4, MMR independence, and quasiconvexity.
25 I thank an anonymous referee for this observation.
26 The relationship is for every , where by convention. This was shown in the proof of lemma 1.
27 The statement of this proposition assumes that c and c′ are finite valued, but that assumption is not required for the proof. (The statement also assumes continuity, but that property is equivalent to lower semicontinuity given the other assumptions.)
-
Ahn, D. S., and H. Ergin. 2010. “Framing Contingencies.” Econometrica 78 (2): 655–95.
-
Aliprantis, C. D., and K. C. Border. 2006. Infinite Dimensional Analysis: A Hitchhiker’s Guide. 3rd ed. Berlin: Springer.
-
Anscombe, F. J., and R. J. Aumann. 1963. “A Definition of Subjective Probability.” Ann. Math. Statis. 34 (1): 199–205.
-
Avoyan, A., M. Ribeiro, and A. Schotter. 2025. “Choosing between and Allocating Time across Contracts: An Experimental Study.” Working paper.
-
Azrieli, Y., and E. Lehrer. 2008. “The Value of a Stochastic Information Structure.” Games and Econ. Behavior 63 (2): 679–93.
-
Benoît, J.-P., and J. Dubra. 2011. “Apparent Overconfidence.” Econometrica 79 (5): 1591–625.
-
Blackwell, D. 1951. “Comparison of Experiments.” In Berkeley Symposium on Mathematical Statistics and Probability, vol. 2, edited by J. Neyman, 93–102. Berkeley: Univ. California Press.
-
———. 1953. “Equivalent Comparisons of Experiments.” Ann. Math. and Statis. 24 (2): 265–72.
-
Cerreia-Vioglio, S., F. Maccheroni, M. Marinacci, and L. Montrucchio. 2011. “Uncertainty Averse Preferences.” J. Econ. Theory 146 (4): 1275–330.
-
Charness, G., U. Gneezy, and A. Henderson. 2018. “Experimental Methods: Measuring Effort in Economics Experiments.” J. Econ. Behavior and Org. 149:74–87.
-
Chateauneuf, A., J. Eichberger, and S. Grant. 2007. “Choice under Uncertainty with the Best and Worst in Mind: Neo-Additive Capacities.” J. Econ. Theory 137:538–67.
-
Dean, M., and P. Ortoleva. 2017. “Allais, Ellsberg, and Preferences for Hedging.” Theoretical Econ. 12 (1): 377–424.
-
Dekel, E., B. L. Lipman, and A. Rustichini. 2009. “Temptation-Driven Preferences.” Rev. Econ. Studies 76 (3): 937–71.
-
de la Rosa, L. E. 2011. “Overconfidence and Moral Hazard.” Games and Econ. Behavior 73 (2): 429–51.
-
DellaVigna, S. 2009. “Psychology and Economics: Evidence from the Field.” J. Econ. Literature 47 (2): 315–72.
-
de Oliveira, H., T. Denti, M. Mihm, and K. Ozbek. 2017. “Rationally Inattentive Preferences and Hidden Information Costs.” Theoretical Econ. 12 (2): 621–54.
-
Dillenberger, D., J. S. Lleras, P. Sadowski, and N. Takeoka. 2014. “A Theory of Subjective Learning.” J. Econ. Theory 153:287–312.
-
Dillenberger, D., A. Postlewaite, and K. Rozen. 2017. “Optimism and Pessimism with Expected Utility.” J. European Econ. Assoc. 15 (5): 1158–75.
-
Drèze, J. H. 1961. “Fondements Logiques de la Probabilité Subjective et de l’Utilité. In La Décision: Paris, 25–30 Mai 1960, Colloques Internat. CNRS, 73–87. Paris: CNRS.
-
———. 1987. Essays on Economic Decisions under Uncertainty. Cambridge: Cambridge Univ. Press.
-
Dziewulski, P., and J. K.-H. Quah. 2024. “Comparative Statics with Linear Objectives: Normality, Complementarity, and Ranking Multi-Prior Beliefs.” Econometrica 92 (1): 167–200.
-
Eliaz, K., and R. Spiegler. 2008. “Consumer Optimism and Price Discrimination.” Theoretical Econ. 3 (4): 459–97.
-
Ellis, A. 2018. “Foundations for Optimal Inattention.” J. Econ. Theory 173:56–94.
-
Epstein, L. G. 1999. “A Definition of Uncertainty Aversion.” Rev. Econ. Studies 6 (3): 579–608.
-
Epstein, L. G., and I. Kopylov. 2007. “Cold Feet.” Theoretical Econ. 2 (3): 231–59.
-
Ergin, H., and T. Sarver. 2010. “A Unique Costly Contemplation Representation.” Econometrica 78 (4): 1285–339.
-
Georgiadis, G. 2024. “Contracting with Moral Hazard: A Review of Theory & Empirics.” In Elgar Encyclopedia on the Economics of Competition, Regulation and Antitrust, edited by Michael D. Noel, 24–37. Cheltenham: Elgar.
-
Georgiadis, G., D. Ravid, and B. Szentes. 2024. “Flexible Moral Hazard Problems.” Econometrica 92 (2): 387–409.
-
Gervais, S., J. B. Heaton, and T. Odean. 2011. “Overconfidence, Compensation Contracts, and Capital Budgeting.” J. Finance 66 (5): 1735–77.
-
Ghirardato, P., and M. Marinacci. 2002. “Ambiguity Made Precise: A Comparative Foundation.” J. Econ. Theory 102 (2): 251–89.
-
Grubb, M. D. 2009. “Selling to Overconfident Consumers.” A.E.R. 99 (5): 1770–807.
-
Gül, F. 1991. “A Theory of Disappointment Aversion.” Econometrica 59 (3): 667–86.
-
Gül, F., and W. Pesendorfer. 2001. “Temptation and Self-Control.” Econometrica 69 (6): 1403–35.
-
Heidhues, P., B. Kőszegi, and P. Strack. 2018. “Unrealistic Expectations and Misguided Learning.” Econometrica 86 (4): 1159–214.
-
Holmström, B. 1979. “Moral Hazard and Observability.” Bell J. Econ. 10 (1): 74–91.
-
———. 2017. “Pay for Performance and Beyond.” A.E.R. 107 (7): 1753–77.
-
Jakobsen, A. M. 2021. “An Axiomatic Model of Persuasion.” Econometrica 89 (5): 2081–116.
-
———. 2025. “Temptation and Persuasion.” Working paper.
-
Kamenica, E., and M. Gentzkow. 2011. “Bayesian Persuasion.” A.E.R. 101 (6): 2590–615.
-
Karni, E. 2006. “Subjective Expected Utility Theory without States of the World.” J. Math. Econ. 42 (3): 325–42.
-
Kőszegi, B. 2014. “Behavioral Contract Theory.” J. Econ. Literature 52 (4): 1075–118.
-
Lipnowski, E., and D. Ravid. 2020. “Cheap Talk with Transparent Motives.” Econometrica 88 (4): 1631–60.
-
Lu, J. 2016. “Random Choice and Private Information.” Econometrica 84 (6): 1983–2027.
-
Maccheroni, F., M. Marinacci, and A. Rustichini. 2006. “Ambiguity Aversion, Robustness, and the Variational Representation of Preferences.” Econometrica 74 (6): 1447–98.
-
Machina, M. J. 1984. “Temporal Risk and the Nature of Induced Preferences.” J. Econ. Theory 33 (2): 199–231.
-
Manove, M., and A. J. Padilla. 1999. “Banking (Conservatively) with Optimists.” RAND J. Econ. 30 (2): 324–50.
-
Manzini, P., and M. Mariotti. 2007. “Sequentially Rationalizable Choice.” A.E.R. 97 (5): 1824–39.
-
———. 2012. “Categorize Then Choose: Boundedly Rational Choice and Welfare.” J. European Econ. Assoc. 10 (5): 1141–65.
-
Masatlioglu, Y., and E. A. Ok. 2005. “Rational Choice with Status Quo Bias.” J. Econ. Theory 121 (1): 1–29.
-
Mirrlees, J. A. (1975) 1999. “The Theory of Moral Hazard and Unobservable Behaviour: Part I.” Rev. Econ. Studies 66 (1): 3–21. Completed October 1975, published 1999.
-
Mirrlees, J. A., and Y. Zhou. 2006. “Principal-Agent Problems Revisited, Part I: Static Models.” Working paper.
-
Moore, D. A., and P. J. Healy. 2008. “The Trouble with Overconfidence.” Psychological Rev. 115 (2): 502–17.
-
Ok, E. A., P. Ortoleva, and G. Riella. 2015. “Revealed (P)Reference Theory.” A.E.R. 105 (1): 299–321.
-
Ortoleva, P. 2010. “Status Quo Bias, Multiple Priors and Uncertainty Aversion.” Games and Econ. Behavior 69 (2): 411–24.
-
Ortoleva, P., and E. Snowberg. 2015. “Overconfidence in Political Behavior.” A.E.R. 105 (2): 504–35.
-
Phelps, R. R. 2001. Lectures on Choquet’s Theorem. 2nd ed. Berlin: Springer.
-
Salant, Y., and A. Rubinstein. 2006. “A Model of Choice from Lists.” Theoretical Econ. 1 (1): 3–17.
-
———. 2008. “(A, f): Choice with Frames.” Rev. Econ. Studies 75 (4): 1287–96.
-
Sandroni, A., and F. Squintani. 2007. “Overconfidence, Insurance, and Paternalism.” A.E.R. 97 (5): 1994–2004.
-
Sarver, T. 2008. “Anticipating Regret: Why Fewer Options May Be Better.” Econometrica 76 (2): 263–305.
-
Schmeidler, D. 1989. “Subjective Probability and Expected Utility without Additivity.” Econometrica 57 (3): 571–87.
-
Spinnewijn, J. 2015. “Unemployed but Optimistic: Optimal Insurance Design with Biased Beliefs.” J. European Econ. Assoc. 13 (1): 130–67.
-
Topkis, D. M. 1998. Supermodularity and Complementarity. Princeton, NJ: Princeton Univ. Press.
-
Van Zandt, T. 1996. “Hidden Information Acquisition and Static Choice.” Theory and Decision 40 (3): 235–47.
-
von Neumann, J., and O. Morgenstern. 1947. Theory of Games and Economic Behavior. 2nd ed. Princeton, NJ: Princeton Univ. Press.
-
Wakker, P. P. 1990. “Characterizing Optimism and Pessimism Directly through Comonotonicity.” J. Econ. Theory 52 (2): 453–63.
