Insurance contracts with imprecise probabilities and adverse selection

(1)

Insurance Contracts with Imprecise Probabilities

and Adverse Selection

∗

Meglena Jeleva

LEN-C3E, Université de Nantes, EUREQua

106-112, Bld de l’Hôpital

75647 Paris Cedex 13

France

Bertrand Villeneuve

†

IDEI Université de Toulouse 1

Place Anatole France

31042 Toulouse cedex

France

April 11, 2003

Abstract

This article deals with optimal insurance contracts in the framework of imprecise probabilities and adverse selection. Agents diﬀer not only in the objective risk they face but also in the perception of risk. In monopoly, a range of configurations that VNM preferences preclude appears: a pooling contract may be optimal, incomplete coverage may be oﬀered to high risks, low risks may be better covered.

Keywords: Imprecise Probabilities; Insurance Markets; Adverse Selec-tion.

JEL: D81, D82, G22

∗_{Useful suggestions by Emmanuelle Auriol, Bernard Bensaïd, Michèle Cohen, Isaac Meilijson}

and the anonymous referee are acknowledged.

(2)

1 Introduction

Following Rothschild and Stiglitz [7] and Stiglitz [10], asymmetric information in insurance markets is generally understood as a situation in which consumers are better informed than insurers. However, the general perception in insurance com-panies is quite diﬀerent: statistical discrimination and risk assessment have made considerable progress, whereas individuals often have limited and biased experience. One could even estimate that adverse selection is eradicated in certain markets (see Chiappori and Salanié [1] for evidence). If we go further and assume that insurers are better informed on risk, then the issue would rather be to inform people correctly on their risks rather than design sophisticated menus. Along these lines, Villeneuve [11] proposes an analysis of the better informed insurer in which the question of information transmission is explored.

Both the adverse selection hypothesis and its reverse contain some truth: insurers know risk better, at least as far as the classifications they use are concerned, whereas consumers know something personal and unobservable that is also relevant to risk. This paper is a modest attempt to reconcile these two points of view. On the one hand, we assume that customers, though “knowing their types”, may not correctly process all the information attached to this knowledge. On the other hand, insurers are able to define risk classes and measure which “types” are pooled in a given statistical class and what risk they bear; however they are assumed to be unable to assign a type to a given customer.

How can insurers know these stable traits of their clientele? Experimentation, whether optimized or involuntary, combined with revealed preference arguments, justifies that statistical parameters of interest be inferred from actual choices. This knowledge can be capitalized to improve contract oﬀers to new groups of consumers. We shall assume that we are at the final stage of this learning process that we do not model.

Several arguments can justify our assumption concerning risk perception by con-sumers. Consumers very often have limited experience, imprecise sources of infor-mation, and consequently their beliefs remain durably inaccurate. A young driver may know that his accident probability is high without being able to quantify it precisely, and this may produce biases in his choice of insurance coverage.

(3)

More-over, even if agents have perfect knowledge of their true probability distribution, a disparity between beliefs and objective probabilities is still rationalizable. A now classical representation is the rank dependent expected utility (Quiggin [6], Yaari [12]).

To summarize, we are in a situation where asymmetric information has two sides. Insurers do not distinguish ex ante high risk consumers from low risk consumers, whereas the policyholders do not necessarily behave as they should if they were perfectly informed or just expected utility maximizers. To simplify matters and avoid diﬃculties with equilibrium concepts that arise in the competitive case, we consider here a monopolistic insurer.

The results of the paper are as follows. A difference between beliefs and prob-abilities in an insurance model with adverse selection, in addition to making the model closer to reality, explains a wider range of configurations. A striking fact is that, though the single-crossing property remains satisfied everywhere, the optimal offer of the monopolistic insurer can be a pooling contract over a continuum of para-meters. The reason for this is that the marginal values for a better coverage across types may be ranked by types and by insurers in opposite ways: the most costly type may be the one with the least willingness to pay for coverage. This “disagree-ment” cannot always be resolved with differentiated offers. The distinction between beliefs and probabilities introduces the possibility of the low risk (in terms of ob-jective probability) being more covered than the high risk. We obtain an important difference with Stiglitz’ model [10]; our findings stress the fact that it is crucial for insurers, when they decide the contracts they offer, to take into account the possible differences between agents’ beliefs and probabilities. After a theorem giving the general properties of our model, we characterize the particularities of the various possible configurations, and we show exactly how the equilibrium moves when the distribution of types is changed.

Section 2 briefly recalls several models of decision making under risk and ambi-guity justifying the existence of a diﬀerence between beliefs and “true”probabilities. In section 3, we define and examine the first-best optimum when insurance appli-cants’ beliefs diﬀer from the true probabilities, but insurers can perfectly categorize applicants according to the true risk they present. Section 4 sets the problem of the monopolistic insurer confronted with a heterogenous population in terms of beliefs

(4)

and probabilities when the insurer cannot distinguish applicants according to their true risks. We establish a theorem that sets the general necessary properties of the optimal menu in this second-best context. In section 5, we consider diﬀerent pos-sible configurations of the parameters and give a complete characterization of the optimal solutions. Proofs are relegated to the Appendix.

2 Beliefs and objective probabilities

In this section, we briefly present several distinct reasons why insurers’ and policy-holders’ risk assessments may diﬀer. Though meanings diﬀer, most of these cases are formally identical, which enables us to include them in the same mathematical framework.

A consumer with initial wealth W0 incurs a loss d with probability p which

represents the “true” (or objective) probability as estimated by the insurer. An insurance contract C = (W1, W2) is fully characterized by the individual’s wealth

in all states of nature: W2 if loss occurs, and W1 otherwise. C0 represents the null

“contract” (W0, W0−d), i.e. no insurance. We denote by V the function representing

consumer’s preferences on contracts.

When consumers know their objective loss probability, a disparity between V and expected utility (EU) is rationalizable by the rank dependent expected utility model (RDEU) proposed by Quiggin [6] and Yaari [12].1 _{Preferences depend on the}

one hand on a utility function (reflecting perception of wealth) and on the other hand on a probability transformation function (reflecting perception of loss distribution). The whole characterizes attitude towards risk. In our simple context, the utility can be expressed:

VRDEU(C) = ½

(1_{− ϕ(1 − p))u(W}2) + ϕ(1− p)u(W1) if W2 ≤ W1

(1_{− ϕ(p))u(W}1) + ϕ(p)u(W2) if W1 ≤ W2 (1)

where ϕ is an increasing probability transformation with ϕ(0) = 0 and ϕ(1) = 1. _{A consumer with concave u is risk averse if ϕ(x) < x, ∀x ∈]0, 1[, i.e. if he} underestimates the probabilities of higher wealth and overestimates that of lower wealth. Risk seeking corresponds to ϕ(x) > x.

(5)

When consumers lack precise information (ambiguity), several preference repre-sentations are possible.

In the general case where information doesn’t take a particular form, it is pos-sible, under certain axioms, to represent preferences using the subjective expected utility (SEU) model of Savage [8]. We get:

VSEU(C) = qu(W2) + (1− q)u(W1), ∀ W1, W2 (2)

where q is the subjective loss probability.

Experimental violations of EU axioms (in particular the independence axiom) inspired the development of new, more general models of decision making under uncertainty. The Choquet expected utility (CEU) proposed by Schmeidler [9] is one of the most widely accepted models in this class. Consumers’ preferences depend on the one hand on a utility function and, on the other hand, on an increasing set function, a capacity, defined on events. This representation separates attitude towards uncertainty and attitude towards wealth. Denote by ω2 the event {loss

occurs} and by ω1 the event {no loss}; CEU is representable by

VCEU(C) = ½

(1_{− ν(ω}1))u(W2) + ν(ω1)u(W1) if W2 ≤ W1

(1_{− ν(ω}2))u(W1) + ν(ω2)u(W2) if W1 ≤ W2

(3)

where ν is a capacity with ν(∅) = 0 and ν(Ω) = 1 (Ω is the set of events). In a two-state context, an individual with concave utility function is “ambiguity averse” if ν(ω1) + ν(ω2) < 1, i.e. if he underestimates the likelihood of an event associated

with high wealth compared to event associated with low wealth.

If available information is just a probability interval (imprecise probabilities), we can refer to Jaﬀray [2]. The model, which generalizes EU, can be used whenever the information is that the true probability distribution belongs to a given set satisfying certain properties. Preferences are represented by a utility function and a pessimism-optimism index reflecting attitude towards ambiguity. With this Jaﬀray imprecise probabilities (JIP) model, if the consumer knows that p ∈ [q0, q00], the preference representation function can be expressed:

VJIP(C) = ½ ((1_{− α)q}0_{+ αq}00_)u(W 2) + (1− (1 − α)q0 − αq00)u(W1) if W2 ≤ W1 (1_{− αq}0_{− (1 − α)q}00_)u(W 1) + (αq0 + (1− α)q00)u(W2) if W1 ≤ W2 (4) where α is the Hurwicz pessimism-optimism index. By definition, α is such that the decision maker is indiﬀerent between: (i) obtaining at least the lowest wealth

(6)

and at most the highest wealth with no information about the probabilities and (ii) obtaining either the lowest wealth with probability α or the highest wealth with probability 1 − α. If α = 1, the individual is a pure pessimist who assigns all the chances to the worst wealth.2

In the following, we assume the so-called indemnity principle: we rule out con-tracts for which the policyholder’s wealth is higher when he has an accident. This precaution prevents ex post moral hazard or incentives for destruction. In addition, “negative” insurance contracts (i.e. such that W1 − W2 > d) are forbidden to

sim-plify the analysis of corner solutions. We denote by C the set of contracts verifying these restrictions.

Under these assumptions, as we have W2 ≤ W1, all the previous models boil

down to one, where the consumer’s preferences can be represented by the following functional:

V (C) = qu(W2) + (1− q)u(W1) (5)

with q being a synthetic belief having a specific meaning in each of the models. To simplify the terminology, a consumer will be called pessimist if q > p, optimist if p > q, and neutral if p = q.

3 Insurance contracts

This section is devoted to the characterization of optimal contracts offered by a fully informed monopolistic insurer to a policyholder whose beliefs and objective probabilities differ. Note that a policyholder’s beliefs are assumed here not to be influenced by the insurer’s offer. Indeed, in case of ambiguity, the policyholder is assumed not to be aware of the insurer’s informational advantage, or at least unable to update his beliefs after observation of the offer he is made, and more generally to learn. If the policyholder knows the objective probability, but has RDEU preferences, the question of learning is obviously irrelevant.

The contract (W1, W2)oﬀered by a risk neutral monopolistic insurer maximizes

its expected profit under the participation constraint of the consumer:

max

(W1,W2)∈C

(1_{− p) (W − W}1) + p (W − d − W2) (6)

(7)

subject to

(1_{− q) u(W}1) + q u(W2)≥ (1 − q) u(W ) + q u(W − d). (7)

Introducing λ as the Lagrange multiplier for the constraint, the first-order conditions can be expressed:

−(1 − p) + λ(1 − q)u0(W1) = 0, (8)

−p + λqu0(W2) = 0,

λ [(1_{− q) u(W}1) + q u(W2)− (1 − q) u(W ) − q u(W − d)] = 0.

Stiglitz [10] proved that, with symmetric information regarding risk, a monopolistic insurer maximizing its profit offers a full coverage contract such that the policyholder is indifferent between purchasing the policy and having no insurance. The optimum is different here. Define

θ_≡ µ q 1_{− q} ¶ Áµ p 1_{− p} ¶ . (9)

The optimal contract C∗ _{= (W}∗

1, W2∗) verifying W2 ≤ W1 is such that:

   u0(W1∗) u0_(W∗ 2) = µ q 1_{− q} ¶ Áµ p 1_{− p} ¶ if θ < 1 W∗ 1 = W2∗ otherwise (10)

with the binding participation constraint:

(1_{− q) u(W}₁∗) + q u(W₂∗) = (1_{− q) u(W ) + q u(W − d).} (11) An insurance contract is eﬃcient in our framework if it maximizes profit for a given utility level (or if it maximizes utility for a given profit level), that is whenever the marginal rate of substitution of the consumer equals the marginal rate of substitu-tion of the insurer, except if this rule leads to a contradicsubstitu-tion with the indemnity principle. In short, a contract is said to be eﬃcient if and only if

u0(W1)

u0_(W₂₎ = min{θ, 1}. (12)

In the following, I(θ) represents the set of efficient contracts (or equivalently, of contracts offering efficient coverage).

The contract oﬀered by a monopolistic insurer having complete information about the consumer’s loss probability and belief is therefore the contract in I(θ)

(8)

which is equivalent, for the consumer, to no insurance. If θ < 1 (p > q), the policy-holder is optimistic, the eﬃcient contract line is below the full insurance line, which corresponds to partial coverage. If θ ≥ 1 (p ≤ q) then full insurance is eﬃcient, as in Stiglitz [10].

Figure 1 shows the optimal contracts for two possible values of θ. Constant profit lines are parallel to π = 0, the indiﬀerence curves passing through C0 for the two

values are given.

INSERT FIGURE 1 HERE

4 Optimal menu under adverse selection:

Gen-eral properties

Under the adverse selection hypothesis, consumers are assumed to be identical, except for the probability of having an accident: two risk classes coexist in the population, denoted by H and L. Their beliefs are respectively qH and qL with

qH > qL, and the true probabilities of loss are pH and pL; θH and θL are the ratios

of the perceived odds of loss to the actual odds as defined above. H denotes the type with the highest belief which is not necessarily the type with the highest true probability.3 _{This choice simplifies considerably the exposition because preferences}

only (not objective probabilities) matter for choice and self-selection. Probabilities and beliefs are not necessarily positively correlated, but even if they are, a low risk (in terms of probability) could still be more pessimistic than the high risk.

The insurer knows true probabilities and beliefs in the population, as well as the type proportions denoted by λH and λL (λH+ λL = 1). We denote by CH and CL

the contracts that would be proposed to H and L, respectively, in the absence of adverse selection (they are the profit maximizing eﬃcient contracts). With qH > qL,

as we assume, this allocation cannot be implemented with adverse selection.

The Revelation Principle being applicable, the problem of the monopolist is to propose a pair of contracts, CLand CH,that are incentive compatible and preferred

(weakly) to no insurance. Obviously, neither CL nor CH need to be diﬀerent from

C0,nor do CLand CH need to be diﬀerent from each other. The second-best optimal

3_{In case of equality, the problem is to find the optimal pooling contract and the distinctions we}

(9)

contracts of a monopolistic insurer are solutions of the following program: max CL,CH∈C λLπL(CL) + λH πH(CH) (13) such that _       CL ºL CH CL ºL C0 CH ºH CL CH ºH C0 (14)

where ºi, with i = L, H, represents the preferences of type i on contracts; πi gives

the per capita profit made by the monopolist if type i selects the specified contract. The problem is similar to Stiglitz [10] or Landsberger and Meilijson [4],[5] hence-forth LM). However the methodologies that they adopt (which significantly diﬀer) are not suited here because individually eﬃcient contracts depend on type. We recall the most important features of Stiglitz’ [10] solution.

Proposition 1 (Stiglitz) In case of pure risk:

1. High risk consumers get full insurance.

2. Low risk consumers are indiﬀerent between their contract and no insurance.

3. High risk consumers are indiﬀerent between their contract and the low risk consumers’ one.

4. The two types never buy the same contract.

5. There is a number strictly between 0 and 1 such that if λH is higher, then the

oﬀer degenerates to the symmetric information insurance contract of high risk CH, and low risk remains uninsured.

All these five points require reconsideration in our model. Concerning what LM calls the structure of indiﬀerence, we have:

Lemma 1 The second-best optimal pair of contracts oﬀered by a monopolistic in-surer is unique and:

(10)

2. Type H consumers are indiﬀerent between their contract and the type L con-sumers’ contract.

Predictions are weaker than in Stiglitz [10]. The lemma is “tight” in that predic-tions of Stiglitz that we exclude are in effect violated with well-chosen parameters. For example, type H may have a binding participation constraint (in this case, no insurance at all is the optimal offer); moreover, type L consumers may not exhibit a strict preference for their contract, which justifies pooling contracts. Another ma-jor difference is that it is not necessarily true that one type at least gets efficient insurance. Section 5 provides conditions for all these cases.

Remark If type L consumers receive insurance, then type H consumers obtain an informational rent, or more precisely, their utility is higher than it would be if they were the only type in the market.

This result is a direct consequence of the preceding lemma. Indeed, every time type L obtains nontrivial coverage, type H consumers’ participation constraint is not binding and thus their utility is higher than with CH.

Thanks to Lemma 1, we can reduce the number of unknowns in the insurer’s program from four (two per contract) to one, since once CL has been chosen among

contracts that are equivalent to C0 from type L’s viewpoint (choice is in a

one-dimensional set), CH is uniquely determined as the profit maximizing contract that

is equivalent to CL for H.

The choice of the parameter is arbitrary. To keep smooth objectives, we proceed in the following way. Let W be the certainty equivalent for H of C0. Let W be

the certainty equivalent for type H of CL. We have W ≤ W . To each t ∈ [0, 1],

we associate W (t) = (1 − t) W + t W . For each level of constant wealth W (t), we define CL(t) as the unique contract (W1(t), W2(t)) which is equivalent to W (t) for

H and equivalent to no-insurance for L: ½

(1_{− q}L) u(W1(t)) + qLu(W2(t)) = (1− qL) u(W0) + qLu(W0− d)

(1_{− q}H) u(W1(t)) + qHu(W2(t)) = u(W (t))

(15)

Given CL(t), CH(t) is defined as the most profitable incentive compatible contract

that the insurer can oﬀer to type H :

CH(t) = arg max (W1,W2)∈C

(11)

subject to ½

(1_{− q}L) u(W1(t)) + qLu(W2(t)) ≥ (1− qL) u(W1) + qLu(W2)

(1_{− q}H) u(W1) + qHu(W2) ≥ (1 − qH) u(W1(t)) + qHu(W2(t)) (17)

The menus M (t) = {CH(t), CL(t)} , t ∈ [0, 1], all satisfy the properties of the lemma.

Menus are continuously parameterized, and CL(t)oﬀers to type L a coverage strictly

increasing in t. See Figure 2.

For a given menu M (t), we can define profits πL(t) and πH(t). Our

parameteri-zation is such that πL(·) is strictly increasing (CL(t)gets closer to the optimal CL as

t increases) while πH(·) is strictly decreasing (CH(t) gets farther from the optimal

CH as t increases). The objective of the monopoly can be expressed as the choice

of t∗(λL):

t∗(λL) = arg max

t∈[0,1]λLπL(t) + (1− λL)πH(t) (18)

This program represents the classical monopoly trade-oﬀ between minimizing infor-mational rents given to H and maximizing profits realized on type L.

The theorem is the basis of the comparative statics we provide in the following section:

Theorem 1 At the optimum, the monopolistic insurer oﬀers M (t∗(λL)), where λL

is the proportion of type L consumers, and

1. t∗₍_{·) is a continuous function.}

2. There are ξ₀ and ξ₁ (with 0 < ξ₀ < 1 and ξ₀ < ξ₁) such that:

(a) if λL ≤ ξ0, then t∗(λL) = 0 : H receives CH (no informational rent) and

L receives no insurance,

(b) if λL ≥ ξ1, then t∗(λL) = 1 : L receives CL and H receives the maximal

informational rent.

(c) if θL≤ 1, then ξ1 = 1 and if θL> 1, then ξ1 < 1.

3. Between extremes (ξ0 < λL< ξ1), t∗(·) is strictly increasing: as the proportion

(12)

Though Stiglitz’ [10] remains silent on this question, it is impossible in his model that types be pooled at CL. The reason is as follows: taking t slightly smaller than

1 has second-order eﬀects on πL(t) (we just move away from the optimum for type

L) whereas it has first-order eﬀects on πH(t) (CH(1) is not optimal for type H).

Consequently, whatever the respective proportions of types, t yields a higher profit than 1.

In our model, if type L is suﬃciently pessimistic (θL> 1), the optimal contract

for type L, CL, is only a constrained optimum (since type L would prefer

over-insurance), meaning that the eﬀect of t on πL(t) at t = 1 is of the first order.

This implies that types may be pooled at CL if there are suﬃciently many type L

policyholders.

The threshold ξ₀is always between 0 and 1. Indeed, moving t from 0 to a slightly positive value has first order eﬀects on the profitability of both types, negative on type H and positive on type L. If λL is small, then it is more profitable to choose

t = 0, in other words to oﬀer insurance to type H only.

5 Equilibrium configurations

The equilibrium configuration depends on the proportions of types in the population (λH and λL) and on the parameters θL and θH determining eﬃcient insurance

cov-erage. Theorem 1 shows that the proportions directly affect the optimal trade-off between offering the most profitable contract to L while not conceding too large informational rent to H. This is true for almost any monopoly models with asym-metric information, including Stiglitz’.

The novelty of our results originate in the type specific parameters θH and θL.

The gap between probabilities and beliefs, i.e. between θH or θL on the one hand

and 1 on the other, determines whether or not at least one type is fully covered in equilibrium, and the diﬀerence between θH and θL determines whether or not types

are separated in equilibrium.

A general remark is necessary at this point. The equilibrium allocation must be interpreted carefully when the higher belief individuals (who are willing to pay more for complete coverage) obtain a higher coverage than the low belief individuals. Indeed, individuals with the higher belief may be, objectively, the low risk consumers

(13)

(pH < pL): thus, in this model, it is not always true that the best coverage and the

informational rent are obtained by the high objective risk type; risk perceptions, and consequently, behaviors, are the critical factors.

5.1 Two pessimistic types

When both types are pessimistic (min{θH, θL} ≥ 1), overinsurance being precluded,

the eﬃcient coverage for both is full insurance. Stiglitz’ equilibrium configuration (separation of the types, full coverage for one of the types and partial coverage for the other) again applies. This situation is shown in Figure 3.

However, for a proportion of L close to 1, a minor diﬀerence has to be mentioned. It appears directly, from point 2 of Theorem 1 that if θL > 1, then threshold ξ1

is strictly lower than 1, which means that types are pooled on CL, the contract

extracting maximal rent on type L. This absence of discrimination of the types would not remain true if overinsurance was permitted.

5.2 At least one optimistic type

Equilibrium configurations specific to our model appear when at least one of the types is optimistic (θH or θL is lower than 1), especially where:

• All types may obtain partial coverage; • A pooling contract may be an equilibrium. This definition is used in the following.

Definition 1 A type i (i = H, L) is over-covered (under-covered) by a contract (W1, W2) if the coverage oﬀered (measured by u0(W1)/u0(W2)) is larger (respectively

smaller) than the eﬃcient one θi.

5.2.1 L more optimistic than H (θL< θH and θL < 1)

In this case, I(θL) is below I(θH) and below the full insurance line. If θH < 1,

(14)

absent, full coverage would be profit maximizing for neither of the two types. This case is significant in practice since beliefs and probabilities are positively correlated: type H represents high risk in terms of preferences and in terms of cost.

The general characteristics of the equilibrium configurations are as follows.

Proposition 2 If θL < θH and θL < 1, then:

1. Types are always separated.

2. Type H obtains optimal insurance:

(a) if θH ≥ 1, type H receives full coverage

(b) if θH < 1, optimal coverage is partial.

3. Type L never gets eﬃcient insurance (except if λH = 0).

4. There is ξ₀ (0 < ξ₀ < 1) such that type L is oﬀered insurance if and only if λL > ξ0, and then, type L’s coverage is continuously strictly increasing with

respect to λL.

To obtain the first two points, we start from a contract on the participation constraint of L and try to determine the most profitable contract for H which is incentive compatible. This contract corresponds to an eﬃcient contract (on I(θH))

such that type H consumers are indiﬀerent between their contract and type L’s contract. If θH ≥ 1, then eﬃcient contracts for type H correspond to full coverage

and if θH < 1, they correspond to partial coverage. The latter situation is depicted

on Figure 4.

The other points of the proposition are direct consequences of Theorem 1. Note that, stated as it is, the proposition is true only if the two types participated if they were alone on the market: min{θH , θL} ≥ θ with θ ≡ u0(W0)/u0(W0− d). If

neither type participated, then the market would not be active at all. If type L only did not participate, there is de facto no adverse selection and type H is optimally insured.

(15)

5.2.2 H more optimistic than L (θL> θH and θH < 1)

This configuration corresponds to a situation when the low belief type’s optimal coverage is higher. Type H can objectively be the low risk, though we don’t think that this negative correlation between beliefs and risks is the most realistic situation. The characterization of equilibria in this situation is given in the following propo-sition.

Proposition 3 Assume both types need insurance (θL> θand θH > θ). If θL> θH

and θH < 1, then there are thresholds denoted by ξ0, ξ0 and ξ1 with 0 < ξ0 < ξ0 < ξ1

such that

1. If λL < ξ0 then (a) types are separated, (b) type H obtains an eﬃcient coverage,

(c) type L is oﬀered insurance if and only if λL > ξ0 (coverage is increasing

with respect to λL).

2. If λL= ξ0 then (a) types are pooled, (b) type H obtains eﬃcient coverage.

3. If ξ0 < λL < ξ1 then (a) types are pooled, (b) neither type obtains eﬃcient

coverage, (c) coverage of the pooling contract is strictly increasing with respect to λL.

4. If λL≥ ξ1 then types are pooled on CL, type L’s profit maximizing insurance.

The origin of optimality of the pooling contract is as follows: on the one hand self-selection constraints force type L to receive less coverage, but on the other hand, type L more highly values coverage. This conflict can only be solved with pooling, which is impossible in Stiglitz’ model.

This pooling occurs only if the proportion of type L is suﬃciently large, i.e. if the insurer considers that leaving rents to type H to gain profits on type L is valuable. Under this condition, as it is diﬃcult to separate types, it is therefore optimal to accept that type H receives over-coverage and type L under-coverage. More precisely, when the proportion of type L becomes high, the monopoly is induced to approach CL as close as possible. Given that θH < θL, contracts that give more

(16)

than CL, but such contracts are also more attractive for type L than CL

(single-crossing condition). The monopoly is forced to pool types on CL.

The participation problem is somewhat more complex here than in paragraph 5.2.1. If, in the absence of adverse selection, type H did not participate whereas type Lparticipated, the optimal oﬀer to the latter always attracts the former. In this case where participation constraints matter, we obtain a truncated version of the previous proposition: only some of the equilibrium configurations remain. Namely, the fact that negative insurance is not allowed creates the possibility that no insurance at all be oﬀered in equilibrium, though one type at least (here L) would like to obtain some form thereof. This occurs for high proportions of type H.

The main results in this case, obtained directly from Theorem 1 are as follows:

• there is a threshold ξ0 such that if λL ≤ ξ0 then no insurance is oﬀered in

equilibrium.

• for λL > ξ0, there is ξ1 such that if ξ0 < λL < ξ1, then types are pooled and

neither type gets optimal insurance, and for λL≥ ξ1,types are pooled on type

L’s optimal insurance.

The result is that, provided the type requiring insurance (here L) is suﬃciently numerous, then the other type can easily be subsidized through a unique insurance contract oﬀered to all.

6 Conclusion

To summarize, two categories of situations can be distinguished: “pessimism ratios” (θL and θH) and costs are either positively or negatively correlated. In the case of

positive correlation, results are, by and large, quite similar to those of Stiglitz: the high risk wants and receives more coverage; adverse selection forces a rationing of the low risk in terms of the quality of insurance, meaning that the low risk remains less covered.

The case of negative correlation is less standard and still significant in practice: even if people are aware of their relative riskiness (qL < qH), pL< pHand θL> θH at

(17)

the same time is a plausible situation. In the absence of adverse selection, everybody would be proposed a fair rate and low risk people would undertake more insurance. In a situation of adverse selection, this possibility is precluded by the propensity of the high risk to prefer cheaper contracts. In equilibrium, we have the strikingly robust prediction that the optimal offer of the monopolistic insurer can be a pooling contract. In Landsberger and Meilijson [5], a pooling, if it is optimal, systematically offers full insurance for all; here a pooling is typically optimal for nobody, since it represents a compromise between covering low risk consumers sufficiently (because it is socially desirable) while not covering high risk consumers too much (because it is socially costly).

The introduction of a gap between beliefs and probabilities in an insurance model with adverse selection, in addition to making the model closer to reality, explains a wider range of optimal configurations. For given true probabilities, the quality of the information consumers use to make their choice is a parameter that should not be overlooked. If our model is more realistic than the standard approach as we know it from Rothschild and Stiglitz [7] or Stiglitz [10], whether the disparity comes from a lack of information on the part of the public or a violation of the independence axiom is the critical question. Our paper remains agnostic with respect to the various interpretations of non-expected utility that could be given. In a more comprehensive game, an eﬀective public policy (or the strategy of a monopoly) should be founded on a clear identification of the origin of the observed behavior.

A

Appendix

A.1 Proof of Lemma 1

Uniqueness. Assume that there are two diﬀerent solutions (CL, CH) and (CL0, CH0 ).

Define a pair of randomized contracts (Cε

L, CHε)with 0 < ε < 1 in the following way:

with Cε

L,whether there is an accident or not, the policyholder receives the wealth

prescribed by contract CL with probability ε and wealth prescribed by contract CL0

with probability 1−ε. This randomization is consistent with incentive compatibility and doesn’t change the expected profit of the insurer (each pair gives the same value of the objective). The following explains why we used (temporarily) contracts that were not permitted in C.

(18)

What is interesting here is that the contingent lotteries are perceived identically by diﬀerent types: e.g., conditional on loss, CLε is worth εu(WL2) + (1− ε)u(WL20 ),

with WL2 (WL20 ) the wealth in case of loss for contract CL (CL0). This lottery can be

replaced by its certainty equivalent Wε

L2,where u(WL2ε ) = εu(WL2) + (1− ε)u(WL20 ).

When implemented on all conditional lotteries, this replacement preserves incen-tive compatibility and creates deterministic contracts. Due to the concavity of the agents’ utility functions, this transformation yields a pair of contracts in C which is more profitable than those with which we started. This proves that two optima were not diﬀerent.

1. Assume that every type strictly prefers its contract to the uninsured posi-tion. (i) If both types are indiﬀerent between the two contracts, then the contracts are identical (single-crossing property). By lowering (slightly enough) the reim-bursement in case of accident (or by increasing the premium), we can maintain participation and increase profits, which contradicts that profits were maximal. (ii) If a type strictly prefers its contract, we make the same operation on this contract, so that incentives are not reversed. This also contradicts that profits were maximal. In consequence, one type’s rationality constraint at least is binding. The single-crossing property and the fact that negative insurances are forbidden ensure that this type is L.

2. (i) Assume now that every type strictly prefers its own contract to the con-tract designed for the other type. Each type is thus necessarily on its rationality constraints, else the type which is not constrained would see its benefits cut (profits would increase and participation would be preserved). The single-crossing property helps once more: each type being indifferent between its contract and no insurance, then necessarily type L gets no insurance. This is a contradiction with the assump-tion that each type strictly prefers its contract. (ii) There is a type at least which is indifferent between both contracts. If type H strictly prefers its contract, we know that type H is strictly better off with this contract than without insurance. Therefore, by lowering reimbursement slightly in case of accident, we maintain par-ticipation and type H will not switch to type L’s contract. The aggravation of type H’s contract in terms of first-order stochastic dominance also ensures that type L does not switch.

(19)

A.2 Proof of Theorem 1

We need the following lemma.

Lemma 2 πL(·) and πH(·) are C1(derivatives exist and are continuous) and π0L(·) >

0 and π0

H(·) < 0.

Proof. Equations in (15) are linear with respect to u(W1(t))and u(W2(t)).Since

qL 6= qH and u is strictly monotonic and continuous, the solution exists, is unique

and has a continuous derivative with respect to t. Given that

πL(t) = (1− pL) (W0− W1(t)) + pL(W0− d − W2(t)), (19)

it follows that

π0_L(t) =_{−(1 − p}L)W10(t)− pLW20(t). (20)

Diﬀerentiating the first equation in (15) leads to

W10(t) =−

qLu0(W2(t))

(1_{− q}L) u0(W1(t))

W20(t), (21)

which we plug in the second equation (after diﬀerentiation) to get W0

2(t) > 0 and

thus W0

1(t) < 0. By a simple calculation, from (15) and (20), we obtain:

sgn {π0L(t)} = sgn ½ W₂0(t)[u 0_(W 2(t)) u0_(W₁_(t))− 1 θL ] ¾ . (22)

For the chosen parameterization, CL(t)is always on or below I(θL)(under-coverage).

This gives the sign of the right-hand side, and the result: π0

L(·) > 0 for all t ∈ (0, 1)

and is continuously diﬀerentiable with respect to t; moreover, π0

L(1) = (>)0 if and

only if θL≤ (>)1.

From the parametrization, πH(t) is the value function of the following program:

πH(t) = max (W1,W2)∈C

(1_{− p}H) (W0− W1) + pH(W0− d − W2) (23)

subject to (17).

We rewrite the program as follows:

πH(x, y) = max U1,U2

(20)

subject to

(1_{− q}L) U1+ qLU2 ≤ x, (25)

(1_{− q}H) U1+ qHU2 ≥ y. (26)

After this transformation, the objective function is still concave (u concave ⇒ u−1

convex ⇒ −u−1 _{concave). Constraints are linear and form a convex set. Hence, the}

solution of the program exists and is unique for every x and y. We transform the pro-gram to make it linear in probabilities by allowing random W1 and W2 (the support

is assumed compact but suﬃciently large). Hence the objective becomes concave and the constraints convex. Moreover, the constraint qualification requirement is always satisfied since the objective and the constraints are linearly independent: the solution exists and is necessarily unique for every x and y. It follows that πH(x, y)is

diﬀerentiable with respect to x and y. Replacing x by (1−qL) u(W1(t))+qLu(W2(t))

and y by (1 −qH) u(W1(t)) + qHu(W2(t)), which are both continuously diﬀerentiable

functions of t (see the first part of the proof), yields that πH(t) is continuously

dif-ferentiable. π0

H(t) < 0 comes directly from the fact that πH(t) and πL(t) vary in

opposite directions and we proved that π0

L(t) > 0.

We can now prove our Theorem.

1. Continuity comes from a variety of the Theorem of the Maximum. As-sume that there is a value of λL denoted by ξ where t∗(·) is not continuous,

mean-ing that there is a sequence (ξs)s≥0 converging to ξ (the limit exists and is finite

since t∗₍_{·) takes its values in a compact set), such that lim}

s→+∞t∗(ξs) < t∗(ξ) or

lim_s→+∞t∗_(ξ

s) > t∗(ξ). As the reasoning is essentially the same for the two

diﬀer-ent case, we argue as if the first inequality were fulfilled. We denote the limit by t_∞ < t∗(ξ). Consider now the value of the objective for ξs: ξs πL(t∗(ξs)) + (1−

ξs) πH(t∗(ξs)). The limit of this value when s goes to infinity is necessarily equal to

ξ πL(t∗(ξ)) + (1− ξ) πH(t∗(ξ)) (else, there is a contradiction with the fact that the

objective is a continuous function). Hence, two values (t_∞ and t∗_{(ξ)) maximize the}

objective given ξ. A contradiction with uniqueness. This proves the continuity of t∗₍_·).

3. (3 is proved before 2) Monotonicity. We know from the symmetric information case that if λL= 0, then t∗ = 0, and if λL= 1, then t∗ = 1. Continuity ensures that

(21)

interior solution can be expressed:

λLπ0L(t∗(λL)) + (1− λL) π0H(t∗(λL)) = 0 (27)

Thus, any t ∈]0, 1[ determines entirely the corresponding λL as:

λL 1_{− λ}L =₋π 0 H(t) π0 L(t) > 0 (28)

Consequently, two diﬀerent ratios cannot correspond to the same menu if 0 < t∗ _{< 1.}

This proves that t∗₍_{·) is a strictly increasing function wherever its value diﬀers from}

0 or 1.

2. Corner Solution. From (28), we see that ξ₀ such that ξ0

1−ξ0 =−π

0

H(0)/π0L(0)

has the announced properties. The oﬀer t∗ = 1 will be made for λL ≥ ξ1 such that 1−ξ1

ξ₁ = −π0L(1)/π0H(1). This and (22) imply that ξ1 < 1 if θL > 1 and ξ1 = 1 if

θL ≤ 1. Remark also that the monotonicity of t∗ proved in the previous paragraph

proves that ξ₁ > ξ₀.

A.3 Proof of Proposition 3

The intersection between the reservation indiﬀerence curve of type L, and the opti-mal insurance set I(θH)is located in the parameterized domain {CL(t) : t∈ (0, 1)}

since θL > θH. Therefore, this point corresponds to a certain t ∈ (0, 1), denoted

by t0_{. We define ξ}0 _{such that} ξ0

1−ξ0 ≡ −π0H(t0)/π0L(t0). The order between ξ0 and ξ1

comes from the monotonicity of the ratio of marginal profits (Theorem 1).

If λL < ξ0, the situation is similar to that of Proposition 2, which explains 1. If

λL > ξ0, the insurer is constrained to pool the two types. Indeed, the oﬀer made

to type L, say CL, gives more coverage than desired (at the optimum) by type H.

As a consequence of Lemma 1, the oﬀer to type H, say CH, is equivalent to CL

for type L. If CH covers more than CL, then the insurer could increase its profit

by oﬀering only CL (CL over-covers type H, but less). If CH covers less than CL,

then the single-crossing property ensures that type L prefers CH to CL,which is not

incentive compatible. Types are therefore necessarily pooled. The rest of the comparative statics derives from Theorem 1.

(22)

References

[1] Chiappori, P.A., Salanié, B.: Testing for asymmetric information in insurance markets. Journal of Political Economy 108, 56-78 (2000)

[2] Jaﬀray, J.Y.: Généralisation du critère de l’utilité espérée aux choix dans l’incertain régulier. Recherche Opérationelle 23, 237-267 (1989)

[3] Jeleva, M.: Demand for insurance with imprecise probabilities. Finance 18, 101-114 (1997)

[4] Landsberger, M., Meilijson, I.: Monopoly insurance under adverse selection when agents diﬀer in risk aversion. Journal of Economic Theory 63, 392-407 (1994)

[5] Landsberger, M., Meilijson, I.: A general model of insurance under adverse selection. Economic Theory 14, 331-352 (1999)

[6] Quiggin, J.: A theory of anticipated utility. Journal of Economic Behavior and Organization 3, 324-343 (1982)

[7] Rothschild, M., Stiglitz, J.: Equilibrium in competitive insurance markets: an essay on the economics of imperfect information. Quarterly Journal of Eco-nomics 90, 629-649 (1976)

[8] Savage, L.: The foundation of statistics. New York: J. Wiley 1954

[9] Schmeidler, D.: Subjective probability and expected utility without additivity. Econometrica 57, 571-587 (1989)

[10] Stiglitz, J.: Monopoly, Non-linear pricing and imperfect information: the in-surance market. Review of Economic Studies 44, 407-30 (1977)

[11] Villeneuve, B.: The consequences for a monopolistic insurance firm of eval-uating risk better than customers: the adverse selection hypothesis reversed. Geneva Papers on Risk and Insurance Theory 25, 65-79 (2000)

(23)

W₂ W₁ C₀ I(θ), θ < 1θ), θ < 1 Figure 1 I(θ), θ ≥≥ 11θ), θ C (θ < 1)θ < 1) C(θ θ ≥≥ 1)1) W₂ W₁ C₀ I_H I_L θθ_L<θθ_H< 1 t = 0 t = 1 L H C_L(t) C_H(t) π=0

(24)

Figure 3 W₂ W₁ L H C₀ C_H C_L θθH ≥≥ θθL≥≥ 1 or θθL ≥≥ θθH ≥≥ 1 I_H= I_L I_L I_H L H t = 1 C_L(0 < t < 1) C_H

Profit increasing direction (per contract)

Figure 4

t = 0 θθL< θθH< 1

(25)

I_L

I_H

L

H

t = 1

Profit increasing direction (per contract) t’(corresp. to ξ’)

Pooling contract

Figure 5

t = 0 11> θθL> θθH

Insurance contracts with imprecise probabilities and adverse selection