Social Insurance, Information Revelation, and Lack of Commitmentâ
Transcription
Social Insurance, Information Revelation, and Lack of Commitmentâ
Social Insurance, Information Revelation, and Lack of Commitment Mikhail Golosov Princeton Luigi Iovino Bocconi March 2015 Abstract We study the optimal provision of insurance against unobservable idiosyncratic shocks in a setting in which a benevolent government cannot commit. A continuum of agents and the government play an in…nitely repeated game. Actions of the government are constrained only by the threat of reverting to the worst perfect Bayesian equilibrium (PBE). We construct a recursive problem that characterizes the resource allocation and information revelation on the Pareto frontier of the set of PBE. We prove a version of the Revelation Principle and …nd an upper bound on the maximum number of messages that are needed to achieve the optimal allocation. Agents play mixed strategies over that message set to limit the amount of information transmitted to the government. The central feature of the optimal contract is that agents who enter the period with low implicitly-promised lifetime utilities reveal no information to the government and receive no insurance against current period shock, while agents with high promised utilities reveal precise information about their current shock and receive insurance as in economies with full commitment by the government. We thank Mark Aguiar, Fernando Alvarez, V.V. Chari, Hugo Hopenhayn, Ramon Marimon, Stephen Morris, Chris Sleet, Pierre Yared, Sevin Yeltekin, Ariel Zetlin-Jones for invaluable suggestions and all the participants at the seminars in Bocconi, Carnegie Mellon, EIEF, Georgetown, Minnesota, Norwegian Business School, Paris School of Economics, Princeton, University of Lausanne, University of Vienna, Washington University, the SED 2013, the SITE 2013, the ESSET 2013 meeting in Gerzensee, 12th Hydra Workshop on Dynamic Macroeconomics and 2014 Econometric Society meeting in Minneapolis. Golosov thanks the NSF for support and the EIEF for hospitality. We thank Sergii Kiiashko and Pedro Olea for excellent research assistance. 1 1 Introduction The major insight of the normative public …nance literature is that there are substantial bene…ts from using past and present information about individuals to provide them with insurance against shocks and incentives to work. A common assumption of the normative literature is that the government is a benevolent social planner with perfect ability to commit. The more information such a planner has, the more e¢ ciently she can allocate resources.1 The political economy literature has long emphasized that such commitment may be dif…cult to achieve in practice.2 Self-interested politicians and voters –whom we would broadly refer to as “the government” –are tempted to re-optimize over time and choose new policies. When the government cannot commit, the bene…ts of providing more information to the government are less clear. Better informed governments may allocate resources more e¢ ciently as in the conventional normative analysis but may also be more tempted to depart from the ex-ante desirable policies. The analysis of such environments is di¢ cult because the main analytical tool to study private information economies –the Revelation Principle –fails when the decision maker cannot commit. In this paper we study optimal information revelation and resource allocation in a simple model of social insurance – the unobservable taste shock environment of Atkeson and Lucas (1992). This environment, together with closely related models of Green (1987), Thomas and Worrall (1990), Phelan and Townsend (1991), provides theoretical foundation for a lot of recent work in macro and public …nance. This set up and its extensions was used to study design of unemployment and disability insurance (Hopenhayn and Nicolini (1997), Golosov and Tsyvinski (2006)), life cycle taxation (Farhi and Werning (2013), Golosov et al. (2011)), human capital policies (Stantcheva (2014)), …rm dynamics (Clementi and Hopenhayn (2006)), military con‡ict (Yared (2010)), international borrowing and lending (Dovis (2009)). In the key departure from that literature we assume that resources are allocated by the government, which, although benevolent, lacks commitment. We study how information revelation a¤ects incentives of the government to provide insurance in such settings and characterize the prop1 The seminal work of Mirrlees (1971) started a large literature in public …nance on taxation, redistribution and social insurance in presence of private information about individuals’types. Well known work of Akerlof (1978) on “tagging” is another early example of how a benevolent government can use information about individuals to impove e¢ ciency. For the surveys of the recent literature on social insurance and private informaiton see Golosov et al. (2006) and Kocherlakota (2010). 2 There is a vast literature in political economy that studies frictions that policymakers face. For our purposes, work of Acemoglu (2003) and Besley and Coate (1998) is particularly relevant who argue that ine¢ ciencies in a large class of politico-economic models can be traced back to the lack of commitment. Kydland and Prescott (1977) is the seminal contribution that was the …rst to analyze policy choices when the policymaker cannot commit. 1 erties of the optimal contract. Formally, we study a repeated game between the government and a continuum of atomless citizens in the spirit of Chari and Kehoe (1990) and focus on the perfect Bayesian equilibria (PBE) that are not Pareto-dominated by other PBE. The economy is endowed with a constant amount of perishable good in each period and the government allocates that endowment among agents. Agents receive privately observable taste shocks that follow a Markov process, which is assumed to be iid in the baseline version of our model. Agents transmit information about their shocks to the government by sending messages. The government uses these messages to form posterior beliefs about realization of agents’types and allocate resources. The main friction is that ex-post, upon learning information about the agent’s type, the government has the temptation to allocate resources di¤erently from what is required ex-ante to provide incentives to reveal information. The more precise information the government has about the agents’types, the higher its payo¤ from deviation is. Our paper makes three contributions. First, we construct a recursive formulation of our problem. The key di¢ culty that we need to overcome is that the value of the worst equilibrium depends on the implicit promises made to all agents. The standard recursive techniques, that characterize optimal insurance for each history of past shocks in isolation from other histories, do not apply directly since information that any agent reveals a¤ects the government’s incentives to renege on the promises made to other agents. We make progress by showing how to construct an upper bound for the value of deviation which coincides with that value if all agents play the best PBE and takes weakly higher value for any other strategy. This upper bound can be represented as a history-by-history integral of functions that depends only on the current reporting strategy of a given agent, and thus can be represented recursively. Since the original best PBE strategies still satisfy all the constraints of this modi…ed, tighter problem, the solution to this recursive problem allows us to uncover the best PBE. The resulting recursive problem is very simple. It is essentially a standard problem familiar from the recursive contract literature with two modi…cations: agents are allowed to choose mixed rather than pure strategies over their reports and there is an extra term in planner’s objective function that captures “temptation” costs of revealing more information. Our second contribution is to show a version of the Revelation Principle for our settings. It is well known that standard direct revelation mechanisms, in which agents report their types truthfully to the government, are not e¢ cient in this type of settings. Bester and Strausz (2001) showed that, in a repeated game between one principle and one agent whose type can take N values, the message space can be restricted to only N messages over which agents play 2 mixed reporting strategies. These techniques do not apply to economies with more than one agent.3 We show that in our baseline version of the model with continuum of agents and iid shocks it is without loss of generality to restrict attention to message spaces with at most 2N 1 messages. This result follows from the fact that with iid shocks our recursive characterization satis…es single-crossing property, and therefore all messages that give any type n the highest utility can be partitioned in three regions: those that also give the highest utility to types n + 1 and n 2N 1 and those that give the highest utility to only type n: There can be at most 1 of such partitions and convexity of our problem implies that it is su¢ cient to have one message per each partition. Our third contribution is the characterization of the properties of the optimal contract and the e¢ cient information revelation. As in the full commitment case of Atkeson and Lucas (1992), it is optimal to allocate resources to agents with temporality high taste shocks by implicitly promising to increase future lifetime utility of agents with temporarily low taste shocks. As agents experience di¤erent histories of shocks, there is a distribution of promised utilities at any given time. We show that it is e¢ cient for agents who enter the period with di¤erent promised utilities to reveal di¤erent amounts of information. In particular, we show that under quite general conditions agents who enter the period with low promised utilities reveal no information about their idiosyncratic shock in that period and receive no insurance. In contrast, under some additional assumption on the utility function and the distribution of shocks, agents who enter the period with high promised utilities reveal their private information fully to the government. The optimal insurance contract for such agents closely resembles the contract when the government can commit. The intuition for this result can be seen from comparing costs and bene…ts of revealing information to the government. The costs are driven by the temptation to deviate from the exante optimal plan and re-optimize. When the government deviates from the best equilibrium, it reneges on all the past implicit promises and allocates consumption to each agent based on its posterior beliefs about agent’s current type. Therefore, the incentives for the government to deviate depend only on the total amount information it has but not on which agents reveal it. On the contrary, the bene…ts of information revelation depend on the e¢ ciency gains from better information on the equilibrium path, which in turn depend on the implicit promises with which an agent enters the period. The general lesson is that the agents for whom better 3 See also Skreta (2006) who extends the analysis to an arbitrary type space and show the optimal selling mechanism in a game between one buyer and one seller. Bester and Strausz (2000) provide an example of an economy with two agents to illustrate that that with more than one agent message spaces with only N elements are restrictive. 3 information leads to the highest e¢ ciency gains on the equilibrium path should send precise signals to the government. In Atkeson and Lucas (1992) multiplicative taste shock environment those are the agents with high promised utility. Participation constraints of the government prevent the emergence of the extreme inequality, known as immiseration, which is a common feature of environments with commitment. The optimal contract exhibits mean reversion, so that agents with low current promised utilities receive in expectation higher future promises, and vice versa. This implies that in an invariant distribution no agent is stuck in the no-insurance region forever, and in a …nite number of steps he reaches a point when he reveals some information about his shock and some insurance is provided. Thus, we show that in the invariant distribution there is generally an endogenous lower bound below which agent’s promised utility never falls. Our paper is related to a relatively small literature on mechanism design without commitment. Roberts (1984) was one of the …rst to explore the implications of lack of commitment for social insurance. He studied a dynamic economy in which types are private information but do not change over time. More recently, Sleet and Yeltekin (2006), Sleet and Yeltekin (2008), Acemoglu et al. (2010), Farhi et al. (2012) all studied versions of dynamic economies with idiosyncratic shocks closely related to our economy but made various assumptions on commitment technology and shock processes to ensure that any information becomes obsolete once the government deviates. In contrast, the focus of our paper is on understanding incentives to reveal information and their interaction with the incentives of the government. Our paper is also related to the literature on the ratchet e¤ect, e.g. Freixas et al. (1985), La¤ont and Tirole (1988). These authors pointed out that without commitment it is di¢ cult to establish which incentive constraints bind, which signi…cantly inhibited further development of this literature. Some papers have thus focused on the analysis of suboptimal but tractable incentive schemes. We show that the problem substantially simpli…es when the principal interacts with a continuum of agents since any agent’s report does not a¤ect the aggregate distribution of the signals received by the principal. In this case we can obtain a tight characterization of the optimal information revelation for high and low values of promised utilities. Our results about e¢ cient information revelation are also related to the insights on optimal monitoring in Aiyagari and Alvarez (1995). In their paper the government has commitment but can also use a costly monitoring technology to verify the agents’reports. They characterize how monitoring probabilities depend on the agents’promised values. Although our environment and theirs are very di¤erent in many respects, we both share the same insight that more information should be revealed by those agents for whom e¢ ciency gains from better information are the 4 highest. Work of Bisin and Rampini (2006) pointed out that in general it might be desirable to hide information from a benevolent government in a two period economy. Finally, our recursive formulation builds on the literature on recursive contracts with private information and a patient social planner. Work of Farhi and Werning (2007) provides recursive tools that are particularly useful in our settings. The rest of the paper is organized as follows. Section 2 describes our baseline environment with iid shocks. Section 3 derives the recursive characterization and shows a version of the Revelation Principle. Section 4 analyzes e¢ cient information revelation and optimal insurance. Section 5 extends our analysis to general Markov shocks. 2 The model The economy is populated by a continuum of agents of total measure 1 and the government. There is an in…nite number of periods, t = 0; 1; 2; ::: The economy is endowed with e units of a perishable good in each period. Agent’s instantaneous utility from consuming ct units of the good in period t is given by tU (ct ) where U : R+ ! R is an increas- ing, strictly concave, continuously di¤erentiable function. The utility function U satis…es Inada conditions limc!0 U 0 (c) = 1 and limc!1 U 0 (c) = 0 and it may be bounded or un- bounded. All agents have a common discount factor v = limc!0 U (c) = (1 . Let v = limc!1 U (c) = (1 ) and ) be the upper and lower bounds on the lifetime expected utility of the agents (v and v may be …nite or in…nite). For our purposes it is more convenient to work with utils rather than consumption units. Let C The taste shock we assume that t t takes values in a …nite set U 1 be the inverse of the utility function. with cardinality j j: For most of the analysis are iid across agents and across time, but we relax this assumption in Section 5. Let ( ) > 0 be the probability of realization of 2 : We assume that 1 < ::: < j j and P normalize ( ) = 1: We use superscript t notation to denote a history of realization 2 of any variable up to period t, e.g. realization of history t t = ( 0 ; :::; t ). Let t t denote the probability of : We assume that types are private information. Each agent is identi…ed with a real number v 2 [v; v] which we soon interpret as a lifetime promised utility. All consumers with the same v are treated symmetrically. The distribution of promises v is denoted by . As it will become clear shortly, the initial distribution gives us a ‡exible way to trace the Pareto frontier of the set of perfect Bayesian equilibria. The government collects reports from agents about their types and allocates consumption subject to the aggregate feasibility in each period. The government is benevolent and its payo¤ is given by the sum of expected lifetime utilities of all agents. 5 2.1 The game between the government and agents The physical structure of our environment corresponds exactly to the model of Atkeson and Lucas (1992). The constrained e¢ cient allocations that they characterize can be achieved by the government which is able to commit in period 0 to in…nite period contracts. We focus on the environment without commitment. We model the interaction between the government and individuals along the lines of the literature on sustainable plans (Chari and Kehoe (1990), Chari and Kehoe (1993)). Before formally de…ning the game we brie‡y outline its structure. The government does not observes agents’types, so it collects information by having the agents submit messages. More speci…cally, at the beginning of any period t the government chooses a message set Mt from some space M: We assume that all the sets in M are endowed with a metric, but we will not refer to this metric explicitly. All agents observe their types and submits messages mt 2 Mt . The government allocates consumption to agents as a function of current and past reports. To capture the main trade-o¤ in information revelation we assume that the government cannot commit even within a period: it can pick any feasible allocation of resources after collecting agents’reports. This allows for a simple and transparent analysis in our benchmark case of iid shocks. As we discuss in Section 5, much of the analysis carries through to the case when shocks are persistent.4 We now formally de…ne our game. We start with information sets. Let S t summary of information available to the government at the end of period t set recursively with S 1 = : St 1 is the space of public histories 1 be the 1: We de…ne this St 1: Each period has three stages. In the …rst stage the government chooses a message set Mt : S t 1 ! M. In the second stage agents send reports about their types. Each agent i observes the realization of his type in period t; i;t ; and of a payo¤ irrelevant random variable zi;t uniformly distributed on Z = [0; 1]. Each agent chooses a reporting strategy Mt as a function of the aggregate history realizations hit Realizations of hti 1 1 = vi ; mit t i 1 ; zit 1 S t 1 ; Mt i;t over ; his past history of reports and sunspot ; current realization zi;t ; and the history of his shocks are privately observed and we call them private history. Realizations of are privately observed and we call them personal history and denote the space of hti by Hit 1 t i: vi ; mti : To economize on notation, de…ne hti 1 1 ; zit and let Hit be the space of all 4 A small recent literature on social insurance without commitment sidesteps the possibility that the government may misuse information when it deviates from equilibrium strategies. For example, Acemoglu et al. (2010) allow agents to stop interacting with the government if it deviates, while Sleet and Yeltekin (2008), Farhi et al. (2012) assume that shocks are iid but there is commitment within the period. All these environment are constructed so that the information about individuals becomes obsolete if the government deviates. 6 hti : Formally, the reporting strategy is de…ned as : St i;t 1 Hit M t i ! (Mt ) where (Mt ) is a set of all Borel probability measures on Mt . We assume that all agents i with the same history choose the same strategy i;t ; so that we can drop the subscript i from the histories, and that the law of large numbers holds so that strategies i;t generate a distribution of reports ~ t over Mt : Finally, in the last stage of the period the government chooses utility allocations ut for each personal history ht as a function of S t = S t space Mt , the distribution or reports ~ t , the function ut ; and history at the beginning of the next period, S t : Let t and and t G;t be the spaces of feasible to denote the whole in…nite sequence of t and fxt g1 t=0 : G;t G;t St 1 1; M ; t ~ t : The message constitute the aggregate be the government strategy in period respectively. We use boldface letters x Note that the public history contains the realized distribution of reports ~ t . The latter implies that only the agents’ deviations that a¤ect ~ t will be detected and punished by the government and the other agents. In particular, since agents are atomistic, a deviation by a single agent has no consequences for ~ t and, hence, will not be detectable by the other players. For our purposes it is useful to keep track of two objects, the distribution of the agents’private histories and that of personal histories. For any S t = S t t as the measure of agents with histories ht ; Borel set At of [v; v] where At 1 Mt t set A of t is a Borel set of H t some Borel set of Mt Let Zt At jS t = t t t (h jS ) H t as Z Z At t 1 : It is de…ned recursively. Let 1 jS 1 Bm 1 = : Any Bz B and Bm ; Bz ; B are the mt -, z-, and -sections of : The measure dat t t t t (h ; jS ) ~ t ; ut de…ne can be represented as a product At = At 1 t 1 1; M ; t t 1 t ht ; t jS t over space H t dz (d ) t dmjS t 1 ; Mt ; at t is de…ned as 1 ; z; : be the measure of agents with personal histories ht ; de…ned for each Borel t AjS t = Z t t A t jS t : (1) We require the government’s strategy to be feasible, so that it satis…es for any S t = St 1; u t Z C ut ; St d Ht t Citizens’strategies induce posterior beliefs pt over ht and St jS t e: t : These beliefs are de…ned for all and satisfy Bayes’rule, i.e. that for any Borel set A of H t and any t ; Z t t pt t jS t ; h t dhjS t = t A jS whenever t AjS t > 0: A 7 (2) (3) Finally, we de…ne continuation payo¤s for agents and the government. Any pair ( ; generates a stochastic process for utility allocation u: We use E( pectation of the random variable xt : St Ht measures generated by strategies ( ; G) ; t and E( St ; ; G) G) xt to denote the ex- ! R with respect to the probability G) Ht [xt jA] to denote the conditional ex- t: pectation given some Borel subset A of The lifetime utility of agent v is P1 t then E( ; G ) t ut jv : Since the government is benevolent and utilitarian, its payo¤ t=0 R P1 t P1 t is E( ; G ) E( ; G ) (dv) : t ut = t ut jv t=0 t=0 De…nition 1 A Perfect Bayesian Equilibrium (PBE) is a strategy pro…le ( ; G) and a belief system p such that conditions (2) and (3) are satis…ed, and both agents and the government choose and G as their best responses, i.e. Agents’ best response: for every S T 1 ; MT ; hT ; T ; " 1 # X t T T 1 T T E( ; ) ; MT ; h ; E(( 0 ; t ut S i G i ); G t=T 0 i for all 2 E( ; G ) 1 X t T t ut S T 1 T ; MT ; h ; t=T (4) and i. n Government’s best response: for every S~T 2 S T " ) " 1 X t T t ut S~T t=T # Z E( ; 0 G ) " 1; ST 1 X o t T ; t ut S~T t=T # for all 0 G 2 G: (5) We focus on equilibria that maximize the payo¤ of the utilitarian government subject to delivering lifetime utility of at least v to an agent that belongs to family v: We call it a best equilibrium and de…ne it as follows De…nition 2 A triple ( ; G; p E( ; ) is a best equilibrium if it is a PBE such that " 1 # X t v for all v; t ut v G) (6) t=0 and there is no other PBE ( 0 ; government, E( 0; 0 G ) 0 ; p0 ) G "1 X t that satis…es (6) and delivers a higher payo¤ to the t ut t=0 Throughout the paper we assume that assume that # > E( ; G ) " 1 X t=0 t t ut # : is such that a best equilibrium exists. We further is such that (6) holds with equality in a best equilibrium. This assumption 8 T # simpli…es the recursive formulation since all the “promise keeping” constraints can be de…ned as equalities in that case. Without this assumption our period 0 recursive formulation would have to be de…ned as an inequality, which would make notation more bulky but not a¤ect any of our results. 3 The recursive problem and the Revelation principle In this section we discuss two intermediate results that are necessary for our analysis of optimal information revelation. First, we derive the recursive problem solution which characterizes the optimal reporting strategy and consumption allocation in best PBEs. Recursive techniques are the standard tool to solve dynamic contracting problems, which essentially allow to solve for optimal insurance after any given history of shocks separately from the other histories. In our game-theoretic settings the standard approach to constructing Bellman equations is not applicable: the government’s incentive to deviate from a best equilibrium depends on the reporting strategies of all agents making the usual history-by-history separation of incentives is impossible. To overcome this di¢ culty, we consider a constrained problem in which we replace the function that characterizes the value of deviation for the government with another function which is linear in the distribution of the past histories of reports. We show that this function can be chosen such that a solution to the original problem is also a solution to the constrained problem. The advantage is that the linearity of the constrained problem in the distribution of past reports allows us to solve for the optimal contract recursively, history-by-history. The key insight for this construction is that the Lagrange multiplier on resource constraint in the best one shot deviation from best equilibrium strategies summarizes all relevant information. The second result of this section characterizes the minimum dimension of the message space M that is needed to support best PBEs. Reduction of an arbitrarily message space M to a smaller dimension object is important for tractability of the maximization problem. This step is similar in spirit to invoking the Revelation Principle in standard mechanism design problems with commitment, and we refer to this result as the (generalized) Revelation principle. Although these steps are crucial for our ultimate goal – characterization of e¢ cient information revelation – these tools are of interest in themselves. A lot of environments without commitment share the feature that value of deviation for the “principal”depends on actions of many agents, and the recursive techniques developed in our settings should also be applicable to such environments. To the best of our knowledge, there is no version of the Revelation Principle for economies with multiple agents when the principal lacks commitment. A well known paper by Bester and Strausz (2001) developed a version the Revelation Principle for an 9 economy with one agent, but their approach and the main result do not extend to multi-agent settings. 3.1 The recursive problem We start by describing a PBE that delivers the lowest payo¤ to the government, to which we refer as the worst PBE. It is easy to show that there are worst equilibria in which no information is revealed.5 To see this, let m? (M ) be an arbitrary message from a set M 2 M. Consider a strategy pro…le ( w; w) G in which all agents report message m? (M ) with probability 1 for all M , and the government allocates the same utility uw t =C The beliefs pw Lemma 1 ( that satisfy (3) are given by w; w ; pw ) G pw t t jht ; S t 1 (e) = t for any message m it receives. t for all ht ; S t : is a worst PBE. Proof. The best response of the government to agents’reporting strategy w is to allocate the same utility to all agents since its beliefs pw do not depend on m and C is strictly convex. Given this allocation rule, playing w is optimal for agents, therefore ( that delivers the government payo¤ U (e) = (1 ) : Strategy w G w; w; w ; pw ) G is a PBE is feasible for any reporting strategy that agents choose, so the government can attain payo¤ U (e) = (1 which proves that ( w ; pw ) G ) in any PBE, is a worst PBE for the government. Standard arguments establish that best equilibria are supported to reverting to the worst PBE following any deviation of the government from its equilibrium strategies. We split our problem in two parts. First, we …x any sequence of message sets M and describe the maximization problem that characterizes the best PBE given that sequence of message sets, which we call the best equilibria given M: We then show that the same payo¤ can be achieved with a much simpler message set that has only a …nite number of messages. That set is independent of M and hence without loss of generality it can be chosen as the message set in the best PBE. We start with several preliminary observations that simplify the notation. Since there is no aggregate uncertainty the aggregate distribution of reports and allocations is a deterministic sequence. Therefore, it is redundant to keep track of the aggregate history S t in best equilibria and we drop this explicit dependence from our notation. Moreover, without loss of generality we can restrict attention to reporting strategies 5 t that depend on ht 1; zt and t; but not In fact, a stronger statement is true that in any worst equilibrium no information is revealed. This statement follows from our Lemma 2. 10 on the past history of shocks since as long as ht ; t t t 1 6 : This results holds more generally with Markov shocks, t 1 follows a Markov process, is payo¤ irrelevant and agents with the same can replicate each other strategies for all past t 1 : This implies that all information about agents at the beginning of period t can be summarized by the distribution of reports t 1: We now turn to setting up the maximization problem that characterizes best equilibria given M: Since any deviation by the government from a best equilibrium triggers a switch to the worst PBE, it is su¢ cient to consider only the best one-shot deviation from equilibrium strategies by the government. The government that observes the distribution of personal ~ t ( t ) de…ned as histories t can achieve the maximum payo¤ W ~ t ( t) = W subject to Z sup Ht fu(h)gh2H t Z E C (u (h)) d t [ jh] u (h) d e: t Ht t (7) (8) Therefore, any best equilibrium given M is a solution to 1 X max E u; t t ut (9) t=0 subject to (2), E 1 X s t ~ t ( t) + W s us s=t E 1 X t t ut E 0 t=0 and 1 X U (e) for all t; 1 t t ut for all 0 (10) (11) t=0 E " 1 X t t ut t=0 # v = v: (12) The main di¢ culty in the analysis of problem (9) is that the sustainability constraint (10) ~ t ; which is a non-linear function of the past reports embedded in the probability depends on W ~ t with a function that (i) is linear in t 1 ; distribution t 1 : To make progress, we replaced W ~ t for all (ii) is weakly greater than W t; ~ t at a best equilibrium’s distribution of and (iii) equals W reports. This de…nes a more constrained maximization problem for which the best equilibrium In the Supplementary Material we show that for any PBE ( 0 ; delivers the same payo¤ to all agents and the government as ( 0 ; ( 0 ; 0G ; p0 )) and has the property that t does not depend on t 6 11 0 0 G; p ) 0 0 G; p ) 1 : there is another PBE ( ; G ; p) which (i.e. ( ; G ; p) is payo¤ -equivalent to is still a solution. The linearity of the constraining function in t 1 allows us to write the constrained problem recursively. Let (u ; by : Let ) be a solution to (9) and w t be the distribution of personal histories induced 0 be a Lagrange multiplier in problem (7) given t: For any mapping : (Mt ) de…ne Wt ( ) = max fu(m)gm2M t Z w t C ( u (m) Mt w t e: (u (m))) (dmj ) (d ) + E [ jm]. Function Wt plays an important role in our analysis: We show that R (13) w 0 w t C (u We use fuw (m)gm2Mt to denote the solution to (13), which is given by equation ! Wt ( t ) dzt d (m)) = t 1 is an upper bound for Wt ( t ) that satis…es the three properties needed for recursive characterization. Before formally proving this result, it is useful to introduce some de…nitions. We say that P ( ) = 1 a.s., i.e. for almost every reporting strategies are uninformative if E [ jm] = message m with respect to the measure generated by . Uninformative reporting strategies reveal no additional information other than the unconditional expectation of : All other strategies are called informative. We also need to generalize the notion of the envelope theorem for Wt : For any and 0 de…ne a derivative of Wt ( ) in the direction @Wt ( ) @ 0 Wt ((1 lim 0) ) + Wt ( ) #0 0 as : With these de…nitions we can state the following lemma, which is important for all subsequent analysis. w t Lemma 2 The multiplier and, therefore, uw is uniformly bounded away from 0 and belongs to a compact set (m) belongs to a compact set in the interior of [(1 ) v; (1 ) v] for all : The family of fWt gt and all the families of its directional derivaties are uniformly bounded. Function Wt is well de…ned, convex, and is minimized if and only if For any t; t 1 ~ t ( t) W with equality if t; t 1 = Z Wt t Ht t; t 1 : 0 is well-de…ned The derivative @Wt ( ) =@ Z @Wt ( ) = ( uw (m) @ 0 Mt w t C jht ; for all (uw (m))) 12 dzd 0 is uninformative. (14) t 1 and satis…es 0 (dmj ) (dmj ) (d ) : (15) Consider now a modi…ed maximization problem (9) in which we replace (10) with Z 1 X s t Wt t jht 1 ; dzd t 1 + E U (e) : s us 1 t H s=t The constraint set is tighter in the modi…ed problem, but (u ; straints, therefore (u ; (16) ) still satis…es all the con- ) is a solution to the modi…ed problem. One can then use standard techniques (see, e.g. Marcet and Marimon (2009) and Sleet and Yeltekin (2008)) and use Lagrangian multipliers to re-write it as L = max E u; 1 X t [ t ut tC (ut ) t Wt ] (17) t=0 1 subject to (11) and (12) for some non-negative sequences t ; t ; t t=0 , with a property that ^ ; with strictly inequality if and only if constraint (16) binds in period t:7 We t t= t 1 interpret ^ as the e¤ective discount factor of the government in the best equilibrium. It is t greater than agents’discount factor because the government also needs to take into account how its actions a¤ects the sustainability constraints in the future. This is a general result about the role of the lack of commitment on discounting that was highlighted, for example, by Sleet and Yeltekin (2008) and Farhi et al. (2012). The maximization problem (17) shows costs and bene…ts of information revelation. The more information the agents reveal, the easier it is for the planner to maximize the costweighted utility function of the agents, t ut tC (ut ) : At the same time, better information also increases the temptation to deviate, captured by the term t Wt : This trade-o¤ is at the heart of our characterization in Section 4. We make two assumptions on the properties of these Lagrange multipliers which we maintain throughout the analysis, lim sup t > 0 and lim inf t > 0: These conditions simplify technical arguments and in the Supplementary material we provide su¢ cient conditions for C that ensure that these assumptions are satis…ed. Since the Langrange multipliers should satisfy these properties much more generally – for example whenever the economy converges to an invariant distribution and the long-run immiseration is a feature of the optimal contract with full commitment –we opted to make assumptions on the multipliers directly.8 An important simplifying feature of the modi…ed maximization problem and the Lagrangian (17) is that the distribution of past reports t 1 enters the objective function and the con- 7 This representation is possible since allowing the government and agents to condition their strategies on z convexi…es the maximization problem (9). The formal arguments are straightforward but cumbersome, we present their sketch in the Supplementary Material. 8 Immiseration is a common feature of long-term contracts with commitment. See Ljunqvist and Sargent (2004) for a textbook discussion and Phelan (2006) and Hosseini et al. (2013) for recent contributions to the literature. We discuss existence of the invariant distribution in Section 5.1. 13 straints linearly. This allows us to solve for the optimal allocations and reporting strategies separately for each history ht 1: We do so by extending the recursive techniques developed by Farhi and Werning (2007) who study an economy with commitment but in which the principal is more patient than agents. Let k0 (v) be the value of the objective function (17) for family v; de…ned as " 1 # X 1 k0 (v) = max E t ( t ut t C (ut ) t Wt ) v 0 u; subject to (11) and (12). The Lagrangian de…ned in (17) satis…es L = Similarly we can de…ne9 kt (v) = 1 max E t u; (18) t=0 " 1 X s us t+s t+s C (us ) t+s Ws s=0 R v 0 k0 (v) # (dv) : (19) subject to (11) and (12). The next proposition shows the relationship between kt (v) and kt+1 (v) : Proposition 1 The value function kt (v) is continuous, concave, di¤ erentiable with limv!v kt0 (v) = 1: If utility is unbounded below, then limv! 0 1 kt (v) = 1; if utility is bounded below, then limv!v kt0 (v) = 1. The value function kt (v) satis…es the Bellman equation h ^ kt (v) = max E u t C (u) + t+1 kt+1 (w) fu(m;z);w(m;z); ( jz; )g(m;z)2M t ( jz; )2 (Mt ) Z t Wt i (20) subject to h i E ^u + wj^; z v = E [ u + w] h i E 0 ^u + wj^; z for all z; ^; (21) 0 (22) Without loss of generality, the optimum is achieved by randomization between at most two points, i.e. one can choose z 2 [0; 1] and set (u ( ; z 0 ) ; w ( ; z 0 ) ; ( jz 0 ; )) = (u ( ; z 00 ) ; w ( ; z 00 ) ; ( jz 00 ; )) for all z 0 ; z 00 z and all z 0 ; z 00 > z: Most of the proof of this proposition follows the arguments of Farhi and Werning (2007) and is provided in the Supplementary materials. The solution to this Bellman equation is attained by some functions uv;t (m; z) ; wv;t (m; z) ; and v;t (mjz; ). In what follows we will drop explicit dependence on t whenever this will not cause confusion. We can use these policy functions to generate recursively ut and t: Standard arguments establish when these (u; ) are a solution to the original problem (9). Variables (u; ) = fut ; t g1 t=0 are de…ned over slightly di¤erent spaces in problems (18) and (19). The 1 sequence of message sets is fMt g1 t=0 in (18), while it is fMs+t gt=0 in (19): 9 14 Proposition 2 If (u; ) obtains a maximum to (18), then it is generated by (uv ; wv ; (u; ) generated by (uv ; wv ; v) v) : If is such that t lim E t!1 vt = 0 (23) then it is a solution to (18). Condition (23) can be veri…ed in the same way as in Farhi and Werning (2007). 3.2 The Revelation Principle The recursive problem (20) is similar to the usual recursive dynamic contract formulation with commitment with two modi…cations. First, agents play a mixed reporting strategy over set M rather than a pure reporting strategy over set t Wt : Second, there is an additional term that captures the cost of information revelation. In this section we further simplify the problem by showing that incentive constraints (22) impose a lot of structure on agents’ reporting strategies and simplify the message space that needs to be considered in best PBEs. Since incentive constraints (22) hold for each z; we drop in this section the explicit dependence of strategies on the payo¤ irrelevant variable, and our arguments should be understood to apply for any z: Let fu (m) ; w (m)gm2Mt be any allocation that satis…es (22). Let Mt ( ) be the subset of messages that give type the highest utility, i.e. u m0 + w m0 u (m) + w (m) for all m 2 Mt ( ) ; m0 2 Mt : Sum this inequality with an analogous condition for type 0 0 u (m) u m0 Mt 0 to get for all ; 0 ; m 2 Mt ( ) ; m0 2 Mt 0 : (24) One immediate implication of this equation is that any incentive compatible allocation is monotone in types, i.e. > 0 implies u (m) u (m0 ) and w (m) w (m0 ) : It follows that each message set Mt ( i ) can be partitioned into three regions: the messages that also belong to Mt ( i+1 ) ; the messages that also belong to Mt ( i 1 ), and the messages that do not belong to any other Mt ( j ) : The next proposition shows that it is without loss of generality to assume that agents send only one message per each partition. Since there can be at most 2j j 1 of such partitions, it also gives an upper bound on the cardinality of the message space needed in best PBEs. Proposition 3 Any best PBE is payo¤ equivalent to a PBE in which agents report no more than 2j j 1 messages after any history ht : Furthermore such PBE can be constructed so that if m0 6= m00 then the utility allocation in equilibrium, ut , satis…es ut ht ; m 6= ut ht ; m ^ : 15 We brie‡y sketch the key steps of the arguments. Consider any partition Mt ( i )\Mt ( i+1 ) : Equation (24) implies that (u (m) ; w (m)) must be the same for all m 2 Mt ( i ) \ Mt ( i+1 ) : Therefore, all reporting strategies that keep the same probability of reporting messages m 2 = h i ^ kt+1 (w) . Mt ( i )\Mt ( i+1 ) have the same on-the-equilibrium-path payo¤ E u C (u) + t+1 t Thus, in a best BPE the reporting of messages in Mt ( i ) \ Mt ( i+1 ) must minimize the o¤- the-equilibrium-path payo¤ E [ t Wt ]. Convexity of Wt implies that such reporting strategy keeps posterior beliefs of the government the same for all m 2 Mt ( i ) \ Mt ( without loss of generality to allow only one message for each Mt ( i ) \ Mt ( i+1 ) : Then it is i+1 ) : The reverse arguments allow to reach the same conclusion for the subset of Mt ( i ) that does not intersect with any other Mt ( j ) ; which we denote Mtexcl ( i ) : Conditional on observing any message sent with a positive probability from subset Mtexcl ( i ) ; the government learns that the sender is type i with certainty. Therefore the o¤-the-equilibrium-path payo¤ does not depend on the exact probability with each type i reports messages in Mtexcl ( i ) : Strict convexity of the on-the-equilibrium-path payo¤ implies that the government chooses that same (u (m) ; w (m)) for all m 2 Mtexcl ( i ), which again implies that it is without loss of generality to restrict attention to only one message. Proposition 3 is very useful for the characterization of the optimal incentive provision and information revelation in the next section. It also allows us to prove a version of the Revelation Principle in our economy. Let M be any set of cardinality 2j j 1: Corollary 1 Any best PBE given M is payo¤ equivalent to a best PBE given Mt = M for all t: The incentive constraint (22) for such equilibrium can be written, for each z; as (m2i 1 j 2i 1 ) > 0 and i u (m2i 1 ) (mj i ) [( i u (m2i 1) + w (m2i + w (m2i 1 )) 1) i u (m) + w (m) ; (25) ( i u (m) + w (m))] = 0 for all i = 1; ::; j j; m 2 M : This result shows that in general the cardinality of message space with a continuum of agents is …nite but bigger than the cardinality of the message space with only one agent. In the latter case Bester and Strausz (2001) showed that without loss of generality one can restrict attention to a message space M = : Their result does not extend to economies with multiple agents.10 Corollary 1 shows that even with a continuum of agents the dimensionality of the message space remains small. 10 See Bester and Strausz (2000) for a counterexample with two agents. 16 4 Characterization In this section we characterize properties of the e¢ cient information revelation. We start with a simpli…ed environment that illustrates the main trade-o¤s very transparently. We then extend the insights developed by this example to our general settings. 4.1 A simpli…ed environment We make two simplifying assumptions in this section. First, we assume that the government can deviate from its implicit promises in period t only if it deviates in period 0. This simpli…es the analysis of the best equilibrium as it is su¢ cient to ensure that government’s sustainability constraint (10) holds only in period 0. This case is isomorphic to our Lagrangian formulation with t = 0 for all t > 0: Secondly, we assume that agents’preferences can be represented by a utility function U (c) = ac1=a when 1=a < 1; with U (c) = ln c when 1=a = 0: Under the …rst simplifying assumption the Lagrangian (17) can be written, up to a constant, as ( L = max E u; 1 X t tC (ut ) 0 W0 t=0 ) subject to (11) and (12). First, consider the sub-game starting from t = 1: Since the sustainability constraints do not bind for all t 1; the standard Revelation principle applies and the maximization problem that characterizes the optimal allocations is identical to that in Atkeson and Lucas (1992). It is easy to show that the value function with CRRA preferences takes the form 1 (v) = const jvja when 1=a 6= 0 and For any reporting strategy 0 (v; 2 )= 1 (v) (M0 ) in period 0, let 0 (v; 0C (u) + max fu(m);w(m)gm2M subject to (21) and (22). Function = const exp (v) when 1=a = 0: 0 (v; E [ ) be de…ned as 1 (v)] 0 ) is interpreted as the value function for the govern- ment on the equilibrium path if an agent plays reporting strategy : It inherits the homogeneity property of function 1; namely 0 (v; ) = d ( ) jvja for some constant d ( ) : If no public randomization is allowed, the actual value function 0 (v) = max 0 (v; ) 0 W0 ( 0 (v) is given by ): With public randomization the maximum is taken over the convex hull of the function on the right hand side. Consider …rst the case when j j = 2 and agents can only play two reporting strategies: a full information revelation strategy in ; when each type reports a distinct message with 17 un ; probability 1, and a no information revelation strategy when each type randomizes between all messages with the same probability. Since the former strategy has higher payo¤ both onand o¤- the equilibrium path, d Figure 1 plots 0 (v; ) in 0 W0 ( > d( ) for un ) in ; 2 ization. The key observation is that function un ) only 0 W0 ( un gives higher in and W0 un ; as well as in ; 0 > W0 ( 0 W0 un ) : 0 (v) in with public random- intersects 0( ; un ) once, from below at some point v2 : This shows that the uninformative strategy in value to the government than for all v < v2 and lower value for all v > v2 : Allowing public randomization may further increase the government’s payo¤ and create an intermediate region (region (v1 ; v3 ) on the graph) in which the reporing strategy is stochastic but it does not alter the main conclusion: no information revelation is optimal for low values of v and full information is optimal for high values of v. To understand the intution for this result, it is useful to compare the on the equilibium path bene…ts of information revelation, costs W0 in W0 ( un ) : 0 in ; 0( ; un ) ; to the o¤ the equilibrium path Given our CRRA assumption, it is easy to see that the bene…ts are monotonically increasing in v; equal to zero as v ! v and to in…nity as v ! v: The o¤ the equilibrium path costs of information revelation do not depend on v: Therefore, there must exist a region of low enough v where the costs of better information revelation exceed the bene…ts and a region of high enough v where the opposite conclusion holds. This conclusion does not depend on the fact that we restricted attention to only two reporting strategies. When j j = 2; any the government satis…es d un and in un ) : > d ( ) and W0 ( ) > W0 ( have higher payo¤ than This implies that reporting for low and high values of v respectively. κ(v) strategies in that reveals some but not all information to ⇐ no info ⇒ full info v 1 v Figure 1: Value functions in the simpli…ed example. Solid line: in in un ; dashed: un 0 (v) 0 0 (v) 0W 0W ( 0 ) : When j j > 2; it is still true that W0 ( ) > W0 ( 18 v 2 un ) 0 3 (v) with public randomization, dotted: and therefore no information revelation is optimal for low enough v. However, full information revelation no longer gives strictly higher value than any other reporting strategy on the equilibrium path. The reason for it is that bunching might be optimal even in standard mechanism design problems with commitment. Our result for high values of v extends if the distribution of types is such that no bunching is desirable with commitment (equation (33) provides a su¢ cient condition for that). More generally, it can be shown that some information revelation is desirable for high v, in particular it is optimal for 1 to reveal his type perfectly. Finally, we comment on the role of the assumption of constant relative risk aversion. Our arguments for no information revelation for low values of v only used the fact that un provides the lowest value to the government o¤ the equilibrium path. This is true for any preferences. The arguments for e¢ ciency of full information revelation rely on the fact that 0 (v; 0 v; in ) becomes unboundedly large for high values of v: A su¢ cient condition for this is that the absolute risk aversion goes to zero as consumption goes to in…nity, and our conclusion holds for all preferences satisfying this condition. This example illustrates the general principle that goes beyond the particular taste-shock model that we consider in this paper. Since the o¤ the equilibrium path costs of information revelation do not depend on past promises (the principal simply reneges on them when deviates) but the bene…ts do, it is optimal to reveal less information for those agents for whom the on the equilibrium gains are lowest. The gains from better information are monotone in v in the taste shock economy that we consider in this paper; which allows us to get sharp cut-o¤s, but the general principle holds regardless of this property. 4.2 Characterization of the general case In this section we study e¢ cient information revelation in our baseline model of Section 3. Let (uv ; wv ; v) be a solution to (20). To streamline the exposition, in the body of the paper we focus on interior optimum when the lower bound U (0) does not bind; many arguments are simpler if the lower bound binds. We start with two key optimality conditions. The …rst order conditions for uv and wv can be re-arranged as ^ t+1 E v 0 kt+1 (wv ) jz = kt0 (v) (26) C 0 (uv ) jz : (27) and kt0 (v) = 1 tE v The …rst condition describes the law of motion for promised utilities. Since ^ t+1 19 with strict inequality when the sustainability constraint binds in period t + 1; it shows a version of meanreversion in promised utilities. Since kt and kt+1 are concave functions with interior maxima, it implies that period t + 1 promises have a drift towards the value that maximizes kt+1 : This drift component implies higher future promises for low-v agents and lower promises for high-v agents. This mean-reverting drift helps the government to relax sustainability constraints in future periods. The second condition provides a linkage between promised and current utility allocations. For our purposes it is important to establish bounds for uv and wv : The optimality conditions (see Appendix for details) imply that 1 kt0 (v) tC 1 0 (uv (m; z)) 1 kt0 (v) j j; (28) and % 1 kt0 (v) + 1 ^ t+1 ! 1 0 kt+1 (wv (m; z)) = ^ t+1 ; % for some numbers %; % with a property that % kt0 (v) + % 1 1 = ^ t+1 ! 0 as ^ j j t+1 ! 1 (29) ! 0: Equation (28) shows the role of private information. If agents’ types were observable, the planner would equalize the marginal costs of providing 1 util to type , 1 1 tC 0 (u) ; to the shadow cost of past promises, kt0 (v) ; while allocating the same promised utility w to all types. This would give more consumption to higher types, which is not incentive compatible. With private information the dispersion of current period consumption allocations is smaller, which can be seen by the bounds (28). Equation (29) shows, in addition, how the sustainability constraints in future periods interact with the provision of incentives in the current period. It shows that bounds for future promises 1 0 kt+1 (wv ) shrink as v ! v: When future sustainability constraints bind, the government realizes that more dispersion of inequality tomorrow makes it harder to sustain e¢ cient outcomes, which limits the dispersion of promises that the government gives and prevents the long run immiseration. Conditions similar to (26)-(29) also appear in the problems in which the government can fully commit (see Atkeson and Lucas (1992) and Farhi and Werning (2007)). Our analysis generalizes those conditions to the environments with no commitment. A novel feature of such environments is that complete information revelation is generally suboptimal. We next turn to the characterization of the e¢ cient information revelation. Suppose that it is optimal for type to randomize between messages m0 and m00 . The 20 optimality for the reporting strategy uv m00 ; z t 00 uw v m ;z w t C together with the envelope theorem yields11 uv m00 ; z tC uv m0 ; z = v tC uv m0 ; z 00 uw v m ;z + ^ t+1 kt+1 wv m00 ; z (30) + ^ t+1 kt+1 wv m0 ; z 0 uw v m ;z w t C 0 uw v m ;z : Equation (30) captures bene…ts and costs of information revelation. The left hand side of the equation captures the bene…ts of better information revelation on the equilibrium path. It compares the “cost-weighted” utility that the government obtains for type messages m0 and m00 : This term is the general form of the expression v; 0 if it receives in 0 (v; un ) that appeared in our simpli…ed example in Section 4.1. The right hand side of the equation captures the cost of information revelation o¤ the equilibrium path. If message m00 reveals more information about further from than m0 ; in a sense that the expectation of conditional on m00 is than the expectation conditional on m0 ; this expression is positive. The right hand side of (30) is the analogue of 0 W0 in W0 ( un ) in Section 4.1. Equation (30) develops the key intuition for e¢ cient information revelation. First, consider information revelation for low values of v: Bounds (28) and (29) imply the expression on the left hand side of (30) converges to zero as v ! v; so that the gains from information revelation disappear for low values of v. Therefore, the optimal reporting strategy must converge to a strategy that minimizes the o¤ the equilibrium path cost of information revelation. Such strategy must be uninformative by Lemma 2. Note that the uninformative strategy assigns the same posterior belief for any reported message, so that that right hand side of (30) is zero. In the Appendix we prove a stronger result that the uninformative strategy is optimal for all v su¢ ciently low. Theorem 1 Suppose that the sustainability constraint (10) binds in periods t and t + 1: There exists vt > v such that for all v 2 v; vt strategy v the allocation (uv ; wv ; v) does not depend on z; is uninformative and (uv (m) ; wv (m)) = (uv (m0 ) ; wv (m0 )) for all m; m0 : We now turn to the analysis of information revelation for high values of v: Consider the case when j j = 2: By Proposition 3 we can restrict attention to at most 3 messages, such that reporting two of those messages reveals the type of the sender perfectly. Therefore the left hand side of (30) compares on-the-equilibrium-path bene…ts from a message that reveals full information about a sender and a message that reveals only some of the information. Bounds Formally, we take a derivative of v in the direction 0 ; de…ned as 0 (m0 j ; z) = (m00 j ; z) = 0 and 0 (mj ; z) = v (mj ; z) for all other m; ; and apply (15). 11 0 21 v (m0 j ; z) + v (m00 j ; z) ; (28) and (29) can be used to show that as long as the coe¢ cient of absolute risk aversion, U 00 (c)=U 0 (c) goes to zero for high c; the informational gains from better information must go to in…nity as v ! v: Since the informational costs are bounded by Lemma 2, this implies that equation (30) cannot hold and each agent must reveal his type fully for all v high enough. Formally, suppose that utility satis…es Assumption 1 (decreasing absolute risk aversion) U is twice continuously di¤ erentiable and lim U 00 (c) =U 0 (c) = 0: (31) c!1 We say that messages as type v reveals full information about type i with positive probability. i if no type j 6= i sends the same Theorem 2 Suppose that Assumption 1 is satis…ed and j j = 2: Then there exists vt+ < v vt+ ; such that for all v v reveals full information about each type. Previous arguments can also be extended for the …rst and the last type of arbitrary message spaces. Corollary 2 Suppose that Assumption 1 is satis…ed. 1. (no informational distortions at the top). Then 1 v reveals full information about for all v su¢ ciently high. 2. (no informational distortions at the bottom). Suppose in addition that % j j 1 j j information about j j 1 j j > j j 1 + j j j j 1 j j 2 : Then v 0 and reveals full for all v su¢ ciently high. The …rst part of this Corollary shows that full information for the lowest type 1 is optimal for high v: It is an informational analogue of the “no distortion at the top” result from the mechanism design literature (see, e.g. Mirrlees (1971)). In the mechanism design version of our environment, the incentive constraints bind upward, so that that no other type wants to pretend to be 1: 1 is the “top type”in a sense This property of the top types drives both the “no distortion at the top” result in standard mechanism design models with commitment and part 1 of Corollary 2 in our environment. Part 2 of Corollary 2 shows that informational distortions are suboptimal for the highest j j as long as the types are not too close to each other. The extra condition is needed to 22 rule out bunching. When some types are close, it might be optimal to give them the same allocations even if they reveal their type fully (as, for example, in models with commitment) in order to provide incentives for other types. Since bunching is suboptimal with two types, no additional conditions were necessary in Theorem 2. More insight into the behavior of the interior types can be gained from assuming that the utility function satis…es U (c) = ac1=a for a > 1: In this case it is easy to …nd the limiting allocations towards (uv ; wv ; v) converge as v ! v: In particular, we can use homogeneity of C and boundedness from below of u to show that for any v; kt (v0 v) =v a ! k~t (v) as v0 ! v; where function k~t (v) is de…ned by h i ^ k~t (v) = max E (32) t C (u) + t+1 kt+1 (w) fu(m);w(m); (mj )gm2M ( j )2 (M ) subject to (22) and (21). Moreover, v converges to a solution ~ of this problem. The limiting problem (32) has no cost of information revelation. It is equivalent to a standard mechanism design problem in which agents can send reports over a redundantly large message set M : If it is suboptimal to bunch di¤erent types when agents report over , it is also suboptimal to bunch types when agents report over M : A su¢ cient condition to rule out bunching in mechanism design problems is an assumption that types are not too close, which we state as Assumption 2. It turns out to be a su¢ cient condition for full information revelation for high enough v in our economy without commitment. Assumption 2 The distribution of ( n 1) [ n n 1] satis…es ( n 1 n 2) j j X ( i) 0 for all n > 2: (33) i=n 1 Note that this assumption is satis…ed both in Theorem 1 and Corollary 2. Suboptimality of bunching in the limit implies that information revelation is suboptimal for all su¢ ciently high v: The proof of this result is a tedious but mostly straightforward application of the Theorem of Maximum and we leave it for the Supplementary material. Proposition 4 Suppose that U (c) = ac1=a for a > 1 and Assumption 2 is satis…ed. Then there exists vt+ < v such that for all v vt+ ; v reveals full information about each type. Remember that we take the distribution of initial promises as given. The following lemma then shows that, depending on the initial utility he will be allocated by the government, an agent can remain in the no-information region or in the full-information region for an arbitrary long time. 23 Lemma 3 (i) Assume utility is bounded below by 0. For any T > 0, there exists v0 that vt 1 ;t 0 such ( jz; ) is uninformative for all (z; ) for t = 1; :::; T . Suppose that the sustainability constraint (10) binds in periods t = 0; :::; T + 1, then v0 can be chosen strictly greater than 0. (ii) Suppose the assumptions of Theorem 1 (Corollary 2) are satis…ed, then for any T > 0 there exists v0 > v0+ such that and 5 j j) vt 1 ;t ( jz; ) reveals full information about each type (types 1 with positive probability for t = 1; :::; T . Extensions We discussion two extensions of the baseline analysis. 5.1 Invariant distribution In the previous sections we used arbitrary distributions of initial lifetime utilities : In this section we brie‡y discuss two implications for our analysis if is an invariant distribution. ^ ^ In this case multipliers ; and do not depend on t, with > ; > 0 needed to prevent immiseration.12 One implication of the invariant distribution is that no agent is stuck forever in the region in which no information is revealed. To see this, note that equation (29) shows that agent’s promised utility strictly increases with positive probability if vt < v and strictly decreases with positive probability if vt > v ; where v is given by k 0 (v ) = 0: Suppose there is no information revelation for all v v + a where a 0: Then the invariant distribution must assign all the mass to v and, moreover, no information about agents is revealed. But in this case the sustainability constraint is slack, which is a contradiction. Therefore the “no information” region lies strictly below v . Another implication of Theorem 1 and bounds (29) in this case is that there must exist an endogenous lower bound beyond which agents’lifetime utility never falls. Once an agent’s utility reaches that bound, he bounces from it with a positive probability. This dynamics resembles that of Atkeson and Lucas (1995), except that in their case the utility bound was set exogenously. Near the endogenous lower bound agent may reveal no information and receive no insurance against that period’s shocks. For given ^ ; ; one can follow the arguments of Farhi and Werning (2007) to show that there exists initial distibution ~ with a property that t generated by a solution to (20) is such that t = ~ for all t: This distribution is invariant if feasibility and sustainabilty constraints hold with equalities. We leave it for future research to explore under which conditions these constraints are satis…ed. 12 24 5.2 Persistent shocks We chose to focus on iid shocks in our benchmark analysis. Most of our key results extend with few modi…cations to Markov shocks. When the shocks are Markov, let the probability of realization of shock assume that j > 0 for all ; conditional on shock conditional on shock t : Let j denote in the previous period. We be the probability of realization of j being realized t periods ago. Many arguments in the persistent case are straightforward extentions of our previous analysis. We brie‡y sketch them in this section leaving the details for the Supplementary material. Following the same steps as before, we can show that in the worst equilibrium there is no information revelation for the government. The payo¤ in that equilibrium, unlike the iid case, depends on the prior beliefs that the government has. The maximum payo¤ that the government can achieve in any period t is given by Z ~ Wt ( t ) = sup fut+s (h)gh2H t ;s 0 Ht 1 X s s=0 Ept+s [ jh] ut+s (h) d t subject to the feasibility constraint (8) holding for all t + s and Z s pt+s jht = j pt d jht for s > 0: ~ t ( t ) with a function that is linear in Similarly to the iid case, we can bound W t 1: De…ne the analogue of (13) as Wt ( ; p) = max fut+s (m)gs 0 Mt Z 1 X s=0 0 s@ Z s (d s j ) s ut+s (m) w t;t+s (C (ut+s (m)) 1 e)A (dmj ) d j then Lemma 2, and in particular bound (14), can be extended to Wt ( t ; pt ) : It is straightforward then to obtain the analogue of Lagrangian (17). This Lagrangian can then be rewritten recursively using the techniques of Fernandes and Phelan (2000), who studied an economy with persistent shocks and commitment. v= v period, 1 ; :::; v j j In their formulation the state is a vector of promised utilities and the realization of the shock in the previous : In our environment v remains a state variable with an analogue of the promise keeping constraints (21) holding for each type, Z v i = [ u (m; z) + w (m; z; )] (dmjz; ) dz d j i E u + wj i . (34) In the environments with commitment each agent plays a pure reporting strategy and is known along the equilibrium path. In our settings 25 is not known, but the planner forms p d ; posterior beliefs about : Therefore, the only di¤erence from the commitment environment is that the posterior belief p replaces as the state variable. The recursive formulation is given by kt (v; p) = max (u;w; ) ( j )2 (M );p0 2 ( ) X p 2 E h u t C (u) + ^ t+1 kt+1 i 0 w; p j t Z Wt dz (35) subject to (34), the incentive constraint E [ u + w ( ; ; ) j ; z] E 0 [ u + w ( ; ; ) j ; z] for all z; ; 0 ; and the Bayes rule condition that requires p0 to satisfy, whenever de…ned,13 P (mj ) j p 2 p0 ( jm; z) = P : j p ( ; )2 2 (mj ) The analysis of dynamic mechanism design problems with persistent shocks is more involved since the recursive problem with additional state variables is more complicated. The broad lessons about desirability of information revelation for high and low promised values (now in the sense of increasing each element of vector v) still holds. First of all, the example in Section 4.1 remains virtually unchanged. More speci…cally, in the Supplementary material we follow similar steps as in the iid case and de…ne functions 1 (v; p) that both these functions are homogenous in v: for any x > 0; 0 (xv; ) = xa 0 (v; ). Note that v and 1 (xv) 0 (v; = ) : We show xa 1 (v; p) and 0 if 1=a 2 (0; 1) and v < 0 if 1=a < 0: We can then conclude that no information revelation is optimal for low values of xv (i.e. x ! 0 if 1=a 2 [0; 1) and x ! 1 if 1=a < 0) and, when no bunching is desirable with commitment, full information is optimal for high values of xv. We can also generalize Theorem 1 to the persistent case (see the Supplementary material for details). Proposition 5 Suppose that utility is bounded. Let v;p be a solution to (35) and mative strategy. If the sustainability constraint (10) binds in periods t; then Pr ( as v ! 0 for all p. 6 an uninforv;p ! )!1 Final remarks In this paper we took a step towards developing of theory of social insurance in a setting in which the principal cannot commit. We focused on the simplest version of no commitment 13 Strictly speaking m should be replaces with any Borel set A of 26 (Mt ) analogously to de…nition (3). that involves a direct, one-shot communication between the principal and agents, and showed how such model can be incorporated into the standard recursive contracting framework with relatively few modi…cations. Our approach can be extended to other types of communication. For example, a literature starting with a seminar work of Myerson (1982) considered communication protocols that involve a mediator. Such protocols may further increase welfare (see, e.g. Bester and Strausz (2007)) but they require some ability to commit by the policy maker to use the mediating device. The key di¤erence is that, after sending any message m; the agent receives a stochastic allocation, so that the incentive constraint (25) holds in expectation rather than message-by-message. Two main insights of the paper – recursive characterization and Theorems 1 and Theorems 2 –depend little on the detailed structure of the incentive constraints. We believe that our analysis can be extended to such settings in a relatively straightforward way and leave it to future research. Another important direction that needs to be explored in future work is how the allocations in best equilibria can be decentralized through a system of taxes and transfers. Some of the most fruitful way to proceed may be to extend our basic set up to incorporate such margins as labor supply or job search e¤ort, and consider decentralizations along the lines of labor taxation in Albanesi and Sleet (2006) and unemployment insurance in Atkeson and Lucas (1995). 27 7 Appendix 7.1 Proof of Lemma 2 w t gt First we show that sequences f is uniformly bounded away from 0. The objective function w t in (7) is concave and the constraint set is convex, therefore there exists t solution to (7), uw t h w t ht 2H t is given by t ; satis…es C 0 uw t h Z C C0 1 1 wE t Ht Since E t [ jh] 2 in…nity as w t 1; j j t 1 = E w t [ jh] d jht : The Lagrange multiplier t t = e: (36) ; the left hand side of (36) is continuous in w t goes in…nity and 0 respectively. Therefore 0 such that the w t and goes to 0 and is bounded away from 0 from below and bounded above, and these bounds can be chosen independently of t. This also implies that the supremum to (7) is obtained. The same arguments applied to (13) establish that Wt is well de…ned, uw (m) satis…es 1 C 0 (uw (m)) = 1 w; t [ jm] 2 wE t j j w t ; (37) and the family fWt gt is uniformly bounded. To see that Wt is convex, for any 2 [0; 1] ; 0 Wt + (1 00 ) = Z max fu(m)gm2M Mt t + (1 ) Z max fu(m)gm2M = 0 Wt Z max fu(m)gm2M + (1 t Z ) Wt (u(m))] (u(m))] w t C [ u (m) Mt t ) w t C [ u (m) Mt + (1 w t C [ u (m) w t C 0 00 and (dmj ) (d ) (dmj ) (d ) + (u(m))] [ u (m) Mt 00 00 0 0; w t e (dmj ) (d ) (u(m))] 00 (dmj ) (d ) + : To see that Wt is minimized if and only if a strategy is uninformative, consider any uninformative . From (37), the solution to (13) given Wt ( ) = C 0 w t C 1 1 w t C0 1 1 w t : Let message set M and note that for any Wt ( ) = max Wt ( ) Z fu(m)gm2M t = max fu(m)gm2M t Mt Z Mt [( u (m) w t C f(E [ jm] u (m) (u (m))) w t C 1 satis…es C 0 (uw a.s. and hence t (m)) = w t R (A) = (Aj ) (d ) for any Borel A of ( uw t (u (m))) 28 w t C (uw t ))] (dmj ) (d ) (E [ jm] uw t w t C (uw t ))g (dm) : w t e The expression in curly bracket is non-negative, which implies that Wt ( ) all : Moreover for informative jE [ jA] , there is a set of messages A with Wt ( ) for (A) > 0 such that 1j > 0: For all such messages the expression in curly brackets is strictly positive since uw t does not satisfy the optimality condition (37) for m 2 A: Since expression above is strictly positive therefore any informative (A) > 0; the cannot be a minimum. We prove (15) by using Theorem 3 in Milgrom and Segal (2002). For completeness, we state it here adapted to our set up Theorem 3 Let V (t) = supx2X f (x; t) where t 2 [0; 1] ; and let x (t) be a solution to this problem given t and ft be the derivative of f with respect to t: Suppose that (i) for any t a solution x (t) exists, (ii) fft (x; )gx2X is equicontinuous, (iii) supx2X jf (x; t) j < 1 at t = t0 , and (iv) ft (x (t) ; t0 ) is continuous in t at t = t0 : Then V 0 (t0 ) exists and V 0 (t0 ) = ft (x (t0 ) ; t0 ) : In our case, f (fug ; ) = Z ( u (m) Mt w t C (u (m))) (1 can be maximized pointwise. For each m, Z w ( u (m) t C (u (m))) (1 0 ) (dmj ) + ) (mj ) + 0 (mj ) (dmj ) (d ) (d ) is continuous is u (m) and by (37) the latter can be restricted to a compact set, hence conditions (i) and (iii) of this theorem are satis…ed. The solution, given by (37), is integrable and, thus, so is the continuous and bounded transformation uw (m) w t C Theorem 11.18 and Remark 11.23.) The derivative with respect to Z w 0 f (fug ; ) ( u (m) (dmj ) t C (u (m))) (uw (m)) (Rudin (1976), is (dmj ) (d ) ; Mt and it does not depend on , therefore ff (fug ; )gfug is equicontinuous. Also, if either R R 0 (mj ) (d ) > 0 or (mj ) (d ) > 0 for some m, the solution uw t (m; ) is unique for 2 (0; 1) and, by the Theorem of the Maximum, there is a solution that is continuous in for R R 0 2 [0; 1]. Finally, if (mj ) (d ) = (mj ) (d ) = 0 for some m, we can always …nd a solution uw t (m; ) that is continuous in for 2 [0; 1]. Since uw t (m; ) is continuous in for all m 2 Mt , we can use the dominated convergence theorem to show that f (fu ( )g ; is continuous in at = 0; 0) which veri…es condition (iv). Since the sequence f t gt is uniformly bounded (away from zero from below), equation (37) implies that uw t (m) is bounded and those bounds do not depend on t: Therefore all the families of directional derivatives f@Wt ( ) =@ 0 gt are uniformly bounded. 29 We prove (14) next. By Lagrange Duality theorem (Luenberger (1969), Theorem 1, p. ~ t ( t ) can be written as a minmax problem 224), W Z ~ Wt ( t ) = min max (E t [ jh] u (h) C (u (h))) d t + e 0 fu(h)gh2H t H t Z w w max (E t [ jh] u (h) t C (u (h))) d t + t e fu(h)gh2H t = = max fu(h)gh2H t Z Ht Z Wt Ht 1 Ht t Z 1 Mt jht 1 n ; u ht dzd 1 By de…nition of u(ht 1 ; m) o t mjht 1 ; dm (d ) dzd t 1: where the inequality follows from the fact that w t w t C ;m w t may not be a minimizer for an arbitrary this inequality becomes equality at t; t: which establishes (14). Finally, if we can restrict Mt to be …nite (as proved in Proposition 3) then Wt ( ) is continuous in . This is immediate as we can restrict fu (m)g to lie in a compact subset of RjMt j and use the Theorem of the Maximum. 7.2 Proof of Proposition 3 First we prove some preliminary results. Fix z and consider any incentive compatible allocation that satis…es (22). Let Mt ( ) be as de…ned in the text. For any i let Mt ( i ) = Mt ( i ) \ Mt ( i 1 ), Mt+ ( i ) = Mt ( i ) \ Mt ( i+1 ) and Mtexcl ( i ) = Mt ( i ) n ([j6=i Mt ( j )) : Lemma 4 For all i; Mt ( i ) = Mt ( i ) [ Mtexcl ( i ) [ Mt+ ( i ) : Proof. Suppose m 2 Mt ( i ) nMtexcl ( i ) : Then there exists j such that m 2 Mt ( j ) : Without loss of generality let i+1 u j > i: For any m0 2 Mt ( m0 + w m0 i+1 ) i+1 u (m) + w (m) and i u (m) + w (m) iu m0 + w m0 ; j u (m) + w (m) ju m0 + w m0 : The sum of the …rst the second inequalities implies u (m0 ) the third inequalities implies u (m0 ) Lemma 5 Let (u ; w ; u (m), the sum of the …rst and u (m) ; therefore u (m0 ) = u (m) and m 2 Mt ( i+1 ) : ) be a solution to (20) for a given z: Then (u (m0 ) ; w (m0 )) = (u (m00 ) ; w (m00 )) for all m0 ; m00 2 Mt ( t ) ; for all m0 ; m00 2 Mt+ ( t ) and for almost all (with respect to measure ) m0 ; m00 2 Mtexcl ( i ) : 30 t 1 Proof. The …rst part of the lemma follows immediately from the discussion in the text, so u (m) jm 2 Mtexcl ( ) ; w ^=E we prove the second part. Let u ^=E and consider an alternative (u0 ; w0 ; ) de…ned as (u0 (m) ; w0 (m)) = (u (m) ; w (m)) if m 2 = Mtexcl ( ) ; and (u0 (m) ; w0 (m)) = (^ u; w) ^ if m 2 Mtexcl ( ) : For any 0 therefore 0 u m0 + w m0 > 0 u (m0 ) + w (m0 ) > w (m) jm 2 Mtexcl ( ) 0 0 6= ; m0 2 Mt 0 u (m) + w (m) for all m 2 Mtexcl ( ) ; u ^ + w, ^ which implies that (u0 ; w0 ; ) satis…es (22). It also satis…es (21) by construction, since m 2 Mtexcl gets the same utility from reporting any message h i ^ kt+1 (w) is strictly concave ( ) : The objective function E u C (u) + t t+1 in (u; w) ; and would take a strictly higher value at (u0 ; w0 ) than at (u ; w ) unless (u ; w ) is constant on Mtexcl ( i ) a.s. Lemma 6 Let (u ; w ; ) be a solution to (20) for a given z: Suppose there exists a set P ~j ~ M ( ) > 0 and u (m0 ) = u (m00 ) for almost all (with respect to M Mt s.t. ~ . If t > 0 then E [ jm0 ] = E [ jm00 ] a.s: measure ) m0 ; m00 2 M ~ and consider an alternative strategy 0 de…ned as 0 (mj ) = Proof. Fix any m ^ 2 M ~ and 0 (mj ~j (mj ) if m 2 = M ^ ) = M for all : Any agent reports any subset of messages with a positive probability with reporting strategy with a positive probability with 0 : Therefore the strategy pro…le 2 [0; 1]. Let h fm E u ^ ( ) only if he reports that subset = (1 ) t Wt ( ): + 0 satis…es (21) and (22) for any ^ t C (u ) + t+1 kt+1 (w ) Since u (m0 ) = u (m00 ) ; w (m0 ) = w (m00 ) ; 0 fm ^ ( )j =0 = = i @Wt ( ) @ w w ~) f(E 0 [ jm] ^ uw (m) ^ ^ Pr(m 2 M t C(u (m))) Zt w w ( uw (m) t C(u (m))) (mj )dm (d )g: t ~ M 0 ( )j Optimality of = 0 implies that fm ^ h i w w ~ uw (m) E jM ^ ^ E t C (u (m)) 0 for all m; ^ so that it can be written as h i w w ~ E [ jm] uw (m) ^ t C(u (m))jM all m: =0 ~ of both sides, we obtain E Taking expectations conditional on M h i ~ which implies that cov (E [ jm] ; uw (m)) E [ jm] uw (m)jM E optimality (37) implies that uw h (m) is monotonically increasing in E 31 i ~ jM E h ~ uw (m)j ^ M i 0: On the other hand, [ jm] ; and therefore cov (E a.s. [ jm] ; uw (m)) 0: The two conditions can be satis…ed only if E Proof of Proposition 3 and Corollary 1. any solution to (20) can take at most 2j j [ jm0 ] = E Lemmas 5 and 6 show that for a given z; 1 distinct values of (u (m) ; w (m) ; E except for possible values that occur with probability 0 under reporting strategy 6 is stated for the case t > 0; but when t [ jm00 ] [ jm]) (Lemma = 0 the problem is isomorphic to the standard mechanism design problem and usual arguments apply to show that j j distinct allocations represent a solution). The outcomes that occur with probability zero can be replaced with any outcome from that set of 2j j 1 distinct values. Therefore without loss of generality the set M can be restricted to at most 2j j 1 messages in (20), and hence, by Proposition 2, to M in the modi…ed problem (17). It remains to verify that it is a best equilibrium since not every solution to (17) needs to be a PBE. Let ( ; G; p ) be a best PBE that generates (u ; M ) along the equilibrium path. By the arguments leading to construction of problem (17), (u ; ) is a solution to that prob- lem given M : It is easy to show that any equilibrium is payo¤-equivalent to one in which the P1 s t us jht 1 E allocations in period t depends only on the expected utility wt ht 1 s=t but not on the particular realizations of shocks in the previous t the Supplementary materials) and so we assume that (u ; 1 periods (Lemma 10 in ) satis…es this property. Pre- vious arguments imply that after any history ht there can be at most 2j j 1 distinct n h i o values of ut ht ; mt ; wt ht ; mt ; E jht ; mt that occur with probability 1 given mt o n jht ; : Therefore it is possible to construct another strategy 0 which is incentive t 2 compatible and delivers the same payo¤ to the agents as tribution of histories any Borel set A of 0 with a property that Pr ([(1 ) v; (1 E C (ut ) ; which veri…es that ( 0 ; ) v] G; p G) generates a dis- 2 A) = Pr ((ut ; E t ) 2 A) for ~ t ( ) and E 0 C (ut ) = ~ t ( 0t ) = W ) : Therefore W t 0 ((ut ; E : The pair ( 0 ; t) ) is a best PBE and proves Proposition 3. If removing all non-distinct messages leaves fewer than jM j messages, extra messages can be added and agents strategies extended to keep posterior probabilities for such messages the same as for some of the original messages. By appropriately choosing which messages to add it is possible construct strategies that satisfy (25). This argument extends for all histories, and hence any value that k0 (v) achieves given any M; can also be achieved given Mt = M for all t: This completes the proof of Corollary 1. 32 7.3 Proofs for Section 4 First, we use Proposition 3 to simplify our problem. We drop explicit dependence of all variables on z and throughout this section all the results are understood to hold for all z: By Proposition 3 we can restrict attention to messages in a set Mv of cardinality no greater than 2j j 1 such that no two distinct reports give the agent the same allocation. We use Nv to denote cardinality of Mv and Mv ( ) to denote the set of messages in Mv that maximize agent ’s utility. The incentive constraints (22) can be written as u m0 + w m0 ; u (m ) + w (m ) m0 j u m0 + w m0 ( u (m ) + w (m )) The triple (uv ; wv ; =0 8 is a solution to h ^ max E (1 v) u t C (u) + t+1 kt+1 (w) (38) ; m 2 Mv ( ) ; m0 2 Mv : 2 v) v u;w; subject to (38) where v w i t Wt ( ) (39) = kt0 (v) :14 Without loss of generality we arrange messages in Mv so that uv (m1 ) < ::: < uv (mNv ) ; which also implies that wv (m1 ) > ::: > wv (mNv ) : Let 0 ( ; m ; m0 ) and 00 ( ; m ; m0 ) be Lagrange multipliers on the …rst and second constraint (38). We set ( ; m; m0 ) = 0 for all m 2 = Mv ( ) and ( ; m; m) = 0 for all m; so that 0 ; 00 are well de…ned for all ( ; m; m0 ) : The next lemma establishes bounds on optimal allocations that are important for establishing our main results in this section. Lemma 7 (uv ; wv ; (29) hold, with % = v) satis…es (26): For all v such that kt0 (v) 1+ ^ j j 1 t+1 1 ;%= j j +1 1 ^ t+1 j j : Proof. The …rst order conditions for wv (m) are " # X ^ t+1 X 0 kt+1 (wv (m)) v (mj ) ( )+ v 2 X ( ;m0 )2 Sum over m to obtain X ( ;m)2 Mv " ^ ( ;m0 )2 0 t+1 0 kt+1 (wv 0 # (m)) v ; m; m0 + v (mj ) 00 ; m; m0 Mv ; m0 ; m + Mv 1, equations (27), (28) and v m0 j (mj ) ( ) = v 0 ; m0 ; m = 0: (40) = k 0 (v) ; 14 To see this, let v be the Lagrange multiplier on the promise keeping constraint in the original, convexi…ed Bellman equation. By Proposition 1 and the envelope theorem it satis…es v = kt0 (v) : Form a Lagrangian with v and observe that it can be maximized for each z separately, which yields (39). 33 which is expression (26). For what follows, it is useful to establish bounds on the Lagrange multipliers. Consider 0 a message m1 : If Nv = 1; then trivially Nv > 1 and let 0 be the largest strictly suboptimal for any with 0 > 0 : Moreover, 0 > 0 ; 00 and are two scalars equal to zero. Suppose such that m1 2 Mv ( ) : Since sending message m1 is ( ; m0 ; m1 ) = 0 uv (m1 ) + wv (m1 ) v (m1 j ) 00 ( ; m0 ; m1 ) = 0 for all ( ; m0 ) uv (m) + wv (m) and uv (m1 ) < uv (m) for all m 6= m1 ; which implies u (m1 ) + w (m1 ) > u (m) + w (m) for all m 6= m1 if Therefore, 0 ( ; m1 ; m0 ) = v write (40) for m1 as (m0 j ) ^ 00 ( ; m1 ; m0 ) = 0 for all ( ; m0 ) with t+1 0 kt+1 (wv (m1 )) v ! 0 < < 0 : : Thus we can = # (m1 ) ; (41) where # (m1 ) = P m0 2Mv 0 0 ; m1 ; m0 + m0 j v 0 00 0 0 0 ; m1 ; m0 ; m0 ; m1 + P v (m1 j ) ( ) 2 Since w is decreasing in m and kt+1 is concave, v; ^ t+1 ^ 0 kt+1 (wv (m1 )) t+1 E v v m1 j 0 00 0 0 kt+1 (wv ) = which implies that # (m1 ) Similar steps for mNv establish # (mNv ) 0: (42) 0: The …rst order conditions for uv (m) are X (1 v) tC 0 (uv (m)) 2 X ( ;m0 )2 where (m) v (mj ) ( )+ X ( ;m0 )2 0 ; m0 ; m + Mv v 0 ; m; m0 + v 00 (mj ) ; m; m0 Mv m0 j 00 ; m0 ; m 0 is the Lagrange multipliers on the constraint u (m) = (m) ; (43) U (0) : It is zero if utility is unbounded below. Moreover, since we assumed uv (m1 ) < ::: < uv (mNv ) ; it can only be strictly positive for m1 even in the case when utility is bounded below. We guess and verify that if kt0 (v) = v 1; then (m1 ) = 0:15 Suppose v (m1 ) = 0: Then the …rst order condition for uv (m1 ) becomes X (1 v) tC 0 (uv (m1 )) 2 15 More precisely, for a …xed su¢ cient. v; v (m1 j ) ( ) = # (m1 ) X 2 v ! (m1 j ) ( ) 0 : (44) maximization problem (39) is convex and thus necessary conditions are also 34 ; m0 ; m1 : Combining it with (42), it implies that tC 0 (uv (m1 )) (1 v) E [ jm1 ] v 0 and veri…es that the lower bound for uv (m1 ) does not bind. Analogous arguments establish (1 v) E [ jmNv ] v tC 0 (uv (mNv )) ; which, together with the monotonicity assumption on uv (m) ; gives (28). For the rest of the proof consider v that satis…es kt0 (v) 1: Summing (43) over m when (m) = 0 implies (27). Finally, combining (41) with (44) we obtain ^ t+1 0 kt+1 (wv (m1 )) = v + v + 1 1 1 tC 0 (u v (m1 )) v tE v 0 0 where we used v 0 v ( E v [ jm1 ] E v [ jm1 ] 1) + 1 1 0 1 0 + + 1 1 0 tC 0 (uv (m1 )) 1 0 v v; [C 0 (uv )] = 1 v to get the second inequality. Re- arrange it to get 1 0 kt+1 (wv 0 (m1 )) ^ ^ 1 0 t+1 +1 1 j j (1 +1 v) (1 v) 1 t+1 + 1 + ^ 1 t+1 ^ ! t+1 ! : This, together with monotonicity of wv (m) ; establishes the upper bound in (29). Analogous manipulation of the …rst order condition for mNv establishes the lower bound in (29). Finally, note that % and % 1 j j ^ t+1 +1 ! 0 as j j ^ = t+1 ^ ! 0 as 1 1 1 j j 1 j j 1 t+1 j j +1 1 1 +1 1 1 ! 0 since we assume that E [ ] = 1. Therefore, ! 0. Analogous arguments prove that % ^ t+1 ! 0 as j j 1 ! 0. We use this lemma to establish bounds that we use to characterize information revelation for low v: 35 Lemma 8 (a) If utility is unbounded below and constraint (10) binds in period t+1; then there are scalars Au;t > 0; Aw;t > 0 and vt such that for all v vt any solution to (39) satis…es 0 uv (m) uv m0 Au;t for all m; m0 ; 0 wv (m) wv m0 Aw;t for all m; m0 : (b) If utility is bounded below, without loss of generality by zero, then limv!v uv (m) = 0 and limv!v wv (m) = 0 for all m: Proof. (a) When (10) binds in t + 1; ^ t+1 > is unbounded below, limv! 1 kt0 (v) vt such that 1 2 1 ^ t+1 ! 1 ^ > 0: When utility t+1 = 1 and therefore expression (29) implies that there exists 0 kt+1 (wv 1 and therefore 3 2 (m)) 1 ^ t+1 ! for all m; v vt : This establishes bounds for wv (m) : The incentive constraint 1 uv (m1 ) + wv (m1 ) 1 uv (mNv ) + wv (mNv ) together with monotonicity wv (m1 ) > ::: > wv (mNv ) ; uv (m1 ) < ::: < uv (mNv ) imply that (wv (m1 ) w (mNv )) 1 uv (mNv ) uv (m1 ) 0 establishing bounds for uv : (b) When utility is bounded, Proposition 1 shows that limv!v kt0 (v) = 1: For any the largest v that satisfy kt0 (v ) close to v = 0: Let v~ = E v : By choosing arbitrarily high we can set v arbitrarily [ uv + wv ] : Due to the possibility of randomization, in general v ) : Therefore v~ v~ 6= v but kt0 (v) = kt0 (~ v …nd v~ = E E v v if v v : Then [ uv + wv ] (45) [ 1 uv + wv ] X ( 1) (mj 1 ) [ 1 uv (m) + wv (m)] v m2Mv ( 1) ( 1 ) [ 1 uv (m1 ) + wv (m1 )] ( 1 ) wv (m1 ) 0: Here the second, third and …fth lines follows from nonnegativity of u and w; and the fourth line follows from the fact that if 1 (mj 1 ) > 0; then (uv (m) ; wv (m)) gives the same utility to as (uv (m1 ) ; wv (m1 )) : Since v limv!v wv (m1 ) = 0: Since wv (m1 ) can be chosen to be arbitrarily close to 0, this implies that wv (m) ; this implies that limv!v wv (m) = 0 for all m: Analogous arguments show that limv!v uv (m) = 0: 36 7.3.1 Proofs for low v We now can prove the …rst limiting result about optimal information revelation. Lemma 9 Suppose that sustainability constraint (10) binds in periods t and t + 1. Then v converges to an uninformed strategy as v ! v: Proof. Let be any uninformative strategy. Let uv (m) = E uv ; wv (m) = E v wv for v all m: Note that since uv (m) is increasing in m uv (m1 ) = uv (m1 ) E uv (m) v uv mj j E v = uv mj j ; and an analogues relationship holds for wv : Pro…le (uv ; wv ; ) is incentive compatible and satis…es E v [ uv + wv ] = E [ uv + wv ] : The value of the objective function (39) evaluated as (uv ; wv ; v) (46) should be higher than eval- uated at (uv ; wv ; ) ; E v h uv uv tC tC (uv ) + ^ t+1 kt+1 (wv ) (uv ) + ^ t+1 kt+1 (wv ) t Wt ( v ) t Wt ( ): i (47) First, suppose that utility is bounded below, without loss of generality by zero. From Lemma 8, uv (m) ! 0 and wv (m) ! 0 for any m as v ! 0; and therefore uv and wv also converge to zero. Hence in the limit equation (47) becomes lim sup v!v Since Wt ( v) t (Wt ( Wt ( ) by Lemma 2 and exists and satis…es limv!v Wt ( v) t ) Wt ( v )) 0: (48) > 0 when (10) binds in periods t, limv!v Wt ( = Wt ( ) : Wt ( ) is continuous in v) and achieves its mini- mum only at uninformative reporting strategies by Lemma 2, therefore v must converge to some uninformative strategy. Now suppose that utility is unbounded below. By the mean value theorem t (C ^ (uv (m)) t+1 (kt+1 (wv ) C (uv )) = tC 0 (~ uv (m)) (uv (m) uv ) ; 0 kt+1 (wv )) = ^ t+1 kt+1 (w ~v (m)) (wv (m) wv ) ; for some u ~v (m) that takes values between uv (m) and uv and for some w ~v (m) that takes values between wv (m) and wv : By construction, uv (m1 ) 37 uv uv (mNv ) and wv (m1 ) wv wv (mNv ), therefore, uv (m1 ) From (28) and (29) limv! 1C w ~v (m) 0 (~ uv uv (mNv ) and wv (m1 ) (m)) = 0 and limv! w ~v (m) wv (mNv ). 0 ~v (m)) = = ^ t+1 for all m: 1 kt+1 (w We have lim E v lim E v v! 1 = v! 1 = 0; h (uv h tC uv ) 0 t (C 0 uv ) + ^ t+1 kt+1 (w ~v ) (~ uv ) (uv i kt+1 (wv )) i wv ) C (uv )) + ^ t+1 (kt+1 (wv ) (uv ) (wv where the second line uses the mean value theorem and (46), and the last line uses the fact that (uv (m) uv ) and (wv (m) wv ) are bounded for low v by Lemma 8. This implies that (48) holds when utility is unbounded and that v converges to an uninformative strategy. We now can prove Theorem 1. Proof of Theorem 1. Suppose Nv > 1 for some realization of z and consider (uv ; wv ; that solve (39) for such z: Let allocation when priors Wt ( = v) max + [E m m2 : Let uw be the optimal w t C [( u (m) ( uw (u (m))) w t C (uw ))] v (mj ) ( ) m; 2Mv 8 < = max [E fu(m)gm2Mv : X 0 v be the largest such that m1 2 Mv are uninformative: Then Wt ( ) X fu(m)gm2Mv 0 v v v) v w t C [ jm1 ] u (m1 ) [ jm] u (m) w t C (u (m1 )) (u (m)) (E v (E v [ jm1 ] uw [ jm] uw w t C w t C 0 (uw ))] @ 0 (uw ))] @ X 0 v X 0 v v (m1 j ) ( )A 19 = A (mj ) ( ) v ; All terms in square brackets are non-negative since the choice u (m) = uw is feasible. They are strictly positive if E v [ jm] 6= 1; since uw is the optimal allocation for E [ jm] = 1: From Lemma 9 the left hand side of this expression goes to zero as v ! v; therefore the right hand side should also go to zero. This is possible either if (a) E v [ jm1 ] ! 1 (and hence 0v ! j j ) and P P 0 0 v (mj ) ( ) ! 0 for all m > m1 ; or (b) v (m1 j ) ( ) ! 0; E v [ jm2 ] ! v v P 0 1 and 0 v (mj ) ( ) ! 0 for all m > m2 (and hence v ! 1 and m2 2 Mv ( 1 )). The v P other possibilities are ruled out since if E v [ jm] ! 1 and 0 v (mj ) ( ) 9 0 for some v P 0 0 m > m2 ; then for some m0 m2 we would have 0 v (m j ) ( ) 9 0 and E v [ jm ] 9 v 1.16 Since it is impossible to have 16 Note that all m > m2 . 1 j j and 1 to be indi¤erent between more than two distinct can be indi¤ererent between at most two distict allocations and therefore 38 v 1 (mj 1 ) = 0 for allocations in the optimum, for any " > 0 there must be some vt such that for all v solution has Nv = 2 and either (a) m1 2 Mv ", E E v E v j j , v [ jm1 ] arbitrarily close to 1, or (b) m2 2 Mv ( 1 ) ; m2 j v j j (m1 j 1 ) v "; E [ jm2 ] = v 0 uw v (m1 ) ! C 1 (1= w ) ; t 0 and in case (b) uw v (m1 ) = C 1( w 1= t ) j j [ jm1 ] = 0 [ jm2 ] arbitrarily close to 1. In case (a) (37) implies that uw v (m2 ) = C 1 and uw v (m2 ) ! vt the 1 > 1 and < 1 and w and j j= t 0 1 C (1= w t ): We can now rule cases (a) and (b) for v low enough. Consider case (b), case (a) is ruled out analogously. In this case v (30) is 1 (uv (m2 ) (m1 j 1 ) < 1 for v low enough and the optimality condition uv (m1 )) + ^ t+1 [kt+1 (wv (m2 )) = [( 1 uw v (m2 ) Function 1u w t C w t C t (C (uv (m2 )) C (uv (m1 ))) (49) kt+1 (wv (m1 ))] (uw v (m2 ))) ( 1 uw v (m1 ) w t C (uw v (m1 )))] : (u) is strictly concave and achieves its maximum u ^ at u ^ = C0 1( w 1= t ) = w w uw v (m1 ) : Since uv (m2 ) is bounded away from uv (m1 ) ; the right hand side of (49) is strictly negative and bounded away from 0. When utility is bounded below, Lemma 8 established that all terms on the left hand side of (49) go to zero as v ! v; yielding a contradiction. When utility is unbounded below, substitute the indi¤erence condition 1 (uv (m2 ) uv (m1 )) = (wv (m1 ) wv (m2 )) into (49) and apply the mean value theorem t (C (uv (m2 )) C (uv (m1 ))) = tC 0 (~ u) (uv (m2 ) uv (m1 )) and ^ = t+1 (kt+1 (wv ^ 0 ~ t+1 kt+1 (w) (m2 )) kt+1 (wv (m1 ))) (wv (m2 ) (wv (m2 ) wv (m1 )) wv (m1 )) for some u ~ 2 (uv (m1 ) ; uv (m2 )) and w ~ 2 (wv (m2 ) ; wv (m1 )) : The terms (uv (m2 ) uv (m1 )) C 0 (~ u) and (wv (m2 ) wv (m1 )) are bounded for small v by Lemma 8 and limv! 1 t = 0; 0 (w) ~ = 0 by Lemma 7. Therefore, the left hand side of (49) converges limv! 1 ^ t+1 kt+1 to zero for utilities unbounded below, yielding a contradiction. This proves that Nv = 1 and v is uninformative for all su¢ ciently low v independently of the realization of z: For uninformative v; there is a unique (uv ; wv ) that solves (39) by strict convexity of u 39 tC (u) : Proofs for high v 7.3.2 We prove Corollary 2, since Theorem 2 is a special case of it. Corollary 2. Suppose v is su¢ ciently high so that case that 1 and v = kt0 (v) < 1: We …rst rule out the send the same message with probability 1 for high v: From (41) and (44) 2 we have that there is #v 0 such that ^ t+1 0 kt+1 (wv (m1 )) = #v ; v and tC where ~ = ( 1 and 2 1 1 + 0 (uv (m1 )) = (1 2 2) = ( 1 + 2) v) E > 1: v [ jm1 ] + #v (1 v) ~ + #v ; The last inequality follows from the fact that types play message m1 with probability 1: De…ne function f as f (x) 1 (uv (m1 ) x) tC x) + ^ t+1 kt+1 wv (m1 ) + (uv (m1 ) This function is concave with 0 f (0) = tC 1 0 ^ (uv (m1 )) + Let x ^v be a solution to f 0 (^ xv ) = (1 1 v) t+1 0 kt+1 (wv ~ ! (m1 )) > (1 v) 1 x : ~ =2 and let (^ uv ; w ^v ) = uv (m1 ) 1 1 : x ^v ; wv (m1 ) + Since f is concave, x ^v > 0 and u ^v < uv (m1 ) ; w ^v > wv (m1 ) : Claim 1. Allocation (^ uv ; w ^v ) satis…es bounds (28) and (29). Let xv be a solution to f 0 (xv ) = 0: By concavity, 0 < x ^v < xv ; so we establish the claim by proving the stronger statement that uv (m1 ) and (29). By de…nition, " 1 1 ^ t+1 0 kt+1 wv (m1 ) + xv and wv (m1 ) + 1 # xv = tC 0 1 (uv (m1 ) xv satisfy bounds (28) xv ) and by xv > 0 tC ^ t+1 0 kt+1 0 (uv (m1 ) wv (m1 ) + xv ) < 1 tC ^ xv 0 (uv (m1 )) ; t+1 0 kt+1 (wv (m1 )) : These conditions imply that 1 (1 1 (1 v) v) tC 1 0 " (uv (m1 ) ^ 1 xv ) < t+1 0 kt+1 j j (1 wv (m1 ) + 40 v) ; 1 xv # < (1 v ) j j; 1 x ^v : establishing the bounds (28) and (29). Claim 2. (1 ^v v) x ! 1 as v ! v: In Supplementary material we showed (Lemma 13) that Assumption 1 implies that there ^v such that are numbers Bv and B C 0 (uv (m1 )) 0 kt+1 (wv (m1 )) Bv 2 C 0 (uv (m1 )) x ^v ; [C 0 (uv (m1 ))]2 ^v B 2 0 kt+1 (w ^v ) x ^v ; 2 1 0 ^v ) 1 kt+1 (w C 0 (^ uv ) 0 kt+1 (w ^v ) ^v = 1 ) v; wv (m1 ) ! v then Bv = [C 0 (uv (m1 ))]2 ! 0 and B and if u ^v ! (1 The bounds from Claim 1 establish that u ^v ! (1 ^ t+1 ^ 0 E v kt+1 (wv ) t+1 (50) 0 kt+1 (w ^v ) 2 ! 0: kt0 (v) ) v as v ! v: Since by (27) = 0 kt+1 (wv (m1 )) ; this implies that wv (m1 ) ! v as v ! v: Therefore by Lemma 13 the …rst terms on the right hand side of these expression go to zero as v ! v:17 We have ~ 1 1 2 1 = v 1 1 ( f 0 (0) " t v f 0 (^ xv ) 0 C (uv (m1 )) tC C 0 (uv (m1 )) Bv 1 [C 0 (uv (m1 ))]2 v From Claim 1, both C 0 (uv (m1 )) = (1 v) 0 ^ (^ uv ) + 2 + 1 t+1 0 kt+1 (wv ^v B 1 and 1 1 0 kt+1 (w ^v ) 2 0 kt+1 (w ^v ) = (1 0 kt+1 (w ^v ) 1 v v) ^v v) x the expression in curly brackets goes to 0 as v ! v: Therefore (1 Claim 3. f (^ xv ) (m1 )) f (0) ! 1 as v ! v: 0 kt+1 (w ^v ) 2 ) (1 # ^v : v) x are …nite, therefore ! 1: Applying the mean value theorem, f (^ xv ) f (0) = f 0 (~ xv ) (1 1 v ^v v) x for some x ~v 2 (0; x ^v ) : Since f is concave; f 0 (0) > f 0 (~ xv ) > f 0 (^ xv ) : Therefore h i 1 ~ ~ : The result follows from Claim 2. 1 ; 1 2 We are now ready to show that it is not optimal for types 1 and 2 f (~ xv ) 1 v 2 to send the same message with probability 1 for high enough v: Suppose it is. Consider an alternative strategy ^ v , where ^ v (mj ^ 1 ) = 1 for some message m ^ that gives allocation (^ uv ; w ^v ) ; and ^ v (mj ) = 17 v (mj ) for When kt is twice di¤erentiable, this result can be established without using Lemma 13. In this case kt00 (v) 2 2 00 00 0 00 kt0 (v)] = )v C (u) = [C (u)] = 0 and limv!v kt (v) = [1 t C (uv ) ; and condition (31) implies that limu!(1 0: Claim 2 can then be established by appling the mean value theorem to the left hand side of (50). 41 all 6= 1 ; m: Since u ^ < uv (m1 ) ; w ^ > wv (m1 ) and 1 uv (m1 ) + wv (m1 ) = ^v 1u + w ^v ; this allocation is incentive compatible and delivers utility v to the agent. Then n h E^v u = ^ t C (u) + kt+1 (w) ( 1 ) ff (^ xv ) f (0)g + i o W (^ ) t v t t fWt ( v ) n E Wt (^ v )g : v h i ^ kt+1 (w) C (u) + t u t Wt ( The second term is …nite since Wt is bounded by Lemma 2, therefore this expression is positive for high v from Claim 3. This contradicts the optimality of (uv ; wv ; Using these arguments we can also rule out type 1 v) : sending the same message as 2 with any positive probability. Then the optimality condition (30) implies that n 1 uv (m1 ) o n ^ 1 uv (m2 ) t C (uv (m1 )) + kt+1 (wv (m1 )) o ^ t C (uv (m2 )) + kt+1 (wv (m2 )) is bounded because the expression on the right hand side of (30) is bounded for all : Therefore the value of this strategy can exceed the value of strategy when 1 and 2 send message m2 with probability 1 by only a …nite amount. But then the arguments of the previous paragraph lead to a contradiction. Part 2 of Corollary 2 is proven using analogous arguments. Here we consider function F (x) When j j uv mjMv j + x j j 1 j j that F 0 (0) = const (1 establish that F (^ xv ) allocation for type type j j: j j 1 v) tC > and % uv mjMv j + x + ^ t+1 kt+1 wv mjMv j j j 1 + j j j j 1 j j 2 j j 1 x : holds, we can show 0 ensures the boundary conditions. Previous arguments F (0) ! 1: Note that the perturbation we consider leaves the same j j 1, and gives an allocation uv mjMv j + x ^v ; wv mjMv j j j 1 x ^v to This is incentive compatible but gives a higher value than v: The contradiction then follows from the fact that kt (v) is a decreasing function for high enough v: Proof of Lemma 3. Part (i). If v0 = 0 then uv0 ;0 (m; z) = wv0 ;0 (m; z) = 0 for all (m; z). Thus, if v0 = 0, then also vt = 0 for all t > 0. The optimal t and, by Lemma 2, t thus minimizes Wt ( ) for all is uninformative for all t. Suppose that the sustainability constraint (10) binds in periods t = 0; :::; T + 1. By Theorem 1, for all t = 1; :::; T there exists vt > 0 such that if vt vt vt ;t ( jz; ) is uninformative and wvt ;t (m; z) is independent of z. By Lemma 8, if v0 ! 0 then v1 = wv0 ;0 (m; z) ! 0 for all (m; z). In turn, the latter implies that v2 = wv1 ;1 (m; z) ! 0. Proceeding recursively, we can therefore …nd v0 such that vt < vt for t = 1; :::; T . Part (ii). Suppose the assumptions of Theorem 1 are satis…ed (the other case is analogous.) 42 o v) Then there exists v0+ such that v0 ;0 reveals full information about each type for all v0 v0+ . Also, the FOCs imply ^ and, moreover, k00 (v) ! t+1 E v0 ;0 k10 (wv0 ;0 ) jz = k00 (v) 1 as v ! v. The latter implies that there cannot be a constant K < v such that wv0 ;0 (m; z) K with probability 1, otherwise, the FOCs would be violated for a high enough v0 . Thus, we can always …nd a high enough v0 such that wv0 ;0 (m; z) > v1+ with positive probability. Following analogous steps as in part (i), we can guarantee that vt = wvt 1 ;t 1 (m; z) > vt+ with positive probability for all t = 1; :::; T . 43 References Acemoglu, Daron, “Why not a political Coase theorem? Social con‡ict, commitment, and politics,” Journal of Comparative Economics, December 2003, 31 (4), 620–652. , Mikhail Golosov, and Aleh Tsyvinski, “Political economy of mechanisms,” Econometrica, 2008, 76 (3), 619. , , and , “Dynamic Mirrlees Taxation under Political Economy Constraints,” Review of Economic Studies, 2010, 77 (3), 841–881. Aiyagari, S. Rao and Fernando Alvarez, “E¢ cient Dynamic Monitoring of Unemployment Insurance Claims,” Mimeo, University of Chicago, 1995. Akerlof, George A., “The Economics of "Tagging" as Applied to the Optimal Income Tax, Welfare Programs, and Manpower Planning,” The American Economic Review, 1978, 68 (1), pp. 8–19. Albanesi, Stefania and Christopher Sleet, “Dynamic optimal taxation with private information,” Review of Economic Studies, 2006, 73 (1), 1–30. Atkeson, Andrew and Robert E. Lucas, “On E¢ cient Distribution with Private Information,” Review of Economic Studies, 1992, 59 (3), 427–453. and , “E¢ ciency and Equality in a Simple Model of E¢ cient Unemployment Insurance,” Journal of Economic Theory, June 1995, 66 (1), 64–88. Benveniste, L M and J A Scheinkman, “On the Di¤erentiability of the Value Function in Dynamic Models of Economics,” Econometrica, May 1979, 47 (3), 727–32. Besley, Timothy and Stephen Coate, “Sources of Ine¢ ciency in a Representative Democracy: A Dynamic Analysis,” American Economic Review, March 1998, 88 (1), 139–56. Bester, Helmut and Roland Strausz, “Imperfect Commitment and the Revelation Principle: the Multi-agent Case,” Economics Letters, November 2000, 69 (2), 165–171. and , “Contracting with Imperfect Commitment and the Revelation Principle: The Single Agent Case,” Econometrica, July 2001, 69 (4), 1077–98. and , “Contracting with imperfect commitment and noisy communication,” Journal of Economic Theory, September 2007, 136 (1), 236–259. 44 Bisin, A. and A.A. Rampini, “Markets as bene…cial constraints on the government,”Journal of Public Economics, 2006, 90 (4-5), 601–629. Chari, V V and Patrick J Kehoe, “Sustainable Plans,” Journal of Political Economy, August 1990, 98 (4), 783–802. and , “Sustainable Plans and Debt,” Journal of Economic Theory, December 1993, 61 (2), 230–261. Clementi, Gina Luca and Hugo A Hopenhayn, “A Theory of Financing Constraints and Firm Dynamics,” The Quarterly Journal of Economics, 02 2006, 121 (1), 229–265. Dovis, Alessandro, “E¢ cient Sovereign Default,” Mimeo, Penn State, 2009. Farhi, Emmanuel and Ivan Werning, “Inequality and Social Discounting,” Journal of Political Economy, 2007, 115 (3), 365–402. and , “Insurance and Taxation over the Life Cycle,” Review of Economic Studies, 2013, 80 (2), 596–635. , Christopher Sleet, Ivan Werning, and Sevin Yeltekin, “Non-linear Capital Taxation Without Commitment,” Review of Economic Studies, 2012, 79 (4), 1469–1493. Fernandes, Ana and Christopher Phelan, “A Recursive Formulation for Repeated Agency with History Dependence,” Journal of Economic Theory, 2000, 91 (2), 223–247. Freixas, Xavier, Roger Guesnerie, and Jean Tirole, “Planning under Incomplete Information and the Ratchet E¤ect,” Review of Economic Studies, April 1985, 52 (2), 173–91. Golosov, Mikhail, Aleh Tsyvinski, and Iván Werning, “New dynamic public …nance: A user’s guide,” NBER Macroeconomics Annual, 2006, 21, 317–363. and , “Designing optimal disability insurance: A case for asset testing,” Journal of Political Economy, 2006, 114 (2), 257–279. , Maxim Troshkin, and Aleh Tsyvinski, “Optimal Dynamic Taxes,” NBER working paper 17642, 2011. Green, E. J., “Lending and the Smoothing of Uninsurable Income,” in E. C. Prescott and N. Wallace, eds., Contractual Arrangements for Intertemporal Trade, Minneapolis: University of Minnesota Press., 1987. 45 Hopenhayn, Hugo A and Juan Pablo Nicolini, “Optimal Unemployment Insurance,” Journal of Political Economy, April 1997, 105 (2), 412–38. Hosseini, Roozbeh, Larry E. Jones, and Ali Shourideh, “Optimal Contracting with Dynastic Altruism: Family Size and per Capita Consumption,”Journal of Economic Theory, 2013, 148 (5), 1806–1840. Kocherlakota, Narayana, The New Dynamic Public Finance, Princeton University Press, USA, 2010. Kydland, Finn E and Edward C Prescott, “Rules Rather Than Discretion: The Inconsistency of Optimal Plans,” Journal of Political Economy, University of Chicago Press, June 1977, 85 (3), 473–91. La¤ont, Jean-Jacques and Jean Tirole, “The Dynamics of Incentive Contracts,” Econometrica, September 1988, 56 (5), 1153–75. Ljunqvist, Lars and Thomas Sargent, Recursive Macroeconomic Theory, MIT Press, 2004. Luenberger, David, Optimization by Vector Space Methods, Wiley-Interscience, 1969. Marcet, Albert and Ramon Marimon, “Recursive Contracts,”Mimeo, European University Institute, 2009. Milgrom, Paul and Ilya Segal, “Envelope Theorems for Arbitrary Choice Sets,” Econometrica, March 2002, 70 (2), 583–601. Mirrlees, James, “An Exploration in the Theory of Optimum Income Taxation,” Review of Economic Studies, 1971, 38 (2), 175–208. Myerson, Roger B., “Optimal Coordination Mechanisms in Generalized Principal-Agent Problems,” Journal of Mathematical Economics, June 1982, 10 (1), 67–81. Phelan, Christopher, “Opportunity and Social Mobility,”Review of Economic Studies, 2006, pp. 487–505. and Robert M Townsend, “Computing Multi-period, Information-Constrained Optima,” Review of Economic Studies, Wiley Blackwell, October 1991, 58 (5), 853–81. Roberts, Kevin, “The Theoretical Limits of Redistribution,” Review of Economic Studies, April 1984, 51 (2), 177–95. 46 Rockafellar, R.T., Convex Analysis Princeton mathematical series, Princeton University Press, 1972. Royden, H.L., Real Analysis Mathematics and statistics, Macmillan, 1988. Rudin, Walter, Principles of Mathematical Analysis, third ed., New York: McGraw-Hill Book Co., 1976. Skreta, Vasiliki, “Sequentially Optimal Mechanisms,”Review of Economic Studies, 2006, 73 (4), 1085–1111. Sleet, Christopher and Sevin Yeltekin, “Credibility and endogenous societal discounting,” Review of Economic Dynamics, July 2006, 9 (3), 410–437. and , “Politically credible social insurance,” Journal of Monetary Economics, 2008, 55 (1), 129–151. Stantcheva, Stefanie, “Optimal Taxation and Human Capital Policies over the Life Cycle,” working paper, 2014. Stokey, Nancy L., Robert E. Lucas, and Edward C. Prescott, Recursive Methods in Economic Dynamics, Cambridge, MA: Harvard University Press, 1989. Thomas, J. and T. Worrall, “Income Fluctuation and Asymmetric Information: An Example of a Repeated Principal-agent Problem,” Journal of Economic Theory, 1990, 51 (2), 367–390. Yared, Pierre, “A dynamic theory of war and peace,”Journal of Economic Theory, September 2010, 145 (5), 1921–1950. 47 8 Supplementary material 8.1 Proofs for Section 3 Lemma 10 (a) Any PBE ( ; is payo¤ -equivalent to a PBE ( 0 ; G ; p) 0 ; p0 ) G 0 where sat- is…es St 0 t 1 t ; Mt ; ht ; ~ (b) Suppose ( ; ^ t; h ~t 2 Ht ries h " 1 X s E( ; G ) G ; p) t 1 1 ; St 0 t = t 1 t ; Mt ; ht ; ^ s us S t 1 ^t ; Mt ; h # = E( ; G) " 1 X 0 ; p0 ) payo¤ equivalent G t+s S S t 1 ; Mt ; Mt+s for all (zt+1 ; mt+1 ; :::; zt+s ) ; all u0t+s S t+s 1 1 ^ t ; zt+1 ; mt+1 ; :::; zt+s ; Mt+s ; h t : St ! t t 1 0 t S Ht B; d t 1 t 1 St t ; Mt ; B; ^ 1 ; M ; B; t R ~ t 1 ;^ ; t 1 ~ B; d for some ^ St R t A ; ^ 1 t 1 B t 1 ; St t R ; t 0 ;~ 1 ; M ; ht ; t ; Mt ; h # : (51) with a property that ~ t ; zt+1 ; mt+1 ; :::; zt+s ; ; Mt+s ; h ~ t ; zt+1 ; mt+1 ; :::; zt+s ; mt+s : ; Mt+s ; h analogously to the de…nition t; 0 t t ht ; ^ 1 ; t t 1 ; t 1 ht ; d t St 0 t = 0; set t t 1 B; d t ; ; jS t t 1; M t t 1; M t jS t ^t 1 ; M ; B; t 1 ; t = : t 1 ~ t 1 ; = t t 1 A ; ; t t ht ; ~ St 0 t 1 ; M ; ht ; t > 0 only if 1 ; t ^t 1 ; t t 1 A ; ^ for all ; >0 t can replicate the strategy of t and achieve the same payo¤ as that agent, and t is the optimal choice of agent ht ; ^ t S ~t by 1 ; M ; ht ; t t 1; M t jS t t 1 t 1 : Since any agent with a history the agent with a history s us t 1 1 1 ; t: and personal histo- G ; p) jS t+s Ht M ! for all ^ t 0 t 1 to ( ; = u0t+s S t+s > 0 de…ne strategy : For any Borel set A of t 1 t = t ; : St t 1 ;^ in the text. Let B be a Borel set of H t : For all B such that 1; M t t t 1 By construction, t 1 jS t t 1 For B such that 0 t ; t t 1 1; M t s t 1 0 t+s = ^ t ; zt+1 ; mt+1 ; :::; zt+s ; mt+s ; Mt+s ; h Proof. (part a). De…ne of R t for all ~ t s=t+1 Then there exists a PBE ( 0 ; jS t+s ; is a PBE and suppose that for some S t s=t+1 0 t+s 1 1 ; ; the new strategy t 0 satis- …es agents’best response constraint (4). Strategies 0 induce distribution government strategy G 0 which satis…es 0 t = satis…es feasibility (2) if agents play i t for all S t 0: 1; M t ; hence the Any posterior belief p0 that t all ^ 1 : satis…es (3) also satis…es payo¤ is E( 0; t ut jS G) t R = t 1 Z Ht = Z Ht and therefore ( 0 ; 0 ; p0 ) G p0t Z Z t 1 ; j d t 1 = R p0t jS t ; ht d ut ht pt jS t ; ht d ut ht d t j d G ; p) : is a PBE that is payo¤ equivalent to ( ; 0 G by 00 00 (z) = z= and : ( ; 1] ! [0; 1] by o n ^ t; h ~ t ; t+s as for all s 1, ht 2 h 0 t+s if zt+1 jht ; zt+1 ; mt+1 ; :::; zt+s ; t+s (z) = (z = t+s t ut jS G) 0 G t ; gives a 0 G when agents play : Therefore G ; p) : 0 and assume that 0 : [0; ] ! [0; 1] ) : De…ne strategies ( 0 ; ) = (1 ~ t; = ut+s h u0t+s ht ; zt+1 ; mt+1 ; :::; mt+s ; because otherwise (part b). For simplicity we drop explicit dependence on S t ; Mt+1 ~t ; ^ t > 0: Let ~t = ~t + ^t = t h and de…ne h t h t h t h 0 : Government’s No strategy when agents play G t 1 dht jS t = E( t would give a higher payo¤ to the government that strategy 0 ; p0 ) G ; dht jS t 0 t achieves the same payo¤ as ( ; higher payo¤ to the government that strategy ( 0; t 1 pt t 1 0 ) G (zt+1 ) ; mt+1 ; :::; mt+s ; ~ t; jh 0 (zt+1 ) ; mt+1 ; :::; zt+s ; t+s and ^ t; = ut+s h u0t+s ht ; zt+1 ; mt+1 ; :::; mt+s 0 t+s if zt+1 > jht ; zt+1 ; mt+1 ; :::; zt+s ; and u0s = us ; t+s = t+s 00 ^ t; jh (zt+1 ) ; mt+1 ; :::; mt+s ; 00 (zt+1 ) ; mt+1 ; :::; zt+s ; t+s 0 s = us for all other histories and periods s: ~ t; h ^ t could have replicated each other strategies after period t in Agents with histories h so they must have been indi¤erent between them. Pro…le 0 gives them the n o ^ t; h ~ t leaving all other histories unchanged, therefore same utility for all histories following h PBE ( ; G ; p), it is incentive compatible, i.e. satis…es (4). Strategy pro…le probability for any realization of ut as any p0 consistent with Bayes’rule, E Therefore ( 0 ; 8.2 0 ; p0 ) G 0 It assigns the same ; therefore feasibility constraint (2) is satis…ed. For t jh t =E 0 t jh t for all ht 2 H t hence (5) is satis…ed. is a PBE which is payo¤ equivalent to ( ; G ; p) : Arguments for maximization problem (17) and recursive formulation We …rst discuss convexity assumptions in problem (17). We have ut : 0: induces Ht ! [(1 ) v; (1 ) v] : Let t t : Ht ! be a space of all such ( t ; ut ), and let ii (Mt ) ; t be a probability measure on t : For any t and random variable x : t ! R we can deR …ne E t x = x (ut (m) ; t ; ) t (dmj ) d d and for an arbitrary 0t let E t 0t x = t Mt R x (ut (m) ; t ; ) 0t (dmj ) d d : As before let to denote the in…nite sequence f t g1 t=0 t Mt and extend the de…nition of expectation E 1 to V M1 1: With this notation, the modi…ed problem (9) can be written as 1 X max E t u t=0 subject to E C (u) E 1 X s t u E Wt + s=t E 1 X t u E (52) U (e) for all t; 1 1 X 0 t=0 and e t (53) 0 u for all (54) t=0 E " 1 X t u v t=0 # v: (55) Note that we wrote the promise keeping constraint (55) as inequality. We do this more general version to prove some technical results on properties of the Lagrange multipliers. We still maintain the assumption that distribution is such that in the optimum it holds with equality. The objective function and constraints are linear in and t t t and therefore convex. Let t t t be the Lagrange multipliers on (52) and (53). Then the constrained maximization problem can be written as max E 1 X t t=0 " 1+ t X s=0 s ! subject to (54), (55). Rede…ning variables Pt Pt t= 1 + s=0 s and t = t = 1 + s=0 u t t C (u) = t # 1+ E 1 X t t Wt (56) t=0 Pt s=0 s ; ^t = t= t 1; t = 0 we obtain (17) in a more general form. > 0 implies ^ t > and t > 0: If lim supt!1 t > s From the de…nitions of these variables, t P 0 then ts=0 s ! 1 (Theorem 3.23 in Rudin (1976)) and t = t ! 1. The arguments of P P1 t P1 t Sleet and Yeltekin (2008) establish that 1 t and t are all …nite, and t=0 t ; t=0 t=0 P1 P1 therefore t=0 t t and t=0 t t are also …nite. The Lagrange multiplier t > 0 in any …nite t: If it is not the case, then it is possible to give lifetime utility v to all families, which violates feasibility. The technical arguments simplify if, in addition, lim inf t > 0: Su¢ cient conditions for this result are given in the following lemma iii Lemma 11 Let be the best PBE. Suppose either that ution or that U is unbounded above and lim inf u!1 CC(u) 0 (u) t converges to an invariant distrib- > 0: Then lim inf > 0: t Proof. We …rst observe that it is incentive compatible to increase utility allocation for all histories by > 0 and that this increase satis…es (55). First, suppose that U is unbounded above. For > 0 de…ne and perturbation (1 ) 1+ t + t (56) evaluated at t ; i.e. "Z " From the de…nition of by t t X s=0 ( t ; ut + ) = t s t ( t ; ut ) for all ( t ; ut ) : Since t is optimal is feasible, this perturbation cannot increase the value of ! u t C (u) # # h t t C (u)] d t 0: (dmj ) d d i 0 t; 1+ t X s=0 s ! t Z [C (u + ) Since it should be true for all ; it implies that 1 E t C 0 (u) Suppose lim inf u!1 all u e C(u) C 0 (u) : t > 0; which implies that there is u ~ and > 0 such that C(u) C 0 (u) for u ~: Feasibility implies E t C (u) = E E t t [C (u) ju < u ~] P C (u) 0 C (u) ju C 0 (u) u ~ P t t (u (u < u ~) + E u ~) E t t C (u) 0 C (u) ju C 0 (u) C 0 (u) ju u ~ P t u ~ P (u t (u u ~) u ~) : Therefore, E t C 0 (u) = E t C 0 (u) ju < u ~ P max C 0 (u) P u2[0;~ u] max C 0 (u) + t t (u < u ~) + E (u < u ~) + t e C 0 (u) ju u ~ P t (u u ~) e u2[0;~ u] From strict convexity of C, C 0 (~ u) = maxu2[0;~u] C 0 (u), therefore, t C 0 (~ u) + e 1 > 0: This proves the second part of the lemma. Analogous arguments establish our result for invariant distribution since in the invariant distribution E C 0 (u) must be …nite. iv If assumption lim sup > 0 is not satis…ed, economy converges to that of Atkeson and t Lucas (1992), who showed that for a wide range of cost functions C (including those satisfying conditions of Lemma 11) all the mass of the lifetime utilities eventually gets concentrated arbitrarily close to the lower bound v; the result known as the immiseration. Immiseration violates constraint (16) and therefore lim sup > 0 in all speci…cations considered by Atkeson t and Lucas (1992). 8.2.1 Sketch of proofs of Propositions 1 and 2 We now adapt the arguments of Farhi and Werning (2007) to write problem (17) recursively. When t is strictly positive with lim inf t > 0; the series f t gt are bounded away from zero uniformly in t. The arguments of Lemma A2 of Farhi and Werning (2007) extend with minimal modi…cations to problem (17) to show that it can be written in a recursive form kt (v) = max E 2 t h ^ t C (u) + t+1 kt+1 (w) u t Wt i subject to v = E [ u + w] E [ u + w] E 0 [ u + w] for all 0 ; where kt (v) is convex. To achieve convexity of this problem, it is su¢ cient to randomize between only two points in t (the arguments are identical to the proof of Lemma 3 in Acemoglu et al. (2008)), and hence this problem can be written as Z i h 0 0 0 ^ kt (v) = max z u t C u + t+1 kt+1 w 0 0 0 (u ;w ; ); (u00 ;w00 ; 00 ); z2[0;1] + (1 z) (dmj ) d ( ) t Wt 0 00 (dmj ) d ( ) t Wt 00 Mt Z h Mt subject to Z v=z 0 0 u + w 0 0 u00 tC u00 + ^ t+1 kt+1 w00 (dmj ) d ( ) + (1 Mt z) Z i u00 + w00 00 (dmj ) d ( ) Mt and Z Z u0 + w0 0 (dmj ) d ( ) Mt Mt u00 + w00 00 (dmj ) d ( ) Z ZMt Mt v u0 + w0 ~ (dmj ) d ( ) for all ~ u00 + w00 ~ (dmj ) d ( ) for all ~ This is the Bellman equation (20). Then Proposition 1 and 2 can be established by direct adaptation of arguments in Farhi and Werning (2007). We show all the results for k0 ; for all other kt the arguments are identical. For brevity we drop explicit conditioning of all expectations on v: Continuity Since k0 (v) is concave, it is continuous on (v; v) : To show that it is also continuous on the boundaries, de…ne the value function k0 (v) = 1 max E ;u 0 1 X t [ t ut tC 1 X (ut )] + max t=0 subject to t t Wt t=0 1 X v=E t ! t ut : t=0 We have k0 (v) k0 (v) and k0 (v) is continuous. If utility is bounded below, then at v = v; the solution to k0 sets ut = U (0) for all t and to minimize Wt : This allocation is incentive compatible, therefore, k0 (v) = k0 (v) : Then continuity of k0 (v) at v implies continuity of k0 (v) at v: If utility is bounded above, then C (ut ) ! +1 as v ! v: Therefore limv!v k0 (v) = which implies limv!v k0 (v) = 1: Di¤ erentiability 1, The proof is an application of the Benveniste and Scheinkman theorem (Benveniste and Scheinkman (1979)). First, suppose that utility is unbounded. Fix any interior v0 and let (u ; ) be the optimal allocation for that v: For any v; let u ~0 = u0 + (v v0 ) ; u ~t = ut for all t > 0. Since utility is unbounded, this perturbation is feasible. Then (~ u; ) satis…es (11) and (12) for v: Let V (v) = 1 E 0 [ 1 0 We have k0 (v) ~t t [ tu tC (~ ut ) t Wt ] 0C (u0 + (v (57) t=0 = E + 1 X 0 (u0 + 1 X E (v v0 )) t [ t ut tC (ut ) v0 )) 0 W0 ] t Wt ] t=1 V (v) : Function V is di¤erentiable with V 0 (v0 ) = 1 0E C 0 (u0 ) : Since k0 is concave, by Benveniste-Scheinkman theorem (see Theorem 4.10 in Stokey et al. (1989)), k00 (v) exists and satis…es k00 (v) = 1 0E C 0 (u0 ) 1: To …nd the values of k00 (v) in the limit as v approaches K (v) = max E u; 1 X t=0 vi t [ t ut tC 1, de…ne a function (ut )] subject to (12). It it easy to show that K (v) is concave and limv! limv!1 K 0 (v) = 1: Let t = max 2 ;c 0 [ U (c) 1 1 X k0 (v) t c] t t 1 = 0 t=0 E 1 X t and ! t = 1 X E 0 [ t ut t C (ut ) t] 0 (v) = 1 and Wt ( ) : Then t [ t ut t C (ut ) t] 1 X E t=0 t=0 K (v) t min 1K t t Wt t=0 1 X t!t t=0 1 X t t t=0 1 X t!t; t=0 where the …rst inequality follows from the fact that the expression in square brackets is negative and t = 0 P1 t [ t ut E t=0 Since k0 (v) t and the second inequality follows from the fact that K (v) maximizes tC (ut )] without incentive constraints. K (v)+const and K (v) is concave, limv!1 K 0 (v) = 1: Since k00 (v) 1 and limv! 1K 0 (v) = 1; limv! 0 1 k0 (v) = 1: 1 implies limv!1 k00 (v) = Now suppose that utility is bounded below but unbounded above. Without loss of general- ity, assume that U (c) v v0 u 0. Then for any v0 ; the allocation is feasible), incentive compatible and attains v: Let V (v) = 1 0 E ; is feasible (since vv0 ut > 0 h P1 v v t v0 ut t C v0 ut t=0 t V (v) is concave, di¤erentiable and satis…es k0 (v0 ) = V (v0 ) ; therefore, k0 (v) is di¤erentiable at v0 by the Benveniste-Scheinkman theorem. A symmetric argument works if utility is bounded above (without loss of generality by 0) but not below. If utility is bounded above and below, a function V can be constructed separately for v v0 and v > v0 : This shows that k0 (v) is di¤erentiable. To establish the value of the derivatives in the limits, note that when utility is bounded, function K (v) still provides an upper bound to k0 (v) and limv!v K 0 (v) = that limv!v k00 (v) = v 1; which implies 1: When utility is unbounded below, we can de…ne V as in (57) for v0 , which shows that k00 (v0 ) 0 0 E C (u0 ) limv! 1 k00 (v) = 1: limv"v0 V (v0 ) = 1 arguments as for unbounded utility establish that 1: Then the same It remains to show the value of limv!v k00 (v) when utility is bounded below, without loss of generality by 0. Let 1 K (v) = subject to (12). Let max 1 X t 0 ut 2R+ t=0 v " X ( ) 2 ! ut # t C (ut ) 1 1 X t!t 0 t=0 be a Lagrange multiplier on (12) for a given v: The …rst order condition for ut is 1 0 t C (ut ) vii t t= 0 v; (58) t Wt i : where the inequality sign is due to the non-negativity constraint on ut : This expression implies that then 1: Therefore for the utilities bounded below, limv!v k00 (v) v t t= 0 ! 0 and there is some T such that 1 T T= 0 v 1: If lim sup t > 0, > 0: For such T the optimality condition (58) is satis…ed only for uv;T > 0: This is impossible since limv!v uv;t = 0 for all t; therefore if lim sup t > 0, then limv!v v = 1: From the envelope theorem, K 0 (v) = K (v) and k0 (v) = Since the solution to this problem is incentive compatible, k0 (v) K (v) : Therefore limv!v k00 (v) 0 limv!v K (v) 1 with limv!v k00 (v) v: = 1 if lim sup t > 0: The arguments for the proof of Proposition 2 mirror the proof of Theorem 2 in Farhi and Werning (2007). 8.3 Intermediate steps used in the proof of Theorem 2 and Corollary 2 We start with preliminary results. Lemma 12 Suppose that f is continuous on some interval [a; b] and one of its Dini derivatives is bounded. Then f is Lipschitz continuous on [a; b] : Proof. Without loss of generality suppose that D+ f (t) ; de…ned as D+ f (t) lim sup h!0+ is bounded by D: Let 1 (t) f (t + h) h f (t) ; = f (t) + Dt: It is continuous since f is continuous and D+ D+ f (t)+D 0: By Proposition 5.2 in Royden (1988) implies f (t00 ) f (t0 ) D (t00 1 = is nondecreasing, and therefore t00 > t0 t0 ) : Applying the same arguments to combining with the previous result, we establish jf (t00 ) 1 (t) f (t0 )j 2 (t) D jt00 = f (t) + Dt and t0 j for all t00 ; t0 2 [a; b] : Lemma 13 If Assumption 1 is satis…ed, then lim u!(1 In particular, for any v (1 C 0 (^ u) )v C 00 (u) = 0: [C 0 (u)]2 ) < a < b < v (1 C 0 (~ u) Ba;b j^ u (59) ) there exists a real number Ba;b such that u ~j for all u ^; u ~ 2 [a; b] : Moreover, for any " > 0; there is a such that Ba;b = (C 0 (b))2 < " for all b > a (60) a: For any v < a < b < v such that kt0 (a) < 1; function kt0 is Lipschitz continuous on [a; b] ^a;b such that and there exist a real number B v) kt0 (^ kt0 (~ v) ^a;b j^ B v v~j for all v^; v~ 2 [a; b] : ^a;b = (1 Moreover, for any " > 0; there is a such that B viii kt0 (b))2 < " for all b > a a: Proof. By de…nition C (U (c)) = c for all c: Di¤erentiate twice C 0 (U (c)) U 0 (c) = 1 and C 00 (U (c)) U 0 (c) 2 + C 0 (U (c)) U 00 (c) = 0: (61) Substitute the …rst expression into the second and regroup U 00 (c) : U 0 (c) C 00 (U (c)) = [C 0 (U (c))]2 If Assumption 1 is satis…ed, we obtain (59). Since U 00 is continuous, so is C 00 from (61). For any u ^; u ~ 2 [a; b] with u ~<u ^; 0 C (^ u) 0 C (~ u) = Z u ^ C 00 (u) du (^ u u ~) max C 00 (u) ; u2[a;b] u ~ where maximum is well de…ned since C 00 is continuous. Let u ^a;b = arg maxu2[a;b] C 00 (u) and Ba;b = C 00 (^ ua;b ) : Since C 00 (^ ua;b ) = [C 0 (^ ua;b )]2 a ! (1 C 00 (^ ua;b ) = [C 0 (b)]2 and u ^a;b ! (1 ) v; condition (59) establishes (60). ) v as Since function kt is concave and di¤erentiable, kt0 is continuous on [a; b] (Corollary 25.5.1 in Rockafellar (1972)). Let D+ be the right upper Dini derivative of kt0 , de…ned at each v0 as D+ kt0 (v0 ) lim sup v!v0+ kt0 (v) v kt0 (v0 ) : v0 Claim 1. D+ kt0 (v0 ) satis…es 0 D+ kt0 (v0 ) V 00 (v0 ); where V (v) is de…ned in (57). Note that by construction V is twice di¤erentiable with V 00 (v0 ) = tE v0 [C 00 (uv0 )], V (v) kt (v) for all v with equality for v = v0 and V 0 (v0 ) = kt0 (v0 ) : Since kt0 is decreas- ing, 0 D+ kt0 (v0 ) by de…nition. Suppose D+ kt0 (v0 ) < V 00 (v0 ): Then there exists v^ > v0 ; such that for all v 2 (v0 ; v^) ; kt0 (v) < V 0 (v) : If this is not the case, there must exist a sequence vn ; with vn ! v0+ ; such that kt0 (vn ) kt0 (vn ) vn V 0 (vn ) or kt0 (v0 ) v0 V 0 (vn ) vn V 0 (v0 ) for all vn : v0 Taking limits and invoking twice di¤erentiability of V; D+ kt0 (v0 ) kt0 (vn ) vn n!1 lim sup ix kt0 (v0 ) v0 V 00 (v0 ) ; which contradicts the assumption. If kt0 (v) < V 0 (v) for all v 2 (v0 ; v^) ; then Z Z v^ 0 kt (v) dv < v^ V 0 (v) dv; v0 v0 where the integrals are well de…ned since kt and V are concave and hence absolutely continuous by Proposition 5.17 in Royden (1988). Integrating and using the fact that kt (v0 ) = V (v0 ) ; we obtain kt (^ v ) < V (^ v ) ; establishing the contradiction. Therefore D+ kt0 (v0 ) V 00 (v0 ): Claim 2. kt0 is Lipschitz continuous on [a; b]. It is su¢ cient to show that V 00 (v0 ) = tE v0 [C 00 (uv0 )] is bounded on [a; b] and apply Lemma 12. From (28), kt0 (a) 1 tC 1 0 (uv0 ) 1 kt0 (b) j j for all v0 2 [a; b] : (62) Since kt0 (a) < 1; this bounds uv0 : C 00 achieves a maximum at that set, say at a point u ^a;b , ^a;b = t C 00 (^ ua;b ) : which implies that V 00 (v0 ) is bounded by B ^a;b satis…es the condition that for any " > 0; there is a such Claim 3. Lipschitz bound B 2 ^a;b = (1 kt0 (b)) < " for all b > a a: that B As a ! v; kt0 (a) ! close to (1 1 and therefore equation (62) implies that u ^a;b gets arbitrarily ) v for all a su¢ ciently high. By the …rst part of the lemma, this implies that C 00 (^ ua;b ) = [C 0 (^ ua;b )]2 approaches zero for high a: Hence ^a;b B [1 kt0 (b)]2 = 00 u ) a;b t C (^ [C 0 (^ ua;b )]2 C 0 (^ ua;b ) 1 kt0 (b) 2 00 u ) a;b t C (^ [C 0 (^ ua;b )]2 j j 2 t also approaches 0 as a ! v: 8.4 Proof of Proposition 4 Suppose C satis…es C (u) = 1 a u for a > 1: a (63) For all x > 0; de…ne a function kt (v; x) = 1 max E t u; " 1 X s us x s+t a 1 s+t C (us ) xa s=0 subject to (11) and (12). For x = 0 we set kt (v; 0) = 1 max E t u; " 1 X s=0 We prove several preliminary result …rst. x s+t s+t C (us ) # : s+t Ws # Lemma 14 Suppose C satis…es (63). Then kt (v; x) is continuous in (v; x) : Proof. For interior (v; x) it is immediate, so we show our result for boundary: if (vn ; xn ) ! (v; 0) then kt (vn ; xn ) ! kt (v; 0) : Since jkt (vn ; xn ) jkt (v; xn ) kt (v; 0) j kt (v; 0) j; and kt (v; x) is continuous in v for all x jkt (vn ; xn ) kt (v; xn ) j + 0 by standard arguments, it is su¢ cient to establish that kt (v; xn ) ! kt (v; 0) as xn ! 0: We show our result for k0 (v; x) ; the arguments are analogous for other periods. Let 1 X K (v; x) = max E u; t xa 1 t ut tC (ut ) t=0 subject to (12). Analogously with the proof of Proposition 1, K (v; x) is …nite for all x k0 (v; x) K (v; x) + xa 0 and const, therefore k0 (v; x) is bounded from above, and that bound can be chosen to be uniform for all x in the neighborhood of x = 0. Function k0 (v; x) is bounded below because u 0; C (u) 0 and Wt is bounded. Moreover, that bound can be chosen to be uniform for all x in the neighborhood of x = 0. Let (ux ; x) be a solution to k0 (v; x) for a given x: We show next that for all x in the neighborhood of x = 0: Since such that and b00 tC (u) b0t 00 j j bt u (u) is convex, there are for all u: Since t x t ut is bounded reals b0t and b00t > 0 zero, we can pick b0 t=0 is bounded away from to be independent of t: Then 0 k0 (v; x) = E b0 b0 x 1 X t xa 1 < 0 t=0 1 X this yields E x t uxt t=0 Since (ux ; x) 1 x t C (ut ) uxt xa E x 1 X t t Wt t=0 t+E b00 1 X xa t=0 1 X t+E t=0 For tC P1 x x 1 X t=0 1 X t t xa xa 1 1 uxt j 00 x j b ut uxt b00 xa E xa E b00 1 1 X t t Wt t=0 x 1 X t t Wt : t=0 t=0 b0 xa x 1 X t xa E x t=0 1 X t t Wt 0 k0 (v; x) : t=0 are optimal for x; incentive compatible and provide utility v to agent, "1 # "1 # X X 0 x E 0 E x t t C ut t( t C (ut )) ; t=0 t=0 P x where the right hand side expression is well de…ned since k0 (v; x) and E x 1 t=0 t ut are P1 x …nite, which implies that k0 (v; 0) lim supx!0 E x t C (ut )) : At the same time t=0 t ( "1 # "1 # X X x a 1 x 0 a 1 0 E 0 k (v; x) = E x xa t Wt xa t Wt t ut x t ut x t t C (ut ) t t C ut t=0 t=0 xi which implies that lim inf k0 (v; x) = lim inf E x!0 x x!0 where again we used boundedness of E x all v: " P1 1 X t( tC t=0 t t=0 # (uxt )) k (v; 0) ; uxt : Therefore limx!0 k0 (v; x) = k0 (v; 0) for First, we analyze the limiting case of x = 0: It has a recursive structure h kt (v; 0) = max E u;w; i ^ kt+1 (w; 0) C (u; 0) + t t+1 subject to (22) and (21). Any conditioning on z is redundant since C ( ; u) is strictly convex and then the Revelation principle implies strict convexity of kt ( ; 0). One can also easily show that kt ( ; 0) is di¤erentiable and decreasing. Let u0v (m) ; wv0 (m) ; 0 v (mj ) ;m be a solution to this problem. Lemma 15 Suppose that u0v (m0 ) ; wv0 (m0 ) 6= u0v (m00 ) ; wv0 (m00 ) for some m0 ; m00 ; 0 v 0 and (m0 j ) > u0v m0 + wv0 m0 = u0v m00 + wv0 m00 : Then tC and 0 v u0v m0 + ^ t+1 kt+1 wv0 m0 > tC u0v m00 + ^ t+1 kt+1 wv0 m00 (64) (m00 j ) = 0: Proof. Suppose tC u0v m0 + ^ t+1 kt+1 wv0 m0 Then setting ~ 0v (m00 j ) = 0 v (m00 j ) + < 0 v tC u0v m00 + ^ t+1 kt+1 wv0 m00 : (m0 j ) ; ~ 0v (m0 j ) = 0 and leaving all other re- porting strategies unchanged satis…es (22) and (21) and delivers a strictly higher value of h i ^ kt+1 (w; 0) ; contradicting optimality of 0 (m0 j ) > 0: E C (u) + t t+1 v Suppose (64) holds with equality. De…ned as u ~ = wv0 (m0 ) + (1 ) wv0 (m00 ) for some u0v (m0 ) ; wv0 (m0 ) u0v (m0 ) + (1 ) u0v (m00 ) and w ~ = 2 (0; 1) : The new allocation satis…es (21) since u0v (m00 ) ; wv0 (m00 ) indi¤erent between and any ^ 6= and m ^ that ^ sends with a positive probability, 0 ^u0 (m) ^ v ^ + wv (m) : It also satis…es (22) since for ^u0 m0 + w0 m0 v v xii is and 0 ^u0 (m) ^ v ^ + wv (m) ^u0 m00 + w0 m00 v v and therefore 0 ^u0 (m) ^ v ^ + wv (m) For any , by strict concavity of tC ^u ~ + w ~ : C and kt+1 (u ) + ^ t+1 kt+1 (w ) > tC u0v m0 + ^ t+1 kt+1 wv0 m0 = tC u0v m00 + ^ t+1 kt+1 wv0 m00 (65) : Augment the message space Mv with a message m? : De…ne ~ (m? j ) = 0v (m0 j ) + 0 (m00 j ) and ~ m j^ = 0 for all ^ = 6 ; and ~ (mj ) = 0v (mj ) for m 2 = fm0 ; m00 ; m? g, ? v ~ mj^ = 0 v mj^ for all ^ 6= and all m 6= m? : Similarly let (~ u (m? ) ; w ~ (m? )) = (u ; w ) 2 (0; 1) and (~ u (m) ; w ~ (m)) = u0v (m) ; wv0 (m) for all m 6= m? : That is we consider for any an augmented state space and a strategy in which type reports m? and receives (u ; w ) in all states in which she reported m0 ; m00 leaving all other strategies and allocations unchanged. The 3-tuple (~ u; w; ~ ~ ) is incentive compatible and delivers the same payo¤ v to the agent, but by (65) and the fact that ~ (m? j ) > 0 delivers strictly higher value to the planner. Therefore 0 v u0v ; wv0 ; cannot be optimal, leading to a contradiction. This lemma shows that each type can receive only one distinct allocation. Under some additional conditions requiring types to be su¢ ciently spread out, we can also show that each receives a distinct allocation from other types. Lemma 16 Suppose condition (33) is satis…ed. Then if 0 v 0 mi j = 0 for all 0 6= : 0 v (mi j ) > 0 for some ; then Proof. Previous lemma established that types can receive at most j j distinct allocations, so we restrict attention to only j j messages. Without loss of generality u0v (m1 ) u0v mj j : Suppose that there is an allocation u0v (mn ) ; wv0 (mn ) ::: that two types receive with a positive probability. Due to the previous lemma, they must receive it with probability 1. Pick the highest type that receives an allocation which is also received by some lower type. To simplify notation, call that type receives u0v (mn u0v (mn 1 ) ; wv0 (mn 1 ) = n and the single crossing property implies that u0v (mn ) ; wv0 (mn ) . For now assume that 1) : First, observe that it must be true that 0 n+1 uv (mn+1 ) + wv0 (mn+1 ) > xiii 0 n+1 uv (mn ) + wv0 (mn ) : u0v n 1 (mn also 2) < Otherwise, if this inequality is weak, the fact that u0v (mn+1 ) ; wv0 (mn+1 ) 6= u0v (mn ) ; wv0 (mn ) implies 0 n uv (mn ) + wv0 (mn ) > 0 n uv (mn+1 ) + wv0 (mn+1 ) : But then wv0 (mn+1 ) can be decreased and wv0 (mn ) increased while keeping 0 n+1 ) wv (mn+1 ) ( ( 0 n ) wv (mn ) + constant. For small changes that will be incentive compatible, and strict concavity of kt+1 ( ; 0) will imply that the perturbed allocation gives a higher value, contradicting optimality. Choose " > 0 small enough so that 0 n+1 uv Let 2 (") = (mn+1 ) + wv0 (mn+1 ) > n 1 "; 3 (") 1 (") = 1 = ( n 1 n n 2 1) [ 0 n+1 uv (mn ) + wv0 (mn ) + ( n+1 ( n) n 1) ": " and n n 1] " + n 1 n 2 n X2 ( i ) ": i=1 By construction, all 1 (") are positive and O (") ; and 8 1< ( n 1) [ n ( 3 (") = n 1] : n 2) n 1 j j X i=n 1 if condition (33) is satis…ed. Consider an allocation (~ u; w) ~ de…ned as u ~ (mn ) = u0v (mn ) + u ~ (mn 1) = u0v (mn 1) ( ( "; n 1) n) u ~ (mi ) = u0v (mi ) for i 2 = fn "; 1; ng ; and w ~ ( i ) = wv0 ( i ) + 3 (") w ~ ( i ) = wv0 ( i ) 1 (") 1 (") for i for i > n; n 1) w ~ ( n ) = wv0 ( n ) 2 (") ( n) w ~ ( n 1 ) = wv0 ( n 1 ) + 2 (") 1 (") : ( xiv n 1 (") ; 2; 9 = ( i) " ; 0 First, observe that (~ u; w) ~ satisfy (21) since j j X j j X ( i) [ iu ~ (mi ) + w ~ (mi )] i=1 = ( n 1) ( n n 1) " + 3 (") 0 i uv ( i) i=1 n X2 ( i) (mi ) + wv0 (mi ) 1 (") i=1 = ( n 1) ( n n 1) " +( n 1 n 2) " n X2 ( i) ( n 1) [ n n 1] " +( n 2) n 1 i=1 = 0: j ( i) " i=1 It also satis…es incentive compatibility. Note that for small " we have u ~ (m1 ) u ~ mj n X2 ::: and it su¢ ces to check local downward incentive compatibility. We have ~ (mn 1 ) n 1u = n 1 + w ~ (mn 1 ) n ( n 1) "+ 1+ 1+ ( n) so the incentive constraint for type ~ (mn 2 ) n 2u = n 2" so the incentive for type +( n 2 n 1 n 1 ~ (mn ) 1u ( w ~ (mn ) n 1) ( 2 (") n) = 0; is satis…ed. Also + w ~ (mn 2) n 2) " ~ (mn 1 ) n 2u n 1" w ~ (mn 1) = 0; is satis…ed. Similar arguments hold for all the other incentive constraints. Finally j j X ( i) i=0 = n X2 h i ^ C (~ u (m )) + k ( w ~ (m )) i i t t+1 t+1 0 ( i ) kt+1 (w ~ (mi )) ( 3 (") 1 (")) X ( i) h 0 0 ^ t C uv (mi ) + t+1 kt+1 wv (mi ) 0 ( i ) kt+1 (w ~ (mi )) ( 1 (")) + o (") : i=n+1 0 Since kt+1 < 0 and under condition (33) ( 3 (") positive for " small enough. This shows that If u0v (mn i=0 j j + i=0 j j X 2) = u0v (mn 1) ; 1 (")) u0v ; wv0 0; the expression above is strictly cannot be optimal. then the same steps as before go through if u0v (mi ) is reduced by " for all i such that u0v (mi ) = u0v (mn 1) and are adjusted accordingly. We are now ready to prove Proposition 4. Proof of Proposition 4. The solution to Bellman equation (20) may involve random- ization over several v~v (z) that in expectations deliver v; but v~v (z) ! 1 for all z as v ! 1. Consider maximization problem for a given z and assume that agents send reports over the message set M for all v: Using homogeneity properties of kt (v) ; if (u0v ; w0 ; xv v) is a solution i ! to kt (v), then (ux ; wx ; x=v x) 1 max E u;w; subject to (22) and v h xa 1 1 u0v ; v 1 wv0 ; v is a solution to the following problem for ^ t C (u) + t+1 kt+1 (w; x) u i xa t Wt ( ) 1 = E [ u + w] : Since u; w are bounded from below by 0, arguments analogous to (45) establish that h i 2 0; ( ( 1 ) ) 1 and that ux (m) lies in a compact set. Since Lemma 14 estab- wx (m) lished that kt+1 (w; x) is continuous in x; the Theorem of Maximum applies and solution correspondence (ux ; wx ; x) is u.h.c. in x: We show that there cannot be several types that send the same message m with a positive probability for low x; which establishes the result of the Proposition. First, observe that there must be some threshold x; such that for all x x no two types send the same message with probability 1. If this is not the case, we can choose a sequence xn ! 0 with solution xn satisfying such property, which by u.h.c. of 0 would imply that xn satis…es this property, violating Lemma 16. Next we rule out that several types send the same message with positive probability. Suppose that for any x we can …nd some x < x with this properties. There must be some type who is indi¤erent between messages m0 and m00 : In this case condition (30) holds and takes the form h xa 1 ux m0 h xa 1 ux m00 = xa t tC uw m00 + ^ t+1 kt+1 wx m0 ux m0 i tC ux m00 + ^ t+1 kt+1 wx m00 w t C uw m00 uw m0 w t C i uw m0 : Since ux ; uw lie in a compact set, taking sequence xn ! 0 we get, invoking upper-hemicontinuity again, h tC 0 u 0 m +^ t+1 kt+1 which violates Lemma 15. 8.5 w 0 0 m i h tC 0 u 00 m +^ t+1 kt+1 w 0 m 00 i = 0; Arguments for case with persistent types In the following proofs we assume that the set of feasible utility is bounded and, without loss of generality, we set the lower bound to 0. Also, we assume that the message set Mt chosen by the planner is …nite for all t. The proof that in the worst equilibrium there is no information xvi revelation to the government is the same as in the iid case. The payo¤ of this equilibrium depends on the government’s information that dissipates slowly due to the persistence of the shocks. Thus, the best payo¤ for a government that deviates at time t is ~ t ( t) = W sup fut+s (h)gh2H t ;s 0 Z 1 X s H t s=0 Ept+s [ jh] ut+s (h) d t (66) subject to the feasibility constraint (8) holding for all t + s and Z s j pt d jht for s > 0: pt+s jht = This payo¤ is a generalization of (7) in the iid case. Similarly to that case, we can bound ~ Wt ( t ) with a function that is linear in t 1 : Given p 2 ( ), de…ne the analogue of (13) as R s ! Z 1 (d s j ) s ut+s (m) X s : Wt ( ; p) = sup (dmj ) d j p d w fut+s (m)gs 0 (C (u (m)) e) t+s s=0 t;t+s Mt (67) As in the iid case, let uw t+s (m) denote the solution to (67), which is given by equation Ept+s [ jm] = w t;t+s C uw t+s (m) . By Langrange duality we can prove the analogue of Lemma 2 in the iid case. Lemma 17 The multiplier w t;t+s is uniformly bounded away from 0 and belongs to a com- pact set for all t and s. Therefore, uw t+s (m) belongs to a compact set in the interior of [(1 ) v; (1 ) v] for all m; ; t; s: Function Wt is well de…ned, continuous, convex in t, if and only if t 1 is uninformative. For any Z ~ t ( t) Wt W Ht with equality if t ; pt ; t 1 = t ; pt ; jht ; t Z ^t ; t;p t 1 uniformly bounded in t, and is minimized , ; pt ht 1 d t 1 dz; (68) : Proof. The objective function (66) is concave and the constraint set is convex and we can ~ t ( t ) as use the Lagrange duality and rewrite W R ( ) P1 s Ept+s [ jh] ut+s (h) d t dz t Z s=0 H ~ t ( t) = R P1 s W min sup : e f t;s 0gs 0 fut+s (h)g t;t+s s=0 t H t Z C (ut+s (h)) d t dz h2H ;s 0 (69) Let w t;t+s be the solution to the minimization problem. Since after deviating, the government ~ t period by period. doesn’t provide incentives, we can maximize W Z s w w sup Ept+s [ jh] ut+s (h) t;t+s C (ut+s (h)) + t;t+s e d t dz: fut+s (h)gh2H t ;s 0 Ht Z xvii w t;t+s By using the same argument as in the iid case we can show that is uniformly bounded away from 0 and uniformly bounded above for all s. This also proves that supremum in (69) is achieved. Also, ~ t ( t) = W f min t;s 0gs max fut+s (h)gh2H t ;s 0 max fut+s (h)gh2H t ;s 0 Z 0 R ( 1 X s H t Z s=0 P1 s=0 Ht Z P1 s t;t+s s=0 : Since t 1 max fut+s (h)gh2H t ;s = Z Ht 1 0 Z Ht max Z fut+s (m)gs 0 Mt 1 X s Z s=0 Z w t;t+s g Ht Z ) d j C (ut+s (h)) d t dz w t;t+s (C (ut+s (h)) e ) e) d t dz may not be a minimizer for an arbitrary w t;t+s (C Ept+s [ jh] ut+s (h) t (dmj R Ept+s [ jh] ut+s (h) d t dz Ept+s [ jh] ut+s (h) where the inequality follows from the fact that f t ; pt ; s 1 X pt d s=0 equation (68) must hold. (ut+s (h)) 2 s4 R e) d t dz s (d sj ) 3 ut+s (m) 5 d w (C (u (m)) e) t+s t;t+s s t 1 dz The same arguments show that we can maximize (67) period by period, thus, if we let R s ! Z (d s j ) s ut+s (m) ^ t+s ( t ; pt ) = max W d j pt d ; t (dmj ) w fut+s (m)g e) t;t+s (C (ut+s (m)) Mt the optimal uw t+s (m) satis…es 1 C 0 (ut+s (m)) = w t;t+s Z p d E jm; 2 ^ t+s is uniformly bounded in t + s, which implies that W 1 ; w t;t+s j j w t;t+s ; t, and pt and that we can re^ t+s by the Theorem strict ut+s (m) to a compact set. The latter implies continuity of W ^ t+s is uniformly bounded, we have that Wt ( t ; pt ) = of the Maximum. Finally, since W P1 s ^ Wt+s ( t ; pt ) is also bounded and continuous. s=0 The proof that Wt is convex in t follows exactly the same steps as the in the iid case. To see that Wt is minimized at an uninformative signal, let be uninformative and note that for any Wt ( t ; pt ) = max fut+s (m)gs max fut+s gs 0 0 Z Mt Z t (dmj d j pt d ) d j 1 X s=0 = Wt ( t ; pt ) xviii pt d 1 X s=0 s s ut+s 2 s4 w t;t+s (C R 3 u (m) s t+s 5 w e) t;t+s (C (ut+s (m)) s (d (ut+s ) sj ) e) Let (A) = R (Aj ) (d ) for any Borel A of message set Mt and let uw t+s be the optimal solution when the strategy is uninformative, then Wt ( t ; pt ) = max fut+s (m)gs 0 Wt ( t ; pt ) Z t (dmj ) Mt 20 Z 1 X s 4@ s s=0 (d s j ) d j pt d 1 s A ut+s (m) w t;t+s (C (ut+s (m)) The expression in square bracket is non-negative. Moreover, if is a set of messages A with (A) > 0 such that jE [ jA] uw t+s e) t w t;t+s C uw t+s is informative, then there 3 e 5 1j > 0: For all such messages the expression in square brackets is strictly positive since uw t does not satisfy the optimality condition 1 C 0 (ut (m)) = for m 2 A: Since mative (A) > 0; Wt ( t ; pt ) w E t;t [ jA] Wt ( t ; pt ) is strictly positive and, hence, any infor- cannot be a minimum. Similarly to the iid case, we replace the incentive constraint for the government E ~ t ( t ) with W E 1 X t s Z t ut Wt Ht t=s t Z jht ; ; pt ht 1 d t 1 dz P1 t s t=s (70) and use standard techniques to derive the analogue of the Langrangian (17) for the persistent case. The problem becomes 1 X L = max E u; subject to pt for all ht 1 ; ht t jh =R R t [ t ut tC (ut ) t Wt ] (71) t=0 mjht ; t t j mjht ; pt d j 1 pt jht d 1 d and m; whenever de…ned, the incentive constraint # " 1 # " 1 X X t t for all E 0 u E u t=0 1 ; jht 0 ; 1 ; (72) t=0 and E " 1 X t # u v = v; t=0 xix (73) t ut 1 for some non-negative sequences of Lagrange multipliers t ; t ; t t=0 with the property that ^ with strict inequality if and only if (70) binds in period t. t t= t 1 Finally, we adapt the arguments in Fernandes and Phelan (2000) with minor modi…cations and obtain a recursive representation of (71). In particular, we …rst rewrite the constranit set in a recursive form by adding the promise-keeping constraint (34) for each type . Since it is no longer optimal for agents to always report their types truthfully, unlike in the case studied in Fernandes and Phelan (2000), the agent’s previous type, , cannot be used as a state variable. Instead, the recursive formulation keeps track of the planner’s posterion beliefs about . Therefore, kt (v; p) = max fu;w; g ( j )2 (M );p0 2 ( ) Z p d E h i 0 ^ t C (u) + t+1 kt+1 w; p j u t Z Wt dz (74) subject to (34), R 0 p ( jm; z) = R (mj ; z) (mj ; z) j p d d j p d whenever de…ned, the incentive constraint E [ u + w ( ; ; ) j ; z] E 0 [ u + w ( ; ; ) j ; z] for all z; ; 0 : (75) We now extend the arguments of the simple example in Section 4.1 to the persistent case. As in the iid case, the assumption that the government can deviate at time t 1 only if it deviates at t = 0 implies that the Lagrangian (71) can be written as ( 1 ) X t L = max E t C (ut ) 0 W0 u; t=0 subject to (72) and (73). Consider the subgame starting from t = 1. From past reports, the planner has beliefs p 2 not bind at t ( ). Also, as in the iid case, since the sustainability constraint does 1, the standard Revelation principle applies and the agent will report his type truthfully (let m denote the message corresponding to type .) Thus, as in Fernandes and Phelan (2000), from time t type. At time 1, let 1 (v; p) 2 we can replace the posterior of the planner with the agent’s 1 (v; p) = be de…ned as max fu(m);w(m; )gm2M 1; 2 subject to v i = X m; j i X p i E 1C (u) + 2 (w; i u (m ) + w (m ; ) j xx i ; 8i )j i and u m0 + w m0 ; u (m ) + w (m ; ) The constraints are linear in (u; w), hence, if (u; w), 2 (w; ; 8m ; m0 ; : ), and equation for some (v; p), it is immediate to see that (x u; x w), also solve the Bellman equation for (x v; p) ; x > 0. Thus, 1 (v; p) xa 1 (v; p) solve the Bellman 2 (w; ), and xa 1 (v; p) is homogenous in v. Also, since the objective function is linear in p and the constraints are independent of p, 1 (v; p) is convex in p. At time 0, for any reporting strategy 0 (v; )= 2 (M0 ) ; let max E [ fu(m);w(m; )gm2M 0 (v; 0C ) be de…ned as (u) + 1 (w; p)] 0; 2 subject to p ( jm) = P (mj ) ( ; whenever de…ned, (34), and (75), where linear in (u; w) and If = un )2 P 2 2 (mj ) j j is the planner’s initial prior. Again, constraints are 1 (v; p) is homogenous in v, thus, 0 (v; ) will also be homogenous in v. P P un (mj ) j > 0, then is uninformative and Pr (m) = 2 2 p ( jm) = un (mj ) P 2 j = Pr (m) X 2 j : and the optimal allocation (uun ; wun ) is uun (m) = u and wun (m; ) = w ( ) for all . Suppose instead that is informative, then there exist m0 , m00 , and By convexity of 1 in p, E 1 wun ; pin E [ 1 (w un such that pin ( jm0 ) 6= pin ( jm00 ). ; pun )] : Finally, if the distribution of types is such that no bunching is desirable under commitment, the latter implies 0 (v; )> 0 (v; un ). To prove Proposition 5 we need the following Lemma. Lemma 18 Pr (uv;p (m; z) ! 0) ! 1 and Pr (wv;p (m; z; ) ! 0) ! 1 as v ! 0, for all m; ; and p. xxi Proof. From (34) for all p we have Z d j v = v;p (dmj ) Z Mt Z d j [ uv;p (m; z) + = Z Z d j [ uv;p (m; ~ z) + Z Z [ uv;p (m; ~ z) + wv;p (m; ~ z; ZZ wv;p (m; z; ) dz, 8m; : [ uv;p (m; z) + wv;p (m; z; )] dz wv;p (m; z; )] dz, m 2 Supp ( v;p ( j )) wv;p (m; ~ z; )] dz, 8m ~ )] dz, 8m; ~ . Z Here the fourth and …fth lines follow from nonnegativity of u and w; and the third line follows R from (75). Thus, limv!0 Z wv;p (m; z; ) dz = 0, 8m; ; p: Analogous arguments prove that R limv!0 Z uv;p (m; z) dz = 0, 8m; p. Since uv;p and wv;p are bounded below by 0; the latter implies that Pr (uv;p (m; z) ! 0) ! 1 and Pr (wv;p (m; z; ) ! 0) ! 1 for all m; ; and p. Proof of Proposition 5. Let Z uv;p (m; z) = Mt be any uninformative strategy. Let d j d v;p (dmj ) uv;p (m; z) and wv;p (m; z; ) = 1 Z v;p (dmj ) [ uv;p (m; z) + wv;p (m; z; )] 1 uv;p (m; z) : Mt Pro…le (uv;p ; wv;;p ; ) is incentive compatible and satis…es Z d j [ uv;p (m; z) + wv;p (m; z; )] dz v;p (dmj ) Mt Z Z [ uv;p (m; z) + wv;p (m; z; )] dz = d j Z Z Z = d j uv;p (m; z) + v;p (dmj ) [ uv;p (m; z) + wv;p (m; z; )] Z Mt Z = d j [ uv;p (m; z) + wv;p (m; z; )] dz = v v;p (dmj ) uv;p (m; z) dz Mt Z Therefore the value of the objective function (74) evaluated at (uv;p ; wv;p ; v;p ) should be higher than evaluated at (uv;p ; wv;p ; ) ; Z Z p d E p d E Z Z uv;p v;p v;p h uv;p tC (uv;p ) + ^ t+1 kt+1 (wv;p ; p0 ) t Wt ( v;p ; p) 0 ^ t C (uv;p ) + t+1 kt+1 wv;p ; p xxii dz t Wt ( ; p) j (76) i dz From Lemma 18, Pr (uv;p (m; z) ! 0) ! 1 and Pr (wv;p (m; z; ) ! 0) ! 1 for all m; ; and p and, thus, Pr (uv;p (m; z) ! 0) ! 1 and Pr (wv;p (m; z; ) ! 0) ! 1 for all m; ; and p: Also, similarly to the iid case, Lemma 3 in Acemoglu et al. (2008) implies that it is su¢ cient to randomize among only a …nite number of points fuv;p ; wv;p ; Z v;p (dmj ) d j Mt Z t Z (Wt ( v;p ; p) R t Z for all p. Since Wt ( Pr ( v;p (uv;p (m) uv;p (m)) C (uv;p (m))) t (C (uv;p (m)) 0 ^ + t+1 (kt+1 (wv;p ; p ) kt+1 (wv;p ; p0 )) Wt ( ; p)) dz (Wt ( v;p ; p) Wt ( ; p)) dz. Hence, in the limit equation (76) becomes Z lim sup t (Wt ( ; p) Wt ( v;p ; p)) dz 0; v!0 uous in Therefore, for each p Z converges to Pr (Wt ( p d v;p g. v;p ; p) v;p ; p) Z Wt ( ; p) by Lemma 17 and t > 0 by assumption, ! Wt ( ; p)) ! 1 as v ! 0 for all p: By Lemma 17, Wt ( ; p) is contin- and achieves its minimum only at uninformative reporting strategies, therefore ! ) ! 1 for all p: xxiii dz