A Theory of the Effects of Privacy
Abstract
The growth of information technologies has intensified concerns about individual privacy and highlighted the importance of consumer privacy rights. At the same time, these technologies enable significant personalization of communications and offerings. We develop a theory of the effects of privacy based on the concavity of an individual’s derived payoff function with respect to market beliefs: When this function is concave, privacy is valuable. We identify market conditions that lead to concavity or convexity in the derived payoff function. The framework is applied to contexts such as product choice, price discrimination, data breaches, and health insurance.
This paper was accepted by Raphael Thomadsen, marketing.
Funding: A. Bonatti acknowledges financial support through the National Science Foundation [Grant SES-1948692].
Supplemental Material: The data files are available at https://doi.org/10.1287/mnsc.2024.05354.
1. Introduction
The development of information technologies makes it possible to track citizens’ searches, communications, movements, and purchases in unprecedented detail. The potential for tracking means a loss of privacy, and understanding when tracking harms or benefits individuals has become a central policy and managerial issue.
However, the existing literature shows contrasting effects of privacy. Within a fixed market environment, the welfare effects of privacy can vary depending on market structure and which dimensions of information are revealed. For example, in simple price discrimination settings, a loss of privacy generally harms consumer welfare when firms learn about consumer preferences (Robinson 1933, Schmalensee 1981), but a loss of privacy can improve consumer welfare when firms are in competition or learn about cost differences across market segments (Thisse and Vives 1988, Chen and Schwartz 2015). Across environments, privacy can have opposite effects despite similar information being revealed. For example, information about consumer preference tends to harm consumers in price discrimination settings but can help in targeted advertising settings through improved match quality between products and preferences (Athey and Gans 2010, de Cornière and de Nijs 2016).
The divergent results within and across market environments call for a unifying framework that accommodates general information structures and market actions. This paper’s main idea is that a loss of privacy increases the precision of observers’ posterior beliefs about an individual’s preferences or behavior. In particular, a loss of privacy leads to a mean-preserving spread in the distribution of their beliefs. An individual’s welfare response therefore hinges on the curvature of their derived payoff in the observers’ beliefs: Concavity implies that a loss of privacy is harmful, and convexity that it is beneficial. This single mechanism offers a unified explanation for existing results and clarifies when new information helps or hurts consumers.
A familiar benchmark illustrates the logic. When the observer is the individual herself, the case of a single decision maker under uncertainty, more information is always valuable; the decision maker’s payoff is convex in their beliefs. In terms of privacy losses, if the observer is perfectly aligned with the individual (i.e., they act in their best interest), a loss of privacy is always beneficial to the individual.
This paper derives conditions on the indirect utility function of an individual and on observers’ responses to information about the individual’s type of behavior such that the individual’s derived payoff function is concave or convex. These conditions depend on three effects that collectively explain when and why privacy is beneficial or harmful within and across different market environments. First, the observers of the information may react in a way that is beneficial or harmful to the individual. Second, as more information leads the observers to react more steeply to any signals, the indirect utility function of the individual with respect to their actions may be convex or concave, which has implications for the derived payoff function in terms of the posterior beliefs. Third, the actions of the observers of the information may react to their posterior beliefs in a concave or convex way.1
Applying the framework clarifies several long-standing puzzles and, more broadly, offers a unifying theory of the effects of privacy, which has thus far proven elusive. Indeed, several existing results on privacy in diverse settings can be reinterpreted through this lens. For example, it explains some conditions for price discrimination to benefit or harm consumers. If price discrimination is based on consumer preference, the first effect typically dominates. That is, higher types suffer more from higher prices, and markets charge higher prices for segments with higher types. In contrast, if price discrimination is based on marginal costs, the third effect may dominate. When the optimal prices are concave in marginal costs,2 more information leads to lower average prices, which benefits consumers. We further consider the application of this framework to other settings, including various forms of targeting effects, health insurance, hacking, and the case of privacy being a value in itself. We show how the insights apply to these settings by identifying the shape of the individual’s derived payoff function and the dominating effect in each application.
This unified framework offers implications for both managers and policymakers. For managers, it provides a systematic approach to predict the environments when individuals will resist sharing information and when they might welcome personalization, guiding data-collection strategies. For policymakers, the framework helps differentiate environments where privacy regulations should be stringent (when information harms consumers) from environments where flexibility might be beneficial (when information benefits consumers).
As discussed in Acquisti et al. (2016), privacy has been defined in different ways, from the protection of an individual’s personal space (Warren and Brandeis 1890), to the control over one’s personal information (Westin 1967), to a dimension of dignity and academic freedom (Schoeman 1992). Related to the change in beliefs, Eilat et al. (2021) measure privacy as the Kullback–Leibler divergence between the prior distribution of beliefs over the types of an agent and the posterior beliefs induced by the equilibrium actions. In terms of this paper, we concentrate only on the aspects of observation by third parties of certain characteristics or behaviors of individuals (the effect on posterior beliefs). If privacy is seen as control over that information, then this paper can be understood as studying whether individuals should give access to that information, at what price, and to whom. If privacy is regarded as a dimension of dignity and academic freedom, then it can be considered a value in itself, and the paper also considers that case as an application, interpreting this possibility in terms of the framework presented here.
Regarding privacy as a value in itself, Farrell (2012) notes that privacy can be seen as a final good (value in itself) or an intermediate good, where privacy allows an individual to receive better offers from other agents. Regarding the value in itself, Farrell (2012) notes the “icky” feeling of being watched and tracked. In fact, at the extreme, most individuals may not find it pleasing if a technology is developed such that the individual is on camera at all hours of the day. However, some literature suggests that this value of privacy per se may not be very large, which has been called “the privacy paradox.” That is, although most individuals state a preference for privacy, they require relatively small incentives to forgo some dimensions of their privacy (Gross and Acquisti 2005, Barnes 2006, Adjerid et al. 2013, Athey et al. 2017, Johnson et al. 2020).
There has been significant work on different dimensions of privacy with a focus on specific market applications. Hui and Png (2006) and Acquisti et al. (2016) provide surveys on the economics of privacy. This literature reveals contrasting findings across different environments: In contracting environments, a loss of privacy may induce inefficient switching (Fudenberg and Tirole 2000, Taylor 2003), increase rent extraction (Calzolari and Pavan 2006), and facilitate adverse selection (Hermalin and Katz 2006), but may also prevent inefficient market exclusion (Hermalin and Katz 2006). In markets with repeat purchases, a loss of privacy may benefit consumers by intensifying competition (Villas-Boas 1999, Zhang 2011) or inducing consumers to strategically delay purchases (Villas-Boas 2004); the effect of privacy can depend on consumer sophistication (Taylor 2004) and the cost of anonymization technologies (Acquisti and Varian 2005, Conitzer et al. 2012). With behavior-based, targeted advertising, a loss of privacy may benefit consumers through improved ad relevance and more informed inferences (Athey and Gans 2010, Goh et al. 2015, Rafieian and Yoganarasimhan 2021, Shin and Yu 2021, Villas-Boas and Yao 2021) but may lead to misuse of data (Evans 2009), negative externalities (Goh et al. 2015), or higher prices (de Cornière and de Nijs 2016, Oery 2016, Ning 2021); the effect may vary with consumer annoyance of receiving advertising (Shen and Villas-Boas 2018, Lin 2020) and consumer flexibility in product preferences (Zhu et al. 2025). In personalized pricing settings, the effect of privacy may depend on market competition (Shaffer and Zhang 1995), fairness concerns (Li and Jain 2016), quality-adjusted costs (Rhee and Thomadsen 2017), consumer valuations (Amaldoss and He 2019), firms’ commitment power (Ichihashi 2020), network effects (Hajihashemi et al. 2022), and approaches to balance personalization with privacy (Chen et al. 2022, Lei et al. 2024). These contrasting results highlight the need for a unifying theoretical framework.
There is also literature on how agents (for example, consumers) react to privacy regulations, incurring costs to preserve privacy, or ceasing to interact with the market (Calzolari and Pavan 2006, Hann et al. 2008, Kummer and Schulte 2019, Montes et al. 2019, Chen et al. 2020, Jullien et al. 2020, Argenziano and Bonatti 2023, Choi et al. 2023, Fainmesser et al. 2023, Miklós-Thal et al. 2024).3
Another strand of literature addresses how the presence of more or less information in a market affects the market interaction between different economic agents (Vives 1984, 1988; Gal-Or 1985; Shapiro 1986; Raith 1996; Casadesus-Masanell and Hervas-Drane 2015; Johnson et al. 2023). In relation to that work, the presentation here does not focus on the interaction between economic agents but instead concentrates on the shape of the derived payoff functions and the market actions as a function of the information in the market. In this vein, the market segmentation analyses in Bergemann et al. (2015), Yang (2022), and Elliott et al. (2022) assess the impact of information design in settings with market power. In particular, Farboodi et al. (2025) use the concavity of the aggregate consumer surplus to sign the welfare effects of information facilitating third-degree price discrimination. Finally, Rhodes and Zhou (2024) explore fully personalized versus fully anonymous pricing in a competitive context.
One potential issue regarding privacy is that an individual may want some agents to have the individual’s information, but for that information not to go to other agents. The analysis of that more complex situation is not considered here, with the focus being solely on the situation of an individual facing a certain type of agent.
Another issue is that if individuals can credibly disclose information about their type (for example, disclosing verifiable information), these problems of privacy can unravel because the “best” types have an incentive to reveal their type to receive better treatment, and the market may be able to infer that the types that do not reveal their information are “less good” types (Posner 1978, Stigler 1980). This yields a situation where individuals do not have privacy. We consider settings where the individual’s type cannot be credibly communicated or where there are laws against such communication.
The remainder of the paper is organized as follows. Section 2 discusses a motivating example and Section 3 presents a general model tying an individual’s indirect utility function to that individual’s preferences for privacy. Section 4 derives conditions on the primitives of the problem, such that the derived payoffs of the individuals satisfy the conditions for privacy to be preferred or not to be preferred. Section 5 presents applications of the framework to several settings, and Section 6 concludes the paper.
2. Motivating Example: Health Insurance
Most privacy regulation involves sensitive personal data. One prime example of such sensitive data is health data. Part of the regulators’ concerns is that such data may be used to price discriminate in health insurance markets. This motivating example illustrates clearly the tradeoffs involved. In Section 5, we present several general market applications, including price discrimination, product and price targeting, and quality pricing and screening.
Consider the following setting, based on the insurance model of Hirshleifer (1971). An individual consumer may or may not have a certain gene. The presence of the gene causes health complications that cost L to address. The individual starts with wealth w and has a strictly concave direct utility function over total wealth. Perfectly competitive, risk-neutral insurers provide insurance to this market.
Each individual consumer has the gene with probability . The risk parameter is distributed in the consumer population with mean . The competitive insurance industry has access to individual-level data, which we model as observing an informative signal s of the type of each individual. Let denote the mean of the firms’ posterior distribution induced by signal s. By the law of total probability, we also have , that is, the posterior averages out to the prior.
Assume that the parameters are such that all consumers participate in the competitive insurance market, so there are no selection effects. This means that the following condition holds:
Every individual who generates signal s is fully reimbursed in case of a loss L and pays a competitive insurance premium of
This results in a fully insured wealth level .
Therefore, the aggregate consumer surplus generated under the signals s is given by
Under the prior distribution only (i.e., without additional signals), we instead obtain
In the rest of this paper, we derive a more general model that applies beyond the insurance setting. In the context of this example, the richer model will enable us to explore the robustness of the above conclusion to assumptions such as a binary distribution of health expenditures for each consumer, competitive insurance markets, full market coverage, and no moral hazard.
3. Model
Consider a representative consumer who interacts with a firm or market. The consumer has a preference type . These types can be discrete or continuous, depending on the application. They can also be seen as private actions taken previously (and nonstrategically) by the individuals. Let be the consumer’s indirect utility function if the consumer has type and the market takes action a.4 This indirect utility function will vary with the application considered and is discussed in the next section.5
The commonly known prior distribution of the consumer’s type is given by The market observes some data about the consumer, which we model as an informative signal of the consumer’s type . The signals s are drawn from a known information structure and allow the market to update its prior beliefs over . The market’s action a is a function of its beliefs about consumer type; that is, the market can tailor its actions to each consumer segment.6 Note that there is asymmetric information between the consumer and the market. The consumer knows whereas the market only observes the signal s.
One extreme is the case in which the signal is fully informative of . Another extreme case is that of an uninformative signal, that is, full consumer privacy. More generally, the signals induce market segmentation
A segmentation is a mixture with weights over distributions that jointly satisfy the law of total probability
In a learning interpretation (Yang 2022), a segmentation is the probability distribution over the market’s posterior beliefs over the consumer’s type. Equivalently, each signal realization s corresponds to a segment of the consumer population.7 The size of each segment s is given by , and its composition in terms of the consumer types is given by .
Under segmentation , each market segment s obtains an expected surplus of8
We define consumer welfare as the aggregate consumer surplus across all market segments. It is then given by
Note that represents the ex ante expected surplus before consumers learn their types. We use consumer welfare to refer to this aggregate (or ex ante) measure W, whereas consumer surplus refers to , the expected surplus for a specific segment s with distribution . In the special case of full privacy, ,9 the market takes a single action , and consumer welfare is given by
We are interested in whether a loss of privacy—a more informative segmentation in the Blackwell order, which we define below—is beneficial or harmful to consumer welfare. We evaluate the question of privacy before individuals learn their types or take their actions. That is, privacy policies are set when the market and individuals have symmetric information about , after which the market observes a signal s and takes action a, and finally, the consumer observes their type and may respond accordingly.
This setup aligns with many real-world environments, such as platform-mediated interactions. In such settings, consumers engage with multiple firms through a common platform, such as an online marketplace or a content ecosystem. When consumers make decisions about their privacy settings, they typically cannot anticipate which specific firms they will eventually interact with or what particular preferences they might have during future transactions. Moreover, the signals that firms receive about consumer types are derived from behaviors across multiple products and interactions, making strategic withholding or distortion of information difficult for any specific anticipated transaction.
Given this structure, we assess the value of privacy from the perspective of the entire consumer population, evaluating expected outcomes across the distribution of consumer types . We recognize that privacy regulation likely has heterogeneous effects on the population, and we take an aggregate welfare perspective.
Because the type space and distribution are unrestricted, the model presented here can be seen as providing a unifying framework to understand the effects of privacy in a multidimensional setting. This unifying framework can be seen as relatively straightforward, but making progress in this direction has been relatively slow.
This framework also offers an interpretation of the monetary value of privacy. When consumer utility is quasi-linear in money, the difference represents the consumer’s willingness to pay to maintain privacy to prevent information disclosure corresponding to segmentation . From the market or an observer’s perspective, this same quantity can be understood as the cost of acquiring the corresponding information.
The model does not formally consider the strategic interaction between the consumer and the market and summarizes that interaction by the market action That is, the point of the paper is just to consider the effects of the market action and the consumer utility properties on whether privacy is beneficial or harmful to the consumer. Considering how the strategic interaction between the consumer and the market affects the market action is beyond the scope of this paper and will depend on the specificities of the market interaction. In the applications considered below, this strategic interaction between the consumer and the market has to be formally considered to understand the properties of the market action in that particular application.
Finally, the relationship between signals and posterior beliefs characterized by mean-preserving spreads of the prior is a core result in the Bayesian persuasion literature (Kamenica and Gentzkow 2011), where signals chosen by the sender induce distributions over the receiver’s posteriors. In that framework, the sender’s benefit from persuasion at every prior depends on the curvature (concavity or convexity) of their utility with respect to the receiver’s posterior beliefs.
3.1. Discussion of Limitations
The setting presented here does not account directly for the possibility of individuals taking actions to protect their privacy. For example, it could be that individuals decide not to participate in the market to protect their privacy (Jullien et al. 2020). In such a case, could represent the equilibrium action of the market if the individual did participate in the market and lost their privacy, and could account for the decision of the individual not to participate in the market.
Similarly, individuals could incur costs to preserve their privacy (Montes et al. 2019). In that case, could represent the equilibrium action of the market if the individual did not incur costs to preserve their privacy, and could account for the decision of incurring costs to protect one’s privacy. One possibility, not accounted for in the model above, is that the individuals could distort their type in response to market behavior (Calzolari and Pavan 2006). This is an interesting avenue for future research.
Another issue to consider is that this framework allows comparisons between privacy regimes in which one captures beliefs that are a mean-preserving spread of the beliefs generated by the other regime, which can be a relevant comparison on the degree of privacy. However, the framework is not informative when comparing privacy regimes that cannot be ordered in terms of the mean-preserving spread.10
The setup considered here assumes that individuals are fully informed about the privacy regime in which they are making decisions. In the real world, individuals may have incomplete information about the privacy regime and only learn through experience about how they are exposed. This is an interesting general issue to consider when modeling the effects of privacy, but it is beyond the scope of this paper.
4. Effects of Privacy
As mentioned, we are interested in conditions under which a loss of privacy is harmful (or beneficial) to consumers as a whole. To formalize the comparison between different degrees of privacy, we use the Blackwell ordering (Blackwell 1951), which provides a precise way to rank information structures according to their informativeness.
(
To build intuition for this definition, let us first examine a simple special case. Consider a scenario where segmentation represents the prior belief, that is, the signal s is completely uninformative and , and segmentation is induced by an informative signal . In this case, (3) holds with transition kernel , which simplifies to the law of total probability (2). This indicates that when market participants update their beliefs in a Bayesian manner with consistent priors, the resulting distribution of posteriors is always a mean-preserving spread of the prior distribution.12 When comparing segmentations induced by different signals, the definition of mean-preserving spread through (3) can be seen as a generalization of the law of total probability.
To get further intuition, consider an example with binary types, in which the consumer’s type is one-dimensional and takes one of two values, with In this case, the distribution over types for signal s is fully characterized by a scalar Denote the prior by Suppose that under the prior, both types are equally likely, The market may receive a high signal or a low signal .13 For a segmentation that is completely uninformative,14 we would have In other words, the posterior distribution over low and high types after receiving a high signal would be and the distribution over types after receiving a low signal would also be
Compare this now with a segmentation that is completely informative. In that case, the posterior beliefs after receiving a low signal would be and the posterior beliefs after receiving a high signal would be That is, the distribution of posteriors in the perfectly informative segmentation case (i.e., perfectly informative signal) is a mean-preserving spread over the prior distribution, which is the same as the distribution over posteriors of the completely uninformative segmentation (i.e., completely uninformative signal).
In the case of binary types, as the realization of posteriors is uni-dimensional, we can also order the signals s, such that a higher s corresponds to a higher posterior belief on the type We can then have that the mean-preserving spread condition (3) is equivalent to the familiar one-dimensional condition where represents the cumulative distribution function of the posterior beliefs under segmentation , and represents the cumulative distribution function of the posterior beliefs under segmentation S.15
Note that the definition of privacy considered in this paper is based on the Blackwell order. Thus, it is equivalent to the statement that the more private segmentation is a garbling of that is, it can be seen as adding noise to the information constructed with segmentation through the transition kernel Using Rothschild and Stiglitz (1970), this is also equivalent to the result that if the expected consumer surplus V is concave (convex) in the posteriors, then consumers prefer more (less) privacy, that is, 16
Now we turn to the central question of this paper: the relationship between information revelation (privacy loss) and consumer welfare. Consider first the case of full privacy versus an informative segmentation. If the consumer retains their privacy, they obtain . If the individual consumer loses their privacy, their welfare is given by . Given that the distribution of the posteriors is a mean-preserving spread of the prior a simple application of Jensen’s inequality establishes that if is a convex (concave) function of the belief distribution F.
The first result uses the definition of a mean-preserving spread to generalize this observation to segmentations induced by different signals. Consider any two segmentations and . We say that consumers prefer more (less) private segmentations if whenever is more private than .
(
Intuitively, the curvature of V determines the consumer’s preferences for privacy. When V is concave, geometrically, this means the consumer surplus function “curves downward” as the market forms more varied beliefs about consumer types (represented by the spread of posteriors). For example, in the earlier binary type example, this means moving from a single point representing the prior belief to posterior beliefs and reduces consumer welfare. However, what determines whether V is concave or convex? In the remainder of this section, we discuss properties of the functions and of the market’s action that determine the shape of V. Using these properties, we provide a decomposition of three sources that explain why consumers prefer more (less) private segmentations when V is concave (convex).
4.1. Binary Types
We first consider the case of binary types to build intuition. Suppose that the consumer’s type is one-dimensional and takes one of two values, , with . Then, let , and compute the expected surplus of a consumer segment with distribution . With a slight change of notation (i.e., defining V to be a function of f rather than the distribution), we obtain
4.1.1. Consumer–Market Alignment.
As a warm-up exercise, suppose that the market acts in the interest of the individual. This is the traditional case of a single decision maker under uncertainty. The market therefore chooses the following action:
In this case, it is a standard result that V is convex in f, and hence convex in the distribution of types. The typical proof of this result is that the right-hand side of (4) is linear in f, and therefore, the maximum V over a has to be convex in f.
It may be nonetheless useful to consider a derivation of this result with differentiation techniques. We will assume that is twice continuously differentiable, that a is chosen from a continuous open set, and that the market’s problem in (4) is concave.
By the envelope theorem, we have
Differentiating the first-order condition for a, we have
Thus, if the high-type benefits from a higher action more than the low type, then a market that acts in the consumer’s interest increases a when it assigns a higher probability to .
4.1.2. General Market Actions.
Without knowledge of the market’s objective function, the model yields a more general expression that highlights additional economic forces. In particular, we differentiate twice and let subscripts denote the partial derivatives of U. We then have
This expression highlights three effects of information: (5) captures that assuming a is increasing, information helps the consumer if higher types also have a higher marginal utility of a17; (6) captures that information helps if U is convex in a on average; and (7) captures that information helps if a is convex in f and U is increasing in a on average.18
4.2. General Types
For more general type spaces, we must use a different approach to highlight these economic forces. By definition, V is convex if and only if, for any and any two distributions of types and , it holds that
Using the definition of , we can rewrite this condition as
We can expand the expression above to obtain the following characterization of convex and concave surplus functions V.19
(
Note that the terms in (8) are nil when consumer indirect utility U is additively separable in type and market action a. Moreover, (9) is zero when U is linear in a, and (10) is zero when a is linear in the belief distribution F. All three conditions hold simultaneously in the knife-edge case where V is both convex and concave, making consumers indifferent to privacy. In the health insurance example, this occurs when consumers’ direct utility function u is linear, leading to an indirect utility function U that is additively separable in and a, and with a linear in F. To gain some intuition for Proposition 2, we consider three particular cases, each isolating one of the effects captured by (8), (9), and (10).
4.2.1. Degree of Consumer-Market Interest Alignment.
The first example is regarding (8), which measures whether consumer’s and market’s interests are aligned. When high-type consumers benefit more from certain market actions than low-type consumers, and markets take those actions more frequently when they believe they face high-type consumers, privacy becomes less valuable.
Suppose that consumer indirect utility U is linear in market action a and that the market action a is linear in belief distribution F. In this case, the effects (9) and (10) are zero. We then obtain the following simpler conditions.
(
When is linear in a and is linear in F, the shape of is determined by the correlation between consumer marginal utility for market actions and the market’s response to consumer types.20 If the high consumer types like increases in a more than the low types and the market chooses a greater a when it has a higher belief distribution, the consumer prefers less privacy. Conversely, V is concave and privacy is preferred if the types that prefer action are also less likely under distribution , that is, if the interests of the market and the agent are misaligned.
As an illustrative example, consider a content platform that customizes the quality level a based on user types. The consumer’s indirect utility function is given by21
This means that the higher types gain more utility from quality improvements. The platform’s revenue comes from advertising, which increases with user engagement time (proportional to ). With quadratic costs of quality provision, the platform’s quality choice becomes proportional to its posterior mean:
In particular, the platform provides higher-quality content when it believes its audience has stronger preferences. Applying Condition (11) yields
This example illustrates that privacy preferences can emerge even when consumers are risk-neutral in market actions (as indicated by the linear utility function). In this case, these preferences stem from the correlation between consumer types and market actions rather than from risk aversion. For more general functions U and a, Expressions (9) and (10) uncover the further effects, including those related to risk attitudes, which we explore in the following examples.
4.2.2. Consumer Preferences for Variability in Market Actions.
The second example shows that the term in (9) captures the effect that with more information, market actions are more varied and that can be beneficial or harmful to consumers depending on whether the indirect utility function is convex or concave in a. Specifically, the convexity of U in a is a force for the consumer to prefer less privacy.
To isolate this effect, suppose that consumer indirect utility U is additively separable in consumer type and market action a, meaning type does not affect how consumers value the action and that the market action a is linear in belief distribution F. This nullifies Effects (8) and (10), leading to the following.
(
When is additively separable in and a and is linear in F, the shape of is determined by the convexity/concavity of in a. If consumers prefer more variable market actions, they prefer less privacy.
As an illustrative example, consider a content platform similar to the previous example, but where consumer utility is additively separable in and a. When the platform chooses quality a, the consumer’s indirect utility function is given by23
That is, the consumer’s indirect utility is independent of types, and the consumer prefers content with higher variability. The platform’s advertising revenue is proportional to the time the consumer spends on the platform (proportional to a) multiplied by the ads’ click-through rate (proportional to ). With quadratic quality costs, the platform’s quality choice is proportional to its posterior mean that is, it provides higher-quality content when it has a higher belief distribution, as in the earlier example. Then, Corollary 2 implies that V is convex and the consumer benefits from less privacy. Intuitively, this is because a loss of privacy leads to higher variability in the platform’s quality choice, and the consumer prefers this variation because they value quality improvements at an increasing rate.24
4.2.3. Market Response Curvature to Information.
Finally, the term in (10) captures the effect of whether the market action is convex or concave in the market’s beliefs. Therefore, if the consumer on average likes a higher market action, then the convexity of a also induces the consumer to dislike privacy.
To isolate this effect, we consider a setting where consumer indirect utility U is additively separable in consumer type and market action a, and is linear in the market action a.
(
When is additively separable in and a and linear in a, the shape of is determined by the convexity/concavity of a in F. Suppose that is convex in F and consumers prefer higher market actions, then, with more information, the market chooses a higher a on average. Thus, the convexity of would be a force for the consumer to prefer less privacy.
As an illustrative example, we again consider the content platform example but now with a linear consumer utility in a. The consumer’s indirect utility function under quality a is given by25
That is, consumer indirect utility is independent of types. Similar to the second setup, the platform’s advertising revenue is proportional to the time a the consumer spends on the platform multiplied by the consumer’s type .
Unlike the earlier examples, we now vary the platform’s cost structure. The platform incurs costs proportional to , where is a constant. The parameter captures different production technologies. A small can capture the economies of scale or learning-by-doing effects, and a large can capture resource constraints. This cost structure leads to the platform’s quality choice
Then, Corollary 3 implies that when , V is convex, and the consumer benefits from less privacy. Intuitively, economies of scale or learning-by-doing effects enable the platform to disproportionately increase quality when it has more precise information about high-value users. In contrast, when , resource constraints make concave, leading consumers to prefer privacy.
5. Applications
We now consider several applications of the general framework that we present.
5.1. Price Discrimination with Linear Demand
Consider the case of price discrimination with linear consumer demand. Let us apply the results in Proposition 2 to a setting with multidimensional consumer types and market actions.
For this application, consider a consumer with a two-dimensional type . The market first observes a signal of the consumer’s type and determines the prices and of two products. Then, the consumer observes their type and chooses the quantities purchased. Let the direct utility function be given by
The consumer’s indirect utility function, when taking into account their optimal quantities purchased, is then given by
We separately consider the case of monopoly (one firm sells both goods) and price competition (two firms, each selling one good). The monopoly prices for Demand System (12) under distribution F are given by, for all ,
This immediately yields an expression for the consumer’s value function in a segment with type distribution F:
We can then apply Proposition 2 and obtain the following proposition. Note that the market actions are linear in distribution F, so the third term (10) is nil.
(
This result mirrors the classic analysis of third-degree price discrimination by Robinson (1933) and Schmalensee (1981). This effect comes from the fact that the first term (8) is negative, the second term (9) is positive, and the first term dominates the second term. Intuitively, the first term, measuring the degree to which the consumer’s and the market’s interests are aligned, is negative because higher types suffer more from higher prices, but with an informed monopolist, the prices will increase more for segments with higher types on average. The second term, measuring consumer preferences for variability in market actions, is positive because of the convexity of the indirect utility function in prices. This convexity arises because consumers optimally choose higher quantities when prices are lower, creating an increasing benefit from price reductions. One could have imagined that this convexity is a force that drives the value of privacy down, but in this quadratic model, this is insufficient to overcome the first term.
For managers and policymakers, this result suggests that privacy regulation protecting consumer data from monopolistic sellers could benefit consumers overall. It may prevent firms from segmenting markets and charging personalized prices that would extract more surplus from high-value customers. Managers of monopolistic firms seeking to implement personalized pricing strategies should recognize that such practices create consumer welfare losses and may trigger regulatory scrutiny or consumer backlash.
We now turn to the case of price competition and show that the result extends to this case. The Nash equilibrium prices for a commonly known distribution of types F and Demand System (12) are given by
Substituting into the utility function and taking expectations over types under distribution F, we obtain
(
The intuition of Proposition 4 is similar to that of Proposition 3: In the duopoly setting, the first term (8) also dominates the second term (9). Interestingly, competition between firms does not change the privacy preferences of consumers in this setting. This suggests that competitive forces alone are insufficient to address consumer privacy concerns regarding price discrimination. Managers in competitive markets should note that access to consumer data does not necessarily benefit consumers. Policymakers should consider that privacy regulations may benefit consumers in seemingly competitive markets.
Finally, we consider the case of a price- and quality-discriminating monopolist introduced by Argenziano and Bonatti (2023). Let the consumer’s direct utility function from consuming q units of good of quality y be given by
A monopolist seller can produce quality y at a quadratic cost
Argenziano and Bonatti (2023) show that the optimal price and quality levels for a monopolist with beliefs F over the consumer’s type are given by
This allows us to write the expected consumer surplus in a segment F as
Note that the term (the expectation of the random variable ) is by definition linear in the probability distribution F. The term (the square of the expectation of ) is convex in F. Also, the assumption implies . Therefore, the sign of determines the convexity/concavity of V.
(
In this example, the third term (10) is also nil, and whether the first term (8) dominates the second (9) depends on the parameter values. Intuitively, higher types suffer more from higher prices and benefit more from higher quality levels. When the cost of investing in quality is high, or consumers’ marginal valuation for quality is low, the prices are sensitive to consumer types (which harms consumers), whereas the quality levels are not sensitive enough to consumer types (which benefits consumers, but insufficiently). This would be a force for the first term to be more negative, resulting in an effect of preference for privacy.
The result implies that managers should carefully assess their cost structure and consumer preferences when determining data collection strategies. In markets with high costs in quality improvements or where quality improvements deliver limited value to consumers, firms should expect resistance to personalization efforts. Policymakers may need to develop context-specific privacy regulations that consider industry cost structures rather than implementing one-size-fits-all approaches. This parameter-dependent result illustrates how the theoretical framework provides nuanced guidance for privacy regulation across different market conditions.
5.2. Targeting with Product and Price
A firm chooses both its location and its price. Consider a unit segment and a consumer who is located at either zero or one. The consumer’s location is their type The type- consumer’s downward-sloping demand function is if the product is located at x and the firm charges a price p. The consumer’s indirect utility function is given by
The firm can only offer one product and has zero production costs. Initially, the firm has a prior belief over the consumer’s location. In addition, the firm receives a signal s of the location of the consumer, which is fully summarized by the posterior probability
The problem of the firm is to choose where to position itself and what price to charge. In this example, the market action has two dimensions: the location of the product, x, and the price charged, p. Let and be the location and price that the firm chooses after receiving a signal, respectively. The firm therefore solves the following problem:
Given that the expected profit is linear in x, the firm will choose to locate at if and only if . Therefore, the firm’s optimal price choice is given by
Finally, the surplus of a consumer in a segment with proportion of types satisfies
Figure 1 illustrates the result for this example.

Let us now consider using Proposition 2 for this application. From the discussion above, we can see that an increase in , from below to above , can induce a discrete change in the firm’s location from to . The direction of this change benefits the consumer on average. Intuitively, the revelation of information can make a firm choose a location that better matches consumer preferences. This effect will tend to dominate in (8). To see that an increase in makes the firm execute a substantial change in location, consider a variation of the model above where there are convex costs of the firm locating farther away from The lower the convexity of these costs is, the greater the effect an increase in on the firm’s change in location, and this application considers the case where this convexity is the lowest, so that the change in location is the highest.
Consider now the effect of the price decision. Holding fixed the firm’s location at , an increase in makes the firm increase its price, so the welfare effect of information through pricing is negative, similar to the intuition in the price discrimination examples. As this effect is small compared with the location decision in this application, the first term (8) is positive.
Furthermore, the second term (9) is positive because consumer indirect utility is convex in both location and price. This convexity arises, similar to the price discrimination examples, because consumers can adjust their demand in response to prices and product locations. Finally, if the revelation of information does not modify the firm’s location choice, the third term (10) is zero at the firm’s optimum, because the pricing decision is linear in We summarize this discussion in the following proposition.
(
This model can also be interpreted as targeted advertising. Consider two types of consumers that are interested in two products respectively, located at . Each product has an ad, and a platform chooses which ad to display (x) and the frequency at which it is displayed (p). When consumers visit the platform, they observe which ad is displayed and spend time on the platform. This engagement time is decreasing in the mismatch between consumer interest and ad content and the ad frequency p. Moreover, the platform’s revenue comes from advertising, which is proportional to user engagement time multiplied by ad frequency.26 In this interpretation, the results show that consumers benefit from the revelation of information only when it causes the platform to significantly improve ad relevance. This illustrates the theoretical framework’s first effect. When information revelation leads to better preference-content matching, consumers may dislike privacy. However, if information merely enables higher ad loads without improving relevance, consumers prefer privacy.
These examples illustrate that the revelation of the same piece of information can be welfare enhancing or welfare reducing, depending on how firms use it.27 The results offer important implications for managers and platforms. For managers, the findings suggest that consumers benefit from personalization when it significantly improves product positioning or ad relevance but are harmed when data are primarily used for price discrimination or personalizing ad load. To enhance consumer welfare, managers should prioritize using consumer data to better match products to consumer preferences or improve ad relevance. From a policy perspective, regulations should distinguish between different uses of consumer data rather than treating all data collection uniformly. Privacy policies that allow data use for improving product matching whereas limiting price discrimination could enhance consumer welfare. Policymakers might consider context-specific regulations that permit targeted content but restrict personalized pricing.
5.3. Price Competition with Targeting
Consider the case of Thisse and Vives (1988). The consumer’s type is uniformly distributed on the segment [0, 1] and two competing firms are located at the extremes of that segment. A consumer buys one unit and gets utility when buying from a firm located at a distance x from the consumer at a price p. Let and be the prices of the firms located at zero and one, respectively. The distance from the firm located at zero is and the distance from the firm located at one is We assume v is large and production costs are nil.
The firms receive a common signal about the location of a consumer. That signal is fully informative and tells the exact location of the consumer with probability and is completely uninformative with probability Part of the information of the signal is also whether the signal is fully informative or fully uninformative. After receiving a fully informative signal the posterior of the firms is that the location of the consumer is at with probability one, which we denote by . That is, after receiving a fully informative signal, the beliefs of the firms are one for type and zero for all other types. After receiving a fully uninformative signal, the posterior of the firms is the prior, which is the uniform distribution on the segment [0, 1], which we denote by
We interpret increasing as decreasing the privacy in the market. When , there is complete privacy; when , there is no privacy. Firms compete on prices and can choose a price contingent on the signal received. For the consumers for whom the firms receive a fully uninformative signal, firms charge a price equal to one, and the consumer buys from the firm that is closer.28
For consumers of type t for whom the firms receive a fully informative signal, the equilibrium has the firm further away from that consumer charging a price of zero, and the firm closer to the consumer charging a price of In either the case of fully uninformative or fully informative case, consumers end up purchasing from the firm that is closer to them. Let m be the market price that is ultimately paid by the consumer, then the consumer’s indirect utility is
Moreover, the above discussion shows that and
From the calculations above, we can obtain the consumer welfare under full privacy as
As represents the fraction of consumers for whom the firms receive a fully informative signal, we have that an increase in , that is, a decrease in privacy, benefits the consumers.
The intuition behind this result is related to the consumer-market interest alignment. When firms receive a signal about consumer locations, they compete more intensely for consumers they can identify, leading to lower prices. In this case, better information leads to more targeted price competition rather than more effective surplus extraction. As information allows firms to compete more aggressively for each consumer segment, prices for identified consumers are lower than when all consumers are served under uniform pricing.
For managers, this finding suggests that in markets with differentiated products, more granular consumer data could be positioned as proconsumer because it intensifies price competition for identified consumers. For policymakers, the result indicates that privacy regulations might need to be more nuanced in competitive settings with differentiated products, as restricting information flows could potentially reduce consumer welfare by softening price competition.
5.4. Quality Pricing and Screening
We now introduce the effects of privacy in a canonical model of second-degree price discrimination. Consider a price-discriminating (Mussa and Rosen 1978) monopolist who faces a consumer with a binary type The consumer’s direct utility for a product with quality q is given by , and the monopolist’s cost function is .
The monopolist first observes a signal of the consumer’s type and then offers a menu of qualities and prices. Afterward, the consumer observes their type and chooses which quality-price combination to purchase. A monopolist with beliefs for the fraction of high types in a segment offers the following menu of qualities:
In this example, the market action consists of and . When and are set, the optimal prices for the two products are fully determined by incentive compatibility and participation constraints and do not depend directly on the belief p. The consumer indirect utility function, taking into account the monopolist’s optimal prices and the consumer’s optimal choice, is given by
Clearly, we have ; that is, the monopolist can extract all the consumer’s surplus under complete information. Therefore, zero privacy is never optimal.
Now, let denote the prior probability of the high type . A straightforward application of concavification techniques yields the following result on the optimality of full privacy.
Zero privacy is never optimal for the consumer. Let be the concave closure of V. Full privacy is optimal if and only if , that is, if and only if
Figure 2 shows V and its concave closure as a function of p.

The intuition for Proposition 7 is straightforward: If the likelihood of a high type is sufficiently high, the monopolist shuts down the low type. Therefore, if the high types are sufficiently likely in the population, consumers benefit on average from revealing some information (to increase their rents after one signal, as indicated by concavification). Otherwise, the strong intuition is that consumers benefit from holding asymmetric (superior) information versus the firm.
Applying Proposition 2 to this application, we can decompose the effects of information. When the market’s belief for the high type increases, the first term (8), measuring the degree of consumer-market interest alignment, is negative because is weakly decreasing in p. With a higher likelihood of a high type, the monopolist reduces the quality offered to low types, decreasing information rents for the high type. This reflects misaligned interests between the monopolist and the consumer, which is a force for consumers to prefer privacy. The second term (9) is nil because the indirect utility function is linear in qualities. The third term (10), measuring the curvature of market actions in beliefs, is negative when p is sufficiently low, because is concave in p in this region. However, when p is sufficiently high, the shutdown of the low type creates a kink , making (10) potentially positive and dominating (8). This convexity of the market action explains why partial information revelation can benefit consumers when the prior probability of high types exceeds .
This example allows for an immediate computation of the consumer-optimal information structure: it is either full privacy or a two-point distribution with support . This feature is difficult to generalize (as a large amount of literature on persuasion has shown) to multidimensional environments or environments where the posterior mean is not a sufficient statistic for the firm’s problem.
Unlike earlier examples, this screening setting shows that neither zero privacy nor full privacy is universally optimal for consumers. When high-value consumers dominate the market, partial information disclosure can benefit consumers on average by inducing firms to offer better quality-price combinations for consumers in low-value segments. Product managers might consider offering consumers options to selectively reveal information about their preferences when the market composition would make such disclosure welfare enhancing. From a policy perspective, regulators should develop market-specific privacy frameworks that account for customer composition rather than applying uniform restrictions. Privacy regulations might be most valuable in markets with fewer high-value customers, where information revelation enables firms to extract more surplus.
5.5. Health Insurance Revisited
We now turn to a richer version of the health insurance example. An important assumption in the motivating example (Section 2) was that the competitive equilibrium in the insurance industry yields full coverage for every consumer type. We now relax that assumption by assuming that, as a reduced form for a model with partial contractibility (e.g., moral hazard), the insurance industry can cover only a fraction of the consumer’s health expenditures. Let denote the fraction of uninsurable expenditures.
5.5.1. Binary Health Expenditures.
In the binary-expenditure example, the indirect utility function of a consumer of type who purchases partial coverage at a premium p is given by
In this example, the insurance premium p is a function only of the average type in each segment. We therefore denote the market action ( in the previous section) as . In particular, the equilibrium premium for a market segment s with is given by
Therefore, the expected surplus of a consumer in market segment s is given by
We know from Proposition 1 that consumers as a whole prefer to maintain privacy vis-à-vis the insurance industry if is concave and to reveal as much information as possible if is convex. Although this example allows for an arbitrary distribution of types, the segment-level surplus is a function of one variable only (). We can then apply the differential approach of (5)–(7) and obtain
The differential approach is useful in this case because both terms can be signed and interpreted. In particular, the strict concavity of the utility function u implies that both terms are negative. Intuitively, the first term corresponds to the first effect (5) in Proposition 1, measuring the alignment between consumer and market interests. This term is negative because, with informed insurance companies, the premium will on average increase more for consumers who are more likely to incur the loss L. These higher-risk consumers have a larger marginal utility of income. The second term captures (6), measuring consumer preferences for variability in market actions (premiums). This term is negative because consumers’ indirect utility function is strictly concave, making them prefer less variability in the premium. This is the same risk-pooling benefit observed in the motivating example. Moreover, the third effect (7) is nil in this case, as the premium is linear in the market belief.
This allows us to reach an intuitive but important conclusion, namely that consumers prefer full privacy in this model. In other words, consumer welfare is maximized in this example by letting the insurance market operate under the prior distribution. This implies that privacy protections in health insurance markets can benefit consumers on average, particularly when insurers charge risk-based premiums. Although precision pricing might seem efficient from a market perspective, it undermines the welfare-enhancing function of insurance by transferring the classification risk to consumers.
The above conclusion, however, relies on the binary nature of the uncertainty facing each consumer type, that is, the health expenditure level , as we show below.
5.5.2. General Health Expenditures.
Now, assume that the health expenditures L of each consumer type follow a more general distribution with cumulative distribution function . Because the market is assumed to be perfectly competitive, the price of partial insurance in any segment s with composition is proportional to the mean expenditure in that segment, that is,
Note that the premium is linear in market beliefs, so the third effect (10) is nil, as in the binary expenditure application.
The segment-level consumer surplus under an arbitrary distribution is then given by
We can apply Proposition 2 in this case to obtain conditions in which consumers are harmed or not. Consider two distributions, and , and let denote the convex combination of the two distributions with weights and , respectively. Finally, we let , and denote the means of these three distributions, respectively. We assume without loss that . We can then write the difference
Regarding the first term, the integrand
(
The last observation reconciles the results for the binary expenditure case and the more general case: with only two health expenditure levels per consumer , every probability shift between the two outcomes is a first-order shift. However, in more general cases, the welfare effects of information can be ambiguous, as illustrated in the following example.
(
In this example, neither distribution first-order stochastically dominates the other, and the first effect (with informed insurance companies, the premium will on average increase more for consumers who are more likely to incur the loss) no longer appears here.
The results in the general case suggest that health insurance privacy regulations might be most important in markets where risks are clearly stratified, such as genetic conditions with high predictive power. In markets with more complex risk distributions, a more nuanced approach to privacy regulation may be warranted.
5.6. Hacking
We consider a minimal model of hacking to provide a microfoundation for privacy preferences under the threat of data leakages. Suppose that an individual can have two types . Hackers receive a signal of the type that leads to posterior beliefs .
The hackers choose which type to target with their attack. Here, we assume that they target type if and only if . A successful attack (i.e., one directed at the true type) causes damage captured by , which is a positive, convex function that is symmetric about . We normalize to be zero. This formulation captures the idea that if the signal is not very accurate (i.e., if the hackers are not very confident in their information), then the signal only captures the less important components of the individual’s identity. Therefore, the hackers inflict limited damage on the individual. As the signal becomes more accurate, the hackers obtain more valuable information about the individual and can inflict increasingly greater damage.
In terms of the framework, let the a represent the type to which the attack is directed and k the extent to which the type is harmed. Let v denote the utility of a type that suffers an unsuccessful (i.e., mismatched) attack. Then, the individual’s indirect utility is given by where the attack action as a function of beliefs is the more likely type. One can denote the market action as a two-dimensional vector
The surplus of a segment with a fraction p of type A individuals is given by
Using the symmetry of , we can therefore write
We now apply the decomposition of effects from Proposition 2 to this application. The first term (8), measuring the degree of consumer-market interest alignment, is negative because is weakly increasing in p, meaning the harms increase as hackers become more confident about user types. This misalignment of interests is a force for individuals to prefer privacy, because more precise targeting of attacks disproportionately harms consumers. The second term (9) is nil as the indirect utility function is linear in the harms and . The third term (10), measuring the curvature of market actions in beliefs, is negative because both and are convex in the posterior beliefs p and individuals prefer lower and . This means that as hackers gain more precise information, the potential harm increases at an increasing rate, an additional force for information disclosure to be harmful to individuals.
5.7. Privacy as a Value in Itself
As discussed above, in some cases, individuals may dislike a lack of privacy per se, because of an “icky” effect. This effect can be seen as the same as the hacking example in the previous section, and we can recast that analysis in terms of the case of privacy as a value in itself.
Suppose that an individual can have two types, A and B, which can be better seen as “behaviors” by the individual that can potentially be observed by “Others.” Others receive a signal of the behavior of the individual that leads to a posterior belief p.
The individual loses utility if Others’ signal is correct, for a convex symmetric function . This captures the idea that if the signal is not very accurate, it only captures the less important components of the individual’s behavior and is therefore less harmful to the individual. As the signal becomes more accurate, Others obtain increasingly valuable information about the individual’s behavior, which is increasingly harmful to the individual. If the signal is incorrect (for example, the signal states that it is behavior A, whereas it is behavior B), the individual obtains utility .
In terms of the notation above, let a represent the effect on the individual’s utility that the signal received by Others indicates behavior , and let k denote the extent to which the individual is harmed. Then, the individual’s indirect utility function is . We can then apply the analysis of the previous section to obtain that the individual’s surplus is concave in the market beliefs, and therefore the individual prefers more privacy.
This formulation shows that seemingly subjective privacy preferences can be understood as rational responses to utility loss from accurate signal detection. For policymakers, these results imply that privacy regulations should account for both concrete economic harms and the disutility of accurate signal detection, particularly in contexts of widespread surveillance or tracking technologies that generate increasingly accurate signals about individual behaviors.
6. Concluding Remarks
This paper develops a theoretical framework to analyze the effects of privacy through the lens of the concavity or convexity of an individual’s derived payoff as a function of market beliefs about their type. We identify environments in which the individual’s derived payoff function with respect to the firm’s posterior belief is globally concave or convex. Under these conditions, either complete or zero privacy is optimal regardless of the initial prior or the market segmentation.
We further decompose the structure of the payoff function into three fundamental determinants: (i) whether market actions informed by better signals about individuals’ types align or conflict with individual welfare; (ii) the curvature—concave or convex—of individuals’ indirect utility from market actions; and (iii) the curvature of the market’s actions as functions of its beliefs about individuals.
The framework yields concrete implications for both managerial decision making and policy formulation. Managers can leverage this analysis to assess when the collection and use of consumer data generates or erodes economic value. Specifically, in contexts characterized by a concave derived payoff—typical in markets featuring price discrimination—firms should anticipate resistance from consumers and regulators to extensive data gathering. Conversely, when the derived payoff is convex—such as in highly targeted product recommendations—firms can enhance consumer value through increased personalization. For policymakers, the findings underscore the importance of tailoring privacy regulations to specific market contexts rather than adopting uniform privacy standards. For example, stringent privacy protections are desirable in markets like health insurance, yet less critical in markets where consumers benefit substantially from personalization.
Several extensions to the baseline model are promising. First, it would be valuable to study scenarios in which information is asymmetrically distributed across agents in the economy. Investigating whether and why individuals may prefer precise information disclosure to certain firms whereas favoring limited disclosure elsewhere could yield insights into preferences concerning not only personal privacy, but the privacy of others as well.
Second, incorporating selective information sharing into the model would enhance realism, as consumers often wish to control precisely which firms access their data. Allowing for heterogeneous segmentation structures, where firms differ systematically in the information they receive, would offer a richer characterization of privacy preferences.
Third, extending the model to incorporate dynamic considerations—such as the persistent consequences of current information revelation for future interactions or the potential for secondary information use by other parties—could explain the endogenous emergence of privacy norms and preferences.
Finally, the current analysis assumes Bayesian rationality and consistent market beliefs, yet real-world markets frequently exhibit deviations because of algorithmic bias, historical discrimination, or incomplete data collection, particularly across demographic groups. Relaxing this assumption by analyzing markets with systematically mis-specified beliefs represents another valuable direction for future research.
This paper benefited from conversations with and comments by Drew Fudenberg, Michael Katz, and Preston McAfee. All errors are our own responsibility.
Appendix A. Consumer Misreporting Types
Consider a model where consumers can misreport their types by incurring a cost and compare it with a benchmark that disallows consumer misreporting types. We show that in both models, in equilibrium, the distribution of a firm’s posterior belief of consumer types is a mean-preserving spread of its prior belief.
Consider the case of monopoly price discrimination with linear consumer demand. Suppose that the consumer’s type takes one of two values, , with . The firm’s prior belief is that , so the prior distribution can be denoted as . The firm observes a signal s, which leads to the segmentation . We will consider two generating processes of s: a benchmark case that disallows consumer misreporting and a case that allows consumer misreporting. After observing s, the firm chooses a price for each segment s, and then consumers choose the quantity purchased.
We first consider a benchmark that disallows misreporting. Let the signal s be a noisy version of consumer types,
Note that the segmentation based on the posterior belief is a mean-preserving spread of the prior. This can be shown using the law of total probability30
Let the direct utility function of a consumer with type be given by
Thus, the consumer’s indirect utility function, when taking into account their optimal quantities purchased, is given by
The expected surplus for a segment with distribution F is then
As we have shown the segmentation based on the signal s is a mean-preserving spread of the segmentation based on the prior, we can apply Proposition 2 and obtain that consumers unambiguously like privacy in this model.31
We now turn to a case that allows consumers to distort their types. Specifically, after a consumer’s type is realized, they can choose to distort their type to or with a cost c, which we assume to be additive. That is, distorting means that the signal for each realization of can decrease or increase by We assume that Note that in this setting both types want to report that they have a lower type, so, if distorting, any of the types will distort to Additionally, note that the higher type has a higher benefit from distorting, as the consumer’s value is convex in Finally, for c close to zero, both types distort, and the firm can fully undo the distorting and that for c large neither type distorts, and we are as in the situation in which distorting is not possible. Therefore, for there to be an effect of distorting we need an intermediate value of c such that type decides not to distort and type distorts, and in what follows, we assume that c is in that intermediate region.32
Let be the so-called reported type after potential distortion. That is, the signal observed by the firm is
Let denote the posterior distribution, and be the probability density of observing signal s, given by
Intuitively, although misreporting distorts the signals, rational firms account for this strategic behavior in equilibrium, resulting in consistent beliefs about the population distribution of types.
As the cost of misreporting is assumed to be additive to the utility of consumption, the consumer demand and the monopoly prices take the same expressions as in the benchmark case. Now we turn to the consumer’s type-reporting decision problem. If consumer reports , their expected utility would be
In the expression above, we use the same notation as in the benchmark model. As discussed earlier, reporting is strictly dominated by reporting , so we only compare the other two cases.
Recall that c is assumed to have an intermediate value, such that consumer is better off misreporting their type as . Also, it is too costly for consumer to misreport their type. This argument then leads to the following proposition.
For an intermediate value of c, there exists a unique equilibrium in the model that allows for consumer misreporting types. In the equilibrium, consumer misreports their type as , and consumer truthfully reports their type. The firm’s posterior belief is given by
Using the same notation of the consumer surplus V as in the benchmark model, consumer welfare can be written as
Note that there is an additional loss f c in consumer welfare, which captures that consumer misreports their type by incurring a cost c, and the probability of being type is f. Thus, we obtain that consumers unambiguously like privacy in this model.
It is noteworthy that the mean-preserving spread result requires the assumption that the firm’s posterior belief is consistent with the consumers’ equilibrium reporting actions. It could be interesting to study models where the firms’ beliefs are mis-specified, but that is beyond the scope of this paper.
Appendix B. Proofs
B.1. Differential Approach to General Results
In some applications, a differential approach allows for an easier characterization of the convexity (concavity) of . This approach leverages Gateaux derivatives, which we introduce as follows. For all distributions , let , and then the Gateaux derivative of V at F in the direction is defined as
Note that here is well defined, as for , is also a distribution.
The second Gateaux derivative of V at F in the direction is defined as
We can then link the convexity of V with its second Gateaux derivatives using the following lemma.
(
We first show is convex if and only if for all and .
: Suppose that is convex. Fix any and let , then is convex in , so
: Suppose that for all and . Fix any and let , we now show is convex in . Note that is homogeneous of degree 2 in , thus, for all , we have
The inequality holds because , and in the direction . Also, for , we have
Thus, is convex in , which implies that for all
Because F, , and are arbitrary, this shows that is convex.
Finally, it is analogous to show is concave if and only if for all and . □
Assuming appropriate regularity conditions, applying the Gateaux derivative operator to V twice leads to the following differential characterization of convex and concave surplus functions V.33 The proposition involves the first and second Gateaux derivatives of a at F in the direction , which can be defined similarly as above.
(
Because
And then applying the Gateaux derivative operator again yields
To see that (5), (6), and (7) are special cases under binary types, we can express . The probability mass distributions over take the form , and thus the only direction of the derivative, up to a scale, is . Then we can evaluate the integrals and obtain (5), (6), and (7).
These expressions are related to (8), (9), and (10), respectively. The first effect in (8) and (19) captures the correlation between a’s derivative and . The second effect in (9) and (B.3) captures the benefits or harms due to the convexity or concavity of U in a. The third effect in (10) and (B.4) captures the convexity or concavity of the market action in F.
Moreover, for exposition purposes, we treat a as one-dimensional in the statement of Proposition B.1. However, it readily extends to multidimensional a, in which case is a vector-valued function, is a matrix-valued function, and the multiplications in (B.2), (B.3), and (B.4) are understood as inner products.
B.2. Proof of Corollary 1 and Discussion
Because U is linear in a,
Now we show that the sign is determined by the correlation between the Riesz representation of a and , with respect to . Because a is linear in F, assuming proper continuity conditions,34 there exists a Riesz representation of a such that
Applying this and the linearity of U in a to (11) leads to
The first expression captures the average change in when the distribution of changes, and the second expression captures the average change in when the distribution of changes. Thus, their product captures the correlation between and with respect to .
B.3. Proof of Proposition 3
We can expand
Here we use as a shortcut for the following integral:
Because and is a negative definite matrix, we have and thus V is concave.
B.4. Proof of Proposition 4
We can expand
Because is linear in F, we obtain
Because for , we have
Because and is a negative definite matrix, we have and thus is concave.
B.5. Proof of Proposition 6
Expanding (14) leads to
First, we show the first part of the proposition. The firm’s optimal location choice when is , whereas the firm’s optimal location choice when is . Thus, the revelation of information does not modify the firm’s location choice if and only if stays on either or . Because is concave on either interval, in either case, full privacy is optimal for the consumer.
Then we show the second part of the proposition. We provide sufficient conditions under which the consumer benefits from two-signal segmentations. Denote the segmentation as ; that is, the first signal indicates that the probability of is , and the second signal indicates that the probability of is , and the weights of the two signals are and , respectively. The consumer benefits from the two-signal segmentations if and only if
The sufficient condition we provide is that and
To show that these conditions imply (B.5), discuss two cases: (i) ; and (ii) . In case (i), because V is concave on , we have
Moreover, expanding (B.6) leads to
Appendix C. Extension: Product Targeting with Price Discrimination
In this section, we discuss an extension to the main model in Section 5.2. This extension illustrates that the effect of privacy can be context dependent. That is, the exposure of information may potentially be detrimental or beneficial to consumers, depending on the specific piece of information.
A firm chooses both its location and its price. There is a unit segment and a consumer with a two-dimensional type , where denotes consumer location and denotes consumer taste, and they are independent given any signal s.
The consumer’s demand function is
Let denote the posterior probability of location, and G denote the distribution of after receiving a signal s.35
Analogous to the derivations in the main model, the firm’s optimal price is
Then, the surplus of a consumer in a segment with proportion of types and distribution G of satisfies
For a fixed , the only term in that is nonlinear in G is , which is a concave function of G. This implies that consumers always benefit from privacy in . This represents a case where the first effect (8) dominates, similar to the price discrimination examples in Section 5.1.
For a fixed G, is different from (14) in a constant and a term , which is linear on either and, respectively. This implies that the shape of is similar to the shape of in Figure 1, leading to a similar conclusion regarding the privacy of as in Proposition 6.
In this extension, the consumer may dislike privacy in but prefer privacy in . The intuition is consistent with the main model: when information leads to better product positioning (location), the first effect (8) can be positive and dominate, benefiting consumers. However, when information only enables more effective price discrimination (through ), (8) becomes negative, and consumers prefer privacy.
This multidimensional extension has important implications. Firms should develop data strategies that distinguish between different types of consumer information rather than treating all data uniformly. Location data that enable better product positioning may be worth collecting even under strict privacy regulations, whereas taste data used primarily for price discrimination might face greater consumer and regulatory resistance. For policymakers, this suggests developing privacy frameworks that differentiate between types of data based on their usage rather than simply restricting data collection broadly.
1 Of course, convexity or concavity does not determine a partition of the space of derived payoff functions, but the mechanisms considered here suggest effects on the evaluation of the benefits of privacy. Indeed, the objective of the paper is to find sufficient conditions under which the value of privacy is unambiguously positive or negative and to highlight the market forces behind these conditions.
2 This can happen when the pass-through rates are nonincreasing in the marginal costs, for example, as discussed in Chen and Schwartz (2015).
3 A related issue is that consumers may use a technology, such as ad blocking, to limit the use of targeting. See Chen and Liu (2022), Dukes et al. (2022), and Gritckevich et al. (2022).
4 In some cases, after the consumer observes the market action a and her type , the consumer can take further actions x. For example, the consumer can choose a quantity to purchase after the firm sets the prices (see an example in Section 5.1). Denote the consumer’s direct utility function as , and then the indirect utility function is defined as . In other words, in such cases, the indirect utility function is the consumer’s best-response payoff if the consumer has type and the market takes action a.
5 We describe the function U as the indirect utility of the individual consumer, but this same analysis also applies if U is the social welfare payoff or the payoff of some other agent in the economy.
6 We treat a as one dimensional in the general presentation, but allowing for multidimensional a can be important in applications. For example, a can represent the equilibrium prices for two products in a monopoly setting or a duopoly setting. (See these examples in Section 5.1.)
7 Note that this framework applies when firms observe signals for only a subset of consumers. In such cases, consumers for whom no signal is observed can be considered as belonging to a specific segment. This segment can be represented using a specific signal denoting no observation, with representing the proportion of consumers for whom no signal is observed. The law of total probability (2) holds when including this segment, as it accounts for the entire population. In this setting, types for which firms do not receive signals are interpreted as types for which firms receive uninformative signals.
8 Note that is defined over cumulative distribution functions rather than individual types. This means that the market does not need to observe signals for all individual types in the entire support.
9 In terms of Definition 1 below, this is denoted as
10 This is the same issue as when evaluating the expected utility of a risk-averse individual from two lotteries that cannot be ordered in terms of the mean-preserving spread.
11 A function is a transition kernel if and only if is measurable for all and is a probability measure for all .
12 Note that the signal need not be unbiased. The signal can be biased as long as the market knows the information structure and can correctly compute posterior beliefs.
13 As another interpretation, the market may observe a signal for only a subset of consumers, which we denote as . For the consumers for whom the market observes no signals, we denote this as .
14 For example, this can happen when both the high types and the low types send the high and low signals with equal probability. In this case, the market observes the high signal and the low signal with equal probability. As another example, this can happen when the high types send the high signal with probability one, and the low type mimics the high type by also sending the high signal with probability one. In this case, the market observes the high signal and cannot distinguish between the high and low types. We include a more general case of consumer strategic misreporting in Appendix A, showing that the mean-preserving property holds assuming consistent beliefs.
15 See, for example, Rothschild and Stiglitz (1970).
16 Note that there may be other definitions of privacy that will not have the garbling property, and the result on the concavity/convexity effects. This Blackwell order is a partial order. However, complete privacy is always a garbling of any (partially) informative signal structure on the consumer’s type.
17 A sufficient condition for this to hold is the Spence–Mirrlees condition, namely is nonnegative for all and a.
18 Information also helps if a is concave in f and U is decreasing in a on average.
19 Appendix B presents a differential approach for these results, which may be useful in some applications.
20 Mathematically, (11) has the same sign as the correlation between and the Riesz representation of a, with respect to . The Riesz representation of a is the derivative of a with respect to F, and can be interpreted as the market’s action when the probability of type is one. We assume proper continuity conditions in a to ensure it exists. See Appendix B.2 for derivations and more discussion.
21 This can be micro-founded by assuming consumer’s direct utility function , where q represents the consumer’s time spent on the platform.
22 In contrast, when there is a negative correlation between consumer marginal utility for market actions and the market’s response to consumer types, ceteris paribus, we have that consumers benefit from privacy. An example is when a denotes prices and the high type suffers more from higher prices (see Section 5.1).
23 This can be micro-founded by assuming consumer’s direct utility function , where q represents the consumer’s time spent on the platform.
24 In contrast, ceteris paribus, when the consumer’s indirect utility function is concave in the market action; that is, the consumer prefers less variation, we have that the consumer prefers privacy.
25 This can be micro-founded by assuming consumer’s direct utility function , where q represents the consumer’s time spent on the platform.
26 One interpretation is that the platform receives a fixed commission each time the ad is displayed. An alternative interpretation is that the products are owned by the platform, and upon each impression, consumers will purchase with a fixed probability.
27 See Appendix C for an extension of the model, in which the revelation of the different pieces of information can be welfare-enhancing or welfare-reducing.
28 For a different modeling of the continuum from full privacy to no privacy see, for example, Chen and Iyer (2002).
29 Alternatively, we can also apply Proposition B.1 (presented in Appendix B, the differential approach of Proposition 2) to show the same results. Specifically, for distribution and direction , Analogous to the reasoning above, regarding the first term, is a decreasing function of L, and the second term is negative. We also have Note that the integrand L is an increasing function of L. Thus, a sufficient condition for the first term to be negative is that the distributions stochastically dominate one another.
30 Consumer types are binary, so the posterior distribution can be summarized by .
31 Moreover, this result can be generalized to compare different signal structures, e.g., when the signals have noises with different standard deviations. Let the be the segmentation when the standard deviation of the noise is , then one can show that is more private than if . We can apply Proposition 2 and obtain that consumers unambiguously like privacy, that is, a segmentation with larger .
32 More details are available upon request. For example, when , , , , and , the intermediate region of c includes [0.015, 0.035].
33 To simplify notation, we present this proposition with unidimensional market actions a. However, the proposition is readily extended to multidimensional market actions, simply replacing scalar multiplications with dot products and matrix multiplications.
34 One example of such conditions is that a is continuous with respect to the weak-* topology in the following sense. Let be a mapping that maps any finite signed measure F and bounded measurable function g to a real number. The weak-* topology is the weakest topology on finite signed measures such that is continuous for all g. Theorem IV.1.2 in Schaefer and Wolff (1971) ensures the existence of the Riesz representation of a given that a is continuous with respect to the weak-* topology.
35 We assume that , that is, is not too low.
References
- (2005) Conditioning prices on purchase history. Marketing Sci. 24(3):367–381.Link, Google Scholar
- (2016) The economics of privacy. J. Econom. Literature 54(2):442–492.Crossref, Google Scholar
- (2013) Sleights of privacy: Framing, disclosures, and the limits of transparency. Proc. 9th Sympos. Usable Privacy Security (Association for Computing Machinery, New York), 1–11.Google Scholar
- (2019) The charm of behavior-based pricing: When consumers’ taste is diverse and the consideration set is limited. J. Marketing Res. 56(5):767–790.Crossref, Google Scholar
- (2023) Data markets with privacy-conscious consumers. AEA Papers Proc. 113:191–196.Crossref, Google Scholar
- (2010) The impact of targeting technology on advertising markets and media competition. Amer. Econom. Rev. 100(2):608–613.Crossref, Google Scholar
- (2017) The digital privacy paradox: Small money, small costs, small talk. NBER Working Paper No. 23488, National Bureau of Economic Research, Cambridge, MA.Google Scholar
- (2006) A privacy paradox: Social networking in the United States. First Monday 11(9).Google Scholar
- (2015) The limits of price discrimination. Amer. Econom. Rev. 105(3):921–957. Crossref, Google Scholar
- (1951) Comparison of experiments. Neyman J, ed. Proc. 2nd Berkeley Sympos. Math. Statist. Probab., vol. 2 (University of California Press, Berkeley), 93–103.Google Scholar
- (2006) On the optimality of privacy in sequential contracting. J. Econom. Theory 130(1):168–204.Crossref, Google Scholar
- (2015) Competing with privacy. Management Sci. 61(1):229–246.Link, Google Scholar
- (2002) Consumer addressability and customized pricing. Marketing Sci. 21(2):197–208.Link, Google Scholar
- (2022) Signaling through advertising when an ad can be blocked. Marketing Sci. 41(1):166–187.Link, Google Scholar
- (2015) Differential pricing when costs differ: A welfare analysis. RAND J. Econom. 46(2):442–460.Crossref, Google Scholar
- (2020) Competitive personalized pricing. Management Sci. 66(9):4003–4023.Link, Google Scholar
- (2022) Privacy-preserving dynamic personalized pricing with demand learning. Management Sci. 68(7):4878–4898.Link, Google Scholar
- (2023) Consumer privacy choices and (un)targeted advertising along the purchase journey. J. Marketing Res. 60(5):889–907.Crossref, Google Scholar
- (2012) Hide and seek: Costly consumer privacy in a market with repeat purchases. Marketing Sci. 31(2):277–292.Link, Google Scholar
- (2016) Online advertising and privacy. RAND J. Econom. 47(1):48–72.Crossref, Google Scholar
- (2022) Skippable ads: Interactive advertising on digital media platforms. Marketing Sci. 41(3):528–547.Link, Google Scholar
- (2021) Bayesian privacy. Theoret. Econom. 16(4):1557–1603. Crossref, Google Scholar
- (2022) Market segmentation through information. Discussion paper, Cambridge University, Cambridge, UK.Google Scholar
- (2009) The online advertising industry: Economics, evolution, and privacy. J. Econom. Perspectives 23(3):37–60.Crossref, Google Scholar
- (2023) Digital privacy. Management Sci. 69(6):3157–3173.Link, Google Scholar
- (2025) Good data and bad data: The welfare effects of price discrimination. Discussion paper, Massachusetts Institute of Technology, Cambridge, MA.Google Scholar
- (2012) Can privacy be just another good? J. Telecomm. High Tech. Law 10(2):251–264.Google Scholar
- (2000) Customer poaching and brand switching. RAND J. Econom. 31(4):634–657.Crossref, Google Scholar
- (1985) Information sharing in oligopoly. Econometrica 53(2):329–343.Crossref, Google Scholar
- (2015) Privacy and marketing externalities: Evidence from do not call. Management Sci. 61(12):2982–3000.Link, Google Scholar
- (2022) Ad blocking. Marketing Sci. 68(6):4703–4724.Google Scholar
- (2005) Information revelation and privacy in online social networks. Sabrina De Capitani DV, Dingledine R, eds. Proc. ACM Workshop Privacy Electronic Soc. (Association for Computing Machinery Location of publisher, New York), 71–80.Google Scholar
- (2022) The perils of personalized pricing with network effects. Marketing Sci. 41(3):477–500.Link, Google Scholar
- (2008) Consumer privacy and marketing avoidance: A static model. Management Sci. 54(6):1094–1103.Link, Google Scholar
- (2006) Privacy, property rights and efficiency: The economics of privacy as secrecy. Quant. Marketing Econom. 4(3):209–239.Crossref, Google Scholar
- (1971) The private and social value of information and the reward to inventive activity. Amer. Econom. Rev. 61(4):561–574.Google Scholar
- (2006)
The economics of privacy . Hendershott T, ed. Handbooks in Information Systems, vol. 1 (Emerald, Bingley, UK), 471–497.Google Scholar - (2020) Online privacy and information disclosure by consumers. Amer. Econom. Rev. 110(2):569–595.Crossref, Google Scholar
- (2020) Consumer privacy choice in online advertising: Who opts out and at what cost to industry? Marketing Sci. 39(1):33–51.Link, Google Scholar
- (2023) Privacy and market concentration: Intended and unintended consequences of the GDPR. Management Sci. 69(10):5695–5721.Link, Google Scholar
- (2020) Privacy protection, security, and consumer retention. Discussion paper, Columbia University and TSE, New York.Google Scholar
- (2011) Bayesian persuasion. Amer. Econom. Rev. 101(6):2590–2615.Crossref, Google Scholar
- (2019) When private information settles the bill: Money and privacy in Google’s market for smartphone applications. Management Sci. 65(8):3470–3494.Link, Google Scholar
- (2024) Privacy-preserving personalized revenue management. Management Sci. 70(7):4875–4892.Link, Google Scholar
- (2016) Behavior-based pricing: An analysis of the impact of peer-induced fairness. Management Sci. 62(9):2705–2721.Link, Google Scholar
- (2020) Two-sided price discrimination by media platforms. Marketing Sci. 39(2):317–338.Link, Google Scholar
- (2024) Frontiers: Digital hermits. Marketing Sci. 43(4):697–708.Link, Google Scholar
- (2019) The value of personal information in online markets with endogenous privacy. Management Sci. 65(3):1342–1362.Link, Google Scholar
- (1978) Monopoly and product quality. J. Econom. Theory 18(2):301–317.Crossref, Google Scholar
- (2021) List price and discount in a stochastic selling process. Marketing Sci. 40(2):366–387.Link, Google Scholar
- (2016) Consumers on a leash: Advertised sales and intertemporal price discrimination. Discussion paper, Cowles Foundation, Yale University, New Haven, CT.Google Scholar
- (1978) Economic theory of privacy. Regulation 2(3):19–26.Google Scholar
- (2021) Targeting and privacy in mobile advertising. Marketing Sci. 40(2):193–218.Link, Google Scholar
- (1996) A general model of information sharing in oligopoly. J. Econom. Theory 71(1):260–288. Crossref, Google Scholar
- (2017) Behavior-based pricing in vertically differentiated industries. Management Sci. 63(8):2729–2740.Link, Google Scholar
- (2024) Personalized pricing and competition. Amer. Econom. Rev. 114(7):2141–2170.Crossref, Google Scholar
- (1933) The Economics of Imperfect Competition (Macmillan, London).Google Scholar
- (1970) Increasing risk: I. A definition. J. Econom. Theory 2(3):225–243.Crossref, Google Scholar
- (1971) Topological Vector Spaces (Springer New York, New York).Google Scholar
- (1981) Output and welfare implications of monopolistic third-degree price discrimination. Amer. Econom. Rev. 71(1):242–247.Google Scholar
- (1992) Privacy and Social Freedom (Cambridge University Press, Cambridge, UK).Crossref, Google Scholar
- (1995) Competitive coupon targeting. Marketing Sci. 14(4):395–416.Link, Google Scholar
- (1986) Exchange of cost information in oligopoly. Rev. Econom. Stud. 53(3):433–446.Crossref, Google Scholar
- (2018) Behavior-based advertising. Management Sci. 64(5):2047–2064.Link, Google Scholar
- (2021) Targeted advertising and consumer inference. Marketing Sci. 40(5):900–922.Link, Google Scholar
- (1980) An introduction to privacy in economics and politics. J. Legal Stud. 9(4):623–644.Crossref, Google Scholar
- (2003) Supplier surfing: Competition and consumer behavior in subscription markets. RAND J. Econom. 34(2):223–246.Crossref, Google Scholar
- (2004) Consumer privacy and the market for customer information. RAND J. Econom. 35(4):631–650.Crossref, Google Scholar
- (1988) On the strategic choice of spatial price policy. Amer. Econom. Rev. 78(1):122–137.Google Scholar
- (1999) Dynamic competition with customer recognition. RAND J. Econom. 30(4):604–631.Crossref, Google Scholar
- (2004) Price cycles in markets with customer recognition. RAND J. Econom. 35(3):486–501. Crossref, Google Scholar
- (2021) A dynamic model of optimal retargeting. Marketing Sci. 40(3):428–458.Link, Google Scholar
- (1984) Duopoly information equilibrium: Cournot and bertrand. J. Econom. Theory 34(1):71–94. Crossref, Google Scholar
- (1988) Aggregation of information in large Cournot markets. Econometrica 56(4):851–876.Crossref, Google Scholar
- (1890) The right to privacy. Harvard Law Rev. 4(5):193–203.Crossref, Google Scholar
- (1967) Privacy and Freedom (Atheneum Publishers, New York).Google Scholar
- (2022) Selling consumer data for profit: Optimal market-segmentation design and its consequences. Amer. Econom. Rev. 112(4):1364–1393.Crossref, Google Scholar
- (2011) The perils of behavior-based personalization. Management Sci. 30(1):170–186.Google Scholar
- (2025) Advertising platforms and privacy. Marketing Sci., ePub ahead of print May 9, https://doi.org/10.1287/mksc.2024.0879.Link, Google Scholar

