• No results found

Point Identification with Individual-Level Data

The previous section established that attention heterogeneity prevents point

identification of deadweight loss using aggregate data, even if we already know the distribution of preferences. To facilitate identification, we require more granular data.35 If we use

(repeated) cross-sectional data, we also require additional structure on tax salience or the consumption set. Alternatively, we can use (long) panel data to identify deadweight loss without any structural assumptions.

34 One can identify a first order approximation trivially; it is zero.

35 Technically, one could simply impose a distribution of ๐‘๐‘ . For instance, one could assume homogeneous perceived tax-inclusive prices. We do not recommend this.

41 4.1 Cross-Sectional Data

We maintain the linear structure on preferences, as well as the assumption that ๐‘š is constant with respect to ๐œ, as in section 3.3. Formally:

๐‘ž = ๐›ผ + ๐›ฝ๐‘ฬ… + ๐›ฝฬƒ๐œ + ๐œ–

Here ๐›ผ is a constant whose identification we generally assume.36 The econometrician observes the distribution of ๐‘ž conditional on (๐‘ฬ…, ๐œ) for all values in ๐‘ ๐‘ข๐‘๐‘(๐‘ฬ…, ๐œ). The data comes from an underlying data-generating process, which we assume yields a well-defined and finite value for deadweight loss. The data identifies expected deadweight loss from a (non-zero) tax ๐œ if any underlying distributions of(๐›ฝ, ๐›ฝฬƒ, ๐œ–) across the population consistent with the observed distribution of(๐‘ฬ…, ๐œ) and conditional distributions of ๐‘ž yield the same value of๐”ผ [๐›ฝฬƒ2

๐›ฝ]. One approach is to identify the joint distribution of (๐›ฝ, ๐›ฝฬƒ) across individuals, then integrate to obtain the desired expected value. To that end, we can use the following lemma from Masten (2017):

Lemma 3: If ๐‘ ๐‘ข๐‘๐‘(๐‘ฬ…, ๐œ) contains an open ball in โ„2, then the joint distribution of (๐›ฝ, ๐›ฝฬƒ, ๐œ–) is identified if and only if:

1. The joint distribution of (๐›ฝ, ๐›ฝฬƒ, ๐œ–) is determined by its moments.

2. All absolute moments of (๐›ฝ, ๐›ฝฬƒ, ๐œ–) are finite.

Proof: Apply lemma 2 from Masten (2017). โˆŽ

36 It is identified if the econometrician observes ๐‘ฬ… = ๐œ = 0 or non-collinear variation in ๐‘ฬ… and ๐œ.

42

It follows as an immediate corollary that the conditions of the lemma also identify deadweight loss, since one can integrate out the marginal distribution of (๐›ฝ, ๐›ฝฬƒ) and then calculate ๐”ผ [๐›ฝฬƒ๐›ฝ2]. The intuition is that one can find the moments of the random coefficients by running increasingly higher-order regressions of the form:

๐”ผ[๐‘ž๐‘–๐‘›|๐‘ฬ…, ๐œ] = โ‹ฏ + ๐”ผ[๐›ฝ๐‘›]๐‘ฬ…๐‘› + โ‹ฏ + ๐”ผ[๐›ฝฬƒ๐‘›]๐œ๐‘›+ โ‹ฏ

The enumerated conditions are satisfied, for instance, when (๐›ฝ, ๐›ฝฬƒ) has finite support.

However, one might want to be able to identify deadweight loss without making assumptions on the distribution of (๐›ฝ, ๐›ฝฬƒ), beyond assuming that ๐”ผ [๐›ฝฬƒ๐›ฝ2] is well-defined and finite. The following theorem does not impose such assumptions, but does require unbounded support in regressors.

Theorem 4: Suppose that for every pair (๐œ†1, ๐œ†2) โˆˆ (โ„ โˆ– {0}) ร— โ„, there is a sequence (๐‘ฬ…๐‘˜, ๐œ๐‘˜)๐‘˜=1โˆž contained within the support of (๐‘ฬ…, ๐œ) such that:

1. lim

๐‘˜โ†’โˆžโ€–(๐‘ฬ…๐‘˜, ๐œ๐‘˜)โ€– = โˆž 2. lim

๐‘˜โ†’โˆž

๐œ๐‘˜ ๐‘ฬ…๐‘˜=๐œ†2

๐œ†1

In addition, suppose that there is another sequence (๐‘ฬ…๐‘˜, ๐œ๐‘˜)๐‘˜=1โˆž contained within the support of (๐‘ฬ…, ๐œ) such that:

1. lim

๐‘˜โ†’โˆž|๐‘ฬ…๐‘˜| < โˆž 2. lim

๐‘˜โ†’โˆž๐œ๐‘˜ = โˆž

Then ๐”ผ[๐‘‘๐‘ค๐‘™๐‘–] is identified.

43

We make note of a special case in which these results apply. Consider a binary choice problem, so that ๐‘‹๐‘‡ = {0,1}. Each agent ๐‘– has quasi-linear utility ๐‘ข๐‘–1 fro consuming the good, so that for tax-inclusive price ๐‘:

๐‘ข๐‘–1 = ๐œ–๐‘–โˆ’ ๐‘

This is an expression of the agent's preference for the taxed good scaled so that an additional dollar yields one unit of utility. We do not assume that ๐”ผ[๐œ–] = 0. However, we normalize utility in the absence of the taxed good to zero:

๐‘ข๐‘–0 = 0

For a price increase from ๐‘ฬ… to ๐‘๐‘ , the change in the expenditure function is:

๐‘’(๐‘๐‘ ) โˆ’ ๐‘’(๐‘ฬ…) = {

0, ๐œ–๐‘– โˆ’ ๐‘ฬ… โ‰ค 0 ๐œ–๐‘–โˆ’ ๐‘ฬ…, ๐‘๐‘  โ‰ฅ ๐œ–๐‘– > ๐‘ฬ…

๐‘๐‘ โˆ’ ๐‘ฬ…, ๐œ–๐‘– โˆ’ ๐‘๐‘  > 0

Each agent ๐‘– has a tax salience ๐‘š๐‘– independent of the tax rate. The agent perceives utility from buying the taxed good:

๐‘ข๐‘–1๐‘  = ๐œ–๐‘–โˆ’ ๐‘ฬ… โˆ’ ๐‘š๐‘–๐œ (8)

They purchase the good if ๐‘ข๐‘–1๐‘  > 0. This yields deadweight loss for that agent of:

๐‘‘๐‘ค๐‘™๐‘– = ๐‘’(๐‘๐‘ ) โˆ’ ๐‘’(๐‘ฬ…) โˆ’ [๐‘๐‘ โˆ’ ๐‘ฬ…]๐•€(๐›ผ โˆ’ ๐‘๐‘ + ๐œ–๐‘– > 0) = {

0, ๐œ–๐‘–โˆ’ ๐‘ฬ… โ‰ค 0 ๐œ–๐‘–โˆ’ ๐‘ฬ…, ๐‘๐‘  โ‰ฅ ๐œ–๐‘– > ๐‘ฬ…

0, ๐œ–๐‘–โˆ’ ๐‘๐‘  > 0 s

Let ๐น๐‘š.๐œ–โˆ— denote the true distribution of (๐‘š, ๐œ–). For simplicity, we assume that โ„™(๐‘ข1๐‘  = 0) and โ„™(๐‘ข1 = 0) are always zero. We assume the econometrician observes only aggregate data, so that the only values observed are triplets (๐‘ฬ…, ๐œ, โ„™(๐‘ข1๐‘  > 0|๐‘ฬ…, ๐œ)). The challenge is to infer deadweight loss, which takes the form:

๐ท๐‘Š๐ฟ = โˆซ ๐•€(๐‘ข1 โ‰ฅ 0 > ๐‘ข1๐‘ )[๐œ–๐‘– โˆ’ ๐‘ฬ…]๐‘‘๐น๐‘š,๐œ–โˆ— (๐‘š๐‘–, ๐œ–๐‘–)

๐‘š๐‘–,๐œ–๐‘– (9)

44

For (2017) shows that, if ๐‘ฬ… has full support, then one can use aggregate demand to infer the CDF of ๐œ– โˆ’ ๐‘š๐œ, allowing one to apply Masten (2017). Thus, we can identify deadweight loss even with aggregate data if we are willing to assume that tax salience does not depend on the tax rate and the choice set is binary.

One might hope that the assumption that ๐‘š โŠฅ (๐‘ฬ…, ๐œ) is not required for identification, as it is generally a rather strong assumption. Unfortunately, one cannot do away entirely with this restriction. For instance, consider a population in which โ„™(๐œ– = 2) = โ„™(๐œ– = 3) = 0.5 and โ„™(๐œ = ๐‘™) = 1. Consider rationalizing the data in two ways. In the first case, ๐‘š = 0.5 with probability one. This yields aggregate demand:

๐ท = {

1, ๐‘ฬ… + 0.5๐œ โ‰ค 2 0.5, 3 โ‰ฅ ๐‘ฬ… + 0.5๐œ > 2ฬ…

0, ๐‘ฬ… + 0.5๐œ > 3 Aggregate deadweight loss takes the form:

๐ท๐‘Š๐ฟ = {

0, ๐‘ฬ… + 0.5๐œ โ‰ค 2 0.5, 3 โ‰ฅ ๐‘ฬ… + 0.5๐œ > 2ฬ…

1.5, ๐‘ฬ… + 0.5๐œ > 3

In the second case, suppose for a moment that ๐‘ฬ… โˆˆ (0,1). Then agents with ๐œ– = 2 have tax salience ๐‘š2(๐‘ฬ…) = 2โˆ’๐‘ฬ…

6โˆ’2๐‘ฬ…, while agents with ๐œ– = 3 have tax salience ๐‘š3(๐‘ฬ…) = 3โˆ’๐‘ฬ…

4โˆ’2๐‘ฬ…. If ๐‘ฬ… โˆ‰ (0,1), then set ๐‘š2(๐‘ฬ…) = ๐‘š3(๐‘ฬ…) = 0.5. One can check that this yields the same aggregate demand, yet aggregate deadweight loss with ๐‘ฬ… โˆˆ (0,1) now takes the form:

๐ท๐‘Š๐ฟ = {

0, ๐‘ฬ… + 0.5๐œ โ‰ค 2 0.75, 3 โ‰ฅ ๐‘ฬ… + 0.5๐œ > 2ฬ…

1.5, ๐‘ฬ… + 0.5๐œ > 3

45 4.2 Panel Data

Now suppose that one can follow specific individual for a long period of time. For every individual ๐‘–, the econometrician observes triplets (๐‘ฬ…๐‘–, ๐œ๐‘–, ๐‘ž๐‘–) with full support for ๐‘ฬ… when ๐œ = 0.

Then the econometrician can identify the demand function ๐‘ž๐‘–(๐‘; 0). Assuming there are no income effects, deadweight loss for an individual ๐‘– with perceived price ๐‘๐‘  takes the form:

๐‘‘๐‘ค๐‘™๐‘– = โˆซ ๐‘ž(๐‘; 0)๐‘‘๐‘

๐‘๐‘ 

๐‘ฬ…

โˆ’ ๐œ๐‘‘(๐‘๐‘ )

If one can observe tax ๐œ, then one can determine deadweight loss using the inferred demand function via:37

๐‘๐‘  = ๐‘‘โˆ’1(๐‘ž๐‘–(๐‘ฬ…, ๐œ))

For instance, with binary choice data, demand for agent ๐‘– is ๐•€(๐›ผ โˆ’ ๐‘๐‘–๐‘ + ๐œ–๐‘–). This yields deadweight loss for agent ๐‘– of:38

๐‘‘๐‘ค๐‘™๐‘– = ๐•€(๐‘๐‘–๐‘  > ๐›ผ + ๐œ–๐‘– > ๐‘ฬ…) [๐›ผ โˆ’ ๐‘ฬ… ๐œ–๐‘–] This yields aggregate deadweight loss:

๐ท๐‘Š๐ฟ = โˆซ ๐•€(๐‘๐‘–๐‘  > ๐›ผ + ๐œ–๐‘– > ๐‘ฬ…) [๐›ผ โˆ’ ๐‘ฬ… + ๐œ–๐‘–] ๐‘‘๐น๐‘โˆ—๐‘ ,๐œ–(๐‘๐‘–๐‘ , ๐œ–๐‘–)

๐‘๐‘–๐‘ ,๐œ–๐‘–