• No results found

3. Non-Identification with Aggregate Continuous Choice Data

3.3 Linear Special Case

We conclude this section with a discussion of the special case in which π‘ž is known to be a linear function of 𝑝̅ and 𝜏 (for fixed 𝑝𝑁𝑇). We focus on this example both because of how frequently economists estimate linear models and because of its relationship to the second

36

order approximation of deadweight loss. As demonstrated in section 2.3, one can express a second order approximation to deadweight loss as a function of derivatives. In the case where the choice function is linear in regressors 𝑝̅ & 𝜏, the second order approximation is an exact calculation of deadweight loss, and our results from the previous subsections apply.

Formally, each preference type πœƒπ‘– takes the form πœƒπ‘– = (𝛽𝑖, πœ–π‘–) ∈ ℝ2.29 To maintain linearity in regressors, we also assume that tax salience π‘š is constant with respect to 𝜏. The choice function π‘ž then takes the form:

π‘žπ‘– = 𝛼 + 𝛽𝑖𝑝𝑖𝑠+ πœ–π‘– = 𝛼 + 𝛽𝑖[𝑝̅ + π‘šπ‘– πœπ‘–] + πœ–π‘–

We are suppressing tie-breaking parameter 𝜁 because πœƒπ‘–, 𝑝̅𝑖, π‘šπ‘–, and πœπ‘– always uniquely determine consumption. We have parameter 𝛼 so that we can assume without loss of generality that 𝔼[πœ–] = 0. Defining 𝛽̃𝑖 ≑ π‘šπ‘–π›½π‘– yields:

π‘žπ‘– = 𝛼 + 𝛽𝑖𝑝̅ + π›½Μƒπ‘–πœπ‘–+ πœ–π‘– with corresponding deadweight loss per agent from equation (3):

𝑑𝑀𝑙𝑖 = ∫ [𝛼 + 𝛽𝑖𝑝 + πœ–π‘–]𝑑𝑝

We assume that the joint distribution of parameters remains unaffected by the specific values of 𝑝̅ and 𝜏. The econometrician observes for various values of regressors:

29 Agents have quasi-linear utility 𝑒𝑖=

π‘žπ‘–2

37

𝔼[π‘ž|𝑝̅, 𝜏] ≑ ∫ [𝛼 + 𝛽𝑖𝑝̅ + π›½Μƒπ‘–πœ + πœ–π‘–]𝑑𝐹𝛽,π›½βˆ—Μƒ,πœ–(𝛽𝑖, 𝛽̃𝑖, πœ–)

𝛽𝑖,𝛽̃𝑖,πœ–π‘–

= 𝛼 + 𝔼[𝛽]𝑝̅ + 𝔼[𝛽̃]𝜏 (7)

where 𝐹𝛽,π›½βˆ—Μƒ,πœ– is the true distribution of (𝛽, π‘šπ›½, πœ–). The challenge is to use the observed values of triplets (𝑝̅, 𝜏, 𝔼[π‘ž|𝑝̅, 𝜏]) to infer aggregate deadweight loss, which is equivalent to its second order approximation around 𝜏 = 0:

π·π‘ŠπΏ = βˆ’1

2∫ π‘šπ‘–2𝛽𝑖𝑑𝐹𝛽,π‘š(𝛽𝑖, π‘šπ‘–)

𝛽𝑖,π‘šπ‘–

𝜏2 = βˆ’1

2𝔼[π‘š2𝛽]𝜏2= βˆ’1

2𝔼[π‘š 𝛽̃]𝜏2

The only restriction that the econometrician imposes on the distribution of tax salience π‘š is that the support of tax salience is contained within the interval [0, π‘šΜ… ].30 The econometrician can also use the CLD as in lemma 1, so that β„™[𝛽 ≀ 0] = 1. In fact, we can permit the

econometrician to know the entire distribution of πœƒ = (𝛽, πœ–). It will not affect our results.

First, we can find a homogeneous perceived price that rationalizes the data for any 𝜏. A linear regression of aggregate demand on sticker prices and taxes may permit identification of 𝛽̂ ≑ 𝔼[𝛽] and 𝛽̃̂ ≑ 𝔼[𝛽̃], respectively.31 We define a measure of central tendency of tax salience:32

π‘šΜ‚ ≑𝛽̃̂

𝛽̂

Then the homogeneous perceived price that rationalizes the data is 𝑝̂𝑠 = 𝑝̅ + π‘šΜ‚ 𝜏. To see this, note that assuming all agents have tax salience π‘šπ‘– = π‘šΜ‚ yields aggregate demand as in equation (7):

30 We assume that π‘šΜ… is sufficiently small so that π‘žπ‘–β‰₯ 0 with probability one, ruling out instances of negative consumption.

31 Such identification requires exogenous & non-collinear variation in sticker prices & taxes. If the econometrician cannot identify these terms, so much the worse for identifying aggregate deadweight loss.

32 If 𝛽̂ = 0, then let π‘šΜ‚ = 0.

38

Thus, the agent cannot rule out all agents perceiving the same price 𝑝̂𝑠, and so cannot rule out π‘šπ‘– = π‘šΜ‚ βˆ€π‘–. For tax 𝜏, this would yield deadweight loss, which by Theorem 1 would be a lower bound:

π·π‘ŠπΏπ‘™π‘œπ‘€ = βˆ’1 2π‘šΜ‚ π›½ΜƒΜ‚πœ2

Alternatively, the econometrician cannot rule out the perceived tax πœπ‘  having support in {0, π‘šΜ… 𝜏}. To see this, consider β„™(𝑝𝑠 = 𝑝̅ + π‘šΜ… 𝜏) =π‘šΜ‚

π‘šΜ… and β„™(𝑝𝑠 = 𝑝̅) = 1 βˆ’π‘šΜ‚

π‘šΜ… independently of other parameters and regressors.33 This will rationalize aggregate demand:

∫ [𝛼 + 𝛽𝑖𝑝̅ +π‘šΜ‚

π‘šΜ… π›½π‘–π‘šΜ… πœπ‘–+ πœ–π‘–] 𝑑𝐹𝛽,πœ–(𝛽𝑖, πœ–π‘–)

𝛽𝑖,πœ–π‘–

= 𝛼 + 𝔼[𝛽]𝑝̅ + π‘šΜ‚ 𝔼[𝛽]𝜏

This yields deadweight loss for tax 𝜏:

π·π‘ŠπΏβ„Žπ‘–π‘”β„Ž = βˆ’1

For instance, if π‘šΜ… = 1, then the value of deadweight loss under a homogeneous perceived price is fraction π‘šΜ‚ of the above calculation of deadweight loss.

Proceeding from Theorem 2 in the previous subsection, we noted that there is a specific distribution of perceived prices on {𝑝̅, 𝑝̅ + π‘šΜ… 𝜏} that maximizes deadweight loss. We describe that distribution in Theorem 3, noting that it involves assigning high or low perceived prices

33 In the true distribution, it must be that π‘šΜ‚ ∈ [0, π‘šΜ… ]. Alternatively, one could check whether π‘šΜ‚ ∈ [0, π‘šΜ…] as a weak test of the null hypothesis that the tax salience is bounded within that interval.

39

based on the ratio of per-person deadweight loss to the change in consumption for that individual. But in this context:

𝑑𝑀𝑙𝑖

π‘žπ‘–(𝑝̅) βˆ’ π‘žπ‘–(𝑝𝑠) =πœπ‘  2

Thus, the distribution of tax salience independent of all other parameters and regressors in which β„™(π‘š = π‘šΜ… ) =π‘šΜ‚

π‘šΜ… and β„™(π‘š = 0) = 1 βˆ’π‘šπ‘šΜ‚Μ… maximizes deadweight loss. More generally, the econometrician cannot rule out this maximal value of deadweight loss so long as they cannot rule out the possibility of some distribution 𝐹 with 𝐹𝛽 = πΉπ›½βˆ— such that 𝑠𝑒𝑝𝑝(π‘š) ∈ {0, π‘šΜ… } with:

ℙ𝐹(π‘š = π‘šΜ… )𝔼𝐹[𝛽̃|π‘š = π‘šΜ… ] = π‘šΜ‚ 𝛽̂ = 𝛽̃̂

Mathematically, one can check that such a distribution rationalizes the data and yields the maximal value of deadweight loss:

𝛼 + 𝔼[𝛽]𝑝̅ + 𝔼𝐹[𝛽̃]𝜏 = 𝛼 + 𝛽̂𝑝̅ + ℙ𝐹(π‘š = π‘šΜ… )𝔼𝐹[𝛽̃|π‘š = π‘šΜ… ] 𝜏 = 𝛼 + 𝛽̂𝑝̅ + π›½ΜƒΜ‚πœ

βˆ’1

2𝔼𝐹[π‘š2𝛽] = βˆ’1

2ℙ𝐹(π‘š = π‘šΜ… )π‘šΜ… 𝔼𝐹[𝛽̃|π‘š = π‘šΜ… ] 𝜏2 = βˆ’1

2π‘šΜ… π›½ΜƒΜ‚πœ2= π·π‘ŠπΏβ„Žπ‘–π‘”β„Ž More intuitively, once one knows 𝛽̂ and 𝛽̃̂, one can rationalize the aggregate data. Since the ratio of deadweight loss to the change in quantity is constant, the relationship between tax salience and preference doesn’t matter upon attaining the observed aggregate demand.

Finally, consider a distribution with π‘š βŠ₯ (𝛽, πœ–) with 𝑠𝑒𝑝𝑝(π‘š) βŠ† {0, π‘šΜ‚ , π‘šΜ… }, β„™(π‘š = π‘šΜ‚ ) = πœ† and β„™(π‘š = π‘šΜ‚ |π‘š β‰  π‘šΜ‚ ) =π‘šΜ‚

π‘šΜ…. Varying πœ† from zero to one yields:

π·π‘ŠπΏ ∈ [βˆ’1

2π‘šΜ‚ π›½ΜƒΜ‚πœ2, βˆ’1

2π‘šΜ… π›½ΜƒΜ‚πœ2]

40

We can conclude from this result that one cannot even identify a second order approximation of deadweight loss with aggregate data alone.34 Imposing structure on preferences to facilitate identification of πΉπœƒβˆ— still only permits interval identification. Nonetheless, we can use aggregate data to obtain bounds, or at least π‘šΜ‚ , which gives us a sense of the uncertainty over the possible values of deadweight loss.

We showcase such an application in section B of the appendix, based on data on aggregate beer consumption from CLK (2009). We replicate the regressions they ran to estimate tax salience, but using a linear (rather than log-log) specification to match this subsection. Taking the ratio of averages across regressions of measures of aggregate demand responses to sales and excise tax variation, we estimate π‘šΜ‚ β‰ˆ 0.27. This estimate suggests that for π‘šΜ… β‰₯ 1, the upper bound of deadweight loss is more than four times the lower bound.