3. Non-Identification with Aggregate Continuous Choice Data
3.3 Linear Special Case
We conclude this section with a discussion of the special case in which π is known to be a linear function of πΜ and π (for fixed πππ). We focus on this example both because of how frequently economists estimate linear models and because of its relationship to the second
36
order approximation of deadweight loss. As demonstrated in section 2.3, one can express a second order approximation to deadweight loss as a function of derivatives. In the case where the choice function is linear in regressors πΜ & π, the second order approximation is an exact calculation of deadweight loss, and our results from the previous subsections apply.
Formally, each preference type ππ takes the form ππ = (π½π, ππ) β β2.29 To maintain linearity in regressors, we also assume that tax salience π is constant with respect to π. The choice function π then takes the form:
ππ = πΌ + π½ππππ + ππ = πΌ + π½π[πΜ + ππ ππ] + ππ
We are suppressing tie-breaking parameter π because ππ, πΜ π, ππ, and ππ always uniquely determine consumption. We have parameter πΌ so that we can assume without loss of generality that πΌ[π] = 0. Defining π½Μπ β‘ πππ½π yields:
ππ = πΌ + π½ππΜ + π½Μπππ+ ππ with corresponding deadweight loss per agent from equation (3):
ππ€ππ = β« [πΌ + π½ππ + ππ]ππ
We assume that the joint distribution of parameters remains unaffected by the specific values of πΜ and π. The econometrician observes for various values of regressors:
29 Agents have quasi-linear utility π’π=
ππ2
37
πΌ[π|πΜ , π] β‘ β« [πΌ + π½ππΜ + π½Μππ + ππ]ππΉπ½,π½βΜ,π(π½π, π½Μπ, π)
π½π,π½Μπ,ππ
= πΌ + πΌ[π½]πΜ + πΌ[π½Μ]π (7)
where πΉπ½,π½βΜ,π is the true distribution of (π½, ππ½, π). The challenge is to use the observed values of triplets (πΜ , π, πΌ[π|πΜ , π]) to infer aggregate deadweight loss, which is equivalent to its second order approximation around π = 0:
π·ππΏ = β1
2β« ππ2π½πππΉπ½,π(π½π, ππ)
π½π,ππ
π2 = β1
2πΌ[π2π½]π2= β1
2πΌ[π π½Μ]π2
The only restriction that the econometrician imposes on the distribution of tax salience π is that the support of tax salience is contained within the interval [0, πΜ ].30 The econometrician can also use the CLD as in lemma 1, so that β[π½ β€ 0] = 1. In fact, we can permit the
econometrician to know the entire distribution of π = (π½, π). It will not affect our results.
First, we can find a homogeneous perceived price that rationalizes the data for any π. A linear regression of aggregate demand on sticker prices and taxes may permit identification of π½Μ β‘ πΌ[π½] and π½ΜΜ β‘ πΌ[π½Μ], respectively.31 We define a measure of central tendency of tax salience:32
πΜ β‘π½ΜΜ
π½Μ
Then the homogeneous perceived price that rationalizes the data is πΜπ = πΜ + πΜ π. To see this, note that assuming all agents have tax salience ππ = πΜ yields aggregate demand as in equation (7):
30 We assume that πΜ is sufficiently small so that ππβ₯ 0 with probability one, ruling out instances of negative consumption.
31 Such identification requires exogenous & non-collinear variation in sticker prices & taxes. If the econometrician cannot identify these terms, so much the worse for identifying aggregate deadweight loss.
32 If π½Μ = 0, then let πΜ = 0.
38
Thus, the agent cannot rule out all agents perceiving the same price πΜπ , and so cannot rule out ππ = πΜ βπ. For tax π, this would yield deadweight loss, which by Theorem 1 would be a lower bound:
π·ππΏπππ€ = β1 2πΜ π½ΜΜπ2
Alternatively, the econometrician cannot rule out the perceived tax ππ having support in {0, πΜ π}. To see this, consider β(ππ = πΜ + πΜ π) =πΜ
πΜ and β(ππ = πΜ ) = 1 βπΜ
πΜ independently of other parameters and regressors.33 This will rationalize aggregate demand:
β« [πΌ + π½ππΜ +πΜ
πΜ π½ππΜ ππ+ ππ] ππΉπ½,π(π½π, ππ)
π½π,ππ
= πΌ + πΌ[π½]πΜ + πΜ πΌ[π½]π
This yields deadweight loss for tax π:
π·ππΏβππβ = β1
For instance, if πΜ = 1, then the value of deadweight loss under a homogeneous perceived price is fraction πΜ of the above calculation of deadweight loss.
Proceeding from Theorem 2 in the previous subsection, we noted that there is a specific distribution of perceived prices on {πΜ , πΜ + πΜ π} that maximizes deadweight loss. We describe that distribution in Theorem 3, noting that it involves assigning high or low perceived prices
33 In the true distribution, it must be that πΜ β [0, πΜ ]. Alternatively, one could check whether πΜ β [0, πΜ ] as a weak test of the null hypothesis that the tax salience is bounded within that interval.
39
based on the ratio of per-person deadweight loss to the change in consumption for that individual. But in this context:
ππ€ππ
ππ(πΜ ) β ππ(ππ ) =ππ 2
Thus, the distribution of tax salience independent of all other parameters and regressors in which β(π = πΜ ) =πΜ
πΜ and β(π = 0) = 1 βππΜΜ maximizes deadweight loss. More generally, the econometrician cannot rule out this maximal value of deadweight loss so long as they cannot rule out the possibility of some distribution πΉ with πΉπ½ = πΉπ½β such that π π’ππ(π) β {0, πΜ } with:
βπΉ(π = πΜ )πΌπΉ[π½Μ|π = πΜ ] = πΜ π½Μ = π½ΜΜ
Mathematically, one can check that such a distribution rationalizes the data and yields the maximal value of deadweight loss:
πΌ + πΌ[π½]πΜ + πΌπΉ[π½Μ]π = πΌ + π½ΜπΜ + βπΉ(π = πΜ )πΌπΉ[π½Μ|π = πΜ ] π = πΌ + π½ΜπΜ + π½ΜΜπ
β1
2πΌπΉ[π2π½] = β1
2βπΉ(π = πΜ )πΜ πΌπΉ[π½Μ|π = πΜ ] π2 = β1
2πΜ π½ΜΜπ2= π·ππΏβππβ More intuitively, once one knows π½Μ and π½ΜΜ, one can rationalize the aggregate data. Since the ratio of deadweight loss to the change in quantity is constant, the relationship between tax salience and preference doesnβt matter upon attaining the observed aggregate demand.
Finally, consider a distribution with π β₯ (π½, π) with π π’ππ(π) β {0, πΜ , πΜ }, β(π = πΜ ) = π and β(π = πΜ |π β πΜ ) =πΜ
πΜ . Varying π from zero to one yields:
π·ππΏ β [β1
2πΜ π½ΜΜπ2, β1
2πΜ π½ΜΜπ2]
40
We can conclude from this result that one cannot even identify a second order approximation of deadweight loss with aggregate data alone.34 Imposing structure on preferences to facilitate identification of πΉπβ still only permits interval identification. Nonetheless, we can use aggregate data to obtain bounds, or at least πΜ , which gives us a sense of the uncertainty over the possible values of deadweight loss.
We showcase such an application in section B of the appendix, based on data on aggregate beer consumption from CLK (2009). We replicate the regressions they ran to estimate tax salience, but using a linear (rather than log-log) specification to match this subsection. Taking the ratio of averages across regressions of measures of aggregate demand responses to sales and excise tax variation, we estimate πΜ β 0.27. This estimate suggests that for πΜ β₯ 1, the upper bound of deadweight loss is more than four times the lower bound.