• No results found

Examine the distribution of effect sizes and the impact of moderating variables

1.4 Meta-analysis

1.4.4 Examine the distribution of effect sizes and the impact of moderating variables

The point estimates that we collect for this meta-analysis come from primary studies that vary in many aspects. The objective of this section is to present the statistical analysis of evidence across empirical studies. The analysis is divided in three sections. Section 1.4.4.1 examines the distribution of effect sizes, and section 1.4.4.2 presents summary statistics for the price differential on property prices for floodplain location and examines the heterogeneity of previous results using a meta-regression analysis to assess the impact of moderating variables. Throughout the analysis we emphasise the theoretical, methodological and contextual differences among primary studies.

1.4.4.1 Examine the distribution of effect sizes

Figure 1.4 shows the distribution of the 349 effect sizes included in the meta-sample. 70% of the observations suggest that properties located in the floodplain are sold at lower prices than comparable properties not in flood risk region; the remaining observations suggest prices of properties in the floodplain are higher. The mean of the distribution is about - 0.06, and the distribution peaks around -0.02. Ninety percent of the observations lie between -0.37 and +0.09. However, the effect sizes correspond to different levels of risk

14

Daniel, Florax and Rietveld (2009a) do not provide a specific explanation for the exclusion of the estimates by Bin and Kruse (2006).

15

Daniel, Florax and Rietveld (2009a) include six estimates by MacDonald, Murdoch and White (1987) and MacDonald et al. (1990) and three by Dei-Tutu and Bin (2002). They compute the effect sizes based on estimated sales prices resulting from assuming hypothetical values of the characteristics of a house, for three different types of houses: below average, average and above average properties.

and come from primary studies which vary in several aspects. Figures 1.5 through 1.7 show the distribution of effect sizes grouping the estimates by different characteristics such as the level of risk, type of flood risk and methodology. The main descriptive statistics of the distributions appear in table 1.4.

Figure 1.4. Effect size: Distribution density plot

Figure 1.5.A shows the distribution of effect sizes by level of risk. Out of the total 349 effect sizes, 256 correspond to estimates for properties in a 100-year floodplain (1% annual probability of flooding) and 93 correspond to estimates for properties in a 500-year floodplain (0.2% annual probability of flooding). As expected, the distribution of estimates from properties in the 100-year floodplain has a larger mean discount, compared to the properties in the 500-year floodplain, and is also the one which shows the most extreme values in the left-tail of the distribution. Nevertheless, it is also the distribution that shows the highest premiums and the most extreme values in the right-tail. As Bin and Kruse (2006) and Bin et al. (2008) suggest, it might be that high positive values for properties in the 100-year floodplain correspond to locations where proximity to water is also associated to important amenity values, such as coastal regions.

To examine this possibility figure 1.5.B shows the effect sizes divided by the type of floods they represent, river flooding (fluvial) or coastal flooding. Both types possess different characteristics and their potential impacts are also different. The former are likely to be a result of heavy rain events whereas the latter are usually a result of storm surges created by storms like hurricanes and tropical cyclones. The distributions are skewed in opposite directions; the one for river flooding shows the highest discounts with a mean around -7%, whereas that for coastal flooding shows the highest premiums with a mean around 3%; both distributions nonetheless have a negative mode. Figures 1.5.C and 1.5.D explore the distribution of effect sizes for river and coastal flood risk by different levels of risk. In both cases largest negative values and fatter left tails correspond to regions with higher flood risk. Coastal flood risk is associated with smaller discounts and higher premiums. It is possible to conclude that extreme values in opposite tails of the distributions of 100-year and 500-year floodplain in figure 1.5.A correspond to flood risk from different sources. This pattern is likely to reflect the difficulties of isolating the value of risk in coastal regions where proximity to water is highly correlated with coastal amenities such as, waterfrontage and proximity to beach.

The two-sample Kolmogorov-Smirnov (K-S) distance statistic is also reported below each pair of distributions. This statistic is commonly used to compare two empirical distributions under the null hypothesis that the samples are drawn from the same distribution. Rejection of the null is regarded as evidence indicating the two distributions are statistically different, i.e. the samples are drawn from two different populations. Values of the K-S statistic in figure 1.5 suggest a significant difference between the distributions of estimates from different levels of risk and different sources of flooding, something which requires further research.

Figure 1.5. Effect size: Distribution density plots for different levels of risk and different types of flooding

Figure 1.5.A. 100 and 500-year Figure 1.5.B. River and Coastal

K-S: 0.25*** K-S: 0.29***

Figure 1.5.C. River: 100 and 500-year Figure 1.5.D. Coastal: 100 and 500-year

K-S: 0.32*** K-S: 0.60**

Notes: K-S represents the two-sample Kolmogorov-Smirnov statistic. H0: the samples are drawn from the same population. *, ** and *** means rejection of the null hypothesis at the 10%, 5% and 1% significance level.

Empirical studies also differ in the theoretical relationship of interest they estimate and, consequently, the econometric approach they use. Figure 1.6.A shows the distribution of effect sizes according to the corresponding methodology. The distribution of estimates using standard hedonic models shows the largest premiums, whereas that of estimates using a DID model shows the largest discounts. As shown in figure 1.6.B, these large discounts correspond to post-flood estimates, in this case the discount is expected to be large as the risk has become salient and homeowners might have experienced flood damages. This evidence is consistent with the idea that recent experience with flooding

raises perception of risk; although, as Hallstrom and Smith (2005) and Atreya and Ferreira (2012a) point out, the large negative values of post-flood estimates might also be due to storm damages when these are not properly accounted for elsewhere. In both cases the K-S statistic suggests that the distributions come from different populations.

Figure 1.6. Effect size: Distribution density plots by methodologies and timing with respect to flood event

Figure 1.6.A Standard and DID hedonic Figure 1.6.B DID: before and after

K-S: 0.18*** K-S: 0.44***

Notes: K-S represents the two-sample Kolmogorov-Smirnov statistic. H0: the samples are drawn from the same population. *, ** and *** means rejection of the null hypothesis at the 10%, 5% and 1% significance level.

In order to explore further the evidence from DID hedonic models, figure 1.7 shows the distribution of effect sizes with respect to the time of the flood event, by different types of flooding and by different levels of risk. Figures 1.7.A and 1.7.B refers to river flood risk for properties in the 100-year and 500-year floodplain, respectively. There seems to be some pre-flood capitalisation of risk for properties in the 100-year floodplain; however, properties in the 500-year floodplain do not seem to be discounted before a flood. In both cases, the occurrence of a flood seems to raise the perception of risk. The mean value of the distribution for properties in the 100-year floodplain goes from a pre-flood discount of 5% to a post-flood discount around 17%, whereas for properties in the 500-year floodplain it goes from a 3% pre-flood premium to a 11% post-flood discount. Notice the change in the mean value is greater for those properties in the 500-year floodplain. Kousky (2010)

suggests that differences in the capitalisation of risk after a flood event might arise due to information issues. Individuals in the 100-year floodplain tend to be more aware about the level of risk they face (this might be due to risk disclosure policies), thus the occurrence of a flood might not imply such an information update as it might be for individuals in the 500-year floodplain, which tend to have none or little information about the risk until it is brought to mind by flooding experiences. As Carbone, Hallstrom, and Smith (2006) suggests, in some cases the occurrence of a flood for individuals in the 100-year floodplain might simply be a realization from a known probability distribution, in which case no flood risk update is expected (see for example Kousky, 2010).

It might also be the case of post-flood estimates being higher than pre-flood ones. As it appears in figure 1.7.A, some post-flood values in the right tail of the distribution imply greater premiums than pre-flood estimates. Montz (1992) and Tobin and Montz (1994) suggest that this pattern could be a result of improvements in housing conditions due to repairs and/or investment to improve house quality after a flood.

The meta-sample includes only three studies applying a DID approach to estimate the implicit price of coastal flood risk, namely Hallstrom and Smith (2005), Morgan (2007) and Samarasinghe and Sharp (2010). Figure 1.7.C shows the distribution of the effect sizes collected from these studies, all of them focus in properties in the 100-year floodplain. These estimates are consistent with Hallstrom and Smith (2005), Bin and Kruse (2006) and Bin et al. (2008), which emphasise the difficulty of identifying the implicit price of flood risk in coastal regions due to the presence of confounder amenity values. Although the evidence suggests important pre-flood and post-flood premiums for location in a floodplain

near to the coast, in all cases the occurrence of a flood decreases the associated premium (in some cases it becomes negative).

Figure 1.7. Effect size: Distribution density plots for estimates using DID models, by different type of flood risk, level of risk and timing with respect to flood event Figure 1.7.A River flooding 100-year Figure 1.7.B River flooding 500-year

K-S: 0.42*** K-S: 0.84***

Figure 1.7.C Coastal flooding 100-year

K-S: 0.57

Notes: K-S represents the two-sample Kolmogorov-Smirnov statistic. H0: the samples are drawn from the same population. *, ** and *** means rejection of the null hypothesis at the 10%, 5% and 1% significance level.

Summarising, estimates for properties located within a 100-year floodplain show a greater discount than those for properties located in a 500-year floodplain. There seems to be a difference in the capitalisation of flood risk in river and coastal properties; the evidence highlights the difficulty of isolating the implicit price of risk in coastal regions where proximity is also associated to important benefits. In general, post-flood estimates imply a

higher discounts, this is the case across regions with different levels of risk and different types of flooding. This evidence supports the widespread idea that recent floods provide new information to homeowners to update their flood risk perception; however, pre-flood information available appears to play a role in determining the extent of the update.

Table 1.4. Effect size: Summary statistic for point estimates grouped by different categories

No.

Grouped by

Num.

Obs. Mean Mode S.D. Min Max 90% Interval

1 0 0 y ea r 5 0 0 y ea r Riv er Co a sta l H ed o n ic D ID H ed o n ic Be fo re Afte r 1  X 256 -0.07 -0.02 0.16 -0.75 0.61 [-0.44; 0.10] 2  X 93 -0.03 -0.02 0.10 -0.37 0.14 [-0.21; 0.09] 3  X 314 -0.07 -0.02 0.14 -0.75 0.14 [-0.38; 0.07] 4  X 35 0.03 -0.06 0.16 -0.20 0.61 [-0.16; 0.32] 5  X  X 226 -0.08 -0.04 0.15 -0.75 0.13 [-0.45; 0.04] 6  X X 88 -0.03 -0.02 0.10 -0.37 0.14 [-0.21; 0.09] 7  X  X 30 0.05 -0.06 0.17 -0.20 0.61 [-0.16; 0.32] 8  X  X 5 -0.07 -0.06 0.02 -0.10 -0.05 [-0.10; -0.05] 9  138 -0.03 -0.01 0.10 -0.42 0.61 [-0.16; 0.08] 10  211 -0.08 -0.02 0.17 -0.75 0.32 [-0.50; 0.08] 11   101 -0.01 0.00 0.09 -0.21 0.32 [-0.17; 0.09] 12   110 -0.14 -0.05 0.20 -0.75 0.16 [-0.57; 0.07] 13     63 -0.05 -0.03 0.07 -0.21 0.05 [-0.19; 0.04] 14     71 -0.17 -0.04 0.23 -0.75 0.13 [-0.68; 0.07] 15     31 0.03 0.00 0.04 -0.05 0.09 [-0.04; 0.08] 16     32 -0.11 -0.02 0.11 -0.37 0.01 [-0.34; 0.01] 17     7 0.12 0.16 0.14 -0.06 0.32 [-0.06; 0.32] 18     7 0.00 -0.11 0.10 -0.11 -0.16 [-0.11; 0.16] Full Sample 349 -0.06 -0.02 0.15 -0.75 0.61 [-0.37; 0.09]

Source: Own elaboration based on results from primary studies.

Finally, it is important to be cautious when interpreting this evidence. As we subdivided the meta-sample for the analysis, the number of observations of the different groups markedly decreases; this is especially true for estimates from coastal regions. Furthermore, although analysing the distribution of effect sizes is useful to have a general overview of the meta-sample and to explore its variability, this analysis is rather limited. When comparing distributions it is only possible to control for one single aspect, for instance the level of risk, the type of flooding or the timing with respect to a flood event; however all

other characteristics of the studies are allowed to vary and therefore no conclusions can be drawn. Section 1.4.4.2 presents the results of a meta-regression analysis, which allow us to explore the drivers of heterogeneity in the results from primary studies, while controlling for other multiple attributes that might cause the effect sizes to vary.

1.4.4.2 Meta-analysis

The objective of this meta-analysis is to gain more insight from information of multiple studies using a weighted average to summarise and combine the results. There are two popular models used for this purpose the fixed-effect model and the random-effects model. In both cases weights are usually assigned using the inverse of the error variance so that more weight is assigned to more precise studies, i.e. those which carry more information. The crucial difference between these two models lies in the assumptions they use to define the error variance.

In the fixed-effect model it is assumed that all studies included in the meta-analysis share a common true effect size, differences in observed effects arise only due to sampling error, i.e. if the sample size for each of the studies were infinite the observed effect for all cases would be the same as the true effect size. However because studies commonly differ in implementation and underlying population, among others, the assumption of the fixed- effect model is often implausible. The random-effects model allows the true effect size to differ from study to study. It assumes a distribution of true effect sizes, and the goal is to estimate the mean of the distribution, i.e. if it were possible to implement an infinite number of studies the true effect size of these studies would be distributed around some mean. Equations (40.a) and (40.b) shows the observed effect size 𝑇, as defined above, for any study 𝑖 under the fixed-effect and random-effects model, respectively:

Fixed-effect Random-effects

𝑇𝑖 = 𝜂 + 𝜉𝑖 (40.a) 𝑇𝑖 = 𝜇 + 𝜑𝑖+ 𝜉𝑖 (40.b)

where 𝜂 represents the common true effect size that all studies share in the fixed-effect model; 𝜇 is the mean of the distribution of true effect sizes that all studies share in the random-effects model (grand mean); 𝜉𝑖 is the difference between the true mean of study 𝑖 (𝜂𝑖) and its observed mean (𝑇𝑖), i.e. 𝜉𝑖 = 𝑇𝑖− 𝜂𝑖 for both models, and 𝜑𝑖 is the difference

between the grand mean of the random-effects model (𝜇) and true mean (𝜂𝑖) for study 𝑖,

i.e. 𝜑𝑖 = 𝜂𝑖 − 𝜇. Thus in the case of the random-effects model there are two sources of variation: true variation in effect sizes (𝜑𝑖) and sampling error (𝜉𝑖).

At this point the difference between the fixed-effect and random-effects model when using the inverse of the error variance as a weighting scheme should be evident. Under the fixed- effect model the overall study variance is simply the within-study variance (𝜎𝑖2), whereas

for the random-effects model it has two components: the within study error variance (𝜎𝑖2)

and the between-study variance (𝜏2), that is the variance of the distribution of true effect

sizes. Therefore, the weights (𝑊𝑖) assigned to each study under the fixed-effect model and

random-effects model are given by:

Fixed-effect Random-effects

𝑊𝑖 = 𝜎1

𝑖2

(41.a) 𝑊𝑖 = 1

𝜎𝑖2+𝜏2 (41.b)

The implication is that, whenever 𝜏2 ≠ 0, relative weights assigned to each study are more

balanced under the random-effects model. However, it is important to bear in mind that we are estimating two different parameters. In the fixed-effect model we estimate the common effect size (𝜂) in the studies that are observed, whereas in the random-effects model is the

mean of a hypothetical population of studies (𝜇), which includes studies that are not observed.

A common practice in meta-analysis literature is to use the results of the heterogeneity test to select the more appropriate model: random-effects or fixed-effect. Here the term heterogeneity is understood as the variation in the true effect sizes. The main statistic for this purpose is the Q-statistic (Cochran, 1954), defined as:

𝑄 = ∑ 𝑊𝑖(𝑇𝑖− 𝜂)2 𝑘

𝑖=1

(42)

where 𝑊𝑖 is the weighting factor for the 𝑖th study assuming a fixed-effect model (1 𝜎⁄ ), 𝑇𝑖2 𝑖

is the study effect size, 𝜂 is the combined effect under the fixed-effect model and 𝑘 is the number of studies. Thus the Q-statistic represents the observed weighted sum of squares (WSS) which reflects the total dispersion. Notice that the expected value of 𝑄 under the assumptions of the fixed-effect model (all the variation is due to sampling error) is simply the degrees of freedom (𝑑𝑓 = 𝑘 − 1, where 𝑘 represents the number of studies), since 𝑄 is a standardized measure and the expected value does not depend on the metric of the effect size. The difference:

𝑄 − 𝑑𝑓 (43)

represents the excess variation, i.e. that attributed to differences in the true effects from study to study.

The heterogeneity test consists of testing the assumption of heterogeneity in effects using the statistic 𝑄 and 𝑑𝑓. The null hypothesis is that all studies share a common effect size.

Under the null 𝑄 follows a central 𝜒2 distribution with 𝑘 − 1 degrees of freedom. Thus, a

rejection of the null is usually interpreted as evidence for the random-effects model. However, Borenstein et al. (2009) emphasises that the selection of the appropriate model should be based on our understanding of whether or not all studies share a common effect size and not on the outcome of the Q-statistic as it often has poor power, especially when dealing with small samples.

Another common measure of dispersion is the 𝐼2 statistic. This was proposed by Higgins et

al. (2003) and describes the ratio of excess dispersion to total dispersion, i.e. what proportion of the observed variance is real. The 𝐼2 statistic is given by:

𝐼2 = (𝑄 − 𝑑𝑓

𝑄 ) × 100 (44)

where 𝑄 and 𝑑𝑓 are defined as in equations (42) and (43). A value of 𝐼2 close to zero suggests that almost all observed variance is due to sampling error, which means that there is little, or nothing, to explain. On the other hand, a large value of 𝐼2 is interpreted as evidence of real variation and supports the use of meta-regression analysis to identify possible causes. Higgins et al. (2003) suggest that values on the order of 25%, 50% and 75% can be considered as an indicator of low, moderate and high variation, respectively; no critical value is proposed.

In our case, as in most meta-analysis in economics, the assumptions of the fixed-effect model seems implausible. Table 1.5 reports the summary statistics obtained with the random-effects meta-analysis over the 349 effect sizes and different subgroups according to the level of risk, the type of risk and the econometric technique for estimation. The table also reports the 90% confidence interval, the between study variance 𝜏2, the Q-statistic and

the 𝐼2. However, this approach treats each observation as a separate study, which results in

more weight assigned to studies reporting more than one outcome. To address this issue, table 1.5 also shows the meta-analysis summary statistics using another popular weighting scheme; assigning weights according to the sample size considered for each study. The weight assigned to each observation corresponds to the average sample size of each study, divided by the number of estimates that the study provides to the final meta-sample. In this way, studies are assigned more weight based on the total information that they contain and not because of providing a higher number of estimates. Thus, we prefer the interpretation of results using the later approach.

Notice that the summary effect sizes for the sub-sample of estimates in the 100-year floodplain suggests a price premium of 3%. However, it is possible to see that this average premium is driven by properties in coastal regions. If we subdivide the sample according to different type of flooding, for properties subject to river flooding in the 100-year floodplain the summary statistic suggests a discount of around 5%, whereas for properties in coastal regions at the same level of risk the results suggest a premium around 14%.

For properties in river regions in the 100-year floodplain there appears to be a significant discount of around 3%, even if we focus only on evidence from DID models before a