INFERENCE IN MULTIPLE REGRESSION We examine ﬁve inferential methods in this chapter:

1. Thet-test for the relationship between the response variable yand a particular predictor variablexi, in the presence of the other predictor variables,x(i), wherex(i) =x1,x2, . . . ,xi−1, xi+1, . . .xmdenotes the set of all predictors not includingxi.

2. TheF-test for the signiﬁcance of the regression as a whole.

3. The conﬁdence interval,βi,for the slope of theith predictor variable.

4. The conﬁdence interval for the mean of the response variableygiven a set of particular values for the predictor variablesx1, x2, . . . ,xm.

5. The prediction interval for a random value of the response variable ygiven a set of particular values for the predictor variablesx1,x2, . . . ,xm.

INFERENCE IN MULTIPLE REGRESSION 101

t-Test for the Relationship Betweenyandx

The hypotheses for at-test betweenyandxi are given by

r _H₀_: _β_i ₌₀ r _H_a_: _β_i₌₀

The models implied by these hypotheses are given by:

r _Under _H₀_: _y₌_β₀₊_β₁_x₁_{+ · · · +}_β_i₋₁_x_i₋₁₊_β_i_x_i₊_β_i₊₁_x_i₊₁_{+ · · ·} +βmxm+ε

r _Under _H_a_: _y₌_β₀₊_β₁_x₁_{+ · · · +}_β_i₋₁_x_i₋₁₊_β_i₊₁_x_i₊₁_{+ · · ·} +βmxm+ε

Note that the only difference between the two models is the presence or absence of theith term. All other terms are the same in both models. Therefore, interpretations of the results for thist-test must include some reference to the other predictor variables being held constant.

Under the null hypothesis, the test statistict =bi/sbi follows at-distribution withn–m– 1 degrees of freedom, wheresbi refers to the standard error of the slope for theith predictor variable. We proceed to perform thet-test for each of the predictor variables in turn, using the results displayed in Table 3.1.

t-Test for the Relationship Between NutritionalRatingandSugars

r _H₀_: _β₁₌_{0; model :} _y₌_β₀₊_β₂₍_{f i ber}₎₊_ε_.

r _H_a_: _β₁₌_{0; model :} _y₌_β₀₊_β₁₍_{sugar s}₎₊_β₂₍_{f i ber}₎₊_ε_.

r _{In Table 3.1, under “Coef” in the “Sugars” row is found the value of} _b

1, −2.2090.

r _{Under “SE Coef” in the “Sugars” row is found the value of}_s_b

1,the standard error of the slope for sugar content. Heresb1 =0.1633.

r _{Under “T” is found the value of the}_t_{-statistic, that is, the test statistic for the}

t-test,

t= b1 sb1

= −2.2090

0.1633 = −13.53

r _{Under “P” is found the}_p_{-value of the}_t_{-statistic. Since this is a two-tailed test,} thisp-value takes the form p-value=P(|t|>tobs),wheretobsrepresents the value of thet-statistic observed from the regression results. Here p-value= P(|t|>tobs)=P(|t|>−13.53)≈0.000,although of course no continuous p-value ever equals precisely zero.

The p-value method is used, whereby the null hypothesis is rejected when the p-value of the test statistic is small. Here we have p-value≈0.00,which is smaller than any reasonable threshold of signiﬁcance. Our conclusion is therefore to reject the null hypothesis. The interpretation of this conclusion is that there is evidence for a linear relationship between nutritional rating and sugar content in the presence of ﬁber content.

SPH SPH

JWDD006-03 JWDD006-Larose November 25, 2005 17:26 Char Count= 0

102 CHAPTER 3 MULTIPLE REGRESSION AND MODEL BUILDING

t-Test for the Relationship Between Nutritional Rating

and Fiber Content

r _H₀_: _β₂₌_0; _{model :} _y₌_β₀₊_β₁₍_sugars₎₊_ε_.

r _H_a_: _β₂₌_0; _{model :} _y₌_β₀₊_β₁₍_sugars₎₊_β₂₍_ﬁber₎₊_ε_. r _{In Table 3.1, under “Coef” in the “Fiber” row is found}_b₂₌₂_.₈₄₀₈_.

r _{Under “SE Coef” in the “Fiber” row is found the standard error of the slope for} ﬁber content,sb2=0.3032.

r _{Under “T” is found the test statistic for the}_t_-test,

t= b2 sb2

= 2.8408 0.3032 =9.37

r _{Under “P” is found the}_p_{-value of the}_t_{-statistic. Again,}_p_-value_≈₀_.₀₀₀_. Thus, our conclusion is again to reject the null hypothesis. We interpret this to mean that there is evidence for a linear relationship between nutritional rating and ﬁber content in the presence of sugar content.

F-Test for the Signiﬁcance of the Overall Regression Model

Next we introduce the F-test for the signiﬁcance of the overall regression model. Figure 3.4 illustrates the difference between thet-test and theF-test. One may apply a separatet-test for each predictorx1, x2,orx3,examining whether a linear relationship

t-test t-test t-test F-test x1 x2 x3 y y {x1, x2, x3}

Figure 3.4 TheF-test considers the relationship between the target and the set of predictors, taken as a whole.

INFERENCE IN MULTIPLE REGRESSION 103

exists between the target variableyand that particular predictor. On the other hand, theF-test considers the linear relationship between the target variableyand theset of predictors(e.g.,{x1,x2, x3}) taken as a whole.

The hypotheses for theF-test are given by r _H₀_: _β₁₌_β₂_{= · · · =}_β_m₌₀_.

r _H_a_: _{At least one of the}_β_i _{does not equal 0.}

The null hypothesis asserts that there is no linear relationship between the target variableyand the set of predictors,x1, x2, . . . ,xm.Thus, the null hypothesis states that the coefﬁcientβifor each predictorxiexactly equals zero, leaving the null model to be

r _{Model under}_H₀_: _y₌_β₀₊_ε

The alternative hypothesis does not assert that the regression coefficients all differ from zero. For the alternative hypothesis to be true, it is sufficient for a single, unspecified regression coefficient to differ from zero. Hence, the alternative hypothesis for theF-test does not specify a particular model, since it would be true if any, some, or all of the coefficients differed from zero.

As shown in Table 3.2, theF-statistic consists of a ratio of two mean squares: the mean square regression (MSR) and the mean square error (MSE). Amean square represents a sum of squares divided by the degrees of freedom associated with that sum of squares statistic. Since the sums of squares are always nonnegative, so are the mean squares. To understand how theF-test works, we should consider the following. The MSE is always a good estimate of the overall variance (see model assump- tion 2)σ2_,_{regardless of whether or not the null hypothesis is true. (In fact, recall that} we use the standard error of the estimate,s=√MSE,as a measure of the usefulness of the regression, without reference to an inferential model.) Now, the MSR is also a good estimate ofσ2_,_{but only on the condition that the null hypothesis is true. If the} null hypothesis is false, MSR overestimatesσ2_.

So consider the value of F=MSR/MSE with respect to the null hypothesis. Suppose that MSR and MSE are close to each other, so that the value ofF is small (near 1.0). Since MSE is always a good estimate ofσ2_, _{and MSR is only a good} estimate ofσ2_{when the null hypothesis is true, the circumstance that MSR and MSE} are close to each other will occur only when the null hypothesis is true. Therefore, when the value ofFis small, this is evidence that the null hypothesis is true.

However, suppose that MSR is much greater than MSE, so that the value ofF is large. MSR is large (overestimatesσ2) when the null hypothesis is false. Therefore, when the value ofFis large, this is evidence that the null hypothesis is false. Therefore, for theFtest, we shall reject the null hypothesis when the value of the test statistic Fis large.

The F-statistic observed,F =Fobs=MSR/MSE, follows anFm,n−m−1dis- tribution. Since all F-values are nonnegative, theF-test is a right-tailed test. Thus, we will reject the null hypothesis when the p-value is small, where the p-value is the area in the tail to the right of the observed F-statistic. That is, p-value = P(Fm_,n₋m₋1>Fobs),and we reject the null hypothesis when P(Fm_,n₋m₋1>Fobs) is small.

SPH SPH

JWDD006-03 JWDD006-Larose November 25, 2005 17:26 Char Count= 0

104 CHAPTER 3 MULTIPLE REGRESSION AND MODEL BUILDING

F-Test for the Relationship Between Nutritional Rating and

{Sugar and Fiber}Taken Together

r _H₀_: _β₁₌_β₂ ₌_0; _{model :} _y₌_β₀₊_ε_. r _H_a_: _{At least one of}_β₁_and_β₂_{does not equal zero.}

r _{The model implied by}_H_a_{is not speciﬁed, and may be any one of the following:}

y=β0+β1(sugars)+ε y=β0+β2(ﬁber)+ε

y=β0+β1(sugars)+β2(ﬁber)+ε

r _{In Table 3.1, under “MS” in the “Regression” row of the “Analysis of Variance”} table, is found the value of MSR, the mean square regression, MSR = 6058.9. r _{Under “MS” in the “Residual Error” row of the “Analysis of Variance” table is}

found the value of MSE, the mean-squared error, MSE=38.9.

r _{Under “F” in the “Regression,” row of the “Analysis of Variance” table is found} the value of the test statistic,

F= MSR

MSE =

6058.9

38.9 =155.73

r _{The degrees of freedom for the}_F_{-statistic are given in the column marked “DF,”} so that we havem=2, andn−m−1=74.

r _{Under “P” in the “Regression” row of the “Analysis of Variance” table is found} the p-value of the F-statistic. Here, the p-value is P(Fm,n−m−1>Fobs)= P(F2,74>155.73)≈0.000,although again no continuousp-value ever equals precisely zero.

This p-value of approximately zero is less than any reasonable threshold of significance. Our conclusion is therefore to reject the null hypothesis. The interpretation of this conclusion is the following. There is evidence for a linear relationship between nutritional rating on the one hand, and the set of predictors, sugar content and fiber content, on the other. More succinctly, we may simply say that the overall regression model is significant.

In document Data Mining Methods And Models Larose DT (2006) pdf (Page 118-122)