Measurement Errors

(1)

Measurement Errors

Introduction to Study Skills & Research Methods (HL10040)

Professor James Betts

(2)

Lecture Outline:

•Measurement Errors Continued

•Types of Errors

•Assessment of Error

•Introduction to Inferential Statistics

•Chi-Squared tests

•Assessment Details.

(3)

Measurement Errors

• Virtually all measurements have errors

– i.e.

Measured Score = ‘True’ Score  Error

Therefore inherently linked to SD

• Reliability and Measurement Error are not the same, rather Reliability infers an acceptable degree of Measurement Error.

(4)

Energy Intake (calories per day)

1500 2500 3500 4500 5500

Number of People

0 20 40 60 80 100 120 140

160 This variability

between methods is caused by both

systematic and error factors

Direct Record

Retrospective Recall

SD

(5)

Total Variance

(SD²)

This total variance can then be

‘partitioned’

Systematic Variance

Error Variance

(6)

Types of Errors

• Systematic Error

– Any variable causing a consistent shift in the mean in a given direction

e.g. Retrospective diet records tend to omit the snacks between meals

• Random Error

– The fluctuation of scores due to chance

e.g. Innaccurate descriptions of the food consumed

(7)

Systematic Error

Skin-Fold Callipers

Hydrostatic Weighing

% Body-fat

Subject 1 Subject 2 Subject 3 Subject 4

10 12 8 11

17 22 14 12

(8)

Random Error

Skin-Fold Callipers

Hydrostatic Weighing

% Body-fat

Subject 1 Subject 2 Subject 3 Subject 4

14 18 10 9

11 15 21 17

(9)

Assessment of Error

• Systematic Error

Descriptive Statistics

4 12.00 22.00 16.2500 4.34933

4 8.00 12.00 10.2500 1.70783

4 Hydrostat

Callipers

Valid N (listwise)

N Minimum Maximum Mean Std. Deviation

(10)

Assessment of Error

• Random Error

8. 009. 0010. 0011. 0012. 00Callipers

Correlations

1 .527

. .473

4 4

.527 1

.473 .

4 4

Pearson Correlation Sig. (2-tailed)

N

Pearson Correlation Sig. (2-tailed)

N Callipers

Hydrostat

Callipers Hydrostat r²= 0.278

r = 0 infers lots of error r = 1 infers no error

(11)

Assessment of Error

• Systematic &

Random Error

Callipers HydroStat. Difference Mean

10.00 17.00 7.00 13.50

12.00 22.00 10.00 17.00

8.00 14.00 6.00 11.00

11.00 12.00 1.00 11.50

14.00 11.00 -3.00 12.50

18.00 15.00 -3.00 16.50

10.00 21.00 11.00 15.50

9.00 17.00 8.00 13.00

(12)

Assessment of Error

• Systematic &

Random Error

12.00 14.00 16.00

Mean

0.00 5.00 10.00

differences



 



Mean = 4.63

The “Bland-Altman” Plot 3 points of visual assessment:

-Systematic Error: are points evenly distributed about the zero line?

-Random Error: do points deviate greatly from the mean line?

-Nature of error: is the error consistent left-right?

(13)

Examples of Bland-Altman Plots

12.00 13.00 14.00 15.00 16.00

Mean

0.00 5.00 10.00

Mean difference Zero

(14)

Examples of Bland-Altman Plots

12.00 13.00 14.00 15.00 16.00

Mean

0.00 5.00 10.00

(15)

Examples of Bland-Altman Plots

12.00 13.00 14.00 15.00 16.00

Mean

0.00 5.00 10.00

(16)

Examples of Bland-Altman Plots

12.00 13.00 14.00 15.00 16.00

Mean

0.00 5.00 10.00

Mean difference

Zero

(17)

Examples of Bland-Altman Plots

12.00 13.00 14.00 15.00 16.00

Mean

0.00 5.00 10.00

Zero

(18)

Why is Error Important

• Measurement Error is clearly of importance when

evaluating the agreement between two measurement tools

• A consideration of error is also relevant when attempting to establish intervention effects/treatment differences

i.e. where some of the variance between trials is due to the independent variable...

(19)

Systematic Variance Total Variance

between trial 1

& trial 2

Systematic Variance

Error Variance Dependent Variable

Extraneous/

Confounding (Error) Variables

Independent

Variable Primary

Variance

So researchers strive to increase the proportion of variance due to IV.

(20)

Smallest Worthwhile Effect

It would appear that even a small amount of primary

variance from an ergogenic aid would guarantee victory to either competitor…

…however, the error variance is such that a re-run could produce entirely different results…

(21)

Total Variance between trial 1

& trial 2

Systematic Variance

Error Variance Dependent Variable

Extraneous/

Confounding (Error) Variables

So researchers strive to increase the proportion of variance due to IV.

(22)

Scientific Reasoning (Logic)

General Theory

Specific Observation

Inductive Reasoning

Formation of a theory grounded in your own observations

Deductive Reasoning

Confirmation of a theory from your own observations

p-values give the probability of seeing this evidence assuming this general rule is true

(23)

Introduction to Inferential Statistics

• Before our next lecture you will be conducting some inferential statistics in your lab classes…

• All you need to be aware of at this stage is that the

‘p-value’ represents the probability of the observed variance occurring if the null hypothesis is true

i.e. p = 0.05 infers a 5 % probability of making your observation if in fact the IV has no effect

n.b. this DOES NOT mean that you will find this result in 95/100 test-retests or that your false positive rate is 5 %

(24)

(25)

Quantitative Analysis of Nominal Data

• Recall that nominal data infers that variables are dichotomous, i.e. belong to distinct categories

e.g. Athlete/Non-Athlete, Male/Female, etc.

• We know that such qualitative data can be coded quantitatively to allow a more objective analysis

• Nominal data does not require any consideration of normality and is analysed used a Chi² test.

(26)

The Chi-Squared Test

• Goodness of fit χ² test

– A comparison of your observed frequency counts against what would be expected according to the null hypothesis

i.e. null hypothesis infers equal dispersion (50:50)

• Contingency χ² test

– A comparison of two observed frequency counts

(27)

Goodness of fit χ

²

test

• Is a leisure centre used more by males than by females?

– n =150

Observed Frequency

Expected Frequency

Male 62 75

Female 88 75

(28)

Gender

62 75.0 -13.0

88 75.0 13.0

150 Male

Female Total

Observed N Expected N Residual

Goodness of fit χ

²

test

SPSS Output

Test Statistics

4.507 1 .034 Chi-Square^a

df

Asymp. Sig.

Gender

0 cells (.0%) have expected frequencies less than 5. The minimum expected cell frequency is 75.0.

p-value AKA a.

significance level

(29)

Contingency χ

²

test

• Are elite athletes more likely to take nutritional supplements than non-athletes

– n =60

Do take supplements

Do not take supplements

Athletes 18 12

Non-athletes 11 19

(30)

Chi-Square Tests

3.270^b 1 .071

2.403 1 .121

3.301 1 .069

.120 .060

3.216 1 .073

60 Pearson Chi-Square

Continuity Correction^a Likelihood Ratio Fisher's Exact Test Linear-by-Linear Association N of Valid Cases

Value df

Asymp. Sig.

(2-sided)

Exact Sig.

(2-sided)

Exact Sig.

(1-sided)

Computed only for a 2x2 table a.

0 cells (.0%) have expected count less than 5. The minimum expected count is 14.

50.

b.

Group * Response Crosstabulation Count

18 12 30

11 19 30

29 31 60

Athletes Non-Athletes Group

Total

Do take supplements

Dont take supplements Response

Total

Contingency χ

²

test

SPSS Output

(31)

Assumptions for Chi-Squared

• Although ND not required…

• Cells in the table should all be independent

i.e. one person could have visited the leisure centre twice

• 80 % of the cells must have expected frequencies greater than 5 and all must be above 1

i.e. the more categories available, the more subjects needed

• Cannot use percentages

i.e. a 15:45 split cannot be expressed as 25%:75%

(32)

Selected Reading

• I know error and variance can be confusing topics, try these:

• Atkinson, G. and A. M. Nevill. Statistical methods for assessing

measurement error (Reliability) in variables relevant to sports medicine.

Sports Medicine. 26:217-238, 1998.

• Hopkins, W. G. et al. Design and analysis of research on sport performance enhancement. Med. Sci. Sport and Exerc. 31:472-485, 1999.

• Hopkins, W. G. et al. Reliability of power in physical performance tests.

Sports Medicine. 31:211-234, 2001.

• Atkinson, G., ''What is this thing called measurement error?'' , in

Kinanthropometry VIII: Proceedings of the 8th International Conference of the International Society for the Advancement of Kinanthropometry (ISAK) , Reilly, T. and Marfell-Jones, M. (Eds.), Taylor and Francis, London , 2003.

(33)

Coursework (60% overall grade)

• Your coursework will require you to address

ONE

of the following research scenarios:

– 1) Effect of Plyometric Training on Vertical Jump – 2) Effect of Ice Baths on Recovery of Strength

– 3) Effect of Diet on the Incidence of Muscle Injury – 4) Effect of Footwear on Sprint Acceleration

– 5) Effect of PMR on Competitive Anxiety.

(34)

Coursework Outline

• For the selected scenario you will need to:

– Perform a literature search in order to provide a comprehensive introduction to the research area – Identify the variables of interest and evaluate the

research design which was adopted

– Formulate and state appropriate hypotheses

– Summarise descriptive statistics in an appropriate and well presented manner…

(35)

Coursework Outline

• Cont’d…

– Select the most appropriate statistical test with justification for your decision

– Transfer the output of your inferential statistics into your word document

– Interpret your results and discuss the validity and reliability of the study

– Draw a meaningful conclusion (state whether hypotheses are accepted or rejected).

(36)

Coursework Details (see unit outline)

 1000 words (2000 absolute maximum)

• Any supporting SPSS data/outputs to be appended

• To be submitted on Thursday 12^th December Assessment Weighting

Evaluation & Analysis (30 %) Reading & Research (20 %) Communication & Presentation (20 %)

Knowledge (30 %)

(37)

Coursework Details

• All information relating to your coursework

(including the relevant data files) are accessible via the unit web page:

www.bath.ac.uk/~jb335/Y1%20Research%20Skills

%20(FH10040).html

Web address also referenced on shared area

(38)

Mid-Term Test (40% overall grade)

• NEXT WEEK

• This test will involve short answer questions covering all the information covered so far

• Mostly knowledge recall but will require

understanding and possibly some calculations

• Duration = 50 min

So…

(39)

Mid-Term Test (40% overall grade)

• Surnames: A-K

– Arrive promptly at 11.10 am for start of test at 11.15 am – Exit in silence afterwards

• Surnames: L-Z

– Arrive promptly at 12.10 am for start of test at 12.15 am – Exit however you like!

(40)

[email protected]