• No results found

analyzing small data sets

N/A
N/A
Protected

Academic year: 2021

Share "analyzing small data sets"

Copied!
45
0
0

Loading.... (view fulltext now)

Full text

(1)

Review of Lognormal Statistics and

Review of Lognormal Statistics and

analyzing small data sets

(2)

Review of IH Statistics

I.

Lognormal distribution

g

II.

Sample 95

th

percentile

III.

UCL for the sample 95

th

percentile

(3)

I. Lognormal Distribution – Example

g

p

Airborne exposures to inorganic lead

3

(4)
(5)

Parameters vs. Statistics

Parameters

Statistics

-

calculated using all elements of the population

-

log transform each element

-

calculated from a sample of n elements randomly selected

-

log transform each element

-

log transform each element

-

log transform each element

Population Mean

μ

Sample Mean

y_

μ

y

Population

Standard Deviation Sample Standard Deviation

y

Standard Deviation

σ

Deviation

s

y

y

Th

t

t d t

t

l l

l ( )

5

(6)

Parameters vs. Statistics

Parameters

Statistics

-

calculated using all elements of the

population

-

randomly selectedcalculated from a sample of n elements

Population Geometric Mean

GM

Sample Geometric Mean

gm

Population Geometric Standard Deviation

GSD

Sample Geometric Standard Deviation

gsd

Standard Deviation

g

(7)

Lognormal distribution PDF

GM

(8)
(9)

Sample geometric mean (gm) &

p g

(g )

geometric standard deviation (gsd)

(10)

Example: Welding fume data

-estimate GM and GSD

Case

x

i

(mg/m

3

)

y

i

=ln(x

i

)

(y

i

-y)

2 1 0.84 -0.1744 0.055877 2 0 98 -0 0202 0 006762

_

2 0.98 -0.0202 0.006762 3 0.42 -0.8675 0.864025 4 1.16 0.1484 0.007463 5 1.36 0.3075 0.060248 6 2.66 0.9783 0.839600 Sum = 0.3722 1.833976 y = 0.0620 gm = 1 06

_

gm = 1.06 gsd = 1.83

(11)

Example: Welding fume data

-estimate GM and GSD

(12)

Example: Welding fume data

-estimate μ and σ

Case

x

i

(mg/m

3

)

( x

i

-x )

2 1 0.84 0.157344

_

2 0.98 0.065878 3 0.42 0.666944 1 16 4 1.16 0.005878 5 1.36 0.015211 6 2 66 2 025878 6 2.66 2.025878 Sum = 7.42 2.937133 1 24

_

x

= 1.24 sd = 0.77

(13)

Example: Welding fume data

-estimate μ and σ

(14)
(15)

1.2 3.1

GSD = X84/X50 = 3 1/1 2 = 2 6

GSD X84/X50 3.1/1.2 2.6

(16)

II. Sample 95

th

Percentile Exposure

The focus is on the upper tail of the exposure profile.

The sample 95

th

percentile can be considered a “

decision

statistic

”.

The (usual) goal is to determine which category the 95

th

P

til

t lik l f ll

Percentile most likely falls.

It is used to assist in reaching a decision that the exposure

profile is

„

“Controlled” or “Acceptable”

„

Controlled or Acceptable

„

“Unacceptable”

(17)

95

th

Percentile interpretation of TWA

OELs

ACGIH

„ Roach, S.A., Baier, E.J., Ayer, H.E., and Harris, R.L.: Testing compliance

with Threshold Limit Values for respirable dusts. American Industrial Hygiene Association Journal 28:543-553 (1967).

„ Stokinger, H.E.: Industrial air standards - theory and practice. Journal of „ Stokinger, H.E.: Industrial air standards theory and practice. Journal of

Occupational Medicine 15:429-431 (1973).

„ Still, K.R. and Wells, B.: Quantitative Industrial Hygiene Programs:

Workplace Monitoring. (Industrial Hygiene Program Management series, part VIII) Applied Industrial Hygiene 4:F14-F17 (1989)

part VIII). Applied Industrial Hygiene 4:F14-F17 (1989).

(18)

95

th

Percentile interpretation of TWA

OELs

AIHA 1991 and 1998 guidance

„ Employer should maintain true group or individual upper percentile

exposure < TWA OEL

„ “Similar Exposure Group” 95th percentile exposure < TWA OEL

„ Corn, M. and Esmen, N.A.: Workplace exposure zones for classification of

employee exposures to physical and chemical agents. American Industrial Hygiene Association Journal 40:47-57 (1979).

(19)

95

th

Percentile interpretation of TWA

OELs

NIOSH guidance

„ Employer should 95% confident that 95% of the exposures are < the TWA

PEL

„ Leidel, N.A., Busch, K.A., Lynch, J.R.: Occupational Exposure Sampling

Strategy Manual. National Institute for Occupational Safety and Health

Strategy Manual. National Institute for Occupational Safety and Health (NIOSH) Publication No. 77-173 (available as a pdf file from NIOSH website) (1977).

OSHA

M d TWA h ld “ l ” d h TWA PEL ( bl

„ Measured TWA exposures should “rarely” exceed the TWA PEL (preamble to

the benzene PEL, 1987)

(20)

95

th

Percentile interpretation of TWA

OELs

EU

„ CEN (Comité Européen de Normalisation): Workplace atmospheres

-Guidance for the assessment of exposure by inhalation of chemical agents for comparison with limit values and measurement strategy. European Standard EN 689, effective no later than Aug 1995 (English version) (Feb , g ( g ) ( 1995).

(21)

Example

A sample of six full-shift TWA welding fume

p

g

measurements resulted in the following statistics:

„

(sample) geometric mean is 1.06 mg/m

3

(sample) geometric standard deviation is 1 83

„

(sample) geometric standard deviation is 1.83

What is the point estimate (i.e., best estimate) of the

true 95

th

percentile?

(22)
(23)

95

th

Percentile

(24)

Alternative upper percentile

formula

(25)

Focus on Upper Tail

(26)

III. Upper Confidence Limit (UCL) for the

Sample 95

th

Percentile

Calculate confidence intervals around estimates of …

„

upper percentile (normal & lognormal)

Confidence intervals are used to …

„

express uncertainty

p

y

„

test hypotheses:

Š

to determine our confidence level that the SEG is in compliance

with an OEL

Š

to determine our confidence level that the true 95

th

percentile

(27)

For single shift, TWA exposure limits (TWA OELs) …

g

,

p

(

)

„

focus on the upper tail of the distribution

„

e.g., 95

th

percentile exposure

(28)

Upper Percentile (e.g., 95

th

percentile)

Concept

„

Calculate the 95% upper confidence interval for the 95th

percentile statistic (upper tolerance limit)

Application

95%UCL can be used to test the following hypotheses:

„

95%UCL can be used to test the following hypotheses:

Š

H

o

: 95th percentile > OEL

Š

H

a

: 95th percentile < OEL

I t

t ti

Interpretation

„

If the 95%UCL is less than the OEL, then we can say that we are

at least 95% confident that the true 95th percentile is less than the

OEL

(29)

95%UCL for the 95

th

Percentile

Procedure:

„

Calculate the gm and gsd

„

Using n, read the UCL K-value from the appropriate table

Š

γ = confidence level, e.g., 0.95

γ

Š

p = proportion, e.g., 0.95

Š

n = sample size

„

Using gm, gsd, and k, calculate the 95%UCL

g g , g ,

,

Š

y = ln( gm )

Š

s

y

= ln( gsd )

_

(30)
(31)
(32)
(33)

IV. Rule-of-thumb for “Eyeballing”

Exposure Data

Given:

„

G = median

„

X

p

= G x D

Zp

(e.g., X

0.95

=G x D

1.645

)

R l

f th

b

id li

b d i d f

… a Rule-of-thumb, or guideline, can be devised for

quickly estimating from limited data the

range

in

which the true 95

th

percentile might lie.

(34)

Multiple of GM (median)

GSD

X

p

= 95

th

percentile

Z

p

= 1.645

1.5

1.95

2.0

3.13

2.5

4.51

3.0

6.09

(35)

R.O.T. for Estimating the 95

th

Percentile

1.

If n is small (i.e., <6) and one or more measurements > OEL, then

d i i

C t

4

decision = Category 4

.

2.

Estimate the median and use it as a surrogate of the sample GM:

-

Sort the data

If n is odd the median is the middle value

-

If n is odd the median is the middle value.

-

If n is even the median is the average of two middle values.

3.

Multiply the median by 2, 4, and 6

-

The results comprise an

The results comprise an

approximate

approximate

low, middle, and high

low, middle, and high

estimate of X

0.95

.

(36)

Rule-of-thumb Workshop

(assume OEL=100)

a.

X = {5}

bb.

X = {68}

c.

X = {7, 34, 57}

d.

X = {1, 1, 2, 5}

e

X

{4 5 8 23}

e.

X = {4, 5, 8, 23}

f.

X = {0.3, 1, 2, 3, 4, 22}

g.

X = {10, 10, 10, 20, 50, 105}

h

X = {7 10 16 21 45 53}

h.

X = {7, 10, 16, 21, 45, 53}

For each dataset, determine the appropriate Exposure Category – 1, 2, 3,

or 4 – using the above Rule-of-thumb.

(37)

Available Data Analysis Tools

IHStats.xls

„

Comes with the AIHA 3

rd

Edition “Exposure Assessment and

Management …”

„

handles n<50

„

handles n<50

EASC-IHStats.xls

„

www.aiha.org/1documents/committees/EASC-IHSTAT.xls

„

An update of the IHStats.xls spreadsheet

„

handles n<200

„

multiple languages

multiple languages

(38)
(39)
(40)
(41)
(42)
(43)
(44)
(45)

References

Related documents

[r]

This study reveals some significant points: (1) The pattern of interaction between the family of former terrorists convict and surrounding communities can be

All the figures and tables should be labeled (Times New Roman 11) and included in list of figures and list of tables respectively.

Quality: We measure quality (Q in our formal model) by observing the average number of citations received by a scientist for all the papers he or she published in a given

hypergeometric functions by employing the generalized Gauss’s second summation the- orem, Bailey’s summation theorem and Kummer’s summation theorem obtained earlier by Lavoie,

Generally the longer the roof slope and the lower the building height the greater the cost reduction from minimising roof slope.. These savings are in the wall structure and

The story of the Estonian journal- ists’ organization throughout various periods of the coun- try’s history clearly brings forth the political and historical contingency of

In this study, it is aimed to develop the Science Education Peer Comparison Scale (SEPCS) in order to measure the comparison of Science Education students'