The normal distribution

(1)

13

The normal distribution

(2)

MathsWorld Mathematical Methods Units 3 & 4

The normal distribution

Continuous random variables, such as height, weight and time, involve characteristics that can be measured and can take any value in a given range. As there are infinitely many values that a characteristic such as height can take, we group the values into class intervals to enable us to construct frequency tables and histograms. For example, the frequency table and histogram for the heights of 100 adult males are shown below.

(Note that [165, 170) means 165 ≤ height < 170.)

Clearly we are unable to determine any information about the heights of individuals from this data. Nor can we determine how many people are between 183 cm and 188 cm, as these values are not at the ends of the given class intervals. We can, however, deduce that there are 17 people who are at least 185 cm tall by adding the 15 people in the [185, 190) interval to the two in the [190, 195) interval.

If the number of males whose heights were measured was increased to a very large number and the class intervals were made very small, the histogram would be transformed into a smooth frequency curve as shown below. We use μ to represent the mean. The smooth frequency curve can be scaled so that it has the properties of a probability density function or pdf for some random variable X. In cases such as this, the pdf of the variable can be modelled by the normal distribution.

The graph of the normal pdf is the familiar bell-shaped curve and is referred to as the normal curve. It is symmetrical about its mean.

If X is a normally distributed random variable with mean μ and variance σ2, we write X~ N(μ, σ2). A graph representing Pr(a≤X≤b) as the area under the curve

Height (cm) [165, 170) [170, 175) [175, 180) [180, 185) [185, 190) [190, 195)

Frequency (f) 4 18 32 29 15 2

Height (cm)

165 170 175 180 185 190 195 Frequency (f)

20 25

15

10

5

0 30 35

165 180 195

f

0

a probability density function

h

μ y

0

Area = Pr(a≤X≤b)

(3)

The normal distribution

chapter

13

Properties of the normal distribution

The normal distribution pdf satisfies the usual properties of any probability density function:

f(x) ≥ 0 and f(x)dx= 1. In addition, it has the following properties.

Confidence intervals of the normal distribution

We are often concerned with questions that require us to find, for example, the proportion of the population in a particular interval. Due to the symmetry of the normal distribution about the mean, and the characteristics of the normal curve, we are able to establish the following confidence intervals relating to the normal distribution:

.Approximately 68% of the population will fall within one standard deviation of the mean. So Pr(μ− σ≤X≤μ+σ) 0.68. This is sometimes referred to as the one sigma limits, as the values in the interval are within one standard deviation (i.e. σ) of the mean.

Warning

Don’t forget the square!

A common mistake is, given the mean μ and standard deviation σ of a distribution, to incorrectly write X ~ N(μ, σ), forgetting to square the standard deviation to get the variance. The standard deviation is often needed to answer questions and in these cases we must remember to take the square root of the variance.

⌠

⌡_−∞

∞

Properties of the normal distribution

._{The equation of the probability}

density function or pdf for the normal curve is given by

.

._{The normal curve is bell-shaped}

and is symmetrical.

._x=μ _{is not only the mean of the}

distribution, but also the median and the mode.

._{The maximum value of} _{, occurs when}_x=μ_.

._{The curve continues infinitely in both directions with the}_x_{-axis as an asymptote of}

the curve.

μ y

y = f(x)

x 0 σ π 1 2 μ ( , ₎ σ π 1 2

f x( ) 1

σ 2π

---e

1 2

--- x–μ σ ---⎝ ⎠ ⎛ ⎞2

–

x∈R

,

=

f x( ) 1

σ 2π

---,

Number

sense with the spence

97

From tip to tail, the longest recorded domestic cat was 97 centimetres long.

So, if a room is 1 metre 94 centimetres or wider, strictly speaking there is enough room to swing a cat.

~

(4)

.Approximately 95% of the population will fall within two standard deviations of the mean. So Pr(μ− 2σ≤ X ≤μ+ 2σ) 0.95. This is sometimes referred to as two sigma limits. It can be stated that a randomly chosen member of the population will most probably be, or is highly likely to be, within two standard deviations of the mean.

.Approximately 99.7% of the population will fall within three standard deviations of the mean. So Pr(μ− 3σ≤ X ≤ μ+ 3σ) 0.997. This is sometimes referred to as three sigma limits. It can be stated that a randomly chosen member of the population will almost certainly be within three standard deviations of the mean.

E x a m p l e

1

If X is a normally distributed variable with μ= 80 and σ= 10, sketch the normal curve for this variable and shade three standard deviations above and below μ.

S o l u t i o n

It is useful to first get an idea about the spread. We know that Pr(μ− 3σ≤X≤μ+ 3σ) 0.997 and so we deduce that 99.7% of values will be between 80 − 3 × 10 = 50 and 80 + 3 × 10 = 110.

~

μ

μ σ– μ σ+ x

‘about two-thirds’

μ − σ ≤X≤μ + σ) ≈ 0.68 Pr(

y

0

μ

μ– 2σ μ+ 2σ xx

‘most probably’

μ − 2σ ≤X≤μ + 2σ) ≈ 0.95 Pr(

y

0

μ

μ– 3σ μ+3σ x

‘almost certainly’

μ − 3σ ≤X≤μ + 3σ) ≈ 0.997 Pr(

y

0

50 60 70 80 90 100 110 x

μ– 3σ μ– 2σ μ σ– μ μ σ+ μ+2σ μ+3σ y

0

(5)

E x a m p l e

2

The heights of adult males in a certain city are normally distributed with mean 178 cm and standard deviation 5 cm. Find the approximate percentage of the adult males who are:

a between 168 cm and 188 cm. b taller than 193 cm.

c taller than 173 cm. d shorter than 183 cm.

S o l u t i o n

a 168 cm is two standard deviations below the mean and 188 cm is two standard deviations above the mean. So approximately 95% of the adult males are between 168 cm and 188 cm.

b 193 cm is three standard deviations above the mean. We know that approximately 99.7% of values are in the interval (μ− 3σ, μ+ 3σ). It follows that 0.3% of values are not in that interval. As the curve is symmetrical, 0.15% of values must lie in each tail and so approximately 0.15% of the adult males are taller than 193 cm.

c 173 cm is one standard deviation below the mean. We know that approximately 68% of values are in the interval (μ−σ, μ+σ). It follows that 32% of values are not in that interval. As the curve is symmetrical, 16% of values must lie in each tail and so approximately 68% + 16% = 84% of the adult males are taller than 173 cm.

d 183 cm is one standard deviation above the mean. We know that approximately 68% of values are in the interval (μ−σ, μ+σ). It follows that 32% of values are not in that interval. As the curve is symmetrical, 16% of values must lie in each tail and so approximately 68% + 16% = 84% of the adult males are shorter than 183 cm. Note that using symmetry, this

area is the same as the area considered in part c.

t i p

A rough sketch of a normal curve can be extremely useful, especially in examples that are not straightforward and in which the symmetry of the normal distribution is to be used to simplify, clarify, or visualise the characteristics of the question.

μ– 3σ μ μ+3σ

178 99.7%

167 193 x

y

0

μ

μ σ– μ σ+

178 183

68%

16% 16%

173 x

y

0

μ

μ σ– μ σ+

178 183

68%

16% 16%

173 x

y

0

(6)

E x a m p l e

3

A continuous random variable X is normally distributed with μ= 54 and σ= 3. Find an interval which will contain approximately:

a 95% of all possible values. b 99.7% of all possible values.

c the highest 16% of all possible values.

S o l u t i o n

a We know that approximately 95% of values are in the interval μ− 2σ≤X≤ μ+ 2σ, so 54 − 2 × 3 ≤X ≤ 54 + 2 × 3, which gives 48 ≤X≤ 60.

b We know that approximately 99.7% of values are in the interval μ− 3σ≤X≤μ+ 3σ, so 54 − 3 × 3 ≤X ≤ 54 + 3 × 3, which gives 45 ≤X≤ 63.

c We know that approximately 68% of values are in the interval μ−σ≤X≤μ+σ.

It follows that 32% of values are not in that interval. As the curve is symmetrical, 16% of values must lie in each tail, and so approximately 16% of values must be at least μ+σ. So, X≥ 54 + 3, which gives X≥ 57.

E x a m p l e

4

Find the value of μ and the (approximate) value of σ if the graph of the pdf of a normal random variable X has a symmetrical region that contains:

a 68% of its values between 12 and 22. b 95% of its values between 48 and 72.

c 99.7% of its values between 23 and 35.

S o l u t i o n

a We know that the region is symmetrical and so the mean must be at its midpoint. So . We also know that approximately 68% of values are in the interval

μ−σ≤X≤μ+σ, so 12 and 22 are each one standard deviation from the mean. Therefore

σ= 5. We could also say that there are two standard deviations between 12 and 22 and so .

b We have . We know that approximately 95% of values are in the interval μ− 2σ≤X≤μ+ 2σ, so 48 and 72 are each two standard deviations from the mean. Therefore . We could also say that there are four standard deviations between 48 and 72 and so .

c We have . We know that approximately 99.7% of values are in the interval μ− 3σ≤X≤μ+ 3σ so 23 and 35 are each three standard deviations from the mean. Therefore . We could also say that there are six standard deviations between 23 and 35 and so .

μ 12+22

2

--- 17

= =

σ 22 12–

2

--- 5

= =

μ 48+72

2

--- 60

= =

σ 72 60–

2

--- 6

= =

σ 72 48–

4

--- 6

= =

μ 23+35

2

--- 29

= =

σ 35 29–

3

--- 2

= =

σ 35 23–

6

--- 2

(7)

exercise

13.1

1 Sketch a normal curve for which:

a μ= 12 and σ= 2. b μ= 0 and σ= 1.

2 For each of the graphs at the right:

i find the mean.

ii estimate the standard deviation.

3 a Sketch, using the same scale in each case, a normal curve for which:

i μ= 16 and σ = 3. ii μ= 16 and σ = 1.

b What effect does decreasing σ have?

4 a Sketch, using the same scale in each case, a normal curve for which:

i μ= 20 and σ= 2. ii μ= 24 and σ = 2.

b What effect does increasing μ have?

5 Sketch a graph for each of the following distributions.

a X~ N(14, 4) b X~ N(80, 100)

6 For each of the three graphs:

i find the mean. ii estimate the standard deviation.

y

0 2

–2 –4

–6 4 6 x

19.5 14.5 17 12

9.5 0

22 24.5 x

y

a

b

a

b

c y

2 4 6 8 10 12 14 16 18 20 22 24 26 28

0

(8)

7 Which of the graphs have the same:

a means?

b standard deviations?

8 A random variable X is normally distributed and has a mean of 16 and a standard deviation of 3.

a Find the percentage of values that are:

i between 10 and 22. ii greater than 19.

b Find the probability that X is:

i between 13 and 19. ii less than 10.

9 A normally distributed variable X has μ= 28 and σ= 4.

a Find the percentage of values that are:

b Find the probability that X is

10 A normally distributed variable X has μ= 63 and σ= 5.

a Find an interval within which:

i 68% of values lie. ii 95% of values lie. iii 99.7% of values lie.

b Find the number x such that:

i 16% of values are greater than x. ii 2.5% of values are less than x.

iii 84% of values are greater than x.

11 If X~ N(86, 81), find:

a an interval within which the following percentages lie.

i 68% ii 95% iii 99.7%

b the number x such that

i 0.15% of values are less than x. ii 16% of values are greater than x.

iii 97.5% of values are less than x.

12 Find the values of μ and σ, if a normal distribution has a symmetrical region with:

a 68% of values between 28 and 62. b 95% of values between 11 and 47.

c 99.7% of values between −3 and 25.

13 Find the values of μ and σ, if a normal distribution has a symmetrical region with:

a 68% of values between −7 and 5. b 95% of values between 57 and 86.

c 99.7% of values between 86 and 99.

A

B E

C

0

D

(9)

14 The number of apples harvested from each tree in a large orchard is normally distributed with a mean of 220 and a standard deviation of 15.

a Find the percentage of trees for which the number of apples harvested is expected to be:

b Find the probability that the number of apples harvested from a randomly selected tree is:

15 The marks on a Maths Methods test are normally distributed and have a mean of 67 and a standard deviation of 8.5. To pass the test a student must achieve a score of 50 or more.

a What percentage of students who sat the test are expected to achieve a score:

i between 41.5 and 92.5? ii less than 58.5?

b If a student was randomly selected from those who sat the test, what is the probability that the student achieved a score:

i between 58.5 and 75.5? ii greater than 41.5?

c What percentage of the students who sat the test are expected to pass?

16 A manufacturer makes washers to be used in the production of a large machine. The diameters of the washers are normally distributed and have a mean of 12 mm and a standard deviation of 0.4 mm.

a If a washer is randomly selected, find the probability that the washer has a diameter that is:

i between 11.2 mm and 12.8 mm. ii greater than 11.2 mm.

b What percentage of washers are expected to have a diameter

i between 11.6 mm and 12.4 mm? ii less than 13.2 mm?

c The washers must have a diameter of at least 11.6 mm to be suitable for use in the machine. What percentage of the washers are expected to be suitable?

E x a m p l e

5

The heights of adult males in a certain city are normally distributed with mean 178 cm and standard deviation 5 cm. Find the approximate percentage of adult males who are

a between 168 cm and 183 cm. b between 183 cm and 188 cm.

S o l u t i o n

a 168 cm is two standard deviations below the mean, 183 cm is one standard deviation above the mean. We know that approximately 95% of values are in the interval μ− 2σ ≤X≤μ+ 2σ and that approximately 68% of values are in the interval μ−σ≤X≤μ+σ. It follows that 27% (95% − 68%) of

values are between these intervals. As the curve is symmetrical, 13.5% of values must lie in each of the left and right regions between the two intervals and so approximately 68% + 13.5% = 81.5% of these adult males are between 168 cm and 183 cm.

μ– 2σ 178 183 μ188+ 2σ 68%

13.5% 13.5%

173

168 x

0 y

(10)

b From the answer to part a, the graph on the right has already been deduced. So approximately 13.5% of these adult males are between 183 cm and 188 cm.

exercise

13.1

17 There is a flea infestation at K9 Kennels. The number of fleas per dog can be modelled by a normal distribution with mean 240 and standard deviation 20. Find the approximate percentage of dogs that have:

a between 200 and 300 fleas.

b between 180 and 220 fleas.

18 The mass of fully grown fish found in Lake Pescatore is normally distributed, with mean 500 g and standard deviation 25 g.

a Find the approximate percentage of these fish that have a mass between 425 g and 525 g.

b What is the probability that a fully grown fish caught from the lake has a mass between 475 g and 575 g?

c Comment on your answers to parts a and b.

19 The wingspan of the blue emperor butterfly is normally distributed with mean 77 mm and variance 9 mm.

a Find the approximate percentage of butterflies that have a wingspan between 77 mm and 83 mm.

b A butterfly is selected at random. What is the probability that it has a wingspan between 83 mm and 86 mm?

20 A professional tennis player took part in a study to measure the speed of his serve. He made numerous serves and the study found that the speeds of the serves could be modelled by a normally distributed random variable with mean 190 km/h and variance 81 km/h. Find the approximate percentage of serves that have a speed between:

a 181 km/h and 208 km/h.

b 172 km/h and 217 km/h.

μ– 2σ 178 183 μ188+2σ 68%

13.5% 13.5%

173

168 x

0 y

(11)

The standard normal

distribution

One of the difficulties associated with the normal distribution, particularly before the advent of calculators, was the fact that a normal distribution and therefore its related probabilities are dependent on both μ and σ. A consequence of this is that if we were to attempt to construct a normal table to calculate probabilities of various intervals for every possible normal distribution, we would need infinitely many tables. Clearly this is absurd and makes the extensive use of tables impractical. This problem is overcome by the use of a transformed or ‘standardised’ form of normal distribution called the standard normal distribution.

The variable in a standard normal distribution is denoted by Z, to distinguish it from X as used in other normal distributions. The standard normal distribution has μ= 0 and σ= 1, and the z value indicates how many standard deviations the corresponding x value is from the mean. To find the value of z we first find the difference between the x value and the mean by finding x − μ. To find how many standard deviations this equals, we divide by σ. This gives

us , which is an important result that we will use many times.

For the standard normal distribution, we write Z ~ N(0, 1). After we have standardised a normal distribution in this way, we require only one table and have only one curve, the curve opposite, to deal with.

All of the properties that applied to the normal distribution also apply to the standard normal distribution. Some statements, while equivalent, look somewhat different. For example,

we know that Pr(μ− σ ≤ X ≤ μ+ σ) 0.68. To convert this to an equivalent statement for a standard normal distribution, we substitute μ = 0 and σ= 1 to get Pr(−1 ≤ Z ≤ 1) 0.68. Similarly Pr(μ− 2σ ≤ X ≤μ+ 2σ) 0.95 can be converted to Pr(−2 ≤ Z ≤ 2) 0.95 and Pr(μ− 3σ ≤ X ≤ μ+ 3σ) 0.997 can be converted to Pr(−3 ≤ Z ≤ 3) 0.997.

The connection between X ~ N(μ, σ2) and Z ~ N(0, 1) can be symbolised by . This is known as standardising.

z x–μ σ

---=

The standard normal distribution

If we substitute μ= 0, σ= 1 and into the equation for a normal curve, that is

, we have the equation for a standard normal curve:

. Note that g(z) is a transformed version of f(x).

z x–μ

σ

---=

f x( ) 1

σ 2π

---e

1 2

--- x–μ σ ---⎝ ⎠ ⎛ ⎞2

–

x∈R

,

=

g z( ) 1

2π

---e

1 2

---z2

–

z∈R

,

=

y = g(z) y

0 ₁

–1 –2

–3 2 3 z

μ– 3σ μ– 2σ μ σ– μ μ σ+ μ+2σ μ+3σ

~

~ ~

Z X–μ σ

---=

13.2

(12)

There are other features of the standard normal distribution that we will use extensively.

.The probability that Z is less than some value z plus the probability that Z is greater than the same value z is equal to 1, i.e. Pr(Z < z) + Pr(Z > z) = 1.

This equation can be transformed to give Pr(Z > z) = 1 − Pr(Z < z). This is an important result, especially if a standard normal table is to be used, as the table is constructed to give Pr(Z < z). That is, it gives the probability that Z < z.

.The symmetry of the distribution shows that Pr(Z <−z) = Pr(Z > z).

By considering the unshaded areas in the above graphs, we can also see that Pr(Z >−z) = Pr(Z < z).

E x a m p l e

1

If Z ~ N(0, 1) and Pr(Z <a) = 0.6, find:

a Pr(Z> a) b Pr(Z<−a) c Pr(Z>−a)

S o l u t i o n

a We know that Pr(Z<a) + Pr(Z>a) = 1 so Pr(Z >a) = 1 − Pr(Z<a) = 0.4.

b We know that Pr(Z< −a) = Pr(Z >a) so Pr(Z < −a) = 0.4.

c We know that Pr(Z> −a) = Pr(Z <a) so Pr(Z > −a) = 0.6.

y

z

0 ₁

–1 –2

–3 2 3 z

y

z

0 ₁

–1 –2

–3 2 3 z

y

–z–1 0 1

–2

–3 2 3 z

y

z

0 1

–1 –2

–3 2 3 z

Symmetry properties of the standard normal distribution

._Pr(_Z>_z₎=₁−_Pr(_Z<_z₎ ._Pr(_Z< −_z₎=_Pr(_Z>_z₎ ._Pr(_Z> −_z₎=_Pr(_Z<_z₎

t i p

(13)

E x a m p l e

2

Convert the variable in each of the following expressions to a standard normal variable Z

then use it to write an equivalent expression involving Z. Do not attempt to evaluate the expression.

a Pr(X < 45); μ = 37, σ= 4 b Pr(X ≥ 12); μ = 15, σ= 2 c Pr(23 <X≤ 29); μ = 27, σ= 5

S o l u t i o n

a We have , so . The equivalent expression is Pr(Z< 2).

b . The equivalent expression is Pr(Z ≥−1.5).

c If x= 23, . If x= 29, . The equivalent expression is Pr(−0.8 <Z≤ 0.4).

exercise

13.2

1 For a standard normal variable Z, find:

a the percentage of values for which Z is

i between −3 and 3. ii less than −1.

b the probability that Z is

i between −2 and 2. ii greater than −2. iii less than zero.

2 For a standard normal variable, find:

a an interval within which the following percentages of values lie.

i 68% ii 95% iii 99.7%

b the number such z that

i 0.15% of values are greater than z.

ii 84% of values are less than z.

iii 97.5% of values are greater than z.

3 If Z~ N(0, 1) and Pr(Z <a) = 0.2, find:

a Pr(Z>a). b Pr(Z< −a). c Pr(Z>−a).

4 If Z~ N(0, 1) and Pr(Z >a) = 0.75, find:

a Pr(Z>−a). b Pr(Z< −a). c Pr(Z<a).

5 Convert the variable in each of the following expressions to a standard normal variable Z

and use it to write an equivalent expression. Do not attempt to evaluate the expression.

a Pr(X< 63); μ= 66, σ= 2

b Pr(X≥ 117); μ= 95, σ= 11

c Pr(−15 < X≤ 0); μ=−6, σ= 3

z x–μ

σ

---= z 45 37–

4

--- 2

= =

z x–μ

σ

--- 12 15–

2

--- –1.5

= = =

z 23 27–

5

--- –0.8

= = z 29 27–

5

--- 0.4

= =

(14)

6 Write an equivalent expression to the one given by converting the variables in each of the following questions to a standard normal variable Z. Do not attempt to evaluate the expression.

a Pr(X> 47); μ= 53, σ= 12

b Pr(X≤ 4.26); μ= 5, σ= 0.37

c Pr(−32 < X≤ 28); μ=−4, σ= 20

7 Give an approximate value for Pr(X< 72), if μ= 69, σ= 3, by first converting the variable to a standard normal variable Z.

8 Give an approximate value for Pr(X≥ 45), if μ= 37, σ= 4, by first converting the variable to a standard normal variable Z.

9 If X~ N(114, 121), find an approximate value for Pr(X≤ 81) by first converting the variable to a standard normal variable Z.

10 If X~ N(4.50, 0.0036), find an approximate value for Pr(4.32 ≤X< 4.68) by first converting the variable to a standard normal variable Z.

13.1_{Have a r}

est

CD

SAC analysis task

A study of a large number of adults finds that their resting heart rate, X beats per minute, is normally distributed such that X ~ N(68, 25).

a Find the probability that a randomly selected person from this group will have a heart rate i between 58 and 78. ii less than 63.

b Find an interval within which the following percentages of values for the heart rate lie. i 68%

ii 95% iii99.7% c Find k such that:

i 16% of heart rates are greater than k. ii 2.5% of heart rates are lower than k. iii 84% of heart rates are greater than k. d Given that when the variable is converted to a

standard normal variable Z, Pr(Z > a) = 0.3, find: i Pr(Z >−a).

ii Pr(Z <−a). iiiPr(Z < a).

e Find the probability that a randomly selected person from this group will have a heart rate greater than 78, given that it is greater than 73.

analysis task 1—

(15)

Calculation of probabilities

A wide range of random variables follow a normal distribution. Some of these are found in nature and include the weights, lengths, areas, volumes, and times taken, for a huge variety of natural phenomena, and will be used in the examples in this chapter. A random variable can also be designed to have a normal distribution. An example of this that is of great interest to students studying for their VCE is the way in which study scores are calculated.

In all VCE subjects, the study scores are normally distributed, with a mean of 30 and a standard deviation of 7. From what we have learned about confidence intervals, we can determine that approximately 68% of study scores are between 23 and 37, 95% of study scores are between 16 and 44, approximately 2.5% of study scores are above 44, and so on.

In this section we will calculate probabilities associated with normal distributions, first, by using some of the properties of normal distributions and then by using the calculator as an aid.

Using properties of normal distributions

We have previously considered probabilities that can be calculated using some of the symmetry properties of normal distributions, as well as using confidence intervals. Now we examine other probabilities that are associated with more general regions.

Often we wish to find the probability that an object, sampled randomly from a variable that is normally distributed, lies between two specified values. That is, we are required to find the probability that a < X < b. This probability is represented by the shaded area in the graph shown below left. Its area is equal to the shaded area in the middle graph minus the shaded area in the right graph. That is Pr(a < X < b) = Pr(X < b) − Pr(X < a).

E x a m p l e

1

If X is normally distributed with Pr(X<a) = 0.2 and Pr(X<b) = 0.7, find:

a Pr(a<X<b). b Pr(X <a|X<b).

S o l u t i o n

a By constructing graphs similar to those in the example above, or by simply using the result above, Pr(a<X<b) = Pr(X<b) − Pr(X <a) = 0.7 − 0.2 = 0.5.

Pr(a<X<b) = Pr(X<b) − Pr(X<a)

Calculating and visualising normal probabilities

μ

0 b x

y

Pr(X < b)

x μ

y

0 a

Pr(X < a)

μ

0 a b x

y

Pr(a < X < b)

=

−

13.3

(16)

b Using the conditional probability formula we have: Pr(A|B) =

Pr(X<a|X<b) =

To find where X<a and X<b on a graph, shade both regions. The area where both are true will be shaded twice. Thus:

so:

E x a m p l e

2

If X is normally distributed with Pr(X>x1) = 0.17 and Pr(X>x2) = 0.85, find:

a Pr(x2<X<x1).

b an approximate value for c if Pr(x2<X<x1) = Pr(−c<Z<c) where Z ~ N(0, 1). S o l u t i o n

a If Pr(X >x1) = 0.17 then Pr(X <x1) = 0.83. If Pr(X>x2) = 0.85 then Pr(X<x2) = 0.15. Using

the equation, we have Pr(x2<X<x1) = Pr(X <x1) − Pr(X<x2) = 0.83 − 0.15 = 0.68.

b Pr(x2<X<x1) = 0.68 and we know that Pr(−1 < Z< 1) 0.68 so c= 1.

E x a m p l e

3

If X is normally distributed with Pr(X>d) = 0.16 and Pr(c <X<d)= 0.59, draw diagrams to represent these probabilities and find:

a Pr(X<c).

b an approximate value for z if Pr(X >d) = Pr(Z>z) where Z ~ N(0, 1).

S o l u t i o n

a If Pr(X >d) = 0.16 then Pr(X<d) = 0.84. We know that

Pr(c<X<d) = Pr(X<d) − Pr(X<c) so Pr(X<c) = Pr(X<d) − Pr(c<X<d) which gives Pr(X<c) = 0.84 − 0.59 = 0.25.

b We know that Pr(−1 <Z < 1) 0.68 so Pr(Z> 1) 0.16. As Pr(X>d) = 0.16, z 1.

Pr(A∩B)

Pr( )B

---Pr[(X<a)∩(X<b)]

Pr(X<b)

---μ y

0 a b x

Pr[(X<a)∩(X<b)] = Pr(X<a)

Pr(X<a | X<b) _Pr---Pr(₍X_X_<<a_b)₎ 2

7

---= =

~

y

c

0 d

Pr(c < X < d) = 0.59

Pr(X > d) = 0.16

~

(17)

Using technology

In most questions relating to a normal distribution we do not have the convenience of being asked about a probability or interval that involves an exact whole number of standard deviations, nor are we always given a probability that is related to the one being sought. If a question relates to finding a probability involving a standard normal distribution, we have a choice of using a normal probability table or a calculator. The normal probability table is a little tedious to use, and is limited by the fact that the probabilities are generally only shown for positive z values. To find other probabilities symmetry must be used.

To find a probability involving a normal distribution that is not a standard distribution, a normal probability table could only be used if any x values were first converted to z values.

The calculator provides us with an easy and convenient way of calculating probabilities whether or not the normal distribution is a standard normal distribution.

The syntax for calculating normal probabilities on the TI-83/84 is as follows:

normalcdf(lower bound, upper bound, μ, σ)

(On the TI-89, the function is named normCdf.) When calculating standard normal probabilities, the values of μ and σ can be omitted:

normalcdf(lower bound, upper bound)

In general, the process for calculating the probability for a standard normal distribution is to type the instructions given above and then enter the interval inside the brackets, putting the lower value of z first and then the higher value. If the question does not specify an interval, that is, it is of the form Z ≥ a, Z < a, or similar, we create an interval by putting a very large number at the unbounded end, as shown in the screenshot above. (On the TI-89, −∞ or ∞ can be used.)

GC 10.4 CAS 11.4

t i p

The processes use the functions built into the calculator. There are several programs available which are designed to perform the same calculations. The program

NORMPROB for the TI-83/84 and TI-89 is a good example.

13.2NORMPROB

CD

GC/CAS pr

ogram

CAS 11.4 _usingThe TI-89 has two methods for calculating normal probabilities. The first is accessible _{CATALOG F3 FlashApps}_{, and the second is accessible via a dialog box in the}

Stats/List Editor App.

t i p

(18)

E x a m p l e

4

In the following Z ~ N(0, 1), find:

a Pr(X< 32) if X is normally distributed with μ= 28, σ= 1.7.

b Pr(Z≥ 0.56). c Pr(X≥ 14) if X ~ N(12, 9). d Pr(−1.25 <Z<−0.12).

S o l u t i o n

The screenshots on the right show the calculations and syntax for each part.

a Pr(X< 32) = 0.9907.

b Pr(Z≥ 0.56) = 0.2877.

c Note that X ~ N(12, 9) means that μ= 12 and

σ2

= 9, so σ= 3. Pr(X ≥ 14) = 0.2525.

d Pr(−1.25 <Z<−0.12) = 0.3466.

Representing normal probabilities graphically

Sometimes it is useful to be able to see the region being considered. This may be to aid understanding or to check that the correct region is being used. A sketch of the curve is one possibility, but the calculator can also be used. For example 4

part d above, we would proceed as follows. First set up an appropriate window. For a standard normal distribution, use −4 to 4 on the horizontal axis and −0.1 to 0.4 on the vertical axis. Enter the command ShadeNorm(-1.25, -0.12). The result is the screenshot shown on the right. The required area is shaded and the probability reported.

For a general normal distribution, the same command is used with the addition of two arguments representing μ and σ. For instance, in example 4 part c the command would be

ShadeNorm(14, 999999, 12, 3). The result is shown on the right.

GC 10.4 CAS 11.4

13.2_NORMPROB

CD

GC/CAS pr

ogram

Care must be taken to set appropriate window dimensions. The horizontal axis dimensions should be chosen to be about four standard deviations either side of the mean. Hence in the second example above, these were 12 − 4 × 3 = 0 and 12 + 4 × 3 = 24, respectively. The vertical axis also needs adjustment. Alternatively, the program NORMPROB can be used to automate the process of finding normal probabilities as shown in the following screenshots.

(19)

exercise

13.3

In questions 1 to 10, X is normally distributed and Z ~ N(0, 1).

1 Given that Pr(X<a) = 0.3 and Pr(X<b) = 0.5, find:

a Pr(X> a). b Pr(a< X<b).

2 If Pr(X> x1) = 0.2 and Pr(X< x2) = 0.4, find:

a Pr(X< x1). b Pr(x1<X< x2).

3 If Pr(X≤ x1) = 0.3 and Pr(X≥ x2) = 0.1, find:

a Pr(x1 ≤X≤x2). b Pr(X<x1|X< x2).

4 Given that Pr(X<c) = 0.42 and Pr(X<d) = 0.84, find:

a Pr(c<X<d). b Pr(X<c |X<d).

5 Given that Pr(X>x1) = 0.01 and Pr(X≥ x2) = 0.96, find:

a Pr(x2 <X<x1).

b an approximate value for c if Pr(x2<X< x1) = Pr(−c< Z<c).

6 Given that Pr(X>c) = 0.001 and Pr(X> d) = 0.998, find:

a Pr(d<X< c).

b an approximate value for z if Pr(d≤ X≤c) = Pr(−z≤Z≤ z).

7 Given that Pr(X<a) = 0.025 and Pr(a< X<b) = 0.345, find:

a Pr(X< b).

b an appropriate value for c if Pr(X>a) = Pr(Z >c).

8 Given that Pr(X<c) = 0.0015 and Pr(c< X<d) = 0.4761, find:

a Pr(X≤ d).

b an approximate value for z if Pr(X>c) = Pr(Z>z).

9 Given that Pr(X<a) = 0.32 and Pr(b<X<a) = 0.18, find:

a Pr(X< b). b Pr(X<a |X< b).

10 Given that Pr(X>c) = 0.45 and Pr(d≤ X≤ c) = 0.09, find:

a Pr(X≥ d). b Pr(X<d |X≤c).

11 Find:

a Pr(X≤ 17) if μ= 19, σ = 1.2 and X is normally distributed.

b Pr(Z>−0.43) where Z ~ N(0, 1).

c Pr(X< 37) if X~ N(34, 16).

d Pr(−0.67 <Z< 1.23) where Z~ N(0, 1).

12 Find:

a Pr(X≥ 61) if X~ N(47, 121).

b Pr(Z<−1.05) where Z~ N(0, 1).

c Pr(0.73 < Z≤ 1.41) where Z~ N(0, 1).

d Pr(X> 13.7) if μ= 9.8, σ= 2.3 and X is normally distributed.

(20)

13 In the following, Z~ N(0, 1). Find:

a Pr(Z≥ 2.11).

b Pr(X< 231.8) if μ= 193.4, σ= 37.4 and X is normally distributed.

c Pr(−1.26 ≤Z≤ 0.16).

d Pr(X≤ 2.97) if X~ N(2.99, 0.04). E x a m p l e

5

A manufacturer produces screws, the lengths of which are normally distributed with a mean of 15.7 mm and a standard deviation of 0.4 mm. If a screw is randomly selected from the production line, find the probability that it is:

a no longer than 16 mm. b at least 15 mm long.

c rejected, if only screws that are between 14.6 mm and 16.6 mm long are accepted.

S o l u t i o n

a Let X represent the length of the screws. We want Pr(X≤ 16) as ’no longer than’ is equivalent to ’less than or equal to’. The calculation is shown in the screenshot on the right.

Hence Pr(no longer than 16 mm) = 0.7734.

b We want Pr(X≥ 15) as ’at least’ is equivalent to ‘greater than or equal to’. The screenshot on the right shows the result using the ShadeNormal

command.

c The screw is accepted if 14.6 ≤X≤ 16.6. We calculate Pr(14.6 ≤X≤ 16.6) to be 0.9848. So the probability that the screw is rejected is 1 − 0.9848 = 0.0152.

exercise

13.3

14 The labels on bags of flour say that the bags have a weight of 1 kg. The actual mean weight of the bags is 1.02 kg, in order to minimise the number of bags that are

underweight. If the weight of the bags is normally distributed with a standard deviation of 12 g, find the percentage of bags, correct to 1 decimal place, that would be expected to:

a weigh more than 1.035 kg.

b be legally underweight, as they weigh less than 990 g.

15 The life of Bright Ideas light globes is normally distributed with a mean of 960 hours and a standard deviation of 24 hours. If a light globe is purchased, what is the probability that its life is:

a greater than 1000 hours? b less than 900 hours?

GC 10.4 CAS 11.4

(21)

16 The Sugary Syrup Company sell bottles of maple syrup. The volume of syrup in each bottle follows a normal distribution with a mean of 500 millilitres and a standard deviation of 3.4 millilitres. If a person buys one bottle of maple syrup, find the probability that the volume of maple syrup is:

a less than 495 millilitres. b greater than 507 millilitres.

c between 496 and 505 millilitres.

17 The Choof’s Pies Company makes an exotic chicken pie that is packed full of chicken and vegetables. The chicken pies have an average weight of 750 g. Their weight is normally distributed with a standard deviation of 8 g.

a What is the probability that a randomly selected chicken pie weighs more than 762 g?

b The company offers a money-back guarantee if any chicken pie purchased weighs less than 735 g. What percentage of purchases, to the nearest per cent, are expected to be eligible for a refund?

E x a m p l e

6

The intelligence quotient or IQ as measured by IQ tests is a normally distributed random variable with mean 100 and standard deviation 15.

a If a person is randomly selected from the population, what is the probability that their IQ is:

i over 120? ii less than 90?

b If four people are randomly selected from the population, what is the probability that exactly one of them has an IQ over 120?

c If a person is randomly selected from the people who have an IQ that is greater than 90, what is the probability that the person selected has an IQ over 120?

S o l u t i o n

a The screenshot on the right shows the results.

i Pr(IQ over 120) = 0.0912

ii Pr(IQ less than 90) = 0.2525

b From above, the probability that a person has an IQ over 120 is 0.0912. We now need to determine the probability that exactly one of the four people who were randomly selected has an IQ that is over 120. The probability of randomly selecting one person from the population with an IQ that is over 120 does not change with successive selections—we are now dealing with a binomial distribution with four trials and the probability of success on one trial is 0.0912.

Let S= number of successes (in this case having an IQ that is over 120). Then:

=

= 0.2738

GC 10.4 CAS 11.4

Pr(S =1) 4 1

⎝ ⎠

⎛ ⎞₍_0.0912₎1

1 0.0912–

( )3

(22)

c This is a conditional probability question. The sample space has been restricted by the fact that the person being selected cannot have an IQ less than 90. Thus:

Pr(IQ > 120 | IQ > 90) =

= (since (IQ > 120) ⊂ (IQ > 90))

=

= 0.1220

exercise

13.3

18 Perfect Pictures develops photographic film and print the photographs on paper that is advertised to be 18 cm long and 13 cm wide. The area of the developed photographs is normally distributed with a mean of 234 cm2 and a standard deviation of 1.1 cm2.

a If a photograph is randomly selected from those that have been printed, what is the probability that it has an area greater than 235.5 cm2?

b If six photographs are randomly selected from those that have been printed, what is the probability that two of them have an area greater than 235.5 cm2?

c If all photographs with an area of less than 232 cm2 are discarded and one photograph is randomly selected from those that remain, what is the probability that it has an area greater than 235.5 cm2?

19 Ball bearings are manufactured for use in large machines. The diameters of the ball bearings are normally distributed with a mean of 2.8 mm and a standard deviation of 0.1 mm.

a To the nearest tenth of a per cent, what is the percentage of ball bearings with a diameter greater than 3 mm?

b The size of the ball bearings is critical. The diameters must be between 2.72 mm and 2.88 mm or the ball bearings can not be used in the machine.

i If one ball bearing is chosen randomly, what is the probability that it can be used in the machine?

ii If three ball bearings are chosen randomly, what is the probability that they can all be used in the machine?

20 A national study was commissioned to learn how far elite Australian Rules footballers could kick a football. Each player was given several kicks in an attempt to produce their longest kick. It was found that the lengths of their best kicks were normally distributed with a mean of 55.3 m and a standard deviation of 3.2 m.

a To the nearest per cent, what percentage of footballers were able to kick a football further than 55 m?

b If five footballers were randomly chosen, what was the probability that at least four of them were able to kick a football further than 55 m?

c If a footballer was randomly chosen from those who were able to kick a football further

Pr IQ[( >120)∩(IQ>90)] Pr IQ( >90)

---Pr IQ( >120) Pr IQ( >90)

---0.0912 1 0.2525–

(23)

Ahab is going on a fishing trip to Lake Lotsafish. The lake is well stocked with large edible fish. The lengths of the fish are normally distributed, with a mean of 45.6 cm and a standard deviation of 4.2 cm.

a If Ahab catches one fish, find the probability that it is:

i less than 43 cm long. ii longer than 48 cm.

iii between 46 cm and 50 cm long.

b If Ahab fishes each day for seven consecutive days, what percentage of the fish he catches, to the nearest tenth of a per cent, are likely to be longer than 52 cm?

c If Ahab catches 50 fish, how many would be expected to be longer than 52.9 cm?

The law states that fish less than 40 cm long must be thrown back in the water and only fish over 40 cm long may be taken to be eaten. d What is the probability that a fish must be

thrown back?

e If Ahab catches three fish, what is the probability that he is allowed to keep all three? f If Ahab only catches one fish that he is allowed to keep, what is the probability that it

is over 50 cm long?

analysis task 2—

a fishy story

SAC

13.3_{A f}

ishy story

CD

SAC analysis task

Number

98

Humans seem to share pretty much 98% of their DNA with—wait for it—chimpanzees. Hi Dad, how’s that chest hair going?

(24)

The inverse normal

distribution

We have calculated probabilities for a standard normal distribution given an interval of Z values, and for a non-standard normal distribution given an interval of X values. Sometimes we are given a probability and wish to determine a value c such that Pr(Z < c) is equal to the given probability. Alternatively, we may need to find an x value for given μ and σ; that is, to find the value b such that Pr(X < b) is equal to the given probability, by first finding c such

that Pr(Z < c) is equal to the given probability and then using to find the value of b.

Finding z or x values given a probability

To find a z value, given a probability for a standard normal distribution, we would have the choice of using a normal probability table (in reverse), an inverse normal probability table, or a calculator. The normal probability tables are a little tedious to use and limited by the fact that the probabilities are generally only shown for positive z values and vice versa. To find an x value given a probability for a non-standard normal distribution, either of the normal probability tables referred to could be used, but only by first finding z values and then converting

them to x values using .

Whether or not the normal distribution is a standard normal distribution, the calculator provides an easy and convenient way of calculating such values for a given probability.

For example, suppose we want to find a person’s IQ so that only 5% of the population had a higher IQ. Recall that IQ is a normally distributed variable that has a mean of 100 and a standard deviation of 15. So we are required to find the value of x given Pr(X > x) = 0.05. This is equivalent to finding the value of x such that Pr(X < x) = 0.95. It is this latter form we must use, because the inverse normal function on the calculator (and in the normal probability tables) is cumulative.

c b–μ σ

---=

z x–μ σ

---=

GC 10.4 CAS 11.4

Calculating values of the inverse normal function

To find the value of x if Pr(X<x) = 0.95 the syntax on the TI-83/84 and the TI-89 is as follows:

.invNorm(probability, μ, σ), or simply

.invNorm(probability) if μ= 0 and σ= 1.

The screenshot on the right shows the result, 124.67, in the IQ example above.

Alternatively, use the NORMPROB program as shown.

13.2NORMPROB

CD

GC/CAS pr

ogram

(25)

E x a m p l e

1

If Z ~ N(0, 1), find the value of c given that:

a Pr(Z<c) = 0.29 b Pr(Z≥c) = 0.19 c Pr(−c<Z≤c) = 0.86

S o l u t i o n

The screenshot on the right shows the results in each case. Note that in parts b and c, the first step is to rewrite the statement in a form where invNorm can be used.

a c=−0.5534

b Pr(Z≥c) = 0.19 so Pr(Z≤c) = 0.81

c= 0.8779

c Pr(−c<Z≤c) = 0.86 so 0.14 of the area under the curve is found in the two tails. Therefore, 0.07 of the area under the curve is found in each tail. It follows that Pr(Z<c) = 0.93.

c= 1.4758

E x a m p l e

2

If X is normally distributed, find:

a x given that Pr(X ≤x) = 0.37, μ= 13.6, σ= 1.4.

b x given that Pr(X >x) = 0.44, μ= 421.2, σ= 15.6.

c a and b given that Pr(a≤X≤b) = 0.53, μ= 41.2, σ= 4.2 and the specified interval is symmetrical about the mean.

S o l u t i o n

The screenshot on the right shows the values for parts a and b.

a x= 13.14

b Pr(X>x) = 0.44 so Pr(X≤x) = 0.56

x= 423.56

c Pr(a≤X≤b) = 0.53 so 0.47 of the area under the curve is found in the two tails. Therefore 0.235 of the area under the curve is found in each tail. It follows that Pr(X≤b) = 0.765. Similarly Pr(X ≤a) = 0.235. We find that a= 38.17 and b= 44.23.

GC 10.4 CAS 11.4

A graph shows the symmetry properties clearly and it is often helpful to sketch a graph before attempting a calculation like that in part c.

y

z c

–c –1 0 1

–2

–3 2 3

0.86

0.07 _0.07

t i p

GC 10.4 CAS 11.4

(26)

Quantiles and percentiles

Quantiles and percentiles are terms that are similar in meaning and are used to convey information about a distribution. The term quantile refers to the value below which there is a specified probability (given as a decimal) that a randomly selected element will fall. So the 0.5 quantile of a standard normal distribution is equal to a where Pr(Z < a) = 0.5. Of course we know that a = 0 in this case.

The term percentile refers to the value below which there is a specified percentage that a randomly selected element will fall. So the 50th percentile of a standard normal distribution is equal to a for which 50% of values are below a. Again, we know that a = 0 in this case. It follows that the 0.5 quantile has the same meaning as the 50th percentile, and both are the same as the median. Note that the mean and the median have the same value for normal distribution.

Percentile is a term commonly used with the lengths and weights of newborn babies. Parents may be told that their child is at the 70th percentile for length and the 60th percentile for weight. This means that the baby is longer than 70% of babies and weighs more than 60% of babies.

E x a m p l e

3

a Find the 0.45 quantile for a standard normal distribution.

b Find the 80th percentile if X ~ N(20.6, 4).

S o l u t i o n

a We require the value of z such that Pr(Z<z) = 0.45.

z=−0.1257

b First convert to a probability. We require the value of x such that Pr(X<x) = 0.80.

x= 22.28

Again a sketch of the graph is helpful, particularly in part c. Try to include a sketch like this in all of your solutions.

t i p

41.2 0.235 0.53 0.235

x b

a

GC 10.4 CAS 11.4

Warning

Variance or standard deviation?

(27)

exercise

13.4

Give answers correct to 4 significant figures.

1 If Z~ N(0, 1), find the value of z given that:

a Pr(Z<z) = 0.36. b Pr(Z≥ z) = 0.28. c Pr(−z<Z ≤z) = 0.38.

2 If Z~ N(0, 1), find the value of z given that:

a Pr(Z<z) = 0.94. b Pr(Z≥ z) = 0.68. c Pr(−z<Z ≤z) = 0.75.

3 Find the value of x if X is normally distributed and:

a Pr(X≤ x) = 0.52, μ= 21, σ= 2.6. b Pr(X> x) = 0.73, μ= 132, σ= 9.8.

c Pr(a≤X≤x) = 0.67, μ= 31.2, σ= 5.6 and the specified interval is symmetrical about the mean.

4 Find the value of x if X is normally distributed and:

a Pr(X≤ x) = 0.21, μ= 44.2, σ= 7.6. b Pr(X> x) = 0.78, μ= 61.7, σ= 6.9.

c Pr(d≤X≤x) = 0.87, μ= 0.94, σ= 0.12 and the specified interval is symmetrical about the mean.

5 If X~ N(87, 169), Pr(c≤X≤a) = 0.92 and the specified interval is symmetrical about the mean, find the value of a.

6 X~ N(10.4, 1.44), Pr(b≤X≤a) = 0.55 and the specified interval is symmetrical about the mean. Find the value of a.

7 For a standard normal distribution find:

a the 0.45 quantile. b the 0.93 quantile.

8 For a standard normal distribution find:

a the 50th percentile. b the 20th percentile.

9 If X is normally distributed and μ= 21, σ= 2.6, find:

a the 0.37 quantile. b the 86th percentile.

10 If X~ N(100, 225), find:

a the 95th percentile. b the 0.4 quantile.

11 The average length of female babies at birth is 48.8 cm with a standard deviation of 3.3 cm. The lengths are normally distributed. How many centimetres long is a female baby at birth if her length is at:

a the 45th percentile? b the 90th percentile?

12 The Perfect Parmesan Cheese Company produces containers of parmesan cheese labelled to have contents of 250 g. The contents are normally distributed with a mean of 251.6 g and a standard deviation of 0.8 g. What is the minimum content, in grams, of 95% of all containers produced?

13 The time taken to drive from Carnegie to Torquay is a normally distributed random variable with a mean of 83.2 minutes and a standard deviation of 3.5 minutes.

a What is the least time taken for the slowest 5% of drives? Give your answer correct to the nearest second.

b What is the greatest time taken for the fastest 5% of drives? Give your answer correct to the nearest second.

(28)

E x a m p l e

4

a If X is normally distributed with mean 28 and Pr(X≤ 31) = 0.64, find the standard deviation

σ correct to 3 decimal places.

b If X ~ N(μ, 16) and Pr(X> 92.3) = 0.23, find the mean μ correct to 2 decimal places.

S o l u t i o n

For a non-standard normal distribution, we need both a mean and a standard deviation to use invNorm directly. For these questions, we ‘translate’ the problem so that we can use

invNorm for a standard normal distribution.

If X ~ N(μ, σ2) and Z ~ N(0, 1), then . We use this as follows.

a We are given that Pr(X≤ 31) = 0.64 where X ~ N(28, σ2).

Pr(X ≤ 31) =

= (since μ = 28)

=

Using the invNorm command on the calculator, we find that if Pr(Z≤z) = 0.64, then z = 0.3585. So:

= 0.3585

σ= = 8.369

b We are given Pr(X> 92.3) = 0.23, which is equivalent to Pr(X≤ 92.3) = 0.77, where

X ~ N(μ, 16). Pr(X≤ 92.3) =

= (since σ = 4)

Using the invNorm command on the calculator we find that if Pr(Z≤z) = 0.77, then

z= 0.7388. So:

= 0.7388 92.3 −μ= 4 × 0.7388

μ= 92.3 − 4 × 0.7388

= 89.34

Z X–μ

σ

---=

t i p

Note that the final answer was obtained using

z = 0.3584587992, the answer shown on the calculator screen below. If z = 0.3585 is used, we get an answer that is incorrect in the third decimal place. Remember that even when asked to give an answer to a certain number of decimal places, you should still use as many decimal places as possible in your working to ensure that subsequent answers are not compromised.

Pr X---_σ–μ 31–μ

σ

---≤

⎝ ⎠

⎛ ⎞

Pr Z≤31 28---_σ–

⎝ ⎠

⎛ ⎞

Pr Z≤_σ---3

⎝ ⎠ ⎛ ⎞ GC 10.4 CAS 11.4 3 σ ---3 0.3585

---Pr X–μ

σ

--- 92.3–μ

σ

---≤

⎝ ⎠

⎛ ⎞

Pr Z 92.3–μ

4

---≤

⎝ ⎠

⎛ ⎞

92.3–μ

4

(29)

---exercise

13.4

14 If X is normally distributed with mean 37 and Pr(X> 41) = 0.27, find the standard deviation, σ.

15 If X is normally distributed with mean 108 and Pr(X≤ 103) = 0.43, find the standard deviation, σ.

16 If X~ N(μ, 121) and Pr(X< 1243) = 0.69, find the mean, μ.

17 If X~ N(μ, 64) and Pr(X≥ 43.7) = 0.73, find the mean, μ.

18 Studies have shown that the IQs of students studying at a tertiary institution are normally distributed with a standard deviation of 13.6. Find the mean, μ, to the nearest whole number, given that 23% of these students have an IQ of at least 120.

19 A small town in Northern Europe has, on average, 8 hours and 30 minutes of daylight per day in December, with the amount of daylight depending partly on weather conditions. Find the standard deviation, σ, given that one in every four days has fewer than 8 hours of daylight and the number of hours of daylight is normally distributed. Give your answer correct to the nearest minute.

20 A random variable, X, is normally distributed. It is known that 40% of the values are at least 49.51 and three-quarters of the values are less than 51.03. Find the mean, μ, and the standard deviation, σ, of the distribution. Give your answers correct to 1 decimal place.

continued

GC 7.1

The TI-83/84 Solver can be used efficiently in questions like those in example 4. For example in part a, the problem was essentially to find σ given that invNorm(0.64, 28, σ) = 31. In part b, it was to find μ given that invNorm(0.77, μ, 4) = 92.3. Set up a general equation in the Solver as shown below left. Enter the given values and solve for the unknown. The second and third screenshots show the results for example 4 parts a and b.

(30)

In all VCE subjects, the study scores are normally distributed with a mean of 30 and a standard deviation of 7.

Part 1

a What percentage, to the nearest tenth of a per cent, of Maths Methods students achieve a study score of 40 or more?

b What is the probability that a randomly selected student achieves a study score of 25 or less?

c What is the probability that a randomly selected student achieves a study score between 28 and 38?

Part 2

d If three students were randomly selected from all of those studying Maths Methods, what is the probability that they all achieved a study score between 28 and 38?

e If five students were randomly selected from all of those studying Maths Methods, what is the probability that three of them achieved a study score between 28 and 38?

Part 3

f If a student performs better than 90% of those studying Maths Methods, what is the least study score to the nearest whole number the student achieves?

g If 60% of those studying Maths Methods perform better than a particular student, what study score, to the nearest whole number, does the student achieve?

Extension

A large research project involving Maths Methods students demonstrated a strong relationship between the amount of effective study done by a student and the study score achieved by that student. The research showed that students who had studied effectively throughout the year never achieved a study score below 25.

h What is the probability that a randomly selected student who has studied effectively achieves a study score of 40 or more?

i What is the probability that a randomly selected student who has studied effectively achieves a study score of 30 or less?

For students who do virtually no work and do not listen in class, the study scores are still normally distributed but have a standard deviation of 3.1.

j If 20% of these students achieve a study score of less than 16, find the mean study score, μ, for this group of students. Give your answer correct to 1 decimal place.

analysis task 3—

the more you STUDY, the more you SCORE

SAC

13.4_{The mor}

e

you STUDY

, the mor

e

you SCORE

CD

SAC analysis task

Number

(31)

Chapter review

Summary

The normal distribution

.We write X ~ N(μ, σ2)to represent the normally distributed random variable X with mean, μ, and variance σ2, i.e. standard deviation, σ.

.The normal distribution has the following properties:

aThe equation of the probability density function or pdf for the normal curve is given by .

aThe normal curve is bell-shaped and symmetrical.

ax = μ is not only the mean of the distribution, it is also the median and the mode.

aThe maximum value of f(x), occurs when x =μ.

aThe curve continues infinitely in both directions with the x-axis as an asymptote of the curve.

.The confidence intervals of the normal distribution are:

aPr(μ− σ≤ X ≤μ + σ) 0.68

aPr(μ− 2σ ≤ X ≤μ + 2σ) 0.95

aPr(μ− 3σ ≤ X ≤μ + 3σ) 0.997

.The standard normal distribution has μ= 0 and σ= 1.

.A non-standard normal distribution, X ~ N(μ, σ2), can be standardised using . A value z of Z indicates how many standard deviations from the corresponding value x of X is from its mean.

.The confidence intervals (transformed from those involving μ and σ) and some of the symmetries of the standard normal random variable Z are:

aPr(−1 ≤ Z ≤ 1) 0.68 aPr(−2 ≤ Z ≤ 2) 0.95 aPr(−3 ≤ Z ≤ 3) 0.997

aPr(Z > z) = 1 – Pr(Z < z) aPr(Z < −z) = Pr(Z > z) aPr(Z >−z) = Pr(Z < z)

Calculation of probabilities

Using properties of normal distributions

We can use known confidence intervals and symmetries to calculate probabilities in special cases.

Using the calculator

.For a standard normal distribution, for which a and b are the lower and upper z values of the interval respectively, to calculate Pr(a < Z < b): normalcdf(a, b).

.For a non-standard normal distribution with mean μ and standard deviation σ, to calculate Pr(a < X < b): normalcdf(a, b, μ, σ).

The inverse normal distribution

.For a standard normal distribution, to find the value of z if Pr(Z < z) = p: invNorm(p).

.For a non-standard normal distribution with mean μ and standard deviation σ, to find the value of x if Pr(X < x) = p: invNorm(p, μ, σ).

f x( ) 1 σ 2π

---e

1 2

---x–μ σ ---⎝ ⎠ ⎛ ⎞2

–

x∈R ,

=

1 σ 2π

---~ ~ ~

Z X–μ σ

---=

(32)

Quantiles and percentiles

.The term quantile refers to the value below which there is a specified probability (given as a decimal) that a randomly selected element will fall. For example, the 0.6 quantile of a standard normal distribution is equal to a such that Pr(Z < a) = 0.6.

.The term percentile refers to the value below which a specified percentage of the values will fall. The 60th percentile of a standard normal distribution is equal to a such that 60% of values are below a.

.The 0.6 quantile has the same meaning as the 60th percentile.

Revision questions

Short answer

1 A normally distributed random variable has μ= 24 and σ= 5. Without using a calculator find:

a the approximate percentage of values that are

b the approximate probability that a randomly selected object from this distribution will be

2 A normally distributed random variable has μ = 47 and σ= 6. Without using a calculator find:

a an interval within which the following percentages of values lie.

i 95% ii 99.7%

b the value such that

i 16% of values are greater than it. ii 84% of values are greater than it.

3 Without using a calculator find the values of μ and σ of a normal distribution that has a symmetrical region which contains:

a 68% of its values between 14 and 26. b 95% of its values between 36 and 56.

c 99.7% of its values between −5 and 22.

4 X ~ N(15, 25). Without using a calculator find:

a the approximate probability that X will be

b the approximate percentage of values that are

5 The mass of tins of sweet corn is normally distributed with mean 250 g and standard deviation 15 g. Answer the following without using a calculator.

a Find the approximate percentage of tins with a mass between 205 g and 265 g.

b A tin is chosen randomly from the production line. Determine the approximate probability that the tin has a mass between 235 g and 295 g.

6 For a standard normal random variable Z, find the following, without using a calculator:

a the approximate percentage of values for which Z is

i between −1 and 1. ii less than −2.

b the approximate probability that Z is

i between −3 and 3. ii greater than −2.

7 A normal random variable X has mean 70. If Pr(X < 80) = p, find in terms of p:

(33)

8 For a standard normal random variable, without using a calculator find:

a an interval symmetric about the mean within which

i 68% of values lie. ii 99.7% of values lie.

b the z value such that

i 0.15% of values are greater than it. ii 97.5% of values are greater than it.

9 Given that Pr(Z < a) = 0.37 where Z ~ N(0, 1), find the following:

a Pr(Z > a) b Pr(Z <−a) c Pr(Z >−a)

10 Write an equivalent statement to the one given by converting the variable in each of the following to a standard normal variable. Do not attempt to evaluate the statement.

a Pr(X ≥ 11); μ= 13, σ = 4 b Pr(57 < X ≤ 63); μ= 59, σ= 5

11 Given that Pr(X < a) = 0.4 and Pr(X > b) = 0.2, find:

a Pr(X > a). b Pr(a < X < b). c Pr(X < a | X < b).

12 Find:

a Pr(Z ≥ 1.1) where Z ~ N(0, 1).

b Pr(X < 41.8) if μ= 37.4, σ = 3.4 and X is normally distributed.

c Pr(−1.6 ≤ Z ≤ 0.3) where Z ~ N(0, 1).

d Pr(X ≤ 5.3) if X ~ N(4.2, 9).

13 If Z ~ N(0, 1), find the value of z, given that:

a Pr(Z < z) = 0.36. b Pr(Z ≥ z) = 0.29. c Pr(−z < Z ≤ z) = 0.44.

14 Find the value of x if X is normally distributed and

a Pr(X ≤ x) = 0.43, μ= 27, σ = 3.3.

b Pr(X > x) = 0.81, μ= 352, σ= 17.5.

c Pr(c ≤ X ≤ x) = 0.74, μ= 47.3, σ= 6.7 and the specified interval is symmetrical about the mean.

15 If X ~ N(137, 196), find:

a the 75th percentile. b the 0.3 quantile.

16 The lengths of the leaves of a particular tree are normally distributed with mean 12.4 cm and standard deviation 2.3 cm. Find the length, in centimetres, of a leaf if its length is at:

a the 35th percentile. b the 60th quantile.

Extended response

1 The times taken for a large group of swimmers to swim 50 metres freestyle are normally distributed with mean 23.1 seconds and variance 0.16 seconds.

a Without a calculator find the approximate percentage of these swimmers who can swim 50 metres freestyle in a time that is between 22.7 seconds and 23.5 seconds.

b If one of these swimmers is randomly chosen, what is the probability that they can swim 50 metres freestyle in less than 22.3 seconds?

(34)

d If two swimmers are chosen randomly, what is the probability that both can swim 50 metres freestyle in less than 23.7 seconds?

e What is the probability that one of these swimmers can swim 50 metres freestyle in less than 23 seconds, given that the swimmer is randomly chosen from those who can swim 50 metres freestyle in less than 23.7 seconds.

2 Dracula’s Stakes sells garden stakes that are advertised as being 1 m long. The lengths of the stakes are normally distributed with a mean of 1.015 m and a st