• No results found

Sampling Distributions

N/A
N/A
Protected

Academic year: 2021

Share "Sampling Distributions"

Copied!
10
0
0

Loading.... (view fulltext now)

Full text

(1)

Sampling Distributions

You have seen probability distributions of various types. The normal distribution is an example of a continuous distribution that is often used for quantitative measures such as weights, heights, etc. The binomial distribution is an example of a discrete

distribution because the possible outcomes are only a “discrete” set of values, 0, 1, … , n. The value of the binomial random variable is the number of “successes” out of a random sample of n trials, in which the probability of success on a particular trail is π.

(2)

The Binomial Distribution

as a Sampling Distributions

The nature of the binomial distribution makes it a sampling distribution. In other words, the value of the binomial random variable is derived from a sample of size n from a population. You can think of the population as being a set containing 0’s and 1’s, with a 1 representing a “success” and 0 representing a “failure.” (The term “success” does not necessarily imply something good, nor does “failure” imply something bad.) The sample is obtained as a sequence of n trials, with each trial being a draw from the population of 0’s and 1’s. (Such trials are called “Bernoulli” trials.)

On the first trial, you draw a value from {0, 1}, with P(1)=π. Then you draw again in the same way, and do this repeatedly a total of n times. So your sample will be a set such as {1, 1, 0, 1, 0, 1} in the case for n=6. This set has 4 1’s and 2 0’s, so the value of the

binomial random variable is y=4. The value of the binomial distribution is the sum of the outcomes from the trials; that is, the number of 1’s.

(3)

The Binomial Distribution

as a Sampling Distribution

Recall that the binomial probability formula is

P(y successes in n trials) = y n!( n! y)!

π

y(1−

π

)n y− where

π

= probability of success on single trial and

n = number or trials.

Mean of the binomial distribution:

n

π

Variance of the binomial distribution:

n

π

(1

π

)

This probability is derived from the sampling distribution of the number of successes that would result from a very large number of samples.

(4)

The Binomial distribution

as a Sampling Distribution

Consider a population that consists 60% of 1’s and 40% of 0’s. If you draw a value at random, you get a 1 with probability .6 and a 0 with probability .4. Such a draw would constitute a Bernoulli trial with P(1)=.6.

Suppose you draw a sample of size n and add up the value you obtain. This would give you a binomial random variable. Now suppose you do this again and again for a very large number of samples. The conceptual results make up a conceptual population. Here are the histograms that correspond to the population for

values of n equal to 1, 2, 3, 6, 10, and 20.

Notice how the shapes of the histograms change as n increases. The distributions become more “mound-shaped” and symmetric.

(5)

Histotram of B(1, .6) Distribution 0 1 2 3 4 5 6 7 0 1 Successes Re la ti v e F re que nc y Histogram of B(2, .6) Distribution 0 0.1 0.2 0.3 0.4 0.5 0.6 0 1 2 Successes Re la ti v e F re que nc y Histogram of B(3, .6) Distribution 0 0.1 0.2 0.3 0.4 0.5 0 1 2 3 Successes Re la ti v e F re que nc y

(6)

Histogram of B(6, .6) Distribution 0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0 1 2 3 4 5 6 Successes Re la ti v e F re que nc y Histogram of B(10, .6) Distribution 0 0.05 0.1 0.15 0.2 0.25 0.3 0 1 2 3 4 5 6 7 8 9 10 Successes R e la ti v e F re que nc y Histogram of Successes 0 0.05 0.1 0.15 0.2 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 Successes R e la ti v e F re que nc y

(7)

The Binomial Sampling Distribution

and Statistical Inference

Here are the values of P(y=k) and P(y<=k) for n=20 and k=6-17 (P(y=k)<.005 for n<6 or n>17): 6 7 8 9 10 11 12 13 14 15 16 17 P(y=k) .005 .015 .035 .071 .117 .160 .180 .165 .124 .075 .035 .012 P(y≤ k) .006 .021 .057 .128 .245 .404 .584 .750 .874 .949 .984 .996 Remember that these are the probabilities for π=.6.

Suppose you are sampling from a distribution with unknown π. If you drew sample with y=12, would you have reason to doubt that π=.6? (Note: y/n=.6)

If you drew sample with y=10, would you have reason to doubt that π=.6? (Note: y/n=.5)

If you drew sample with y=6, would you have reason to doubt that

(8)

The Binomial Sampling Distribution

and Statistical Inference

Here are the numeric values of P(y=k) for n=10, 3 decimal places: 0 1 2 3 4 5 6 7 8 9 10 P(y= k)

.000 .002 .011 .042 .111 .201 .251 .215 .121 .040 .006 P(y≤ k)

.000 .002 .012 .055 .166 .367 .618 .833 .954 .994 1.00 Remember that these are the probabilities for π=.6.

Suppose you are sampling from a distribution with unknown π. If you drew sample with y=6, would you have reason to doubt that π=.6? (Note: y/n=.6)

If you drew sample with y=5, would you have reason to doubt that π=.6? (Note: y/n=.5)

If you drew sample with y=3, would you have reason to doubt that π=.6? (Note: y/n=.3)

(9)

Normal Distributions and

Sampling Distributions

One of the most useful statistics is the sample mean,

y

. Statistical inference based on

y

is derived from its sampling distribution. If the population from which the sample was obtained is normally distributed with mean µ and variance σ2, then the sampling

distribution of

y

is normal. Moreover, the mean of the sampling has mean µ and variance σ2/n.

If yi ~ ( ,N

µ σ

2), i=1,…,n, then

2

~ ( , / )

y N

µ σ

n .

(10)

Normal Distributions and

Sampling Distributions

If the distribution from which the samples were obtained is not normal, then the sampling distribution of

y

is only

approximately normal, but the distribution becomes more nearly normal as n increases. This is the Central limit Theorem.

Central Limit Theorem: If y is a random variable with mean µ

and variance σ2, then the sampling distribution of

y

is

References

Related documents

This is the recurring motto of the unedited treatise Diez privilegios para mujeres preñadas 4 (Ten Privileges for Pregnant Women), written in 1606 by the Spanish physician

The calculated values of V can now be compared to the measured values defined in Table 1 at the specific time the measurements are available, considering the calculated values

[3] NEST’s primary goals are to read in ESA and third part SAR data products, provide tools for calibration, orthorectification, co-registration, interferometry,

This paper proposes a logical model to examine the effect of the EDoS attack in cloud environment using finite queuing model and enhanced with experimental model. Due to

Make measurements on timeslot 0 [FREQUENCY] {Timeslot Off} [Enter] Activate the ORFS measurement (figure 25) [MEASURE] {GMSK Output RF The default setting measures spectrum

toda: Dorylaimida ) from the rhizosphere of the wild growing grape ( Vitis vinifera ssp. silvestris ) in the riparian woods of the rivers Danube and March

de Klerk, South Africa’s last leader under the apartheid regime, Mandela found a negotiation partner who shared his vision of a peaceful transition and showed the courage to