4. (from last time, we will try again) A special case of means: The proportion
Suppose there are n independent trials that constitute a sample. Each trial results in either "success" or
"failure", and the chance of a success each time is π.
The proportion π could be thought of as the mean of a special kind of population. The population only has values of 1 or 0. If a population has that feature, the population mean E(X) or µ is
π -- the proportion of 1’s in your population and the population standard deviation is
) 1 (
* π
π
σ = −
The binomial can also be used for PROPORTIONS or percentages. For a binomial distribution, if n is large (>20), then the distribution of the sample proportion P is approximately normal with µ = π, and the standard error of the distribution of sample proportions is
P
n
) 1 ( π σ = π −
For example, this is the population of college students in Los Angeles in 2000:
SEX | Freq. Percent Cum.
---+--- Male | 205,534 52.85 52.85 Female | 183,393 47.15 100.00 ---+--- Total | 388,927 100.00
Let’s arbitrarily assign the value 1 to males and the value 0 to females. Look what happens when I compute some summary statistics for sex. Note the proportion of males is the same as the mean and it’s just the percentage or proportion of 1’s in the data.
SEX
--- Percentiles Smallest
1% 0 0 5% 0 0
10% 0 0 Obs 388927 Å a population of 388,927 25% 0 0 Sum of Wgt. 388927
50% 1 Mean .5284642 Å this is π
Largest Std. Dev. .4991898 75% 1 1
90% 1 1 Variance .2491904
95% 1 1 Skewness -.1140418
99% 1 1 Kurtosis 1.013006
We can take samples from a population of 1’s and 0’s. For example, one single sample of size 25 could look like this
0 1 1 1 0 0 1 0 1 0 1 1 1 1 0 0 0 1 1 1 1 0 0 1 0
This would have a mean of 14/25 = .56, it would have a standard deviation of . 4964 = . 56 * ( 1 − . 56 ) This sample came from a sampling distribution of all possible samples of size 25 from this population.
We can call the sample proportion for a single sample P. A collection of sample statistics (or the
population of all possible sample proportions from samples of the same size) will have a mean of π and a standard error of
P