• No results found

STATISTICAL METHODS FOR BUSINESS

N/A
N/A
Protected

Academic year: 2021

Share "STATISTICAL METHODS FOR BUSINESS"

Copied!
20
0
0

Loading.... (view fulltext now)

Full text

(1)

7.1.- Distributions associated with the sampling

process.

7.2.- Inferential processes and relevant distributions.

7.1.- Distributions associated with the sampling

process.

7.2.- Inferential processes and relevant distributions.

UNIT 7: INFERENTIAL TOOLS. DISTRIBUTIONS

ASSOCIATED WITH SAMPLING

(2)

UNIT 7. GOALS

To describe the chi-square and Student’s t

distributions.

To calculate probabilities and percentiles (quantiles).

To apply the main pivotal statistics used in

inferential processes on the mean, the proportion

and the variance of a population.

 

To describe the chi-square and Student’s t

distributions.

To calculate probabilities and percentiles (quantiles).

To apply the main pivotal statistics used in

inferential processes on the mean, the proportion

and the variance of a population.

(3)

7.1.- Distributions associated with the sampling process.

7.1.- Distributions associated with the sampling process.

UNIT 7: INFERENTIAL TOOLS. DISTRIBUTIONS

ASSOCIATED WITH SAMPLING

(4)

χ

n

2

t

n

N

(

μ, σ

)

Probability models associated with sampling

Chi-squared distribution

Student's

t

distribution

Normal distribución

(5)

Chi-squared distribution

Chi-squared distribution

X

1

, ..., X

n

INDEPENDIENT RVs with distribution N(0,1):

Z has a chi-squared distribution with n degrees of freedom

X

1

, ..., X

n

2

n

n

1

i

2

i

χ

X

Z

0 2 4 6 8 10 12 14 16 18
(6)

Expression

RANDOM VARIABLES

CONSTRAINTS

Degrees of

freedom

None

X

1

, X

2

, ..., X

n

X

1

− ̄

X

, X

2

− ̄

X

, ..., X

n

− ̄

X

i=

1

n

X

i

2

i=

1

n

(

X

i

− ̄

X

)

2

X

1

, X

2

, ..., X

n

or

i=1 n

X

i

n

= ̄

X

i=1 n

(

X

i

− ̄

X

)

=

0

n

n-1

Degrees of freedom

The number of degrees of freedom in an expression may be

interpreted as the number of values that may be arbitrarily chosen.

(7)

Chi-squared distribution

Chi-squared distribution

The chi-squared model is REPRODUCTIVE with respect

to the number of degrees of freedom.

Given two independent RVs

X

 

2

n

Y

 

2

m

It holds:

2

n m

X Y

  

Reproductivity

Reproductivity

(8)

Fisher’s Theorem

Given a simple random sample (X

1

, ..., X

n

) from a N(



)

population, it holds:

The distribution of the random variable

is chi-squared with n-1 degrees of freedom.

(

n

1

)

S

2

σ

2

The sample mean and the sample variance S

2

are

independent random variables.

̄

X

S

2

=

i=

1

n

(

X

i

− ̄

X

)

2

n

1

SAMPLE VARIANCE

(9)

Student's t distribution

Student's t distribution

n

t

n

Y

X

=

t

N(0,1)

X

Y

n

2

n

t

n

Y

X

=

t

X and Y INDEPENDIENT RVs

Then

RV distributed as a Student’s t with n freedom degrees.

(10)

7.2.- Inferential processes and relevant distributions.

7.2.- Inferential processes and relevant distributions.

UNIT 7: INFERENTIAL TOOLS. DISTRIBUTIONS

ASSOCIATED WITH SAMPLING

(11)

Pivotal statistics associated with inferential

processes

POPULATION X

Parameter

Simple r.s.

(X

1

, X

2

, ...,X

n

)

Estimator

T(X

1

, X

2

, ...,X

n

)

Pivot

d

T

Pivot

d

T

STANDARDIZATION

(12)

Pivotal statistics

The estimator T summarizes the information that the

sample contains on the unknown parameter

.

The pivot is obtained by standardizing the

estimator. A pivot should have a known probability

distribution, not depending on unknown parameters.

(13)

Estimator for the population mean

Estimator for the population mean

E

(

X

̄

)

Var

(

X

̄

)

=

σ

2

n

̄

X

=

i=

1

n

X

i

n

SAMPLE MEAN

If X

N(

,

) then

If the distribution of X is unknown and n large

̄

X

N

(

μ,

σ

n

)

̄

X

N

(

μ,

σ

n

)

Unbiased estimator

σ

X

̄

=

σ

n

Standard error

(14)

Inference on the population mean

Inference on the population mean

E

(

X

̄

)

Var

(

X

̄

)

=

σ

2

n

̄

X

ESTIMATOR

PIVOT

d

̄

X

=

̄

X

E

( ̄

X

)

Var

( ̄

X

)

=

̄

X

μ

σ

n

(15)

Inference on the population mean

Inference on the population mean

d

X

̄

=

X

̄

μ

σ

n

Unknown

d

X

̄

=

X

̄

μ

S

n

t

n-1

t

n-1

NORMAL POPULATION

unknown

What is the probability distribution of the pivot?

Reproductivity of

normal model

N(0,1)

N(0,1)

Central Limit

Theorem

NORMAL POPULATION

known

NON-NORMAL POPULATION

(16)

Estimator for the population proportion

Estimator for the population proportion

̂

p

=

X

n

SAMPLE PROPORTION

X: “Number of elements in the sample that have

the characteristic of interest”.

X: “Number of elements in the sample that have

the characteristic of interest”.

X

B

(

n,p

)

E

( ̂

p

)=

E

(

X

)

n

=

np

n

=p

Var

( ̂

p

)=

Var

(

X

)

n

2

=

np

(

1

p

)

n

2

=

p

(

1

p

)

n

(X

1

, …, X

n

) a s.r.s. with X

i

being Bernoulli distributed:

X

i

=1

If the characteristic under study is present.

P(X

i

=1)=p

X

i

=0

Otherwise.

P(X

i

=0)=1-p

(17)

Inference on the population proportion

Inference on the population proportion

PIVOT

d

̂

p

=

̂

p

p

p

(

1

p

)

n

n large

N(0,1)

N(0,1)

Central

Limit Theorem

p unknown

p unknown

d

̂

p

=

̂

p

p

̂

p

(

1

− ̂

p

)

n

1

(18)

Estimator for the population variance

Estimator for the population variance

S

2

=

i=

1

n

(

X

i

− ̄

X

)

2

n

1

SAMPLE VARIANCE

Biased

estimator

E S

E

n

n

S

n

n

E S

n

n

( )

(

)

(

)

( )

2

2

2

2

1

1





 

E

(

S

n

2

)

=

n

1

n

σ

2

Plug-in estimator

S

n

2

=

i=

1

n

(

X

i

− ̄

X

)

2

n

Unbiased estimator

(19)

Inference on the population variance

Inference on the population variance

PIVOT

d

S

2

=

(

n

1

)

S

2

σ

2

NORMAL

POPULATION

Fisher Theorem

χ

n

2

1

(20)

What parameters are we interested in?

Mean

Proportion Variance



p



What parameters are we interested in?

Mean

Proportion Variance



p



What estimators to use?

Sample mean Sample proportion Sample variance



S



What estimators to use?

Sample mean Sample proportion Sample variance



X

ˆp

S



What probability distributions are relevant?

What probability distributions are relevant?

Chi-squared

Normal

Student's t

References

Related documents

Minnesota State Community and Technical College (MSCTC) generates an annual economic impact of $197 million from its operations in the West Central region.. Based on the West

The Workcover treatment plans are generated directly from the system, with the relevant practice and patient details stored in the system being automatically filled in. You then

The observation that PHC facilities and private pharmacies, which tended to offer the poorest services, were more commonly utilized as the first option for child care by women in

[r]

Whenever there is an Earth fault,the relay will pick up & the NO contact of the relay will connect Ground to Terminal No.5 The PCC will now display Ground Fault & will trip

Coverage Plus is an optional feature of priority listing based products – Enhanced Listing; Enhanced Listing Info Page; Web Link; Site Builder – which enables advertisers to appear

However, the ten-year prescriptive period for an action for reconveyance is not applicable where the complainant is in possession of the land to be reconveyed and the registered

| ANNUAL REPORT 2020 136,000 m 3 of water annually, contributing to the long term sustainability of water supply to Sungai Selangor, which is the main source of raw water to