• No results found

Descriptive Analysis

N/A
N/A
Protected

Academic year: 2021

Share "Descriptive Analysis"

Copied!
43
0
0

Loading.... (view fulltext now)

Full text

(1)

Research Methods

William G. Zikmund

Basic Data Analysis: Descriptive

Statistics

Descriptive Analysis

• The transformation of raw data into a form

that will make them easy to understand and

interpret; rearranging, ordering, and

manipulating data to generate descriptive

information

(2)

Type of

Measurement

Nominal

Two

categories

More than

two categories

Frequency table

Proportion (percentage)

Frequency table

Category proportions

(percentages)

Mode

Type of

descriptive analysis

Type of

Measurement

Type of

descriptive analysis

Ordinal

Rank order

Median

(3)

Type of

Measurement

Type of

descriptive analysis

Interval

Arithmetic mean

Type of

Measurement

Type of

descriptive analysis

(4)

Health Economics Research Method 2003/2

Tabulation

• Tabulation - Orderly arrangement of data in

a table or other summary format

• Frequency table

• Percentages

Health Economics Research Method 2003/2

Frequency Table

• The arrangement of statistical data in a

row-and-column format that exhibits the count

of responses or observations for each

(5)

Measure of

Central

Measure of

Type of Scale

Tendency

Dispersion

Nominal

Mode

None

Ordinal

Median

Percentile

Interval or ratio

Mean

Standard

deviation

Central Tendency

Cross-Tabulation

• A technique for organizing data by groups,

categories, or classes, thus facilitating

comparisons; a joint frequency distribution

of observations on two or more sets of

variables

(6)

cross-Health Economics Research Method 2003/2

Cross-Tabulation

• Analyze data by groups or categories

• Compare differences

• Contingency table

• Percentage cross-tabulations

Health Economics Research Method 2003/2

Base

• The number of respondents or observations

(in a row or column) used as a basis for

computing percentages

(7)

Health Economics Research Method 2003/2

Elaboration and Refinement

• Moderator variable

– A third variable that, when introduced into an

analysis, alters or has a contingent effect on the

relationship between an independent variable

and a dependent variable.

– Spurious relationship

• An apparent relationship between two variables that

is not authentic.

Two

Two

rating

rating

scales

scales

4 quadrants

4 quadrants

two

two

-

-

dimensional

dimensional

table

table

Importance

Importance

-

(8)

Health Economics Research Method 2003/2

Data Transformation

• Data conversion

• Changing the original form of the data to a

new format

• More appropriate data analysis

• New variables

Data Transformation

Summative Score =

VAR1 + VAR2 + VAR 3

(9)

Collapsing a Five-Point Scale

• Strongly Agree

• Agree

• Neither Agree nor

Disagree

• Disagree

• Strongly Disagree

• Strongly Agree/Agree

• Neither Agree nor

Disagree

• Disagree/Strongly

Disagree

Health Economics Research Method 2003/2

Index Numbers

• Score or observation recalibrated to indicate

how it relates to a base number

(10)

Health Economics Research Method 2003/2

Calculating Rank Order

• Ordinal data

• Brand preferences

Health Economics Research Method 2003/2

Tables

• Bannerheads for columns

• Studheads for rows

(11)

Health Economics Research Method 2003/2

Charts and Graphs

• Pie charts

• Line graphs

• Bar charts

– Vertical

– Horizontal

(12)

Health Economics Research Method 2003/2

Line Graph

0

10

20

30

40

50

60

70

80

90

1st Qtr

2nd Qtr

3rd Qtr

4th Qtr

East

West

North

Health Economics Research Method 2003/2

(13)

643 Networking

213 print ad

179 Online recruitment site

112 Placement firm

18 Temporary agency

How did you find your last job?

700

600

500

400

300

200

100

0

Networking

print ad

Online recruitment site

Placement firm

Temporary agency

55.2 %

18.3 %

15.4 %

9.6 %

1.5 %

WebSurveyor Bar Chart

Computer Programs

• SPSS

• SAS

• SYSTAT

• Microsoft Excel

• WebSurveyor

(14)

Microsoft Excel -Data Analysis

The

Paste Function

Provides

Numerous Statistical Operations

(15)

Computer Programs

• Box and whisker plots

• Interquartile range - midspread

• Outlier

(16)

Health Economics Research Method 2003/2

Interpretation

• The process of making pertinent inferences

and drawing conclusions

• concerning the meaning and implications of

a research investigation

Research Methods

William G. Zikmund

(17)

Health Economics Research Method 2003/2

Univariate Statistics

• Test of statistical significance

• Hypothesis testing one variable at a time

Hypothesis

• Unproven proposition

• Supposition that tentatively explains certain

facts or phenomena

(18)

Health Economics Research Method 2003/2

Hypothesis

• An unproven proposition or supposition that

tentatively explains certain facts or

phenomena

– Null hypothesis

– Alternative hypothesis

Health Economics Research Method 2003/2

Null Hypothesis

• Statement about the status quo

• No difference

(19)

Health Economics Research Method 2003/2

Alternative Hypothesis

• Statement that indicates the opposite of the

null hypothesis

Significance Level

• Critical probability in choosing between the

null hypothesis and the alternative

(20)

Health Economics Research Method 2003/2

Significance Level

• Critical Probability

• Confidence Level

• Alpha

• Probability Level selected is typically .05 or

.01

• Too low to warrant support for the null

hypothesis

0

.

3

:

µ

=

o

H

The null hypothesis that the mean is

equal to 3.0:

(21)

0

.

3

:

1

µ

H

The alternative hypothesis that the

mean does not equal to 3.0:

Health Economics Research Method 2003/2

(22)

µ=3.0

x

α=.025

α=.025

Health Economics Research Method 2003/2

A Sampling Distribution

LOWER

LIMIT

UPPER

LIMIT

µ=3.0

Health Economics Research Method 2003/2

(23)

Critical values of µ

µµ

µ

Critical value - upper limit

n

S

Z

ZS

X

+

+

=

µ

or

µ

+

=

225

5

.

1

96

.

1

0

.

3

Health Economics Research Method 2003/2

( )

0

.

1

96

.

1

0

.

3

+

=

196

.

0

.

3

+

=

196

.

3

=

Critical values of µ

µµ

µ

(24)

Critical value - lower limit

n

S

Z

ZS

X

or

--

µ

µ

=





=

225

5

.

1

96

.

1

-0

.

3

Health Economics Research Method 2003/2

Critical values of µ

µµ

µ

( )

0

.

1

96

.

1

0

.

3

=

196

.

0

.

3

=

804

.

2

=

Health Economics Research Method 2003/2

(25)

Region of Rejection

LOWER

LIMIT

UPPER

LIMIT

µ=3.0

Health Economics Research Method 2003/2

(26)

Accept null

Reject null

Null is true

Null is false

Correct

Correct

-

-no error

no error

Type I

Type I

error

error

Type II

Type II

error

error

Correct

Correct

-

-no error

no error

Health Economics Research Method 2003/2

Type I and Type II Errors

Type I and Type II Errors

in Hypothesis Testing

State of Null Hypothesis

Decision

in the Population

Accept Ho

Reject Ho

Ho is true

Correct--no error

Type I error

Ho is false

Type II error

Correct--no error

(27)

Calculating Z

obs

x

s

x

z

obs

Health Economics Research Method 2003/2

obs

S

X

Z

Alternate Way of Testing the

Hypothesis

(28)

X

obs

S

Z

=

3

.

78

µ

1

.

0

.

3

78

.

3

=

1

.

78

.

0

=

=

7

.

8

Health Economics Research Method 2003/2

Alternate Way of Testing the

Hypothesis

Health Economics Research Method 2003/2

Choosing the Appropriate

Statistical Technique

• Type of question to be answered

• Number of variables

– Univariate

– Bivariate

– Multivariate

(29)

PARAMETRIC

STATISTICS

NONPARAMETRIC

STATISTICS

Health Economics Research Method 2003/2

t-Distribution

• Symmetrical, bell-shaped distribution

• Mean of zero and a unit standard deviation

• Shape influenced by degrees of freedom

(30)

Health Economics Research Method 2003/2

Degrees of Freedom

• Abbreviated

d.f.

• Number of observations

• Number of constraints

or

X

l

c

S

t

X

±

.

.

=

µ

n

S

t

X

c

.

l

.

limit

Upper

=

+

n

S

t

X

c

.

l

.

limit

Lower

=

Health Economics Research Method 2003/2

Confidence Interval Estimate

Using the t-distribution

(31)

= population mean

= sample mean

= critical value of t at a specified confidence

level

= standard error of the mean

= sample standard deviation

= sample size

.

.l

c

t

µ

X

X

S

S

n

Health Economics Research Method 2003/2

Confidence Interval Estimate Using

the t-distribution

x

cl

s

t

X

±

=

µ

66

.

2

7

.

3

=

=

S

X

Confidence Interval Estimate Using

the t-distribution

(32)

07

.

5

)

17

66

.

2

(

12

.

2

7

.

3

limit

upper

=

+

=

Health Economics Research Method 2003/2

33

.

2

)

17

66

.

2

(

12

.

2

7

.

3

limit

Lower

=

=

(33)

Health Economics Research Method 2003/2

Hypothesis Test Using the

t-Distribution

Suppose that a production manager believes

the average number of defective assemblies

each day to be 20. The factory records the

number of defective assemblies for each of the

25 days it was opened in a given month. The

Univariate Hypothesis Test

Utilizing the t-Distribution

(34)

20

:

20

:

1

0

=

µ

µ

H

H

Health Economics Research Method 2003/2

n

S

S

X

(35)

The researcher desired a 95 percent

confidence, and the significance level becomes

.05.The researcher must then find the upper

and lower limits of the confidence interval to

determine the region of rejection. Thus, the

value of t is needed. For 24 degrees of

freedom (n-1, 25-1), the t-value is 2.064.

Health Economics Research Method 2003/2

Univariate Hypothesis Test

Utilizing the t-Distribution

:

limit

Lower

25

/

5

064

.

2

20

.

.

=

t

c

l

S

X

µ

( )

1

064

.

2

20 −

=

(36)

:

limit

Upper

25

/

5

064

.

2

20

.

.

=

+

+

t

c

l

S

X

µ

( )

1

064

.

2

20+

=

064

.

20

=

Health Economics Research Method 2003/2

X

obs

S

X

t

=

µ

1

20

22 −

=

1

2

=

2

=

Health Economics Research Method 2003/2

Univariate Hypothesis Test

t-Test

(37)

Health Economics Research Method 2003/2

Testing a Hypothesis about a

Distribution

• Chi-Square test

• Test for significance in the analysis of

frequency distributions

• Compare observed frequencies with

expected frequencies

• “Goodness of Fit”

i

i

i

E

E

O

x

Chi-Square Test

(38)

x

² = chi-square statistics

O

i

= observed frequency in the i

th

cell

E

i

= expected frequency on the i

th

cell

Health Economics Research Method 2003/2

Chi-Square Test

j

i

ij

Chi-Square Test

Estimation for Expected Number

for Each Cell

(39)

Chi-Square Test

Estimation for Expected Number

for Each Cell

R

i

= total observed frequency in the i

th

row

C

j

= total observed frequency in the j

th

column

n

= sample size

Health Economics Research Method 2003/2

(

) (

)

2

2

2

2

1

2

1

1

2

E

E

O

E

E

O

X

=

+

Univariate Hypothesis Test

Chi-square Example

(40)

(

) (

)

50

50

40

50

50

60

2

2

2

=

+

X

4

=

Health Economics Research Method 2003/2

Univariate Hypothesis Test

Chi-square Example

Hypothesis Test of a Proportion

π is the population proportion

p is the sample proportion

π is estimated with p

(41)

5

.

:

H

5

.

:

H

1

0

¹

π

=

π

Health Economics Research Method 2003/2

Hypothesis Test of a Proportion

( )( )

100

4

.

0

6

.

0

=

p

S

100

24

.

=

=

=

(42)

p

S

p

Zobs

=

π

04899

.

5

.

6

. −

=

04899

.

1

.

=

=

2

.

04

Health Economics Research Method 2003/2

0115

.

S

p

=

000133

.

S

p

=

1200

16

.

S

p

=

1200

)

8

)(.

2

(.

S

p

=

n

pq

S

p

=

20

.

p =

200

,

1

n =

Health Economics Research Method 2003/2

Hypothesis Test of a Proportion:

Another Example

(43)

0115

.

S

p

=

000133

.

S

p

=

1200

16

.

S

p

=

1200

)

8

)(.

2

(.

S

p

=

n

pq

S

p

=

20

.

p =

200

,

1

n =

Health Economics Research Method 2003/2

Hypothesis Test of a Proportion:

Another Example

level.

.05

the

at

rejected

be

should

hypothesis

null

the

so

1.96,

exceeds

value

Z

The

348

.

4

Z

0115

.

05

.

Z

0115

.

15

.

20

.

Z

S

p

Z

p

=

=

=

π

=

Hypothesis Test of a Proportion:

Another Example

References

Related documents

We want to see how much a deictic gesture such a pointing to text can help the child’s reading while listening (RWL) proficiency. The effect of pointing on the child’s

By enabling the Connector and using only one Cassandra node, the output step took 160.78 seconds, while increasing the number of Cassandra nodes reduced the response time to 65

All West TV can be an effective way to keep customers occupied and happy while waiting for a service to be completed at a vehicle repair shop. or

Support workers (SWs) employe d and working in health care settings SW Undertake Care Certificate programme (two taught days plus access to other learning

Free-piston engines are under investigation by a number of research groups due to potential fuel efficiency and exhaust emissions advantages over conventional technology. The

Conclusions: The performance of National Health Emergency Response Command Center can be improved by learning from the good practice of the selected EOCs, including setting

Moreover, auditors are required by ISA 240 The auditor’s responsibilities relating to fraud in an audit of financial statements to identify and assess the risks of material

Methods: We registered the following data for 106 50–70-year-old men: age, weight, blood lipid, including total cholesterol, low density lipoprotein cholesterol, high