• No results found

Unit 1 Notes

N/A
N/A
Protected

Academic year: 2019

Share "Unit 1 Notes"

Copied!
31
0
0

Loading.... (view fulltext now)

Full text

(1)

Chapter 1

The Role of Statistics

and the Data Analysis

(2)

What is statistics?

• the science of collecting,

analyzing, and drawing

(3)

Why should one study

statistics?

1. To be informed . . .

a) Extract information from tables, charts and graphs

b) Follow numerical arguments

c) Understand the basics of how data should be gathered, summarized, and analyzed to draw statistical conclusions

Can dogs help

patients with heart failure by reducing

stress and anxiety?

When people take a vacation do they

(4)

Why should one study

statistics? (continued)

2. To make informed judgments

3. To evaluate decisions that affect your life

If you choose a particular major, what are your chances of finding a job when

you graduate?

Many companies now require drug

screening as a condition of employment. With these screening tests there is a risk of a false-positive reading. Is the

(5)

What is variability?

Suppose you went into a convenience store to purchase a soft drink. Does every can on the shelf contain exactly 12 ounces?

NO – there may be a little more or less in the various cans due to the variability

that is inherent in the filling process.

In fact, variability is almost universal!

(6)

If the Shoe Fits ...

The two histograms to the right display the distribution of heights of gymnasts and the

distribution of heights of female basketball players. Which is

which? Why?

Heights – Figure A

(7)

If the Shoe Fits ...

Suppose you found a pair of size 6 shoes left outside the locker room. Which team would you go to first to find the owner of the shoes? Why?

Suppose a tall woman (5 ft 11 in) tells you see is looking for her sister who is

(8)

The Data Analysis Process

1. Understand the nature of the problem

2. Decide what to measure and how to measure it

3. Collect data

4. Summarize data and perform preliminary analysis

5. Perform formal analysis

6. Interpret results

It is important to have a clear direction before gathering data.

It is important to carefully define the variables to be studied and to develop

appropriate methods for determining their values.

It is important to understand how data is collected because the type of

analysis that is appropriate depends

on how the data was collected!

This initial analysis provides insight into important characteristics of the

data.

It is important to select and apply the appropriate inferential statistical

methods

This step often leads to the

(9)

Suppose we wanted to know the

average GPA of high school

graduates in the nation this year.

We could collect data from all

high schools in the nation.

(10)

Population

• The entire collection of

individuals or objects about which

information is desired

• A

census

is performed to gather

about the entire population

What do you call it when you

collect data about the entire

(11)

GPA Continued:

Suppose we wanted to know the

average GPA of high school

graduates in the nation this year.

We could collect data from all

high schools in the nation.

Why might we not want to use a census here?

(12)

Sample

• A subset of the population, selected for study in some prescribed manner

What would a sample of all high school graduates across the nation look like?

(13)

GPA Continued:

Suppose we wanted to know the

average GPA of high school

graduates in the nation this year.

We could collect data from a sample of high schools in the nation.

(14)

Descriptive statistics

• the methods of organizing & summarizing data

• Create a graph

If the sample of high school GPAs contained

1,000 numbers, how could the data be organized or summarized?

(15)

GPA Continued:

Suppose we wanted to know the

average GPA of high school graduates in the nation this year.

We could collect data from a sample of high schools in the nation.Could we use the data from our

(16)

Inferential statistics

• involves making generalizations from a sample to a population

Based on the sample, if the average GPA for high

school graduates was 3.0, what generalization could be made?

The average national GPA for this year’s high school graduate is approximately 3.0.

Could someone claim that the average GPA for graduates in your local school district is 3.0?

No. Generalizations based on the results of a sample can only be made back to the population from which the sample came from.

(17)

Variable

• any characteristic whose value may change from one individual to

another

• Suppose we wanted to know the average GPA of high school

graduates in the nation this year. Define the variable of interest.

The variable of interest is the GPA of high school graduates

Is this a variable . . .

The number of wrecks per week at the intersection outside

(18)

Data

• The values for a variable from individual observations

For this variable . . .

The number of wrecks per week at the intersection outside . . . What could observations be?

(19)

Two types of variables

categorical

numerical

(20)

Categorical variables

• Qualitative

• Identifies basic differentiating characteristics of the population

(21)

Numerical variables

• quantitative

• observations or measurements take on numerical values

• makes sense to average these values

• two types - discrete & continuous

(22)

Discrete (numerical)

• Isolated points along a number line

(23)

Continuous (numerical)

• Variable that can be any value in a given interval

(24)

Identify the following variables:

1. the color of cars in the teacher’s lot

2. the number of calculators owned by

students at your school

3. the zip code of an individual

4. the amount of time it takes students to

drive to school

5. the appraised value of homes in your city

Categorical

Categorical

discrete numerical

Discrete numerical

Continuous numerical

(25)

Classifying variables by the

number of variables in a data set

Suppose that the PE coach records the

height of each student in his class.

Univariate - data that describes a single characteristic of the population

This is an example of a

(26)

Classifying variables by the

number of variables in a data set

Suppose that the PE coach records the

height and weight of each student in his

class.

Bivariate - data that describes two characteristics of the population

This is an example of a

(27)

Classifying variables by the

number of variables in a data set

Suppose that the PE coach records the

height, weight, number of sit-ups, and number of push-ups for each student in

his class.

Multivariate - data that describes more than two characteristics (beyond the scope of this course)

This is an example of a

(28)

Bar Chart

When to Use Categorical data

How to construct

– Draw a horizontal line; write the categories or labels below the line at regularly spaced

intervals

– Draw a vertical line; label the scale using frequency or relative frequency

– Place equal-width rectangular bars above

(29)

Bar Chart (continued)

What to Look For

Frequently or infrequently occurring categories

Collect the following data and then display the data in a bar chart:

What is your favorite ice cream flavor?

(30)

Dotplot

When to Use Small numerical data sets

How to construct

– Draw a horizontal line and mark it with an appropriate numerical scale

(31)

Dotplot (continued)

What to Look For

– The representative or typical value

– The extent to which the data values spread out

– The nature of the distribution along the number line – The presence of unusual values

Collect the following data and then display the data in a dotplot:

References

Related documents

Start assigning numbers for each carbon in the parent chain beginning at the terminal carbon nearest the principal functional group or the first branch point (in alkanes and

En este trabajo se ha llevado a cabo una revisión y actualización de diferentes aspectos relacionados con la calidad del aceite de oliva virgen, se han estudiado las

We examined the implications of worker sorting for the patterns of domestic production and international trade as well as for the demand for protection, income distribution and

1/4" Soaker hose dripline (aka Drip emitter tubing) is attached directly to ½” solid poly tubing supply line with.. 1/4"

Rain Bird SDI specified under turf grass should be designed in a grid pattern, with supply and flush manifolds so that the individual drip emitters will be set out in a pattern

8 Each application must: (1) identify the qualified borrower expected to own the project, 9 (2) if any of the bonds are expected to be issued as pooled financing bonds, 10

Yet the availability of work as described by the precedence relationships within and between phases is an important constraint on project performance (Rosenau and Moran, 1993; Clark

Attfield’s (1999) review of material culture literature from the late 1990s, for example, traces the interdisciplinary implications of a material culture approach to