Company Logo Company Logo
Sampling Design
Sampling Design
Lecture - 5 Lecture - 5Advanced Research Methods (ARM) Advanced Research Methods (ARM)
What is difference between
What is difference between
data and statistics?
Recall…
Recall…
Statistics is a tool for convertingStatistics is a tool for converting datadata intointo
information information:: Data Data Statistics Statistics Information Information
But where then does
But where then does datadata come from? How is it gathered?come from? How is it gathered?
How do we ensure its accurate? Is
How do we ensure its accurate? Is the data reliable? Is itthe data reliable? Is it
representative of the population from which it was
Sampling
Sampling
Sampling is that part of Sampling is that part of statistical practicestatistical practice
which is concerned with the selection of which is concerned with the selection of
individual observations intended to yield some individual observations intended to yield some knowledge about a population of concern,
knowledge about a population of concern, especially for the purposes of
especially for the purposes of statisticalstatistical inference.
…
…
Sampling
Sampling is the processis the process
of selecting a small number of elements
of selecting a small number of elements
from a larger defined target group
from a larger defined target group
of elements such that
of elements such that
the information gathered
the information gathered
from the small group will allow judgments
from the small group will allow judgments
to be made about the larger groups
What is a sample?
What is a sample?
A sample is a portion of the
A sample is a portion of the
elements of a population. A sample
elements of a population. A sample
is chosen to make inferences about
is chosen to make inferences about
the population by examining or
the population by examining or
measuring the elements in the
measuring the elements in the
sample.
Reasons for Sampling
Reasons for Sampling
Researchers rarely survey the population for two Researchers rarely survey the population for two reasons(Adér, Mellenbergh, & Hand, 2008):
reasons(Adér, Mellenbergh, & Hand, 2008): (1)The cost is too high and
(1)The cost is too high and
(2)The population is dynamic, i.e., the component (2)The population is dynamic, i.e., the component of population could change over time. E.g.
of population could change over time. E.g. patients in a hospital
Advantages of sampling:
Advantages of sampling:
(1) The cost is lower, (1) The cost is lower,
(2) Data collection is faster, and (2) Data collection is faster, and (3) It is possible
(3) It is possible to ensure homogeneity and toto ensure homogeneity and to improve the accuracy and quality of the data improve the accuracy and quality of the data because the data set is smaller.
Basics of Sampling Theory
Basics of Sampling Theory
Population Population Element Element Defined target Defined target population population Sampling unit Sampling unit Sampling frame Sampling frame
Selection of Elements
Selection of Elements
Population Population
Population Element Population Element Sampling Sampling Survey Survey CensusCensus
Definitions
Definitions
Population: The target population is the collection of Population: The target population is the collection of
elements or objects that possess the information sought elements or objects that possess the information sought by the researcher and about which inferences are to be by the researcher and about which inferences are to be
made.
made. The target The target population should population should be defined be defined in termsin terms of elements, sampling units, extent, and
of elements, sampling units, extent, and time.time.
An An elementelement is the object about which or from is the object about which or from whichwhich
the information is desired, e.g., the respondent. the information is desired, e.g., the respondent.
A A sampling unitsampling unit is an element, or a unit containingis an element, or a unit containing
the element, that is available for
the element, that is available for selection at someselection at some stage
stage of the of the sampling sampling process. process. E.g. orE.g. organizationganization
ExtentExtent refers to the geographical boundaries.refers to the geographical boundaries.
Sampling frame
Sampling frame
Sampling frame (synonyms: "sample frame",Sampling frame (synonyms: "sample frame",
"survey frame") is the actual set of
"survey frame") is the actual set of units fromunits from which a sample has been drawn: in the case of a which a sample has been drawn: in the case of a
simple random sample, all units from the simple random sample, all units from the sampling frame have an equal chance to be sampling frame have an equal chance to be drawn and to occur in the sample.
drawn and to occur in the sample.
In the ideal case, the sampling frame shouldIn the ideal case, the sampling frame should
coincide with the population of interest. coincide with the population of interest.
Example
Example
Consider, a survey aimed at establishing the number of potentialConsider, a survey aimed at establishing the number of potential customers for Easypaisa in the population of Islamabad City.
customers for Easypaisa in the population of Islamabad City. The research team has drawn 1000 numbers at random from a The research team has drawn 1000 numbers at random from a telephone directory for the city, made 200 calls each day from telephone directory for the city, made 200 calls each day from Monday to Friday from 8am to 5pm and asked some questions. Monday to Friday from 8am to 5pm and asked some questions.
In this example, populationIn this example, population of interest is all inhabitants of the of interest is all inhabitants of the city; thecity; the sampling frame
sampling frame includes only those dwellers who satisfy all includes only those dwellers who satisfy all the followingthe following conditions:
conditions:
has a telephone;has a telephone;
the telephone number is included in the directory;the telephone number is included in the directory;
likely to be at home from 8am to 5pm from Monday to Friday;likely to be at home from 8am to 5pm from Monday to Friday;
Sampling Plans…
Sampling Plans…
A A sampling plansampling plan is just a method or procedureis just a method or procedure
for specifying how a sample will be taken from a for specifying how a sample will be taken from a population.
What is a Good Sample?
What is a Good Sample?
Accurate: absence of bias Accurate: absence of bias
Is sample unbiased?
Types of Errors
Types of Errors
Sampling andSampling and
Sampling Error
Sampling Error
Sampling error is any type of bias that isSampling error is any type of bias that is
attributable to mistakes in either drawing a sample or attributable to mistakes in either drawing a sample or determining the sample size
determining the sample size
Sampling errors are caused by sampling design. ItSampling errors are caused by sampling design. It
includes: includes: (1)
(1) Selection errorSelection error: Incorrect selection probabilities: Incorrect selection probabilities are used.
are used. (2)
(2) Estimation errorEstimation error: Biased parameter estimate: Biased parameter estimate because of the elements in these samples.
E.g. Two samples of size 10 of 1,000 households. If E.g. Two samples of size 10 of 1,000 households. If
we happened to get the highest income level data
we happened to get the highest income level data
points in our first sample and all the lowest income
points in our first sample and all the lowest income
levels in the second, this delta is due to sampling
levels in the second, this delta is due to sampling
error.
error.
Increasing the sample sizeIncreasing the sample size will will reduce this type of reduce this type of
error.
Non-sampling errors
Non-sampling errors
Nonsampling errorsNonsampling errors are more serious and are due to mistakesare more serious and are due to mistakes
made in the acquisition of data or due to
made in the acquisition of data or due to the sample observationsthe sample observations being selected improperly. Non-sampling errors are caused by the being selected improperly. Non-sampling errors are caused by the
mistakes in data processing. It includes: mistakes in data processing. It includes: (1)
(1) OvercoverageOvercoverage: Inclusion of data from outside of the: Inclusion of data from outside of the population.
population. (2)
(2) UndercoverageUndercoverage: Sampling frame does not : Sampling frame does not include elements ininclude elements in the population.
the population. (3)
(3) Measurement errorMeasurement error: The : The respondent misunderstanrespondent misunderstand thed the question.
question. (4)
(4) Processing errorProcessing error: Mistakes in data coding.: Mistakes in data coding. (5)
Increasing the sample sizeIncreasing the sample size will not will not reduce this type of error.reduce this type of error.
Acquisition errors arise from the recording of incorrect Acquisition errors arise from the recording of incorrect
responses, due to:
responses, due to:
— incorrect measurements being taken because of faulty equipment,— incorrect measurements being taken because of faulty equipment,
— mistakes made during transcription from primary sources,— mistakes made during transcription from primary sources,
— inaccurate recording of data due — inaccurate recording of data due to misinterpretation of terms, orto misinterpretation of terms, or
Sampling Methods
Sampling Methods
Probability Probability sampling sampling Nonprobability Nonprobability sampling samplingSteps in Sampling Design
Steps in Sampling Design
What is the relevant population? What is the relevant population?
What are the parameters of interest? What are the parameters of interest?
What is the What is the sampling framesampling frame??
What is the type of sample? What is the type of sample?
What size sample is needed? What size sample is needed?
Steps
Steps
Define the Population Define the Population
Determine the Sampling Frame Determine the Sampling Frame Select Sampling Technique(s) Select Sampling Technique(s)
Determine the Sample Size Determine the Sample Size Execute the Sampling Process Execute the Sampling Process
Classification of Sampling
Classification of Sampling
Techniques
Techniques
Sampling Techniques Sampling Techniques Nonprobability Nonprobability Sampling Techniques Sampling Techniques Probability Probability Sampling Techniques Sampling Techniques Convenience Convenience Sampling Sampling Judgmental Judgmental Sampling Sampling Quota Quota Sampling Sampling Snowball Snowball Sampling Sampling Systematic Systematic Sampling Sampling Stratified Stratified Sampling Sampling Cluster Cluster Sampling Sampling Other Sampling Other Sampling Techniques Techniques Simple Random Simple Random Sampling SamplingCompany Logo Company Logo
Non-Probability
Non-Probability
Sampling Designs
Sampling Designs
Nonprobabil
Nonprobability
ity Sampling Methods
Sampling Methods
Convenience sampling relies
Convenience sampling relies
upon convenience and access
upon convenience and access
Judgment sampling relies upon belief
Judgment sampling relies upon belief
that participants fit characteristics
that participants fit characteristics
Quota sampling emphasizes representation
Quota sampling emphasizes representation
of specific characteristics
of specific characteristics
Snowball sampling relies upon respondent
Snowball sampling relies upon respondent
referrals
Nonprobability Sampling
Nonprobability Sampling
Reasons to use Reasons to use
Procedure satisfactorily meets the samplingProcedure satisfactorily meets the sampling
objectives objectives
Lower CostLower Cost
Limited TimeLimited Time
Not as much human error as selecting aNot as much human error as selecting a
completely random sample completely random sample
Nonprobability Sampling
Nonprobability Sampling
Convenience SamplingConvenience Sampling
Purposive Sampling Purposive Sampling
Judgment SamplingJudgment Sampling
Quota SamplingQuota Sampling
Convenience Sampling
Convenience Sampling
Convenience samplingConvenience sampling attempts to obtain a sample of attempts to obtain a sample of convenient
convenient elements. elements. Often, rOften, respondents espondents are are selectedselected because they happen to be in
because they happen to be in the right place at the rightthe right place at the right time.
time.
use of students, and members of social organizationsuse of students, and members of social organizations
mall intercept interviews without qualifying themall intercept interviews without qualifying the
respondents respondents
department stores using charge account listsdepartment stores using charge account lists
Judgmental Sampling
Judgmental Sampling
Judgmental samplingJudgmental sampling is a form of convenienceis a form of convenience sampling in which the population elements are
sampling in which the population elements are selectedselected based on the judgment of the researcher.
based on the judgment of the researcher.
test marketstest markets
purchase engineers selected in industrial purchase engineers selected in industrial marketingmarketing
research research
Quota Sampling
Quota Sampling
Quota sampling
Quota sampling may be viewed as two-stage restricted judgmentalmay be viewed as two-stage restricted judgmental sampling.
sampling.
The first stage consists of developing control categories, or quotas, of The first stage consists of developing control categories, or quotas, of
population elements. population elements.
In the second stage, sample elements are selected based onIn the second stage, sample elements are selected based on
convenience or judgment. convenience or judgment.
P
Pooppuullaattiioonn SSaammppllee
ccoommppoossiittiioonn ccoommppoossiittiioonn Control
Control C
Chhaarraacctteerriissttiicc PPeerrcceennttaaggee PPeerrcceennttaaggee NNuummbbeerr Sex Sex M Maallee 4488 4488 448800 F Feemmaallee 5522 5522 552200 _ _______ ________ ________ 110000 110000 11000000
Snowball Sampling
Snowball Sampling
InIn snowball samplingsnowball sampling, an initial group of respondents, an initial group of respondents is selected, usually at random.
is selected, usually at random.
After being interviewed, these respondents are asked After being interviewed, these respondents are asked
to identify others who belong to the target population to identify others who belong to the target population of interest.
of interest.
Subsequent respondents are selected based on theSubsequent respondents are selected based on the
referrals. referrals.
Company Logo Company Logo
Probability Sampling
Probability Sampling
Designs
Designs
Probability Sampling Designs
Probability Sampling Designs
Simple random sampling Simple random sampling
Systematic sampling Systematic sampling
Stratified sampling Stratified sampling
ProportionateProportionate
DisproportionateDisproportionate
Cluster samplingCluster sampling
Simple Random Sampling
Simple Random Sampling
Each element in the population has a Each element in the population has a known and equalknown and equal
probability of selection. probability of selection.
Each possible sample of a given size (n) has a known andEach possible sample of a given size (n) has a known and
equal probability of being the sample actually selected. equal probability of being the sample actually selected.
This implies that every element is This implies that every element is selected independently selected independently
of every other element. of every other element.
Systematic Sampling
Systematic Sampling
The sample is chosen by selecting a random starting point andThe sample is chosen by selecting a random starting point and
then picking every ith element in succession from the sampling then picking every ith element in succession from the sampling frame.
frame.
The sampling interval, i, is The sampling interval, i, is determined by dividing the populationdetermined by dividing the population
size N by the sample size n and rounding to the nearest integer. size N by the sample size n and rounding to the nearest integer.
When the ordering of the elements When the ordering of the elements is related to the is related to the characteristiccharacteristic
of interest, systematic sampling
of interest, systematic sampling increases the representativenessincreases the representativeness of the sample.
of the sample.
For example, there are 100,000 elements in the population and aFor example, there are 100,000 elements in the population and a
sample of 1,000
sample of 1,000 is desired. is desired. In this case In this case the sampling the sampling interval, i, iinterval, i, iss 100.
100. A A random random number number between 1 between 1 and 10and 100 is 0 is selected. selected. If, forIf, for example, this number is 23, the sample consists of elements 23 example, this number is 23, the sample consists of elements 23,, 123, 223, 323, 423, 523, and so on.
Stratified Sampling
Stratified Sampling
A two-step process in which the population is partitioned A two-step process in which the population is partitioned
into subpopulations, or strata. into subpopulations, or strata.
The strata should be mutually exclusive and collectively The strata should be mutually exclusive and collectively
exhaustive in that every population element should be exhaustive in that every population element should be assigned to one and only one stratum and no population assigned to one and only one stratum and no population elements should be omitted.
elements should be omitted.
Next, elements are selected from each stratum by aNext, elements are selected from each stratum by a
random procedure, usually SRS. random procedure, usually SRS.
A major objective of stratified sampling is to increase A major objective of stratified sampling is to increase
precision without increasing cost. precision without increasing cost.
Stratified Sampling
Stratified Sampling
The elements within a stratum should be as homogeneous asThe elements within a stratum should be as homogeneous as
possible, but the elements in different strata should be as possible, but the elements in different strata should be as heterogeneous as possible.
heterogeneous as possible.
Finally, the variables should decrease the cost of the stratificationFinally, the variables should decrease the cost of the stratification
process by being easy to measure and apply. process by being easy to measure and apply.
In proportionate stratified sampling, the size of the sample drawnIn proportionate stratified sampling, the size of the sample drawn
from each stratum is proportionate to the relative size of that from each stratum is proportionate to the relative size of that stratum in the total population.
stratum in the total population.
In disproportionate stratified sampling, the size of the sample fromIn disproportionate stratified sampling, the size of the sample from
each stratum is proportionate to the relative size of that stratum and each stratum is proportionate to the relative size of that stratum and to the standard deviation of the distribution of the characteristic of to the standard deviation of the distribution of the characteristic of interest among all the elements in that stratum.
Cluster Sampling
Cluster Sampling
The target population is first The target population is first divided into mutually exclusive anddivided into mutually exclusive and
collectively exhaustive subpopulations, or clusters. collectively exhaustive subpopulations, or clusters.
Then a random sample of clusters is selected, based on a Then a random sample of clusters is selected, based on a probability probability
sampling technique such as SRS. sampling technique such as SRS.
For each selected cluster, either all the elements are included in the sampleFor each selected cluster, either all the elements are included in the sample
(one-stage) or a sample
(one-stage) or a sample of elements is drawn probabilistically (two-stage).of elements is drawn probabilistically (two-stage).
Elements within a cluster should be as heterogeneous as possible, butElements within a cluster should be as heterogeneous as possible, but
clusters themselves should
clusters themselves should be as be as homogeneous as homogeneous as possible. possible. IdeallyIdeally, , eacheach cluster should be a small-scale representation of the population.
cluster should be a small-scale representation of the population.
InIn probability proportionate to sizeprobability proportionate to size samplingsampling, the clusters are, the clusters are
sampled with probability proportional to
sampled with probability proportional to size. size. In the In the second stage, thesecond stage, the probability of selecting a sampling unit
probability of selecting a sampling unit in a selected in a selected cluster varies inversely cluster varies inversely with the size of the cluster.
Types of Cluster Sampling
Types of Cluster Sampling
Fig. 11.3
Fig. 11.3 Cluster SamplingCluster Sampling
One-Stage One-Stage Sampling Sampling Multistage Multistage Sampling Sampling Two-Stage Two-Stage Sampling Sampling Simple Cluster Simple Cluster Sampling Sampling Probability Probability Proportionate Proportionate to Size Sampling to Size Sampling
Sample vs. Census
Sample vs. Census
Type of Study
Type of Study
1. Budget
1. Budget
Sample Sizes Used in Marketing Sample Sizes Used in Marketing
Research Studies Research Studies Table 11.2 Table 11.2
Type of Study
Type of Study
Problem identificat
Problem identificat
Factors to Consider in Sample Design
Factors to Consider in Sample Design
R
Reesseeaarrcch h oobbjjeeccttiivveess DDeeggrreee e oof f aaccccuurraaccyy
R
Reessoouurrcceess TTiimme e ffrraammee Knowledge of
Knowledge of target population
target population Research scopeResearch scope Statistical analysis needs
How many completed questionnaires do we needHow many completed questionnaires do we need
to have a
to have a representative sample?representative sample?
Generally the larger the better, but that takesGenerally the larger the better, but that takes
more time and money. more time and money.
Answer depends on: Answer depends on:
How different or dispersed the population is.How different or dispersed the population is.
Desired level of confidence.Desired level of confidence.
Desired degree of aDesired degree of accuracy.ccuracy.
Determining Sample Size
Common Methods:Common Methods:
Budget/time availableBudget/time available
Executive decisionExecutive decision
Statistical methodsStatistical methods
Historical data/guidelinesHistorical data/guidelines
Common
Common
Methods
Methods
for
for
Determining
Determining
Sample Size
Factors Affecting Sample Size for Probability
Factors Affecting Sample Size for Probability
Designs
Designs
Variability of the population characteristic Variability of the population characteristic
under investigation under investigation
Level of confidence desired in the estimateLevel of confidence desired in the estimate
Degree of precision desired in Degree of precision desired in estimating theestimating the
population characteristic population characteristic
n = [N
n = [N
22Z
Z
22]/[Ne
]/[Ne
22+ σ
+ σ
22Z
Z
22]
]
Where Where
e is the specified errore is the specified error
σ is the SD of the populationσ is the SD of the population
N is the populationN is the population
Z is the table value Z is the table value of Z-Table. For a 95%of Z-Table. For a 95%
Confidence Interval, value of Z is 1.96 Confidence Interval, value of Z is 1.96
For a simple sample size calculator, click
For a simple sample size calculator, click here:here: http://www.surveysystem.com/sscalc.htm
http://www.surveysystem.com/sscalc.htm
Probability Sampling and
Probability Sampling and
Sample Sizes
Company Logo
Company Logo
Research Design
Measurement
Measurement
Selecting observable empirical eventsSelecting observable empirical events
Using numbers or symbols Using numbers or symbols to represent aspectsto represent aspects
of the events of the events
Applying a Applying a mapping rulemapping rule to connect theto connect the
observation to the symbol observation to the symbol
What is Measured?
What is Measured?
Objects::Objects
Things of ordinary experienceThings of ordinary experience
Some things not concreteSome things not concrete
Characteristics of Data
Characteristics of Data
ClassificationClassification OrderOrder Distance (interval between numbers)Distance (interval between numbers)
Data Types
Data Types
O
Orrd
deerr IIn
ntteerrv
va
all O
Orriig
giin
n
N
Noommiinnaall nnoonnee nnoonnee nnoonnee O
Orrddiinnaall yyeess uunneeqquuaall nnoonnee IInntteerrvvaall yyeess eeqquuaal l oorr nnoonnee
unequal unequal R
Sources of Measurement
Sources of Measurement
Differences
Differences
RespondentRespondent Situational factorsSituational factors
Measurer or researcherMeasurer or researcher
Validity
Validity
Content ValidityContent Validity
Criterion-Related ValidityCriterion-Related Validity
PredictivePredictive
ConcurrentConcurrent
Reliability
Reliability
Stability Stability Test-retestTest-retest Equivalence Equivalence Parallel formsParallel forms
Internal Consistency Internal Consistency
Split-half Split-half
KR20KR20
Practicality
Practicality
Economy Economy ConvenienceConvenience Interpretability InterpretabilityCompany Logo Company Logo
MEASUREMENT
MEASUREMENT
SCALES
SCALES
What is Scaling?
What is Scaling?
Scaling Scaling is assigning numbers to indicants of theis assigning numbers to indicants of the
properties of objects properties of objects
Types of Response Scales
Types of Response Scales
Rating Scales Rating Scales
Ranking Scales Ranking Scales
Types of Rating Scales
Types of Rating Scales
Simple category Simple category
Multiple choice,Multiple choice,
single response single response
Multiple choice,Multiple choice,
multiple response multiple response
Likert scaleLikert scale
Semantic differentialSemantic differential
•
•
Numerical
Numerical
•
•
Multiple rating
Multiple rating
•
•
Fixed sum
Fixed sum
•
•
Stapel
Stapel
•
Rating Scale Errors to Avoid
Rating Scale Errors to Avoid
Leniency Leniency
Negative Leniency Negative Leniency
Positive Leniency Positive Leniency
Central Tendency Central Tendency
Types of Ranking Scales
Types of Ranking Scales
Paired-comparisonPaired-comparison
Forced RankingForced Ranking
Dimensions of a Scale
Dimensions of a Scale
UnidimensionalUnidimensional MultidimensionalMultidimensionalScale Design Techniques
Scale Design Techniques
Arbitrary scaling Arbitrary scaling
Consensus scalingConsensus scaling
Item Analysis scalingItem Analysis scaling
Cumulative scalingCumulative scaling
Sarndal, Swenson, and Wretman (1992), ModelSarndal, Swenson, and Wretman (1992), Model
Assisted Survey Sampling,
Assisted Survey Sampling, Springer-VerlagSpringer-Verlag
Fritz Scheuren (2005). "What is a Margin of Fritz Scheuren (2005). "What is a Margin of
Error?", Chapter 10, in "What is a
Error?", Chapter 10, in "What is a Survey?",Survey?", American Statistical Association,
Company Logo
Company Logo
Thank you for your
Thank you for your
kind attention
kind attention
Go forth and research….
Go forth and research….
….but be careful out there.