Sampling - PowerPoint PPT Presentation

1 / 31

About This Presentation

Title:

Sampling

Description:

it can be troublesome if there are patterns to the data (periodicities) ... One-step procedures can introduce error if clusters are not homogenous. ... – PowerPoint PPT presentation

Number of Views:24

Avg rating:3.0/5.0

Slides: 32

Provided by: John3

Category:

more less

Transcript and Presenter's Notes

Title: Sampling

1
Sampling

Dr. John T. Drea
Professor of Marketing
MKTG 329 - Western Illinois University

2
Basic Sampling Terms

Population
The entire group under study as specified by the
objectives of the research project.
It is the groups to which we want to generalize
the findings.
Sample
A subset of the population that should represent
the entire group.
Census
Different than a sample - it is an accounting of
an entire population. The U.S. Census is an
example - everyone is accounted for.

3
Basic Sampling Terms

Sample Unit
It is the basic level of investigation.
It might be an individual, a household, a
business, etc.
All of the sample units together comprise the
sample.
Sample Frame
A master list of all the sample units in the
population
Sample Frame Error
The extent to which the sample frame misses
elements of the population

4
Example Surveying WIU Students

The sample unit could be
individual students, or
student addresses

Sample 600
Census 13,000
Population 13,000

The sample frame could be
roster of all students currently enrolled at WIU,
or
roster of all current addresses, or
listing of all residence hall rooms.

Non-sampled students 12,400
5
Example Surveying WIU Students
Sample 600
Census 13,000
Question What are the advantages and
disadvantages of doing a survey of this
population as opposed to doing a census?
Population 13,000
Non-sampled students 12,400
6
Example Surveying Other Populations
Question To get the sample frame for WIU
students, we would go to the WIU registrar and
request a roster for the current semester. But
if our research project was to survey consumers
who has purchased a particular brand of
antiperspirant, where would we get that sample
frame?
Population 13,000
Sample Frame - a master list of all the sample
units in the population
7
Two Basic Sampling Methodologies
Probability Each member of the population has an
equal and known probability of being included in
the sample Nonprobability The probability of
selecting members of the population is
unknown/unequal.
8
Probability Samples Simple Random Sampling
Every member of the population has an equal and
known probability of being included in the
sample. Where... P (sample size/population
size)

Blind Draw Method (like drawing numbers out of a
hat)
Table of Random Numbers (assign all units a
number, choose a random starting point in the
table and proceed)

Simple random sampling requires unique
designations for each population member.
9
Probability Samples Systematic Sampling
Uses a skip interval typically drawn from a
printed listing, where every nth sample unit is
included in the sample Skip Interval (pop.
size/sample size)
Systematic Sampling Steps 1. Get the listing
(sample frame) 2. Compute skip interval 3. Use
random numbers to identify starting point for
page, column, and position in column. 4. Apply
skip interval 5. Treat the list as circular.
10
Probability Samples Systematic Sampling
Uses a skip interval typically drawn from a
printed listing, where every nth sample unit is
included in the sample Skip Interval (pop.
size/sample size)

Systematic sampling is efficient...
it doesnt require a listing of the population
it only requires the use of random numbers to
get started
But be careful
it requires the population be known
it can be troublesome if there are patterns to
the data (periodicities)

11
Probability Samples Cluster Sampling
Divides the total population into subgroups, then
one of more of these subgroups is randomly
selected to represent the total population.
Cluster Sampling Steps 1. Determine the area to
be surveyed/identify subgroups (clusters) 2.
Choose a one-step (performing a census on one
random cluster) or two-step procedure (randomly
select multiple clusters, then randomly select
sample units from the cluster) 3. Randomly
select the subgroup(s) 4. For a two-step
procedure, identify clusters and identify random
starting point in each cluster (similar to
systematic sampling)
12
Probability Samples Cluster Sampling
Divides the total population into subgroups, then
one of more of these subgroups is randomly
selected to represent the total population.

Cluster sampling makes sense when the population
is relatively homogenous (each cluster is likely
to also be homogenous.)
More efficient than systematic sampling.
One-step procedures can introduce error if
clusters are not homogenous.

13
Probability Samples Stratified Sampling
Separates the population into different subgroups
(strata,) then samples all of these subgroups. It
is used when the population is not normally
distributed.
Involves the calculation of a weighted mean when
the strata sample sizes are disproportionate to
the population MeanPop. (meanA)(proportionA)
(meanB)(proportionB) ... (meani)(proportioni)
14
Probability Samples Stratified Sampling
Separates the population into different subgroups
(strata,) then samples all of these subgroups. It
is used when the population is not normally
distributed.
Stratified Sampling Steps 1. Confirm that the
population is not normally distributed. 2.
Divide the population into strata. 3. Select a
probability sample from each strata. 4. If strata
sizes are disproportionate to strata sizes in the
population, a weighted mean approach is required.
15
Nonprobability Samples Convenience Sampling
A sample drawn at the convenience of the
researcher. Typically done in high-traffic
areas. Mall-intercepts are an example.

Its faster and easier to generate large sample
sizes with convenience sampling, but it raises
potential problems
Does the sample represent the population?
Likely to miss certain segments of the
population who arent in the high traffic area.

16
Nonprobability Samples Judgment Sampling
An expert decides which sample units will be
included in the sample. Example If I selected
Intro. to Psychology students at WIU to represent
all WIU Fresh/Soph students.

Judgment sampling is common in focus group
research for selecting participants.
Judgment sampling is dependent upon the judgment
of the individual selecting the sample - how will
you know if that judgment was incorrect?

17
Nonprobability Samples Referral (snowball)
Sampling
Respondents provide the names of other potential
respondents
Respondent 1
A
B
C
D
E
H
I
L
M
G
F
J
K
N
O
P

Useful when there is a limited sample frame and
respondents might be otherwise difficult to
identify.
Whats the connection between referral sampling
and the incidence rate?

18
Nonprobability Samples Quota Sampling
Establishes a specific quota for the types of
individuals to be contacted. Ex If we know the
population is 50 female, 50 male, 70
Caucasian, 14 African-American, 10
Hispanic-American, 5 Asian-American, and 1
other, then we select a sample to fit those
parameters (quotas)

Useful when we need certain number of subjects
in each group for analysis purposes.
Can be subjective.

19
Developing a Sample Plan
20
Some comments on the sampling plan

Obtaining a list of the population is not always
easy
Examine the fit between the sample frame and the
population
Who is likely to be missing, and is it
systematic?
Oversampling and resampling are frequently
necessary.
How many surveys do you need to mail to get 300
responses?
Oversampling might select 1,500 names to get 300
responses.
Resampling might involve an initial sample of 300
names that elicited only 60 responses, so we draw
another 300 names from the sample frame.
Validate the sample
How does the sample compare w/ known data about
the population?

21
Determining Sample Size

Two appropriate approaches
Statistical analysis approach
For some tests, you may need to have a minimum
number of subjects per cell
Confidence interval approach
The other approaches discussed by Burns Bush
are really more considerations in determining a
sample size (e.g., judgment, cost)
What is the acceptable level of sampling error?
Consider the implications/costs of mistakes

22
Sampling and Non-Sampling Errors

Sampling Error the difference between the
observed value of a measure and the long-run
average of the observed values in repetitions of
the measurement.
The difference between the sample mean and the
mean of the population (as estimated by multiple
means of the population by the same instrument)
A key to reducing sampling error is increasing
sample size.
Non-Sampling Errors types of errors not related
to the sample (i.e., all other samples)
Sampling errors decrease as sample sizes
increase, but non-sampling errors do not.

23
Basic statistics underlying sample size
determination

Variability the amount of dissimilarity in
respondents answers to a particular question.
(ex quiz, exam scores)
Standard deviation an approximation of the
average absolute distance away from the mean for
all respondents to a particular question.
Standard deviation is needed to determine a
confidence interval.
Confidence interval a range whose endpoints
define a certain percentage of responses to a
question.
1.96 x standard deviation /- mean 95 of the
responses to a given question

Example If we ask a question scaled 1-7, the
standard deviation is 0.8 and the mean is 4.5, we
know that 1.96(0.8) 1.568 4.5-1.568
2.932 4.51.568 6.068 Thus, 95 of the
responses will be between 2.932 and 6.068
24
Basic statistics underlying sample size
determination

Central Limit Theorem As sample size, n,
increases, the distribution of the mean, X, of a
random sample taken from most populations
approaches a normal distribution.
Sampling distribution If you took repeated
samples from the same population, the sampling
distribution is the distribution of these sample
means.
Standard error How far from the true population
value a typical sample result is expect to fall.
The standard error of the mean is the standard
deviation of the population divided by the square
root of the sample size.

What happens is we increase the sample
size from 100 to 300? to 500?
25
Estimating sample size

To estimate a sample size, a researcher must
estimate the standard deviation of the population
(a good rule of thumb is 1/6th of the range)
make a judgment about allowable amounts of error
determine a confidence interval
Once these are known, the formula for calculating
sample size is

where... Z standardized value that corresponds
to the confidence level (95 is 1.96) S sample
standard deviation E acceptable magnitude of
error
Z2S2 E2
n
26
Estimating sample size

Suppose a researcher studying annual expenditures
on lipstick wishes to have a 95 confidence
interval (Z1.96) and a range of error (E) of
less than 2, and an estimate of the standard
deviation is 29.

If we change the range of acceptable error to 4,
sample size falls to 202
(1.962)(292)/42 (3.8416)(841)/16
201.9241
If the range of error is 5, n 129.231
Thus, if we do the research with a and determine
that the mean is 110 w/ a standard deviation of
29, we can be 95 confident that the mean for
the population is between 108-112 ( if n808),
between 106-114 (if n202), or between
105-115 (if n129)
Partial source Zikmund (1999), Essentials of
Marketing Research
27
Estimating sample size

Suppose you wanted to estimate the same size for
a survey which contains the following question
What is your overall attitude towards Hospital X?
Very Good 7 6 5 4 3 2 1 Very Poor
The range of acceptable error is 0.1 points, the
confidence level is 95, and the estimated
standard deviation is 1/6 of the range.

Z2S2 E2
(1.962)(12)/0.12 a sample size of 384
n
If you increase the acceptable error to 0.2, the
sample size drops to n 96!
28
Sample size determination when a proportion is
present
Sp estimate of the std. error of the
proportion p proportion of successes q (1 -
p), or the proportion of failures
pq n
Sp
Suppose that 20 of a sample of 1,200 recall
seeing an ad.
(0.2)(0.8) 1200
Thus, the population proportion who saw the ad is
between 17.8 and 22.2, w/ 95 confidence.
0.0115
Sp
Confidence interval p ZclSp .2
(1.96)(0.0115) .2 .022
29
Sample size determination when a proportion is
present
To determine the sample size for a proportion, we
need to know or estimate the following Z2cl
square of the confidence level in standard error
units (i.e., typically 1.962, or
3.8416) p estimated proportions of successes q
(1 - p), or the proportion of failures E2
square of the maximum allowance for error
We insert this information into the following
formula
n Z2cl(pq) E2
30
Sample size determination when a proportion is
present
Example We estimate that 60 of respondents will
prefer to stay in town X than to drive 20 miles
to see a physician, with a confidence level of
95, and the allowable error is /- 4.
n Z2clpq E2
(1.96)2(.6)(.4) 0.042
(3.8416)(0.24)/0.0016 n 576
If we assume a 70/30 split and if we increase the
maximum allowable error to 5, what would be n?
31
Overall Sample size determination when a
proportion is present

When the split is hypothesized to be 70/30 (95
CI)
1 7,939
2 2,009
3 895
5 322

When the split is hypothesized to be 85/15 (95
CI)
1 4,850
2 1,222
3 544
5 306

Small population sizes typically require a
slightly smaller sample size If population
10,000, the 70/30 split sample sizes would be
14,465 21,678 3823 and 5313

Write a Comment

User Comments (0)