Survey sampling

Short description: Statistical selection process

In statistics, survey sampling describes the process of selecting a sample of elements from a target population to conduct a survey. The term "survey" may refer to many different types or techniques of observation. In survey sampling it most often involves a questionnaire used to measure the characteristics and/or attitudes of people. Different ways of contacting members of a sample once they have been selected is the subject of survey data collection. The purpose of sampling is to reduce the cost and/or the amount of work that it would take to survey the entire target population. A survey that measures the entire target population is called a census. A sample refers to a group or section of a population from which information is to be obtained.

Survey samples can be broadly divided into two types: probability samples and super samples. Probability-based samples implement a sampling plan with specified probabilities (perhaps adapted probabilities specified by an adaptive procedure). Probability-based sampling allows design-based inference about the target population. The inferences are based on a known objective probability distribution that was specified in the study protocol. Inferences from probability-based surveys may still suffer from many types of bias.

Surveys that are not based on probability sampling have greater difficulty measuring their bias or sampling error.^[1] Surveys based on non-probability samples often fail to represent the people in the target population.^[2]

In academic and government survey research, probability sampling is a standard procedure. In the United States, the Office of Management and Budget's "List of Standards for Statistical Surveys" states that federally funded surveys must be performed:

selecting samples using generally accepted statistical methods (e.g., probabilistic methods that can provide estimates of sampling error). Any use of nonprobability sampling methods (e.g., cut-off or model-based samples) must be justified statistically and be able to measure estimation error.^[3]

Random sampling and design-based inference are supplemented by other statistical methods, such as model-assisted sampling ^[4]^[5] and model-based sampling ^[6].

For example, many surveys have substantial amounts of nonresponse. Even though the units are initially chosen with known probabilities, the nonresponse mechanisms are unknown. For surveys with substantial nonresponse, statisticians have proposed statistical models with which the data sets are analyzed.

Issues related to survey sampling are discussed in several sources, including Salant and Dillman (1994).^[7]

Probability sampling

In a probability sample (also called "scientific" or "random" sample) each member of the target population has a known and non-zero probability of inclusion in the sample.^[8] A survey based on a probability sample can in theory produce statistical measurements of the target population that are unbiased, because the expected value of the sample mean is equal to the population mean, E(ȳ)=μ, or have a measurable sampling error, which can be expressed as a confidence interval or margin of error.^[9]^[10]

A probability-based survey sample is created by constructing a list of the target population, called the sampling frame, a randomized process for selecting units from the sample frame, called a selection procedure, and a method of contacting selected units to enable them to complete the survey, called a data collection method or mode.^[11] For some target populations this process may be easy; for example, sampling the employees of a company by using payroll lists. However, in large, disorganized populations simply constructing a suitable sample frame is often a complex and expensive task.

Common methods of conducting a probability sample of the household population in the United States are Area Probability Sampling, Random Digit Dial telephone sampling, and more recently, Address-Based Sampling.^[12]

Within probability sampling, there are specialized techniques such as stratified sampling and cluster sampling that improve the precision or efficiency of the sampling process without altering the fundamental principles of probability sampling.

Stratification is the process of dividing members of the population into homogeneous subgroups before sampling, based on auxiliary information about each sample unit. The strata should be mutually exclusive: every element in the population must be assigned to only one stratum. The strata should also be collectively exhaustive: no population element can be excluded. Then methods such as simple random sampling or systematic sampling can be applied within each stratum. Stratification often improves the representativeness of the sample by reducing sampling error.

Non-sampling error in probability sampling

Biases in surveys are undesirable, but often unavoidable. While the sampling errors (the difference between the population quantity and the appropriately estimated equivalent in the sample) can be quantified using appropriate statistical methods, other sources of error are more difficult to assess:

Non-response bias: When individuals or households selected in the survey sample cannot or will not complete the survey there is the potential for bias to result from this non-response. Nonresponse bias occurs when the observed value deviates from the population parameter due to differences between respondents and nonrespondents.^[13]
Measurement error: inaccurate reporting of the measure of interest due to cognitive difficulty in processing the survey request (e.g. difficulties of placing an event within or outside the requested recall period: "Have you purchased any appliances in the past 12 months?"), unclear labeling of response categories ("How frequently do you consume alcohol? Never, rarely, often"), social desirability bias (underreporting of behaviors or outcomes that are stigmatized by the society, e.g. drug use, and overreporting of the praised behaviors, e.g. voting).
Selection Bias: Selection bias occurs when some units have a differing probability of selection that is unaccounted for by the researcher. For example, some households have multiple phone numbers making them more likely to be selected in a telephone survey than households with only one phone number. This selection bias would be corrected by applying a survey weight equal to [1/(# of phone numbers)] to each household.
Self-selection bias: A type of bias in which individuals voluntarily select themselves into a group, thereby potentially biasing the response of that group.
Participation bias: Bias that arises due to the characteristics of those who choose to participate in a survey or poll.
Coverage bias: Coverage bias can occur when population members do not appear in the sample frame (undercoverage). Coverage bias occurs when the observed value deviates from the population parameter due to differences between covered and non-covered units. Telephone surveys suffer from a well known source of coverage bias because they cannot include households or unhoused individuals without telephones.

Both representation and measurement errors are analyzed within the paradigm of the total survey error^[14].

Non-probability sampling

Many surveys are not based on probability samples, but rather on finding a suitable collection of respondents to complete the survey. Some common examples of non-probability sampling are:^[15]

Judgement Samples: A researcher decides which population members to include in the sample based on his or her judgement. The researcher may provide some alternative justification for the representativeness of the sample. The underlying assumption is that the investigator will select units that are characteristic of the population. This method can be subjected to researcher's biases and perception.^[16]
Snowball Samples: Often used when a target population is rare. Members of the target population recruit other members of the population for the survey.
Quota Samples: The sample is designed to include a designated number of people with certain specified characteristics. For example, 100 coffee drinkers. This type of sampling is common in non-probability market research surveys.
Convenience Samples: The sample is composed of whatever persons can be most easily accessed to fill out the survey.

In non-probability samples the relationship between the target population and the survey sample is immeasurable and potential bias is unknowable. Sophisticated users of non-probability survey samples tend to view the survey as an experimental condition, rather than a tool for population measurement, and examine the results for internally consistent relationships.

References

↑ "Non-Probability Sampling - AAPOR". https://www.aapor.org/Education-Resources/Reports/Non-Probability-Sampling.aspx.
↑ Weisberg, Herbert F. (2005), The Total Survey Error Approach, University of Chicago Press: Chicago. p.231.
↑ "Archived copy". Office of Management and Budget. https://obamawhitehouse.archives.gov/omb/inforeg/statpolicy/standards_stat_surveys.pdf.
↑ Brewer, Ken (2002). Combined Survey Sampling Inference: Weighing Basu's Elephants. Hodder Education Publishers. ISBN 978-0340692295.
↑ Särndal, Carl-Erik; Swensson, Bengt; Wretman, Jan (1992). Model Assisted Survey Sampling. Springer. ISBN 978-0387975283.
↑ Richard Valliant, Alan H. Dorfman, and Richard M. Royall (2000), Finite Population Sampling and Inference: A Prediction Approach, Wiley, New York, p. 19
↑ Salant, Priscilla, I. Dillman, and A. Don. How to conduct your own survey. No. 300.723 S3. 1994.
↑ Kish, L. (1965), Survey Sampling, New York: Wiley. p. 20
↑ Kish, L. (1965), Survey Sampling, New York: Wiley. p.59
↑ "Why Sampling Works - AAPOR". http://www.aapor.org/Education-Resources/For-Researchers/Poll-Survey-FAQ/Why-Sampling-Works.aspx.
↑ Groves et al., Survey Methodology, Wiley: New York.
↑ Michael W. Link, Michael P. Battaglia, Martin R. Frankel, Larry Osborn, and Ali H. Mokdad, A Comparison of Address-Based Sampling (ABS) Versus Random-Digit Dialing (RDD) for General Population Surveys; Public Opinion Q, Spring 2008; 72: 6 - 27.
↑ "Glossary - NCES Statistical Standards". https://nces.ed.gov/StatProg/2002/glossary.asp.
↑ Groves, Robert M.; Lyberg, Lars (2010). "Total Survey Error: Past, Present, and Future". Public Opinion Quarterly (American Association for Public Opinion Research) 74 (5): 849–879. doi:10.1093/poq/nfq065. https://doi.org/10.1093/poq/nfq065. Retrieved 2024-04-01.
↑ "Survey Sampling Methods". https://www.statpac.com/surveys/sampling.htm.
↑ Government of Canada, Statistics Canada; Government of Canada, Statistics Canada (28 January 2009). "Learning resources: Statistics: Power from data! Non-probability sampling". https://www150.statcan.gc.ca/n1/edu/power-pouvoir/ch13/nonprob/5214898-eng.htm.

External links

CRAN Task View Survey Methodology
What is a Survey? Booklet published by National Opinion Research Center and The American Statistical Association
Journal of Information Technology Learning and Performance article Organizational Research: Determining Sample Size in Survey Research
Sample Design and Confidence Intervals
Survey Sampling Methods
Non-probability sampling

0.00

(0 votes)

Original source: https://en.wikipedia.org/wiki/Survey sampling. Read more

[1] "Non-Probability Sampling - AAPOR". https://www.aapor.org/Education-Resources/Reports/Non-Probability-Sampling.aspx.

[2] Weisberg, Herbert F. (2005), The Total Survey Error Approach, University of Chicago Press: Chicago. p.231.

[3] "Archived copy". Office of Management and Budget. https://obamawhitehouse.archives.gov/omb/inforeg/statpolicy/standards_stat_surveys.pdf.

[4] Brewer, Ken (2002). Combined Survey Sampling Inference: Weighing Basu's Elephants. Hodder Education Publishers. ISBN 978-0340692295.

[5] Särndal, Carl-Erik; Swensson, Bengt; Wretman, Jan (1992). Model Assisted Survey Sampling. Springer. ISBN 978-0387975283.

[6] Richard Valliant, Alan H. Dorfman, and Richard M. Royall (2000), Finite Population Sampling and Inference: A Prediction Approach, Wiley, New York, p. 19

[7] Salant, Priscilla, I. Dillman, and A. Don. How to conduct your own survey. No. 300.723 S3. 1994.

[8] Kish, L. (1965), Survey Sampling, New York: Wiley. p. 20

[9] Kish, L. (1965), Survey Sampling, New York: Wiley. p.59

[10] "Why Sampling Works - AAPOR". http://www.aapor.org/Education-Resources/For-Researchers/Poll-Survey-FAQ/Why-Sampling-Works.aspx.

[11] Groves et al., Survey Methodology, Wiley: New York.

[12] Michael W. Link, Michael P. Battaglia, Martin R. Frankel, Larry Osborn, and Ali H. Mokdad, A Comparison of Address-Based Sampling (ABS) Versus Random-Digit Dialing (RDD) for General Population Surveys; Public Opinion Q, Spring 2008; 72: 6 - 27.

[13] "Glossary - NCES Statistical Standards". https://nces.ed.gov/StatProg/2002/glossary.asp.

[14] Groves, Robert M.; Lyberg, Lars (2010). "Total Survey Error: Past, Present, and Future". Public Opinion Quarterly (American Association for Public Opinion Research) 74 (5): 849–879. doi:10.1093/poq/nfq065. https://doi.org/10.1093/poq/nfq065. Retrieved 2024-04-01.

[15] "Survey Sampling Methods". https://www.statpac.com/surveys/sampling.htm.

[16] Government of Canada, Statistics Canada; Government of Canada, Statistics Canada (28 January 2009). "Learning resources: Statistics: Power from data! Non-probability sampling". https://www150.statcan.gc.ca/n1/edu/power-pouvoir/ch13/nonprob/5214898-eng.htm.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

v t e Social survey research
Data collection	Collection methods Census Sampling for surveys Random sampling Questionnaire Interview Structured Semi-structured Unstructured Couple
Data analysis	Categorical data Contingency table Level of measurement Descriptive statistics Exploratory data analysis Multivariate statistics Psychometrics Statistical inference Statistical models Graphical Log-linear Structural
Applications	Audience measurement Demography Market research Opinion poll Public opinion
Major surveys	Afrobarometer American National Election Studies Comparative Study of Electoral Systems Eurobarometer European Social Survey Gallup Poll General Social Survey HILDA International Social Survey Latinobarómetro List of household surveys in the United States National Health and Nutrition Examination Survey New Zealand Attitudes and Values Study World Values Survey
Associations	American Association for Public Opinion Research European Society for Opinion and Marketing Research International Statistical Institute Pew Research Center World Association for Public Opinion Research
Category Projects Business Politics Psychology Sociology Statistics

Anonymous

Search

Survey sampling

Namespaces

More

Page actions

Contents

Probability sampling

Non-sampling error in probability sampling

Non-probability sampling

See also

References

Further reading

External links

Navigation

Navigation

Resources

Help

googletranslator

Navigation

Wiki tools

Wiki tools

Anonymous

Search

Survey sampling

Probability sampling

Non-sampling error in probability sampling

Non-probability sampling

See also

References

Further reading

External links

Navigation

Wiki tools

Page tools

Other projects

Categories