Differences between revisions 2 and 10 (spanning 8 versions)
Revision 2 as of 2020-10-22 17:37:02
Size: 2509
Comment:
Revision 10 as of 2025-01-10 15:13:43
Size: 2460
Comment: Killing SurveyStatistics page
Deletions are marked like this. Additions are marked like this.
Line 1: Line 1:
## page was renamed from SurveyStatistics/SurveyStatistics
## page was renamed from SurveyStatistics
Line 51: Line 53:
This is why samples are *randomly* drawn. This is why samples are ''randomly'' drawn.
Line 54: Line 56:
Incompleteness of data can skew samples. Incompleteness of frame data can skew samples.

See [[SurveySamples|here]] for a details on survey sampling.
Line 65: Line 69:
 + remoteness of location (i.e. rural Alaska)  * remoteness of location (i.e. rural Alaska)
Line 74: Line 78:
=== Common Sampling Frames ===

 * Census Bureau surveys are excellent sources of regional, hierarchical data (i.e. states > counties > tracts)
 * U.S. Postal Service Delivery Sequence files
 * Random digit dialing

Survey Statistics

There are a number of considerations for analyzing survey data.


Statistics with Survey Data

There are key differences between the statistics employed in models and in surveys.

Model-based inference

  1. Build a mathematical model that describes a population.
  2. Generate a random sample from that population to generate estimates.
  3. Estimate how the error terms of those estimates would vary if repeated samples were drawn.

In other words, the key is how well the model describes the population.

Design-based inference

  1. Identify a population with fixed descriptives.
  2. Draw a sample from that population to collect measures from.
  3. Estimate how the measures would vary if repeated samples were drawn.

In other words, the key is how well the sample fits the population. If the full population were contacted, measures would be perfect.

Inferential statistics from complex survey data

Using model-based inference while accounting for survey design.


Survey Populations

A sample is contacted, rather than the full population, for reasons of cost and administration. There are costs paid for this decision.

Limitations of Survey Sampling

If a sample is a poor fit for the population, then it will be difficult/impossible to estimate population descriptives. This is why samples are randomly drawn.

But for random sampling to succeed, the target populations needs to be completely identified. Incompleteness of frame data can skew samples.

See here for a details on survey sampling.

Limitations of Surveying

Non-random non-response is an additional roadblock to estimating population descriptives.

Some populations are inherently difficult to contact, due to:

  • political, legal, and ethical considerations (i.e. minors in the population)
  • language and cultural barriers
  • remoteness of location (i.e. rural Alaska)

The availability of contact data often dictates the mode of survey instrument. Some populations are not easily contacted by specific modes, due to:

  • socioeconomic conditions (i.e. lack of telephone connectivity)
  • low accuracy contact data (i.e. outdated addresses)


CategoryRicottone

Statistics/SurveyInference (last edited 2025-04-18 19:40:59 by DominicRicottone)