Differences between revisions 28 and 31 (spanning 3 versions)
Revision 28 as of 2026-02-10 20:36:52
Size: 4617
Comment: Rewrite
Revision 31 as of 2026-02-11 19:20:24
Size: 5205
Comment: Links
Deletions are marked like this. Additions are marked like this.
Line 17: Line 17:
Survey weights begin with a [[Statistics/DesignWeight|design weight]] reflecting [[Statistics/SurveySampling|probability of selection]]. Generally this is simply the inverse of the sampling probability: ''n,,k,,/N'' for all strata ''k''. Survey weights begin with [[Statistics/DesignWeights|design weights]] reflecting [[Statistics/SurveySampling|probability of selection]]. Generally this is simply the inverse of the sampling probability: ''n,,k,,/N'' for all strata ''k''.
Line 54: Line 54:
 * [[CalibrationEstimatorsInSurveySampling|Calibration estimators in survey sampling]], Jean-Claude Deville and Carl-Erik Särndal, 1992
Line 57: Line 58:
 * [[RandomEffectsModelsForSmoothingPoststratificationWeights|Random-effects Models for Smoothing Poststratitication Weights]], Laura C. Lazzeroni and Roderick J.A. Little, 1998
 * [[TheGeneralizedExponentialModelForSamplingWeightCalibrationForExtremeValuesNonresponseAndPostStratification|The generalized exponential model for sampling weight calibration for extreme values, nonresponse, and poststratification]], R.E. Folsom and A.C. Singh, 2000

Survey Weights

Survey weights account for the survey design, sampling error, and non-sampling error.


Description

Survey data is collected through a mechanism which can be specified statistically. If it is not specified, bias can be introduced and estimates can be over-confident.

Inverse variance weights are related, but not the same.

Survey weights begin with design weights reflecting probability of selection. Generally this is simply the inverse of the sampling probability: nk/N for all strata k.

All real surveys feature non-sampling error, especially nonresponse. If nonresponse is uncorrelated with key metrics, it is negligible. Otherwise there is potential for nonresponse bias. This bias can be corrected through survey weights in a few ways:

Modeling on insignificant or uncorrelated attributes does not introduce bias, but it does inflate variance.

Calibration can be used to:

  • make estimates be consistent with known true population proportions
  • correct sampling error like undercoverage or overcoverage

  • further correct for non-sampling error like nonresponse bias

The methods here include:


Weighted Estimators

Survey weights w are designed such that a population proportion μ can be calculated using the weighted estimator Σ(wx) / Σw.

In the case that all cases have equal weight, it is straightforward to show that the variance of that estimator is w2σ2.

In any other case, the variance is given by Σ(w2σ2) / (Σw)2. This ratio must then be linearized or simulated to arrive at an approximate variance. Taylor expansion is a common strategy for linearization.


Reading Notes


CategoryRicottone

Statistics/SurveyWeights (last edited 2026-02-26 22:09:47 by DominicRicottone)