= A Comparison of Methods of Weighting Adjustment for Nonresponse =

'''A Comparison of Methods of Weighting Adjustment for Nonresponse''' was written by Graham Kalton and Dalisay S. Maligalig in 1991. It was published in the proceedings of the 1991 Annual Research Conference of the [[UnitedStates/CensusBureau|Census Bureau]]. The scan can be found online at https://books.google.com/books?lr=&id=Cy22AAAAIAAJ&pg=PA409.

The estimator for population proportion ''Y̅'' is {{attachment:est.svg}} where ''π,,i,,'' is the probability of element ''i'' being sampled.

Given element nonresponse, the estimator becomes {{attachment:estnr.svg}} where ''r'' is the number of respondents. The bias is approximated by {{attachment:biasestnr.svg}} where ''ϕ,,i,,'' is the probability of element ''i'' responding if sampled. This bias relates to the covariance of ''Y,,i,,'' and ''ϕ,,i,,''; if covariance is 0, then bias is 0.

If ''ϕ,,i,,'' are known, use the corrected estimator {{attachment:estknown.svg}}. But realistically we can only ''estimate'' those probabilities, and then use {{attachment:estmod.svg}}. Three methods follow:
 1. The simplest model is to assume constant probability of responding if sampled, i.e. ''ϕ,,i,, = ϕ ∀ i''.
 2. The recommendation is to model ''ϕ,,i,,'' using [[Statistics/LogisticModel|logistic regression]], i.e. ''log(ϕ,,i,,/(1-ϕ,,i,,)) = x,,i,,β'' given some auxiliary information ''x,,i,,''.
 3. The remainder of the paper addresses three alternative methods.



=== Population-based adjustment cell weighting ===

The first method the authors introduce is a population-based adjustment cell weighting, partitioning the population into cells indexed by ''h''. The estimator is {{attachment:estcal1.svg}} where ''W,,h,, = N,,h,,/N'', ''r,,h,,'' is the number of respondents in cell ''h'', and the cell mean is given by {{attachment:estcal2.svg}}. Alternatively, {{attachment:estcalalt.svg}} where ''w,,hi,,'' is an element's weight. The bias of this estimator is {{attachment:biasestcal.svg}}.

Similar to before, this estimator's bias relates to the covariance of ''Y,,hi,,'' and ''ϕ,,hi,,''. Importantly though it derives from covariance ''within the cell''. Therefore if the probability to respond is constant within a cell, i.e. ''ϕ,,hi,, = ϕ,,h,,'', there is no bias. ''E[y̅,,p,,] = Y̅'' and ''MSE(y̅,,p,,) = Var(y̅,,p,,) = ΣW^2^,,h,,S^2^,,h,,/r,,h,,'' where ''S^2^,,h,,'' is the element variance within cell ''h''.

Consider two schemes:
 * scheme 1 uses ''H'' cells
 * scheme 2 collapses cells 1 and 2 together, rendering ''H-1'' cells

The collapse leads to the second scheme having lower variance. At the same time, the bias becomes {{attachment:biasestcalcoll.svg}}. Making assumptions about element variance, ''MSE(y̅,,p1,,) > MSE(y̅,,p2,,)'' if {{attachment:compcoll.svg}}.



=== Sample-based adjustment cell weighting ===

The authors also introduce a sample-based method. Importantly, making parallel assumptions, they arrive to the same expression for when collapsing yields a lower MSE.



=== Raking ratio weighting ===

Finally, the authors introduce a raking method with two dimensions, one indexed by ''h'' and the other indexed by ''k''. The estimator is {{attachment:estcal1.svg}} where ''w̃,,hk,,'' estimates ''W,,hk,,'' in the joint distribution through iterative fitting. More formally, ''E[w̃,,hk,,] = W,,hk,,''. More concretely, at convergence, the weights reflect the marginal distributions expressed as ''W,,k,, = Σ,,h,,W,,hk,, = Σ,,h,,w̃,,hk,,'' and ''W,,h,, = Σ,,k,,W,,hk,, = Σ,,k,,w̃,,hk,,''.

The authors make a parallel assumption to the above: that the probability to respond is constant within a cell, i.e. ''ϕ,,hki,, = ϕ,,hk,,''. At the same time, they loosen the assumption that ''w̃,,hk,,'' converges to ''W,,hk,,''.

If ''E[w̃,,hk,,] = W̃,,hk,,'', then bias of this estimator is then given by ''Bias(y̅,,p,,) = ΣΣ(W̃,,hk,, - W,,hk,,)(Y̅,,hk,, - Y̅,,h,, - Y̅,,k,, + Y̅)''. Therefore, even when ''w̃,,hk,,'' is a biased estimator for ''W,,hk,,'', this can be an unbiased estimator for ''Y̅'' if there is no interaction in ''Y,,hk,,'' for the two-way classification.

If ''W,,hk,,'' are known, the authors demonstrate that variance under adjustment cell weighting is lower than or equal to variance under raking ratio weighting. "An argument advanced for the use of raking is that it deals with the problem of small cells. To the extent that it does so, it operates in an indirect manner. When the ''W,,hk,,'' distribution is known, it is not clear why raking should be preferred to adjustment cell weighting. With the latter procedure, weights can be trimmed and cells collapsed in a way that is tailor-made for the survey variables under study and for the particular sample configuration encountered. Further research is needed in this area."



== Reading notes ==

The authors do actually discuss how the selection of estimators occurs after observing the response patterns, so they take ''r̃'' as given, i.e. ''E[y̅,,p,,|r̃] = Y̅''. I omit this from my notes for brevity and because ''r̃'' is inconvenient to type.



----
CategoryRicottone CategoryReadingNotes