|
Size: 7926
Comment: Link
|
Size: 16126
Comment: Reorg
|
| Deletions are marked like this. | Additions are marked like this. |
| Line 7: | Line 7: |
| * [[Statistics/Collider|Collider]] * [[Statistics/Combinations|Combinations]] * [[Statistics/Confounder|Confounder]] * [[Statistics/Correlation|Correlation]] * [[Statistics/Logit|Logit]] * [[Statistics/MahalanobisDistance|Mahalanobis distance]] * [[Statistics/Mediator|Mediator]] * [[Statistics/Moderator|Moderator]] * [[Statistics/Permutations|Permutations]] == Uncertainty == |
|
| Line 8: | Line 20: |
| * [[Statistics/Combinations|Combinations]] * [[Statistics/CovarianceMatrices|Covariance matrices]] |
* [[Statistics/Covariance|Covariance]] |
| Line 11: | Line 22: |
| * [[Statistics/Permutations|Permutations]] * [[Statistics/MahalanobisDistance|Mahalanobis distance]] |
* [[Statistics/Entropy|Entropy]] * [[Statistics/Variance|Variance]] * [[Statistics/PooledVariance|Pooled variance]] |
| Line 17: | Line 29: |
| * [[Statistics/BayesianNotation|Bayesian notation]] | |
| Line 19: | Line 30: |
| * [[Statistics/Independence|Independence]] | |
| Line 21: | Line 31: |
| * [[Statistics/Logit|Logit]] * [[Statistics/ProbabilityNotation|Probability notation]] |
|
| Line 26: | Line 34: |
| == Probability distributions == * [[Statistics/BernoulliDistribution|Bernoulli]] * [[Statistics/BinomialDistribution|Binomial]] * [[Statistics/ChiSquaredDistribution|Chi-squared]] * [[Statistics/FDistribution|F]] * [[Statistics/NormalDistribution|Normal]] * [[Statistic/StudentsTDistribution|Student's t]] * [[Statistics/UniformDistribution|Uniform]] == Probability tests == |
== Prediction == * [[Statistics/BayesianNotation|Bayesian notation]] * [[Statistics/ConditionalExpectations|Conditional expectations]] * [[Statistics/Moments|Moments]] == Tests == |
| Line 39: | Line 43: |
| * [[Statistics/CronbachsAlpha|Cronbach's alpha]] * [[Statistics/FTest|F test]] * [[Statistics/GrangerCausalityTest|Granger causality test]] |
|
| Line 40: | Line 47: |
| * [[Statistics/HotellingsTSquaredTest|Hotelling's t-squared test]] | |
| Line 43: | Line 51: |
| * [[Statistics/PearsonTest|Pearson test]] | * [[Statistics/MardiasTest|Mardia's test]] * [[Statistics/MillsRatio|Mills' ratio]] * [[Statistics/PearsonsChiSquaredTest|Pearson's chi-squared test]] |
| Line 45: | Line 55: |
| * [[Statistics/StudentsTTest|Student's t test]] | |
| Line 46: | Line 57: |
| * [[Statistics/WaldWolfowitzRunsTest|Wald-Wolfowitz runs test]] | |
| Line 49: | Line 61: |
| * [[Statistics/MultistageSample|Multistage sample]] * [[Statistics/NeymanAllocation|Neyman allocation]] * [[Statistics/ProbabilityProportionalToSizeSample|Probability proportional to size sample]] |
|
| Line 51: | Line 66: |
| * [[Statistics/SurveyFrame|Survey frame]] | |
| Line 52: | Line 68: |
| * SurveySamples == Prediction == * [[Statistics/ConditionalExpectations|Conditional expectations]] * [[Statistics/ExpectedValues|Expected values]] |
|
| Line 61: | Line 71: |
| * [[Statistics/AnalysisOfVariance|Analysis of variance (ANOVA)]] | |
| Line 64: | Line 75: |
| * [[Statistics/EconometricsNotation|Econometrics notation]] | * [[Statistics/CoxProportionalHazardsModel|Cox proportional hazards model]] * [[Statistics/CrossValidation|Cross-validation]] * [[Statistics/GeneralizedEstimatingEquation|Generalized estimating equation]] * [[Statistics/GeneralizedLeastSquares|Generalized least squares]] |
| Line 66: | Line 80: |
| * [[Statistics/InverseVarianceWeights|Inverse variance weights]] * [[Statistics/IterativelyReweightedLeastSquares|Iteratively reweighted least squares]] |
|
| Line 67: | Line 83: |
| * [[Statistics/LogisticModel|Logistic Model]] * [[Statistics/MultilevelModel|Multilevel Model]] * [[Statistics/MultilevelRegressionWithPoststratification|Multilevel Regression with poststratification]] |
* [[Statistics/LogisticModel|Logistic model]] * [[Statistics/Matching|Matching]] * [[Statistics/MaximumLikelihoodEstimation|Maximum likelihood estimation]] * [[Statistics/MultilevelModel|Multilevel model]] * [[Statistics/MixedModel|Mixed model]] * [[Statistics/MultilevelRegressionWithPoststratification|Multilevel regression with poststratification]] |
| Line 73: | Line 92: |
| * [[Statistics/StructuralEquationModeling|Structural equation modeling]] == Panel modeling == |
* [[Statistics/Residuals|Residuals]] == Econometrics == * [[Statistics/AutoregressiveModels|Autoregressive models]] * [[Statistics/CensoredAndTruncatedRegressionModels|Censored and Truncated regression models]] * [[Statistics/DifferenceInDifferences|Difference in differences]] * [[Statistics/EconometricsNotation|Econometrics notation]] |
| Line 79: | Line 102: |
| * [[Statistics/InstrumentalVariablesMethod|Instrumental variables method]] | |
| Line 80: | Line 104: |
| * [[Statistics/ProbitModel|Probit model]] | |
| Line 82: | Line 107: |
| * [[Statistics/VectorAutoregression|Vector autoregression]] == Psychometrics == * [[Statistics/FactorAnalysis|Factor analysis]] * [[Statistics/MediationAnalysis|Mediation analysis]] * [[Statistics/StructuralEquationModeling|Structural equation modeling]] (and related reading notes) |
|
| Line 92: | Line 123: |
| == Analysis == * [[Statistics/FactorAnalysis|Factor analysis]] * [[Statistics/MediationAnalysis|Mediation analysis]] |
|
| Line 99: | Line 125: |
| * [[Statistics/Calibration|Calibration]] * [[Statistics/DesignWeights|Design weights]] * [[Statistics/DoubleListExperiment|Double list experiment]] * [[Statistics/ExperienceSamplingMethod|Experience sampling method]] * [[Statistics/FocusGroup|Focus group]] * [[Statistics/GeneralizedRegressionEstimator|GREG estimator]] * [[Statistics/InverseProbabilityWeights|Inverse probability weights]] * [[Statistics/MarginOfError|Margin of error]] * [[Statistics/NonresponseBias|Nonresponse bias]] * [[Statistics/OnlineBulletinBoard|Online bulletin board]] * [[Statistics/QualitativeCoding|Qualitative coding]] * [[Statistics/ResponseRate|Response rate]] |
|
| Line 100: | Line 138: |
| * [[Statistics/FocusGroup|Focus group]] * [[Statistics/SurveyFrame|Survey frame]] * [[Statistics/MarginOfError|Margin of error]] * [[Statistics/NonResponseBias|Non-response bias]] * [[Statistics/OnlineBulletinBoard|Online bulletin board]] * [[Statistics/ResponseRate|Response rate]] * [[Statistics/QualitativeCoding|Qualitative coding]] |
|
| Line 108: | Line 139: |
| * [[Statistics/SurveyWeights|Survey weights]] | * [[Statistics/SurveyNonresponse|Survey nonresponse]] (and related reading notes) * [[Statistics/SurveyWeights|Survey weights]] (and related reading notes) |
| Line 110: | Line 142: |
| * [[Statistics/UnexpectedEventDuringSurveyDesignFramework|Unexpected event during survey design framework]] * [[Statistics/WeightingClassAdjustment|Weighting class adjustment]] |
|
| Line 123: | Line 157: |
| Note: reading notes for the above topics are listed on the respective pages, not here. * [[EstimationOfRelationshipsForLimitedDependentVariables|Estimation of Relationships for Limited Dependent Variables]], James Tobin, 1958 * [[MultipleFrameSurveys|Multiple Frame Surveys]], H.O. Hartley, 1962 * [[TheCommonStructureOfStatisticalModelsOfTruncationSampleSelectionAndLimitedDependentVariablesAndASimpleEstimatorForSuchModels|The Common Structure of Statistical Models of Truncation, Sample Selection and Limited Dependent Variables and a Simple Estimator for Such Models]], James J. Heckman, 1976 * [[SequentialSampleSelectionMethods|Sequential Sample Selection Methods]], James R. Chromy, 1979 * [[TheCentralRoleOfThePropensityScoreInObservationalStudiesForCausalEffects|The central role of the propensity score in observational studies for causal effects]], Paul R. Rosenbaum and Donald B. Rubin, 1983 * [[SamplingRarePopulations|Sampling Rare Populations]], Graham Kalton and Dallas W. Anderson, 1986 |
|
| Line 124: | Line 166: |
| * [[TheEffectOfWeightTrimmingOnNonlinearSurveyEstimates|The Effect of Weight Trimming on Nonlinear Survey Estimates]], Frank J. Potter, 1993 | |
| Line 126: | Line 167: |
| * [[SamplingWeightsAndRegressionAnalysis|Sampling Weights and Regression Analysis]], Christopher Winship and Larry Radbill, 1994 * [[ImprovingOnProbabilityWeightingForHouseholdSize|Improving on Probability Weighting for Household Size]], Andrew Gelman and Thomas C. Little, 1998 |
* [[EstimationInDualFrameSurveysWithComplexDesigns|Estimation in Dual Frame Surveys With Complex Designs]], J.N.K. Rao and C.J. Skinner, 1996 |
| Line 129: | Line 169: |
| * [[MeasurementValidity|Measurement Validity: A Shared Standard for Qualitative and Quantitative Research]], Robert Adcock and David Collier, 2001 * [[DoubleSampling|Double Sampling]], Michael Hidiroglou, 2001 |
|
| Line 130: | Line 172: |
| * [[TheInfluenceOfViolationsOfAssumptionsOnMultilevelParameterEstimatesAndTheirStandardErrors|The influence of violations of assumptions on multilevel parameter estimates and their standard errors]], Cora J.M. Maas and Joop J. Hox, 2003 | |
| Line 131: | Line 174: |
| * [[StrugglesWithSurveyWeightingAndRegressionModeling|Struggles with Survey Weighting and Regression Modeling]], Andrew Gelman, 2007 | * [[ASimulationStudyOfCellCollapsingInPoststratification|A simulation study of cell collapsing in poststratification]]; Jay J. Kim, Linda Tompkins, Jianzhu Li, and Richard Valliant; 2005 * [[IsOLSWithABinaryDependentVariableReallyOK|Is OLS with a binary dependent variable really OK?: Estimating (mostly) TSCS models with binary dependent variables and fixed effects]], Nathaniel Beck, 2011 * [[AlternativeSurveySampleDesigns|Alternative survey sample designs: Sampling with multiple overlapping frames]], Sharon L. Lohr, 2011 |
| Line 133: | Line 178: |
| * [[RespondentUseOfStraightliningAsAResponseStrategyInEducationSurveyResearch|Respondent use of straight-lining as a response strategy in education survey research: Prevalence and implications]]; James S. Cole, Alexander C. Mc``Cormick, Robert M. Gonyea; 2012 | |
| Line 134: | Line 180: |
| * [[WhyAskWhy|Why ask why? Forward causal inference and reverse causal questions]], Andrew Gelman and Guido Imbens, 2013 * [[BeyondPowerCalculations|Beyond Power Calculations: Assessing Type S (Sign) and Type M (Magnitude) Errors]], Andrew Gelman and John Carlin, 2014 |
|
| Line 135: | Line 183: |
| * [[StraitliningInWebSurveyPanelsOverTime|Straightlining in Web survey panels over time]], Matthias Schonlau and Vera Toepoel, 2015 * [[APractitionersGuideToClusterRobustInference|A Practitioner’s Guide to Cluster-Robust Inference]], A. Colin Cameron and Douglas L. Miller, 2015 * [[ImputationUnderInformativeSampling|Imputation Under Informative Sampling]]; Emily Berg, Jae-Kwang Kim, and Chris Skinner; 2016 |
|
| Line 137: | Line 188: |
| * [[WhyPropensityScoresShouldNotBeUsedForMatching|Why Propensity Scores Should Not Be Used for Matching]], Gary King and Richard Nielsen, 2019 | |
| Line 138: | Line 190: |
| * [[UnexpectedEventDuringSurveyDesign|Unexpected Event during Surveys Design: Promise and Pitfalls for Causal Inference]]; Jordi Muñoz, Albert Falcó-Gimeno, and Enrique Hernández; 2020 * [[APermutationTestOnComplexSampleData|A Permutation Test on Complex Sample Data]], Daniell Toth, 2020 * [[ExactAdaptiveConfidenceIntervalsForSmallAreas|Exact Adaptive Confidence Intervals for Small Areas]], Kyle C. Burris and Peter D. Hoff, 2020 |
|
| Line 139: | Line 194: |
| * [[UsingHierarchicalModelsToEstimateHeterogeneousEffects|Using Hierarchical Models to Estimate Heterogeneous Effects]], Joshua Alley, 2023 * [[OutOfOneMany|Out of One, Many: Using Language Models to Simulate Human Samples]]; Lisa P. Argyle, Ethan C. Busby, Nancy Fulda, Joshua R. Gubler, Christopher Rytting, and David Wingate; 2023 * [[CausalModelsForLongitudinalAndPanelData|Causal Models for Longitudinal and Panel Data: A Survey]], Dmitry Arkhangelsky and Guido Imbens, 2023 |
|
| Line 141: | Line 199: |
| * [[TheMicroTaskMarketForLemons|The micro-task market for lemons: data quality on Amazon’s Mechanical Turk]]; Douglas J. Ahler, Carolyn E. Roush, and Gaurav Sood; 2024 * [[AdaptingToMisspecification|Adapting to Misspecification]]; Timothy B. Armstrong, Patrick Kline, and Liyang Sun; 2024 * [[LinkingSurveyAndLinkedInData|Linking Survey and LinkedIn Data: Understanding Usage and Consent Patterns]]; Tarek Al Baghal, Alexander Wenz, Paulo Serôdio, Shujin Liu, Curtis Jessop, and Luke Sloan; 2024 * [[SmallAreaPredictionForExponentialDispersionFamiliesUnderInformativeSampling|Small Area Prediction for Exponential Dispersion Families Under Informative Sampling]], Emily Berg and Abdulhakeem Eideh, 2024 * [[AreaLevelModelBasedSmallAreaEstimationOfDivergenceIndexesInTheSpanishLabourForceSurvey|Area-Level Model-Based Small Area Estimation of Divergence Indexes in the Spanish Labour Force Survey]]; Esteban Cabello, Domingo Morales, Agustín Pérez; 2024 * [[TextMessagesToFacilitateTheTransitionToWebFirstSequentialMixedModeDesignsInLongitudinalSurveys|Text Messages to Facilitate the Transition to Web-First Sequential Mixed-Mode Designs in Longitudinal Surveys]], Pablo Cabrera-Álvarez and Peter Lynn, 2024 |
|
| Line 142: | Line 206: |
| * [[SelfReportingNewsUseInSituAndInRetrospect|Self-Reporting News Use in Situ and in Retrospect]]; Danit Shalev, Teresa K Naab, and Yariv Tsfati; 2025 * [[WhereToPlaceSensitiveQuestions|Where to place sensitive questions? Experiments on survey response order and measures of discriminatory attitudes]]; Amanda Sahar d’Urso, Tabitha Bonilla, and Genni Bogdanowicz; 2025 * [[DifferenceInDifferencesDesigns|Difference-in-Differences Designs: A Practitioner’s Guide]]; Andrew Baker, Brantly Callaway, Scott Cunningham, Andrew Goodman-Bacon, and Pedro H. C. Sant’Anna; 2025 * [[InferringAPopulationCompositionFromSurveyDataWithNonignorableNonresponse|Inferring a Population Composition From Survey Data With Nonignorable Nonresponse: Borrowing Information From External Sources]], Veronica Ballerini and Brunero Liseo, 2025 * [[BayesianSampleSizeCalculationsForExternalValidationStudiesOfRiskPredictionModels|Bayesian Sample Size Calculations for External Validation Studies of Risk Prediction Models]]; Mohsen Sadatsafavi, Paul Gustafson, Solmaz Setayeshgar, Laure Wynants, and Richard D. Riley; 2026 |
Statistics
A branch of mathematics.
Foundations
Uncertainty
Probability
Prediction
Tests
Samples
Modeling
Econometrics
Psychometrics
Structural equation modeling (and related reading notes)
Non-parametric modeling
Survey analysis
Survey nonresponse (and related reading notes)
Survey weights (and related reading notes)
Natural language processing
Reading Notes
Note: reading notes for the above topics are listed on the respective pages, not here.
Estimation of Relationships for Limited Dependent Variables, James Tobin, 1958
Multiple Frame Surveys, H.O. Hartley, 1962
The Common Structure of Statistical Models of Truncation, Sample Selection and Limited Dependent Variables and a Simple Estimator for Such Models, James J. Heckman, 1976
Sequential Sample Selection Methods, James R. Chromy, 1979
The central role of the propensity score in observational studies for causal effects, Paul R. Rosenbaum and Donald B. Rubin, 1983
Sampling Rare Populations, Graham Kalton and Dallas W. Anderson, 1986
Measurement Error Models, Wayne A. Fuller, 1987
Evidence on the Validity of Cross-sectional and Longitudinal Labor Market Data, John Bound, Charles Brown, Greg J. Duncan, and Willard L. Rodgers, 1994
Estimation in Dual Frame Surveys With Complex Designs, J.N.K. Rao and C.J. Skinner, 1996
Statistical Modeling: The Two Cultures, Leo Breiman, 2001
Measurement Validity: A Shared Standard for Qualitative and Quantitative Research, Robert Adcock and David Collier, 2001
Double Sampling, Michael Hidiroglou, 2001
Hierarchical Linear Models: Applications and Data Analysis Methods, Stephen W. Raudenbush and Anthony S. Bryk, 2002
The influence of violations of assumptions on multilevel parameter estimates and their standard errors, Cora J.M. Maas and Joop J. Hox, 2003
Ascertaining the validity of individual protocols from Web-based personality inventories, John A. Johnson, 2004
A simulation study of cell collapsing in poststratification; Jay J. Kim, Linda Tompkins, Jianzhu Li, and Richard Valliant; 2005
Is OLS with a binary dependent variable really OK?: Estimating (mostly) TSCS models with binary dependent variables and fixed effects, Nathaniel Beck, 2011
Alternative survey sample designs: Sampling with multiple overlapping frames, Sharon L. Lohr, 2011
Identifying Careless Responses in Survey Data, Andrew Meade, S. Bartholomew Craig, 2012
Respondent use of straight-lining as a response strategy in education survey research: Prevalence and implications; James S. Cole, Alexander C. McCormick, Robert M. Gonyea; 2012
Estimating Measurement Error in Annual Job Earnings, John M. Abowd and Martha H. Stinson, 2013
Why ask why? Forward causal inference and reverse causal questions, Andrew Gelman and Guido Imbens, 2013
Beyond Power Calculations: Assessing Type S (Sign) and Type M (Magnitude) Errors, Andrew Gelman and John Carlin, 2014
How Robust Standard Errors Expose Methodological Problems They Do Not Fix, and What to Do About It, Gary King and Margaret E. Roberts, 2015
Straightlining in Web survey panels over time, Matthias Schonlau and Vera Toepoel, 2015
A Practitioner’s Guide to Cluster-Robust Inference, A. Colin Cameron and Douglas L. Miller, 2015
Imputation Under Informative Sampling; Emily Berg, Jae-Kwang Kim, and Chris Skinner; 2016
Sampling-based vs. Design-based Uncertainty in Regression Analysis; Alberto Abadie, Susan Athey, Guido W. Imbens, and Jeffrey M. Wooldridge; 2017
When Should You Adjust Standard Errors for Clustering?; Alberto Abadie, Susan Athey, Guido W. Imbens, and Jeffrey M. Wooldridge; 2017
Why Propensity Scores Should Not Be Used for Matching, Gary King and Richard Nielsen, 2019
Regression and Other Stories, Andrew Gelman, Jennifer Hill, and Aki Vehtari, 2020
Unexpected Event during Surveys Design: Promise and Pitfalls for Causal Inference; Jordi Muñoz, Albert Falcó-Gimeno, and Enrique Hernández; 2020
A Permutation Test on Complex Sample Data, Daniell Toth, 2020
Exact Adaptive Confidence Intervals for Small Areas, Kyle C. Burris and Peter D. Hoff, 2020
The Independent Contractor Workforce: New Evidence on Its Size and Composition and Ways to Improve Its Measurement in Household Surveys; Katharine G. Abraham, Brad J. Hershbein, Susan N. Houseman, and Beth C. Truesdale; 2023
Using Hierarchical Models to Estimate Heterogeneous Effects, Joshua Alley, 2023
Out of One, Many: Using Language Models to Simulate Human Samples; Lisa P. Argyle, Ethan C. Busby, Nancy Fulda, Joshua R. Gubler, Christopher Rytting, and David Wingate; 2023
Causal Models for Longitudinal and Panel Data: A Survey, Dmitry Arkhangelsky and Guido Imbens, 2023
The effect of online interviews on the University of Michigan Survey of Consumer Sentiment, Ryan Cummings and Ernie Tedeschi, 2024
The micro-task market for lemons: data quality on Amazon’s Mechanical Turk; Douglas J. Ahler, Carolyn E. Roush, and Gaurav Sood; 2024
Adapting to Misspecification; Timothy B. Armstrong, Patrick Kline, and Liyang Sun; 2024
Linking Survey and LinkedIn Data: Understanding Usage and Consent Patterns; Tarek Al Baghal, Alexander Wenz, Paulo Serôdio, Shujin Liu, Curtis Jessop, and Luke Sloan; 2024
Small Area Prediction for Exponential Dispersion Families Under Informative Sampling, Emily Berg and Abdulhakeem Eideh, 2024
Area-Level Model-Based Small Area Estimation of Divergence Indexes in the Spanish Labour Force Survey; Esteban Cabello, Domingo Morales, Agustín Pérez; 2024
Text Messages to Facilitate the Transition to Web-First Sequential Mixed-Mode Designs in Longitudinal Surveys, Pablo Cabrera-Álvarez and Peter Lynn, 2024
Measurement error when surveying issue positions: a MultiTrait MultiError approach; Kim Backström, Alexandru Cernat, Rasmus Sirén, and Peter Söderlund; 2025
Self-Reporting News Use in Situ and in Retrospect; Danit Shalev, Teresa K Naab, and Yariv Tsfati; 2025
Where to place sensitive questions? Experiments on survey response order and measures of discriminatory attitudes; Amanda Sahar d’Urso, Tabitha Bonilla, and Genni Bogdanowicz; 2025
Difference-in-Differences Designs: A Practitioner’s Guide; Andrew Baker, Brantly Callaway, Scott Cunningham, Andrew Goodman-Bacon, and Pedro H. C. Sant’Anna; 2025
Inferring a Population Composition From Survey Data With Nonignorable Nonresponse: Borrowing Information From External Sources, Veronica Ballerini and Brunero Liseo, 2025
Bayesian Sample Size Calculations for External Validation Studies of Risk Prediction Models; Mohsen Sadatsafavi, Paul Gustafson, Solmaz Setayeshgar, Laure Wynants, and Richard D. Riley; 2026
