|
Size: 6669
Comment: Add link
|
Size: 14811
Comment: Links
|
| Deletions are marked like this. | Additions are marked like this. |
| Line 5: | Line 5: |
| Line 9: | Line 7: |
| * [[Statistics/AverageAbsoluteDeviation|Average Absolute Deviation]] | |
| Line 11: | Line 8: |
| * [[Statistics/CovarianceMatrices|Covariance Matrices]] * [[Statistics/DegreesOfFreedom|Degrees of Freedom]] |
* [[Statistics/Correlation|Correlation]] * [[Statistics/Logit|Logit]] * [[Statistics/MahalanobisDistance|Mahalanobis distance]] |
| Line 14: | Line 12: |
| * [[Statistics/MahalanobisDistance|Mahalanobis Distance]] |
== Uncertainty == * [[Statistics/AverageAbsoluteDeviation|Average absolute deviation]] * [[Statistics/Covariance|Covariance]] * [[Statistics/DegreesOfFreedom|Degrees of freedom]] * [[Statistics/Entropy|Entropy]] * [[Statistics/Variance|Variance]] * [[Statistics/PooledVariance|Pooled variance]] |
| Line 20: | Line 24: |
| * [[Statistics/BayesRule|Bayes' Rule]] * [[Statistics/BayesianNotation|Bayesian Notation]] * [[Statistics/ConditionalProbability|Conditional Probability]] * [[Statistics/Independence|Independence]] * [[Statistics/JointProbability|Joint Probability]] * [[Statistics/Logit|Logit]] * [[Statistics/ProbabilityNotation|Probability Notation]] * [[Statistics/SigmaAlgebraNotation|σ Algebra Notation]] * [[Statistics/TestStatistic|Test Statistic]] == Probability Distributions == |
* [[Statistics/BayesRule|Bayes' rule]] * [[Statistics/ConditionalProbability|Conditional probability]] * [[Statistics/JointProbability|Joint probability]] * [[Statistics/SigmaAlgebraNotation|σ Algebra notation]] * [[Statistics/TestStatistic|Test statistic]] == Prediction == * [[Statistics/BayesianNotation|Bayesian notation]] * [[Statistics/ConditionalExpectations|Conditional expectations]] * [[Statistics/ExpectedValues|Expected values]] * [[Statistics/Moments|Moments]] == Probability distributions == |
| Line 38: | Line 43: |
| * [[Statistics/HotellingsTSquaredDistribution|Hotelling's T-squared]] * [[Statistics/MillsRatio|Mills' ratio]] |
|
| Line 39: | Line 46: |
| * [[Statistic/StudentsTDistribution|Student's t]] | * [[Statistics/StudentsTDistribution|Student's t]] |
| Line 41: | Line 48: |
== Probability Tests == |
* [[Statistics/WeibullDistribution|Weibull]] == Probability tests == |
| Line 47: | Line 53: |
| * [[Statistics/CronbachsAlpha|Cronbach's alpha]] * [[Statistics/FTest|F test]] * [[Statistics/GrangerCausalityTest|Granger causality test]] |
|
| Line 48: | Line 57: |
| * [[Statistics/HotellingsTSquaredTest|Hotelling's t-squared test]] | |
| Line 51: | Line 61: |
| * [[Statistics/PearsonTest|Pearson test]] | * [[Statistics/MardiasTest|Mardia's test]] * [[Statistics/PearsonsChiSquaredTest|Pearson's chi-squared test]] |
| Line 53: | Line 64: |
| * [[Statistics/StudentsTTest|Student's t test]] | |
| Line 54: | Line 66: |
| * [[Statistics/WaldWolfowitzRunsTest|Wald-Wolfowitz runs test]] | |
| Line 59: | Line 70: |
| * [[Statistics/PostStratification|Post-Stratification]] | * [[Statistics/MultistageSample|Multistage sample]] * [[Statistics/NeymanAllocation|Neyman allocation]] * [[Statistics/ProbabilityProportionalToSizeSample|Probability proportional to size sample]] * [[Statistics/SimpleRandomSample|Simple random sample]] |
| Line 61: | Line 75: |
| * [[Statistics/SurveySampling|Survey Sampling]] * [[Statistics/UnequalWeightingAndDesignEffects|Unequal Weighting and Design Effects]] * SurveySamples == Prediction == * [[Statistics/ConditionalExpectations|Conditional Expectations]] * [[Statistics/ExpectedValues|Expected Values]] * [[Statistics/StandardError|Standard Error]] |
* [[Statistics/SurveyFrame|Survey frame]] * [[Statistics/SurveySampling|Survey sampling]] |
| Line 77: | Line 80: |
| * [[Statistics/BayesianHierarchicalModel|Bayesian Hierarchical Model]] | * [[Statistics/AnalysisOfVariance|Analysis of variance (ANOVA)]] * [[Statistics/BayesianHierarchicalModel|Bayesian hierarchical model]] |
| Line 79: | Line 83: |
| * [[Statistics/CausalInference|Causal Inference]] * [[Statistics/EconometricsNotation|Econometrics Notation]] * [[Statistics/Exogeneity|Exogeneity]] * [[Statistics/GeneralizedLinearModel|Generalized Linear Model]] * [[Statistics/Homoskedasticity|Homoskedasticity]] |
* [[Statistics/CausalInference|Causal inference]] * [[Statistics/CoxProportionalHazardsModel|Cox proportional hazards model]] * [[Statistics/CrossValidation|Cross-validation]] * [[Statistics/GeneralizedEstimatingEquation|Generalized estimating equation]] * [[Statistics/GeneralizedLeastSquares|Generalized least squares]] * [[Statistics/GeneralizedLinearModel|Generalized linear model]] * [[Statistics/InverseVarianceWeights|Inverse variance weights]] * [[Statistics/IterativelyReweightedLeastSquares|Iteratively reweighted least squares]] |
| Line 85: | Line 92: |
| * [[Statistics/LogisticModel|Logistic Model]] * [[Statistics/MultilevelModel|Multilevel Model]] * [[Statistics/MultilevelRegressionWithPoststratification|Multilevel Regression with Poststratification]] * [[Statistics/OrdinaryLeastSquares|Ordinary Least Squares]] * [[Statistics/StructuralEquationModeling|Structural Equation Modeling]] == Non-parametric Modeling == |
* [[Statistics/LogisticModel|Logistic model]] * [[Statistics/Matching|Matching]] * [[Statistics/MaximumLikelihoodEstimation|Maximum likelihood estimation]] * [[Statistics/MultilevelModel|Multilevel model]] * [[Statistics/MixedModel|Mixed model]] * [[Statistics/MultilevelRegressionWithPoststratification|Multilevel regression with poststratification]] * [[Statistics/OrdinaryLeastSquares|Ordinary least squares]] * [[Statistics/PostStratification|Post-stratification]] * [[Statistics/StandardErrors|Standard errors]] * [[Statistics/Residuals|Residuals]] == Econometrics == * [[Statistics/AutoregressiveModels|Autoregressive models]] * [[Statistics/CensoredAndTruncatedRegressionModels|Censored and Truncated regression models]] * [[Statistics/DifferenceInDifferences|Difference in differences]] * [[Statistics/EconometricsNotation|Econometrics notation]] * [[Statistics/FirstDifferencedEstimator|First-differenced estimator]] * [[Statistics/FixedEffectsModel|Fixed effects model]] * [[Statistics/InstrumentalVariablesMethod|Instrumental variables method]] * [[Statistics/PooledOrdinaryLeastSquaresModel|Pooled OLS model]] * [[Statistics/ProbitModel|Probit model]] * [[Statistics/RandomEffectsModel|Random effects model]] * [[Statistics/UnobservedComponentsModel|Unobserved components model]] * [[Statistics/VectorAutoregression|Vector autoregression]] == Psychometrics == * [[Statistics/FactorAnalysis|Factor analysis]] * [[Statistics/MediationAnalysis|Mediation analysis]] * [[Statistics/StructuralEquationModeling|Structural equation modeling]] (and related reading notes) == Non-parametric modeling == |
| Line 97: | Line 128: |
| * [[Statistics/GradientBoosting|Gradient Boosting]] | * [[Statistics/GradientBoosting|Gradient boosting]] |
| Line 99: | Line 130: |
| * [[Statistics/SupportVectorMachines|Support-Vector Machines]] == Analysis == * [[Statistics/FactorAnalysis|Factor analysis]] * [[Statistics/MediationAnalysis|Mediation analysis]] == Surveying == * [[Statistics/SurveyDisposition|Survey Disposition]] * [[Statistics/FocusGroup|Focus Group]] * [[Statistics/SurveyFrame|Survey Frame]] * [[Statistics/MarginOfError|Margin of Error]] * [[Statistics/NonResponseBias|Non-response Bias]] * [[Statistics/OnlineBulletinBoard|Online Bulletin Board]] * [[Statistics/ResponseRate|Response Rate]] * [[Statistics/QualitativeCoding|Qualitative Coding]] * [[Statistics/SurveyInference|Survey Inference]] * [[Statistics/SurveyWeights|Survey Weights]] == Natural Language Processing == * [[Statistics/BagOfWordsModel|Bag of Words Model]] |
* [[Statistics/SupportVectorMachines|Support-vector machines]] == Survey analysis == * [[Statistics/Calibration|Calibration]] * [[Statistics/DesignWeights|Design weights]] * [[Statistics/DoubleListExperiment|Double list experiment]] * [[Statistics/ExperienceSamplingMethod|Experience sampling method]] * [[Statistics/FocusGroup|Focus group]] * [[Statistics/GeneralizedRegressionEstimator|GREG estimator]] * [[Statistics/InverseProbabilityWeights|Inverse probability weights]] * [[Statistics/MarginOfError|Margin of error]] * [[Statistics/NonresponseBias|Nonresponse bias]] * [[Statistics/OnlineBulletinBoard|Online bulletin board]] * [[Statistics/QualitativeCoding|Qualitative coding]] * [[Statistics/ResponseRate|Response rate]] * [[Statistics/SurveyDisposition|Survey disposition]] * [[Statistics/SurveyInference|Survey inference]] * [[Statistics/SurveyNonresponse|Survey nonresponse]] (and related reading notes) * [[Statistics/SurveyWeights|Survey weights]] (and related reading notes) * [[Statistics/UnequalWeightingAndDesignEffects|Unequal weighting and design effects]] * [[Statistics/UnexpectedEventDuringSurveyDesignFramework|Unexpected event during survey design framework]] * [[Statistics/WeightingClassAdjustment|Weighting class adjustment]] == Natural language processing == * [[Statistics/BagOfWordsModel|Bag of words model]] |
| Line 129: | Line 158: |
| * [[Statistics/WordEmbedding|Word Embedding]] * [[Statistics/RecursiveSequencing|Recursive Sequencing]] * [[Statistics/TopicModel|Topic Model]] * [[Statistics/SentimentAnalysis|Sentiment Analysis]] * [[Statistics/TextClassification|Text Classification]] |
* [[Statistics/WordEmbedding|Word embedding]] * [[Statistics/RecursiveSequencing|Recursive sequencing]] * [[Statistics/TopicModel|Topic model]] * [[Statistics/SentimentAnalysis|Sentiment analysis]] * [[Statistics/TextClassification|Text classification]] |
| Line 139: | Line 166: |
| Note: reading notes for the above topics are listed on the respective pages, not here. * [[EstimationOfRelationshipsForLimitedDependentVariables|Estimation of Relationships for Limited Dependent Variables]], James Tobin, 1958 * [[MultipleFrameSurveys|Multiple Frame Surveys]], H.O. Hartley, 1962 * [[TheCommonStructureOfStatisticalModelsOfTruncationSampleSelectionAndLimitedDependentVariablesAndASimpleEstimatorForSuchModels|The Common Structure of Statistical Models of Truncation, Sample Selection and Limited Dependent Variables and a Simple Estimator for Such Models]], James J. Heckman, 1976 * [[SequentialSampleSelectionMethods|Sequential Sample Selection Methods]], James R. Chromy, 1979 * [[TheCentralRoleOfThePropensityScoreInObservationalStudiesForCausalEffects|The central role of the propensity score in observational studies for causal effects]], Paul R. Rosenbaum and Donald B. Rubin, 1983 * [[SamplingRarePopulations|Sampling Rare Populations]], Graham Kalton and Dallas W. Anderson, 1986 |
|
| Line 140: | Line 175: |
| * [[TheEffectOfWeightTrimmingOnNonlinearSurveyEstimates|The Effect of Weight Trimming on Nonlinear Survey Estimates]], Frank J. Potter, 1993 | |
| Line 142: | Line 176: |
| * [[SamplingWeightsAndRegressionAnalysis|Sampling Weights and Regression Analysis]], Christopher Winship and Larry Radbill, 1994 * [[ImprovingOnProbabilityWeightingForHouseholdSize|Improving on Probability Weighting for Household Size]], Andrew Gelman and Thomas C. Little, 1998 |
|
| Line 145: | Line 177: |
| * [[MeasurementValidity|Measurement Validity: A Shared Standard for Qualitative and Quantitative Research]], Robert Adcock and David Collier, 2001 * [[DoubleSampling|Double Sampling]], Michael Hidiroglou, 2001 * [[HierarchicalLinearModels|Hierarchical Linear Models: Applications and Data Analysis Methods]], Stephen W. Raudenbush and Anthony S. Bryk, 2002 * [[TheInfluenceOfViolationsOfAssumptionsOnMultilevelParameterEstimatesAndTheirStandardErrors|The influence of violations of assumptions on multilevel parameter estimates and their standard errors]], Cora J.M. Maas and Joop J. Hox, 2003 |
|
| Line 146: | Line 182: |
| * [[StrugglesWithSurveyWeightingAndRegressionModeling|Struggles with Survey Weighting and Regression Modeling]], Andrew Gelman, 2007 | * [[ASimulationStudyOfCellCollapsingInPoststratification|A simulation study of cell collapsing in poststratification]]; Jay J. Kim, Linda Tompkins, Jianzhu Li, and Richard Valliant; 2005 * [[IsOLSWithABinaryDependentVariableReallyOK|Is OLS with a binary dependent variable really OK?: Estimating (mostly) TSCS models with binary dependent variables and fixed effects]], Nathaniel Beck, 2011 |
| Line 148: | Line 185: |
| * [[RespondentUseOfStraightliningAsAResponseStrategyInEducationSurveyResearch|Respondent use of straight-lining as a response strategy in education survey research: Prevalence and implications]]; James S. Cole, Alexander C. Mc``Cormick, Robert M. Gonyea; 2012 | |
| Line 149: | Line 187: |
| * [[WhyAskWhy|Why ask why? Forward causal inference and reverse causal questions]], Andrew Gelman and Guido Imbens, 2013 * [[BeyondPowerCalculations|Beyond Power Calculations: Assessing Type S (Sign) and Type M (Magnitude) Errors]], Andrew Gelman and John Carlin, 2014 * [[HowRobustStandardErrorsExposeMethodologicalProblemsTheyDoNotFix|How Robust Standard Errors Expose Methodological Problems They Do Not Fix, and What to Do About It]], Gary King and Margaret E. Roberts, 2015 * [[StraitliningInWebSurveyPanelsOverTime|Straightlining in Web survey panels over time]], Matthias Schonlau and Vera Toepoel, 2015 * [[SamplingBasedVsDesignBasedUncertaintyInRegressionAnalysis|Sampling-based vs. Design-based Uncertainty in Regression Analysis]]; Alberto Abadie, Susan Athey, Guido W. Imbens, and Jeffrey M. Wooldridge; 2017 * [[WhenShouldYouAdjustStandardErrorsForClustering|When Should You Adjust Standard Errors for Clustering?]]; Alberto Abadie, Susan Athey, Guido W. Imbens, and Jeffrey M. Wooldridge; 2017 * [[WhyPropensityScoresShouldNotBeUsedForMatching|Why Propensity Scores Should Not Be Used for Matching]], Gary King and Richard Nielsen, 2019 |
|
| Line 150: | Line 195: |
| * [[UnexpectedEventDuringSurveyDesign|Unexpected Event during Surveys Design: Promise and Pitfalls for Causal Inference]]; Jordi Muñoz, Albert Falcó-Gimeno, and Enrique Hernández; 2020 * [[APermutationTestOnComplexSampleData|A Permutation Test on Complex Sample Data]], Daniell Toth, 2020 |
|
| Line 151: | Line 198: |
| * [[UsingHierarchicalModelsToEstimateHeterogeneousEffects|Using Hierarchical Models to Estimate Heterogeneous Effects]], Joshua Alley, 2023 * [[OutOfOneMany|Out of One, Many: Using Language Models to Simulate Human Samples]]; Lisa P. Argyle, Ethan C. Busby, Nancy Fulda, Joshua R. Gubler, Christopher Rytting, and David Wingate; 2023 * [[CausalModelsForLongitudinalAndPanelData|Causal Models for Longitudinal and Panel Data: A Survey]], Dmitry Arkhangelsky and Guido Imbens, 2023 |
|
| Line 153: | Line 203: |
| * [[TheMicroTaskMarketForLemons|The micro-task market for lemons: data quality on Amazon’s Mechanical Turk]]; Douglas J. Ahler, Carolyn E. Roush, and Gaurav Sood; 2024 * [[AdaptingToMisspecification|Adapting to Misspecification]]; Timothy B. Armstrong, Patrick Kline, and Liyang Sun; 2024 * [[LinkingSurveyAndLinkedInData|Linking Survey and LinkedIn Data: Understanding Usage and Consent Patterns]]; Tarek Al Baghal, Alexander Wenz, Paulo Serôdio, Shujin Liu, Curtis Jessop, and Luke Sloan; 2024 * [[MeasurementErrorWhenSurveyingIssuePositions|Measurement error when surveying issue positions: a MultiTrait MultiError approach]]; Kim Backström, Alexandru Cernat, Rasmus Sirén, and Peter Söderlund; 2025 * [[SelfReportingNewsUseInSituAndInRetrospect|Self-Reporting News Use in Situ and in Retrospect]]; Danit Shalev, Teresa K Naab, and Yariv Tsfati; 2025 * [[WhereToPlaceSensitiveQuestions|Where to place sensitive questions? Experiments on survey response order and measures of discriminatory attitudes]]; Amanda Sahar d’Urso, Tabitha Bonilla, and Genni Bogdanowicz; 2025 * [[DifferenceInDifferencesDesigns|Difference-in-Differences Designs: A Practitioner’s Guide]]; Andrew Baker, Brantly Callaway, Scott Cunningham, Andrew Goodman-Bacon, and Pedro H. C. Sant’Anna; 2025 * [[InferringAPopulationCompositionFromSurveyDataWithNonignorableNonresponse|Inferring a Population Composition From Survey Data With Nonignorable Nonresponse: Borrowing Information From External Sources]], Veronica Ballerini and Brunero Liseo, 2025 |
Statistics
A branch of mathematics.
Foundations
Uncertainty
Probability
Prediction
Probability distributions
Probability tests
Samples
Modeling
Econometrics
Psychometrics
Structural equation modeling (and related reading notes)
Non-parametric modeling
Survey analysis
Survey nonresponse (and related reading notes)
Survey weights (and related reading notes)
Natural language processing
Reading Notes
Note: reading notes for the above topics are listed on the respective pages, not here.
Estimation of Relationships for Limited Dependent Variables, James Tobin, 1958
Multiple Frame Surveys, H.O. Hartley, 1962
The Common Structure of Statistical Models of Truncation, Sample Selection and Limited Dependent Variables and a Simple Estimator for Such Models, James J. Heckman, 1976
Sequential Sample Selection Methods, James R. Chromy, 1979
The central role of the propensity score in observational studies for causal effects, Paul R. Rosenbaum and Donald B. Rubin, 1983
Sampling Rare Populations, Graham Kalton and Dallas W. Anderson, 1986
Measurement Error Models, Wayne A. Fuller, 1987
Evidence on the Validity of Cross-sectional and Longitudinal Labor Market Data, John Bound, Charles Brown, Greg J. Duncan, and Willard L. Rodgers, 1994
Statistical Modeling: The Two Cultures, Leo Breiman, 2001
Measurement Validity: A Shared Standard for Qualitative and Quantitative Research, Robert Adcock and David Collier, 2001
Double Sampling, Michael Hidiroglou, 2001
Hierarchical Linear Models: Applications and Data Analysis Methods, Stephen W. Raudenbush and Anthony S. Bryk, 2002
The influence of violations of assumptions on multilevel parameter estimates and their standard errors, Cora J.M. Maas and Joop J. Hox, 2003
Ascertaining the validity of individual protocols from Web-based personality inventories, John A. Johnson, 2004
A simulation study of cell collapsing in poststratification; Jay J. Kim, Linda Tompkins, Jianzhu Li, and Richard Valliant; 2005
Is OLS with a binary dependent variable really OK?: Estimating (mostly) TSCS models with binary dependent variables and fixed effects, Nathaniel Beck, 2011
Identifying Careless Responses in Survey Data, Andrew Meade, S. Bartholomew Craig, 2012
Respondent use of straight-lining as a response strategy in education survey research: Prevalence and implications; James S. Cole, Alexander C. McCormick, Robert M. Gonyea; 2012
Estimating Measurement Error in Annual Job Earnings, John M. Abowd and Martha H. Stinson, 2013
Why ask why? Forward causal inference and reverse causal questions, Andrew Gelman and Guido Imbens, 2013
Beyond Power Calculations: Assessing Type S (Sign) and Type M (Magnitude) Errors, Andrew Gelman and John Carlin, 2014
How Robust Standard Errors Expose Methodological Problems They Do Not Fix, and What to Do About It, Gary King and Margaret E. Roberts, 2015
Straightlining in Web survey panels over time, Matthias Schonlau and Vera Toepoel, 2015
Sampling-based vs. Design-based Uncertainty in Regression Analysis; Alberto Abadie, Susan Athey, Guido W. Imbens, and Jeffrey M. Wooldridge; 2017
When Should You Adjust Standard Errors for Clustering?; Alberto Abadie, Susan Athey, Guido W. Imbens, and Jeffrey M. Wooldridge; 2017
Why Propensity Scores Should Not Be Used for Matching, Gary King and Richard Nielsen, 2019
Regression and Other Stories, Andrew Gelman, Jennifer Hill, and Aki Vehtari, 2020
Unexpected Event during Surveys Design: Promise and Pitfalls for Causal Inference; Jordi Muñoz, Albert Falcó-Gimeno, and Enrique Hernández; 2020
A Permutation Test on Complex Sample Data, Daniell Toth, 2020
The Independent Contractor Workforce: New Evidence on Its Size and Composition and Ways to Improve Its Measurement in Household Surveys; Katharine G. Abraham, Brad J. Hershbein, Susan N. Houseman, and Beth C. Truesdale; 2023
Using Hierarchical Models to Estimate Heterogeneous Effects, Joshua Alley, 2023
Out of One, Many: Using Language Models to Simulate Human Samples; Lisa P. Argyle, Ethan C. Busby, Nancy Fulda, Joshua R. Gubler, Christopher Rytting, and David Wingate; 2023
Causal Models for Longitudinal and Panel Data: A Survey, Dmitry Arkhangelsky and Guido Imbens, 2023
The effect of online interviews on the University of Michigan Survey of Consumer Sentiment, Ryan Cummings and Ernie Tedeschi, 2024
The micro-task market for lemons: data quality on Amazon’s Mechanical Turk; Douglas J. Ahler, Carolyn E. Roush, and Gaurav Sood; 2024
Adapting to Misspecification; Timothy B. Armstrong, Patrick Kline, and Liyang Sun; 2024
Linking Survey and LinkedIn Data: Understanding Usage and Consent Patterns; Tarek Al Baghal, Alexander Wenz, Paulo Serôdio, Shujin Liu, Curtis Jessop, and Luke Sloan; 2024
Measurement error when surveying issue positions: a MultiTrait MultiError approach; Kim Backström, Alexandru Cernat, Rasmus Sirén, and Peter Söderlund; 2025
Self-Reporting News Use in Situ and in Retrospect; Danit Shalev, Teresa K Naab, and Yariv Tsfati; 2025
Where to place sensitive questions? Experiments on survey response order and measures of discriminatory attitudes; Amanda Sahar d’Urso, Tabitha Bonilla, and Genni Bogdanowicz; 2025
Difference-in-Differences Designs: A Practitioner’s Guide; Andrew Baker, Brantly Callaway, Scott Cunningham, Andrew Goodman-Bacon, and Pedro H. C. Sant’Anna; 2025
Inferring a Population Composition From Survey Data With Nonignorable Nonresponse: Borrowing Information From External Sources, Veronica Ballerini and Brunero Liseo, 2025
