Differences between revisions 6 and 10 (spanning 4 versions)
Revision 6 as of 2023-10-28 05:32:36
Size: 1022
Comment: Estimates and residuals
Revision 10 as of 2024-06-07 15:14:21
Size: 2471
Comment: Rewrite 2
Deletions are marked like this. Additions are marked like this.
Line 5: Line 5:
== Data == == Observations and Measurements ==
Line 9: Line 9:
The outcome variable is ''y''. For observation ''i'', the outcome value is ''y,,i,,''. The outcome variable is ''y''. The outcome measurement for observation ''i'' is ''y,,i,,''.
Line 11: Line 11:
The treatment variable is ''x,,1,,''. For observation ''i'', the treatment value is ''x,,1i,,''. If there is a single predictor, it may be specified as ''x''; the measurement is ''x,,i,,''. More commonly, there is a set of predictors specified like ''x,,1,,'', ''x,,2,,'', and so on. The measurements are then ''x,,1i,,'', ''x,,2i,,'', and so on.
Line 13: Line 13:
The control variables are ''x,,2,,'' through ''x,,k,,'' (up to ''k'' - 1 control variables). For observation ''i'', a control value might be ''x,,2i,,''. When expressing data with [[LinearAlgebra|linear algebra]], the outcome measurements are composed into vector ''y'' with size ''n'', and the predictor measurements are composed into matrix '''''X''''' of shape ''n'' by ''p''.

A very common exception: income is usually represented by ''Y'' or ''y''. In relevant literature, expect to see different letters.



== Error Terms ==

Error terms are variably represented by ''ε'', ''e'', ''u'', or ''v''. The error term for observation ''i'' would be represented like ''ε,,i,,''.
Line 19: Line 27:
The average outcome is:


== Distributions ==

The [[Statistics/NormalDistribution|normal distribution]] is frequently expressed in econometrics. The typical notation is ''x,,i,, ~ N(μ, σ)''.

For multiple variables, at minimum the distribution is specified as ''NI'' to emphasize independence of the distributions. Some pieces of [[LinearAlgebra|linear algebra]] notation are also introduced. For example, the joint statement of [[Econometrics/Exogeneity|exogeneity]] and [[Econometrics/Homoskedasticity|homoskedasticity]] is:

{{attachment:exo.svg}}

Note how the covariance matrix is fully expressed as the [[LinearAlgebra/SpecialMatrices#Diagonal_Matrices|diagonal matrix]] of each term's variance.



== Statistics ==

There is a mixture of notations for scalar statistics. The conventional estimators for population mean ''μ'', variance ''σ^2^'', standard deviation ''σ'', covariance ''σ,,xy,,'', and correlation ''ρ,,xy,,'' are:
Line 23: Line 48:
The variance is:
Line 26: Line 49:

The standard deviation is:
Line 31: Line 52:
The covariance between the treatment and outcome is:
Line 34: Line 53:

The correlation between the treatment and outcome is:
Line 39: Line 56:
Based on [[Econometrics/LinearRegression|regression]], the estimated outcome for observation ''i'' is: Based on [[Econometrics/OrdinaryLeastSquares|OLS regression]], the estimated outcome for observation ''i'' is:
Line 43: Line 60:
And the residual is: No matter the regression method, the residual is:
Line 46: Line 63:

And the coefficient of determination, a.k.a. the ''R^2^'', is:

{{attachment:rsquared.svg}}

Econometrics Notation

Observations and Measurements

The number of observations is n.

The outcome variable is y. The outcome measurement for observation i is yi.

If there is a single predictor, it may be specified as x; the measurement is xi. More commonly, there is a set of predictors specified like x1, x2, and so on. The measurements are then x1i, x2i, and so on.

When expressing data with linear algebra, the outcome measurements are composed into vector y with size n, and the predictor measurements are composed into matrix X of shape n by p.

A very common exception: income is usually represented by Y or y. In relevant literature, expect to see different letters.

Error Terms

Error terms are variably represented by ε, e, u, or v. The error term for observation i would be represented like εi.

Statistics

Distributions

The normal distribution is frequently expressed in econometrics. The typical notation is xi ~ N(μ, σ).

For multiple variables, at minimum the distribution is specified as NI to emphasize independence of the distributions. Some pieces of linear algebra notation are also introduced. For example, the joint statement of exogeneity and homoskedasticity is:

exo.svg

Note how the covariance matrix is fully expressed as the diagonal matrix of each term's variance.

Statistics

There is a mixture of notations for scalar statistics. The conventional estimators for population mean μ, variance σ2, standard deviation σ, covariance σxy, and correlation ρxy are:

average.svg

variance.svg

sd.svg

covariance.svg

correlation.svg

Based on OLS regression, the estimated outcome for observation i is:

estimate.svg

No matter the regression method, the residual is:

residual.svg

And the coefficient of determination, a.k.a. the R2, is:

rsquared.svg


CategoryRicottone

Statistics/EconometricsNotation (last edited 2025-01-10 14:15:50 by DominicRicottone)