|
Size: 2153
Comment: Rewrite 1
|
← Revision 19 as of 2025-11-03 01:33:40 ⇥
Size: 3008
Comment: Links
|
| Deletions are marked like this. | Additions are marked like this. |
| Line 25: | Line 25: |
| == Distributions == | == Statistics == |
| Line 27: | Line 27: |
| The [[Statistics/NormalDistribution|normal distribution]] is frequently expressed in econometrics. The typical notation is ''x,,i,, ~ N(μ, σ)''. | There is a mixture of notations for scalar statistics. The conventional estimators for population mean ''μ'', [[Statistics/Variance|variance]] ''σ^2^'', standard deviation ''σ'', [[Statistics/Covariance|covariance]] ''σ,,xy,,'', and [[Statistics/Correlation|correlation]] ''ρ,,xy,,'' are: |
| Line 29: | Line 29: |
| For multiple variables, pieces of [[LinearAlgebra|linear algebra]] notation are introduced. For example, the joint statement of [[Econometrics/Exogeneity|exogeneity]] and [[Econometrics/Homoskedasticity|homoskedasticity]] is: | {{attachment:average.svg}} |
| Line 31: | Line 31: |
| {{attachment:exo.svg}} | {{attachment:variance.svg}} {{attachment:sd.svg}} {{attachment:covariance.svg}} {{attachment:correlation.svg}} Frequently for multiple variable statistics, some pieces of [[LinearAlgebra|linear algebra]] notation are introduced. For example, covariances are frequently expressed in a covariance matrix. Covariances of ''x'' and ''y'' are specified as ''σ,,xy,,''; variances are expressed as covariances of ''x'' and ''x''. {{attachment:covariancem.svg}} |
| Line 35: | Line 45: |
| == Statistics == | == Distributions == |
| Line 37: | Line 47: |
| The average outcome is: | The [[Statistics/NormalDistribution|normal distribution]] is commonly used in econometrics, and a shorthand notation has emerged as ''x,,i,, ~ N(μ, σ)''. |
| Line 39: | Line 49: |
| {{attachment:average.svg}} | For multiple variables, at minimum the distribution is specified as ''NI'' to emphasize independence of the distributions. Some pieces of [[LinearAlgebra|linear algebra]] notation are also introduced. |
| Line 41: | Line 51: |
| The variance is: | |
| Line 43: | Line 52: |
| {{attachment:variance.svg}} | |
| Line 45: | Line 53: |
| The standard deviation is: | == Models == |
| Line 47: | Line 55: |
| {{attachment:sd.svg}} | A linear model of ''k'' variables specifies constant ''β,,0,,'' and coefficients ''β,,1,,'' through ''β,,k,,''. The [[LinearAlgebra|linear algebra]] notation uses a coefficient vector ''β'' of size ''p''. |
| Line 49: | Line 57: |
| The covariance between the treatment and outcome is: | When the model is fit using [[Statistics/OrdinaryLeastSquares|regression]], the estimated coefficients are notated using a hat, as in ''ˆβ,,0,,''. The linear algebra notation uses ''b'' instead. |
| Line 51: | Line 59: |
| {{attachment:covariance.svg}} | The predicted outcome is also notated using a hat, as in ''ŷ,,i,,''. |
| Line 53: | Line 61: |
| The correlation between the treatment and outcome is: | The generic calculation of the [[Statistics/Residuals|residual]] for observation ''i'' is ''y,,i,, - ŷ,,i,,''. The residual sum of squares (RSS) is what is minimized to fit a model. |
| Line 55: | Line 63: |
| {{attachment:correlation.svg}} Based on [[Econometrics/OrdinaryLeastSquares|OLS regression]], the estimated outcome for observation ''i'' is: {{attachment:estimate.svg}} No matter the regression method, the residual is: {{attachment:residual.svg}} And the coefficient of determination, a.k.a. the ''R^2^'', is: |
And the coefficient of determination is: |
Econometrics Notation
Observations and Measurements
The number of observations is n.
The outcome variable is y. The outcome measurement for observation i is yi.
If there is a single predictor, it may be specified as x; the measurement is xi. More commonly, there is a set of predictors specified like x1, x2, and so on. The measurements are then x1i, x2i, and so on.
When expressing data with linear algebra, the outcome measurements are composed into vector y with size n, and the predictor measurements are composed into matrix X of shape n by p.
A very common exception: income is usually represented by Y or y. In relevant literature, expect to see different letters.
Error Terms
Error terms are variably represented by ε, e, u, or v. The error term for observation i would be represented like εi.
Statistics
There is a mixture of notations for scalar statistics. The conventional estimators for population mean μ, variance σ2, standard deviation σ, covariance σxy, and correlation ρxy are:
Frequently for multiple variable statistics, some pieces of linear algebra notation are introduced. For example, covariances are frequently expressed in a covariance matrix. Covariances of x and y are specified as σxy; variances are expressed as covariances of x and x.
Distributions
The normal distribution is commonly used in econometrics, and a shorthand notation has emerged as xi ~ N(μ, σ).
For multiple variables, at minimum the distribution is specified as NI to emphasize independence of the distributions. Some pieces of linear algebra notation are also introduced.
Models
A linear model of k variables specifies constant β0 and coefficients β1 through βk. The linear algebra notation uses a coefficient vector β of size p.
When the model is fit using regression, the estimated coefficients are notated using a hat, as in ˆβ0. The linear algebra notation uses b instead.
The predicted outcome is also notated using a hat, as in ŷi.
The generic calculation of the residual for observation i is yi - ŷi. The residual sum of squares (RSS) is what is minimized to fit a model.
And the coefficient of determination is:
