Size: 2153
Comment: Rewrite 1
|
Size: 2471
Comment: Rewrite 2
|
Deletions are marked like this. | Additions are marked like this. |
Line 25: | Line 25: |
== Statistics == |
|
Line 29: | Line 34: |
For multiple variables, pieces of [[LinearAlgebra|linear algebra]] notation are introduced. For example, the joint statement of [[Econometrics/Exogeneity|exogeneity]] and [[Econometrics/Homoskedasticity|homoskedasticity]] is: | For multiple variables, at minimum the distribution is specified as ''NI'' to emphasize independence of the distributions. Some pieces of [[LinearAlgebra|linear algebra]] notation are also introduced. For example, the joint statement of [[Econometrics/Exogeneity|exogeneity]] and [[Econometrics/Homoskedasticity|homoskedasticity]] is: |
Line 32: | Line 37: |
Note how the covariance matrix is fully expressed as the [[LinearAlgebra/SpecialMatrices#Diagonal_Matrices|diagonal matrix]] of each term's variance. |
|
Line 37: | Line 44: |
The average outcome is: | There is a mixture of notations for scalar statistics. The conventional estimators for population mean ''μ'', variance ''σ^2^'', standard deviation ''σ'', covariance ''σ,,xy,,'', and correlation ''ρ,,xy,,'' are: |
Line 41: | Line 48: |
The variance is: |
|
Line 44: | Line 49: |
The standard deviation is: |
|
Line 49: | Line 52: |
The covariance between the treatment and outcome is: |
|
Line 52: | Line 53: |
The correlation between the treatment and outcome is: |
Econometrics Notation
Observations and Measurements
The number of observations is n.
The outcome variable is y. The outcome measurement for observation i is yi.
If there is a single predictor, it may be specified as x; the measurement is xi. More commonly, there is a set of predictors specified like x1, x2, and so on. The measurements are then x1i, x2i, and so on.
When expressing data with linear algebra, the outcome measurements are composed into vector y with size n, and the predictor measurements are composed into matrix X of shape n by p.
A very common exception: income is usually represented by Y or y. In relevant literature, expect to see different letters.
Error Terms
Error terms are variably represented by ε, e, u, or v. The error term for observation i would be represented like εi.
Statistics
Distributions
The normal distribution is frequently expressed in econometrics. The typical notation is xi ~ N(μ, σ).
For multiple variables, at minimum the distribution is specified as NI to emphasize independence of the distributions. Some pieces of linear algebra notation are also introduced. For example, the joint statement of exogeneity and homoskedasticity is:
Note how the covariance matrix is fully expressed as the diagonal matrix of each term's variance.
Statistics
There is a mixture of notations for scalar statistics. The conventional estimators for population mean μ, variance σ2, standard deviation σ, covariance σxy, and correlation ρxy are:
Based on OLS regression, the estimated outcome for observation i is:
No matter the regression method, the residual is:
And the coefficient of determination, a.k.a. the R2, is: