Econometrics Notation
Observations and Measurements
The number of observations is n.
The outcome variable is y. The outcome measurement for observation i is yi.
If there is a single predictor, it may be specified as x; the measurement is xi. More commonly, there is a set of predictors specified like x1, x2, and so on. The measurements are then x1i, x2i, and so on.
When expressing data with linear algebra, the outcome measurements are composed into vector y with size n, and the predictor measurements are composed into matrix X of shape n by p.
A very common exception: income is usually represented by Y or y. In relevant literature, expect to see different letters.
Error Terms
Error terms are variably represented by ε, e, u, or v. The error term for observation i would be represented like εi.
Statistics
There is a mixture of notations for scalar statistics. The conventional estimators for population mean μ, variance σ2, standard deviation σ, covariance σxy, and correlation ρxy are:
Frequently for multiple variable statistics, some pieces of linear algebra notation are introduced. For example, covariances are frequently expressed in a covariance matrix. Covariances of x and y are specified as σxy; variances are expressed as covariances of x and x.
Distributions
The normal distribution is commonly used in econometrics, and a shorthand notation has emerged as xi ~ N(μ, σ).
For multiple variables, at minimum the distribution is specified as NI to emphasize independence of the distributions. Some pieces of linear algebra notation are also introduced. For example, the joint statement of exogeneity and homoskedasticity is:
Note how the covariance matrix is fully expressed as the diagonal matrix of each term's variance.
Modeling
A univariate model is specified with a constant term α and a coefficient term β. A multivariate model of j variables specifies constant β0 and coefficients β1 through βj. A linear algebra notation uses a coefficient vector β of size p.
In any case, when a model is estimated, the estimated coefficients are notated differently. Scalar notations attach a hat, as in βˆ0. The linear algebra notation replaces β with b.
The predicted outcome from a model is also marked as an estimate by attaching a hat: yˆ.
The generic calculation of the residual for observation i is yi - yˆi. The sum of square residuals (SSR) is what is minimized to fit a model.
And the coefficient of determination is: