Differences between revisions 9 and 21 (spanning 12 versions)
Revision 9 as of 2023-10-28 16:49:33
Size: 1408
Comment:
Revision 21 as of 2024-06-05 23:20:59
Size: 2060
Comment: Simplify
Deletions are marked like this. Additions are marked like this.
Line 13: Line 13:
The regression line passes through two points: Given one independent variable and one dependent (outcome) variable, the OLS model is specified as:
Line 15: Line 15:
{{attachment:regression1.svg}} {{attachment:model.svg}}
Line 17: Line 17:
and It is estimated as:
Line 19: Line 19:
{{attachment:regression2.svg}} {{attachment:estimate.svg}}
Line 21: Line 21:
These points, with the generic equation for a line, can [[Econometrics/OrdinaryLeastSquares/UnivariateProof|prove]] that the slope of the regression line is equal to: This model describes (1) the mean observation and (2) the marginal changes to the outcome per unit changes in the independent variable.
Line 23: Line 23:
{{attachment:b12.svg}}

The generic formula for the regression line is:

{{attachment:b13.svg}}
The derivation can be seen [[Econometrics/OrdinaryLeastSquares/Univariate|here]].
Line 33: Line 29:
== Linear Model == == Multivariate ==
Line 35: Line 31:
The linear model can be expressed as: Given ''k'' independent variables, the OLS model is specified as:
Line 37: Line 33:
{{attachment:model1.svg}} {{attachment:mmodel.svg}}

It is estimated as:

{{attachment:mestimate.svg}}

More conventionally, this is estimated with [[LinearAlgebra|linear algebra]] as:

{{attachment:matrix.svg}}

The derivation can be seen [[Econometrics/OrdinaryLeastSquares/Multivariate|here]].

----



== Estimated Coefficients ==
Line 47: Line 59:
Then OLS is the best linear unbiased estimator ('''BLUE''') for these coefficients. Then OLS is the best linear unbiased estimator ('''BLUE''') for regression coefficients.
Line 49: Line 61:
Using the computation above, the coefficients are estimated to produce: The variances for each coefficient are:
Line 51: Line 63:
{{attachment:model3.svg}} {{attachment:homo1.svg}}
Line 53: Line 65:
The variance for each coefficient is estimated as: Note that the standard deviation of the population's parameter is unknown, so it's estimated like:
Line 55: Line 67:
{{attachment:model4.svg}} {{attachment:homo2.svg}}
Line 57: Line 69:
Where R^2^ is calculated as: If the homoskedasticity assumption does not hold, then the estimators for each coefficient are actually:
Line 59: Line 71:
{{attachment:model5.svg}} {{attachment:hetero1.svg}}
Line 61: Line 73:
Note also that the standard deviation of the population's parameter is unknown, so it's estimated like: Wherein, for example, ''r,,1j,,'' is the residual from regressing ''x,,1,,'' onto ''x,,2,,'', ... ''x,,k,,''.
Line 63: Line 75:
{{attachment:model6.svg}} The variances for each coefficient can be estimated with the Eicker-White formula:

{{attachment:hetero2.svg}}

See [[https://www.youtube.com/@kuminoff|Nicolai Kuminoff's]] video lectures for the derivation of the robust estimators.

Ordinary Least Squares

Ordinary Least Squares (OLS) is a linear regression method. It minimizes root mean square errors.


Univariate

Given one independent variable and one dependent (outcome) variable, the OLS model is specified as:

model.svg

It is estimated as:

estimate.svg

This model describes (1) the mean observation and (2) the marginal changes to the outcome per unit changes in the independent variable.

The derivation can be seen here.


Multivariate

Given k independent variables, the OLS model is specified as:

mmodel.svg

It is estimated as:

mestimate.svg

More conventionally, this is estimated with linear algebra as:

matrix.svg

The derivation can be seen here.


Estimated Coefficients

If these assumptions can be made:

  1. Linearity
  2. Exogeneity

  3. Random sampling
  4. No perfect multicolinearity
  5. Homoskedasticity

Then OLS is the best linear unbiased estimator (BLUE) for regression coefficients.

The variances for each coefficient are:

homo1.svg

Note that the standard deviation of the population's parameter is unknown, so it's estimated like:

homo2.svg

If the homoskedasticity assumption does not hold, then the estimators for each coefficient are actually:

hetero1.svg

Wherein, for example, r1j is the residual from regressing x1 onto x2, ... xk.

The variances for each coefficient can be estimated with the Eicker-White formula:

hetero2.svg

See Nicolai Kuminoff's video lectures for the derivation of the robust estimators.


CategoryRicottone

Statistics/OrdinaryLeastSquares (last edited 2025-01-10 14:33:38 by DominicRicottone)