Differences between revisions 6 and 12 (spanning 6 versions)
Revision 6 as of 2023-10-28 07:04:18
Size: 2049
Comment:
Revision 12 as of 2023-10-28 17:49:29
Size: 1660
Comment:
Deletions are marked like this. Additions are marked like this.
Line 21: Line 21:
Take the generic equation form of a line:

{{attachment:b01.svg}}

Insert the first point into this form.

{{attachment:b02.svg}}

This can be trivially rewritten to solve for ''a'' in terms of ''b'':

{{attachment:b03.svg}}

Insert the second point into the original form.

{{attachment:b04.svg}}

Now additionally insert the solution for ''a'' in terms of ''b''.

{{attachment:b05.svg}}

Expand all terms to produce:

{{attachment:b06.svg}}

This can now be eliminated into:

{{attachment:b07.svg}}

Giving a solution for ''b'':

{{attachment:b08.svg}}

This solution is trivially rewritten as:

{{attachment:b09.svg}}

Expand the formula for correlation as:

{{attachment:b10.svg}}

This can now be eliminated into:

{{attachment:b11.svg}}

Finally, ''b'' can be eloquently written as:
These points, with the generic equation for a line, can [[Econometrics/OrdinaryLeastSquares/UnivariateProof|prove]] that the slope of the regression line is equal to:
Line 69: Line 25:
Giving a generic formula for the regression line: The generic formula for the regression line is:
Line 86: Line 42:
 2. Exogeneity

{{attachm
ent:model2.svg}}

 3.#3 Random sampling
 2. [[Econometrics/Exogeneity|Exogeneity]]
 3. Random sampling
Line 92: Line 45:
 5. Heteroskedasticity  5. [[Econometrics/Homoskedasticity|Homoskedasticity]]
Line 100: Line 53:
The variance for each coefficient is estimated as: The variances for each coefficient are:
Line 103: Line 56:

Where R^2^ is calculated as:

{{attachment:model5.svg}}
Line 112: Line 61:
If the homoskedasticity assumption does not hold, then the estimators for each coefficient are actually:

{{attachment:hetero1.svg}}

It follows that the variances for each coefficient are:

{{attachment:hetero2.svg}}

These variances can be estimated with the Eicker-White formula:

{{attachment:hetero3.svg}}

Ordinary Least Squares

Ordinary Least Squares (OLS) is a linear regression method. It minimizes root mean square errors.


Univariate

The regression line passes through two points:

[ATTACH]

and

[ATTACH]

These points, with the generic equation for a line, can prove that the slope of the regression line is equal to:

[ATTACH]

The generic formula for the regression line is:

[ATTACH]


Linear Model

The linear model can be expressed as:

model1.svg

If these assumptions can be made:

  1. Linearity
  2. Exogeneity

  3. Random sampling
  4. No perfect multicolinearity
  5. Homoskedasticity

Then OLS is the best linear unbiased estimator (BLUE) for these coefficients.

Using the computation above, the coefficients are estimated to produce:

[ATTACH]

The variances for each coefficient are:

[ATTACH]

Note also that the standard deviation of the population's parameter is unknown, so it's estimated like:

[ATTACH]

If the homoskedasticity assumption does not hold, then the estimators for each coefficient are actually:

[ATTACH]

It follows that the variances for each coefficient are:

[ATTACH]

These variances can be estimated with the Eicker-White formula:

[ATTACH]


CategoryRicottone

Statistics/OrdinaryLeastSquares (last edited 2025-05-17 03:48:23 by DominicRicottone)