Differences between revisions 3 and 9 (spanning 6 versions)
Revision 3 as of 2024-06-05 20:57:05
Size: 1253
Comment: Rewrite 2
Revision 9 as of 2024-06-05 23:20:10
Size: 1577
Comment: Simplify
Deletions are marked like this. Additions are marked like this.
Line 1: Line 1:
= Ordinary Least Squares Univariate Proof = = OLS Univariate Derivation =
Line 6: Line 6:

The model is fit by a minimization problem:

{{attachment:min.svg}}
Line 15: Line 19:
where:

 * ''X‾'' is the sample mean of ''X'' (estimating ''μ,,X,,'')
 * ''Y‾'' is the sample mean of ''Y'' (estimating ''μ,,Y,,'')
 * ''s,,X,,'' is the sample standard deviation of ''X'' (estimating ''σ,,X,,'')
 * ''s,,Y,,'' is the sample standard deviation of ''Y'' (estimating ''σ,,Y,,'')
 * and ''r,,XY,,'' is the sample correlation coefficient between ''X'' and ''Y'' (estimating ''ρ,,XY,,'')
Line 21: Line 33:
Insert the second point into the original form. Insert the second point and the solution for ''α'' into the estimation.
Line 23: Line 35:
{{attachment:b04.svg}} {{attachment:beta1.svg}}
Line 25: Line 37:
Now additionally insert the solution for ''a'' in terms of ''b''. {{attachment:beta2.svg}}
Line 27: Line 39:
{{attachment:b05.svg}} {{attachment:beta3.svg}}
Line 29: Line 41:
Expand all terms to produce: This reduced form can be quickly solved for ''β''.
Line 31: Line 43:
{{attachment:b06.svg}} {{attachment:beta4.svg}}
Line 33: Line 45:
This can now be eliminated into: Because the correlation coefficient can be expressed in terms of covariance and standard deviations...
Line 35: Line 47:
{{attachment:b07.svg}} {{attachment:correlation.svg}}
Line 37: Line 49:
Giving a solution for ''b'': ...the solution for ''β'' can be further reduced.
Line 39: Line 51:
{{attachment:b08.svg}} {{attachment:beta5.svg}}
Line 41: Line 53:
This solution is trivially rewritten as: Therefore, the regression line is estimated to be:
Line 43: Line 55:
{{attachment:b09.svg}}

Expand the formula for correlation as:

{{attachment:b10.svg}}

This can now be eliminated into:

{{attachment:b11.svg}}

Finally, ''b'' can be eloquently written as:

{{attachment:b12.svg}}

Giving a generic formula for the regression line:

{{attachment:b13.svg}}
{{attachment:regression.svg}}

OLS Univariate Derivation

The model is constructed like:

model1.svg

The model is fit by a minimization problem:

min.svg

This is estimated as:

model2.svg

This line must pass through the mean and the slope of the line must be the marginal change in Y given a unit change in X. In other words, the line must pass through two points:

model3.svg

where:

  • X‾ is the sample mean of X (estimating μX)

  • Y‾ is the sample mean of Y (estimating μY)

  • sX is the sample standard deviation of X (estimating σX)

  • sY is the sample standard deviation of Y (estimating σY)

  • and rXY is the sample correlation coefficient between X and Y (estimating ρXY)

Insert the first point into the estimation. This is quickly solved for α.

alpha1.svg

alpha2.svg

Insert the second point and the solution for α into the estimation.

beta1.svg

beta2.svg

beta3.svg

This reduced form can be quickly solved for β.

beta4.svg

Because the correlation coefficient can be expressed in terms of covariance and standard deviations...

correlation.svg

...the solution for β can be further reduced.

beta5.svg

Therefore, the regression line is estimated to be:

regression.svg


CategoryRicottone

Statistics/OrdinaryLeastSquares/Univariate (last edited 2025-01-10 14:33:54 by DominicRicottone)