Differences between revisions 3 and 6 (spanning 3 versions)
Revision 3 as of 2024-06-05 20:57:05
Size: 1253
Comment: Rewrite 2
Revision 6 as of 2024-06-05 21:22:43
Size: 1518
Comment: Rewrite 5 (final)
Deletions are marked like this. Additions are marked like this.
Line 15: Line 15:
where:

 * ''X‾'' is the sample mean of ''X'' (estimating ''μ,,X,,'')
 * ''Y‾'' is the sample mean of ''Y'' (estimating ''μ,,Y,,'')
 * ''s,,X,,'' is the sample standard deviation of ''X'' (estimating ''σ,,X,,'')
 * ''s,,Y,,'' is the sample standard deviation of ''Y'' (estimating ''σ,,Y,,'')
 * and ''r,,XY,,'' is the sample correlation coefficient between ''X'' and ''Y'' (estimating ''ρ,,XY,,'')
Line 21: Line 29:
Insert the second point into the original form. Insert the second point and the solution for ''α'' into the estimation.
Line 23: Line 31:
{{attachment:b04.svg}} {{attachment:beta1.svg}}
Line 25: Line 33:
Now additionally insert the solution for ''a'' in terms of ''b''. {{attachment:beta2.svg}}
Line 27: Line 35:
{{attachment:b05.svg}} {{attachment:beta3.svg}}
Line 29: Line 37:
Expand all terms to produce: This reduced form can be quickly solved for ''β''.
Line 31: Line 39:
{{attachment:b06.svg}} {{attachment:beta4.svg}}
Line 33: Line 41:
This can now be eliminated into: Because the correlation coefficient can be expressed in terms of covariance and standard deviations...
Line 35: Line 43:
{{attachment:b07.svg}} {{attachment:correlation.svg}}
Line 37: Line 45:
Giving a solution for ''b'': ...the solution for ''β'' can be further reduced.
Line 39: Line 47:
{{attachment:b08.svg}} {{attachment:beta5.svg}}
Line 41: Line 49:
This solution is trivially rewritten as: Therefore, the regression line is estimated to be:
Line 43: Line 51:
{{attachment:b09.svg}}

Expand the formula for correlation as:

{{attachment:b10.svg}}

This can now be eliminated into:

{{attachment:b11.svg}}

Finally, ''b'' can be eloquently written as:

{{attachment:b12.svg}}

Giving a generic formula for the regression line:

{{attachment:b13.svg}}
{{attachment:regression.svg}}

Ordinary Least Squares Univariate Proof

The model is constructed like:

model1.svg

This is estimated as:

model2.svg

This line must pass through the mean and the slope of the line must be the marginal change in Y given a unit change in X. In other words, the line must pass through two points:

model3.svg

where:

  • X‾ is the sample mean of X (estimating μX)

  • Y‾ is the sample mean of Y (estimating μY)

  • sX is the sample standard deviation of X (estimating σX)

  • sY is the sample standard deviation of Y (estimating σY)

  • and rXY is the sample correlation coefficient between X and Y (estimating ρXY)

Insert the first point into the estimation. This is quickly solved for α.

alpha1.svg

alpha2.svg

Insert the second point and the solution for α into the estimation.

beta1.svg

beta2.svg

beta3.svg

This reduced form can be quickly solved for β.

beta4.svg

Because the correlation coefficient can be expressed in terms of covariance and standard deviations...

correlation.svg

...the solution for β can be further reduced.

beta5.svg

Therefore, the regression line is estimated to be:

regression.svg


CategoryRicottone

Statistics/OrdinaryLeastSquares/Univariate (last edited 2025-01-10 14:33:54 by DominicRicottone)