Differences between revisions 1 and 2
Revision 1 as of 2024-06-05 22:43:56
Size: 1599
Comment: Partial
Revision 2 as of 2024-06-05 22:49:27
Size: 1749
Comment: Partial
Deletions are marked like this. Additions are marked like this.
Line 7: Line 7:
where:

 * ''y'' and ''ε'' are vectors of size ''n''
 * ''β'' is a vector of size ''p''
 * '''''X''''' is a matrix of shape ''n'' by ''p''
Line 11: Line 17:
----
Line 16: Line 20:

----

Ordinary Least Squares Univariate Proof

The model is constructed like:

model1.svg

where:

  • y and ε are vectors of size n

  • β is a vector of size p

  • X is a matrix of shape n by p

The model is fit by a minimization problem:

[ATTACH]

This is estimated as:

model2.svg


This line must pass through the mean and the slope of the line must be the marginal change in Y given a unit change in X. In other words, the line must pass through two points:

model3.svg

where:

  • X‾ is the sample mean of X (estimating μX)

  • Y‾ is the sample mean of Y (estimating μY)

  • sX is the sample standard deviation of X (estimating σX)

  • sY is the sample standard deviation of Y (estimating σY)

  • and rXY is the sample correlation coefficient between X and Y (estimating ρXY)

Insert the first point into the estimation. This is quickly solved for α.

[ATTACH]

[ATTACH]

Insert the second point and the solution for α into the estimation.

[ATTACH]

[ATTACH]

[ATTACH]

This reduced form can be quickly solved for β.

[ATTACH]

Because the correlation coefficient can be expressed in terms of covariance and standard deviations...

[ATTACH]

...the solution for β can be further reduced.

[ATTACH]

Therefore, the regression line is estimated to be:

[ATTACH]


CategoryRicottone

Statistics/OrdinaryLeastSquares/Multivariate (last edited 2025-01-10 14:34:03 by DominicRicottone)