⇤ ← Revision 1 as of 2024-06-05 22:43:56
Size: 1599
Comment: Partial
|
Size: 1749
Comment: Partial
|
Deletions are marked like this. | Additions are marked like this. |
Line 7: | Line 7: |
where: * ''y'' and ''ε'' are vectors of size ''n'' * ''β'' is a vector of size ''p'' * '''''X''''' is a matrix of shape ''n'' by ''p'' |
|
Line 11: | Line 17: |
---- |
|
Line 16: | Line 20: |
---- |
Ordinary Least Squares Univariate Proof
The model is constructed like:
where:
y and ε are vectors of size n
β is a vector of size p
X is a matrix of shape n by p
The model is fit by a minimization problem:
This is estimated as:
This line must pass through the mean and the slope of the line must be the marginal change in Y given a unit change in X. In other words, the line must pass through two points:
where:
X‾ is the sample mean of X (estimating μX)
Y‾ is the sample mean of Y (estimating μY)
sX is the sample standard deviation of X (estimating σX)
sY is the sample standard deviation of Y (estimating σY)
and rXY is the sample correlation coefficient between X and Y (estimating ρXY)
Insert the first point into the estimation. This is quickly solved for α.
Insert the second point and the solution for α into the estimation.
This reduced form can be quickly solved for β.
Because the correlation coefficient can be expressed in terms of covariance and standard deviations...
...the solution for β can be further reduced.
Therefore, the regression line is estimated to be: