Size: 1253
Comment: Rewrite 2
|
Size: 1437
Comment: Rewrite 3
|
Deletions are marked like this. | Additions are marked like this. |
Line 15: | Line 15: |
where: * ''X‾'' is the sample mean of ''X'' (estimating ''μ,,X,,'') * ''Y‾'' is the sample mean of ''Y'' (estimating ''μ,,Y,,'') * ''s,,X,,'' is the sample standard deviation of ''X'' (estimating ''σ,,X,,'') * ''s,,Y,,'' is the sample standard deviation of ''Y'' (estimating ''σ,,Y,,'') * and ''r,,XY,,'' is the sample correlation coefficient between ''X'' and ''Y'' (estimating ''ρ,,XY,,'') |
|
Line 21: | Line 29: |
Insert the second point into the original form. | Insert the second point and the solution for ''α'' into the estimation. |
Line 23: | Line 31: |
{{attachment:b04.svg}} | {{attachment:beta1.svg}} |
Line 25: | Line 33: |
Now additionally insert the solution for ''a'' in terms of ''b''. | {{attachment:beta2.svg}} |
Line 27: | Line 35: |
{{attachment:b05.svg}} | {{attachment:beta3.svg}} |
Line 29: | Line 37: |
Expand all terms to produce: {{attachment:b06.svg}} This can now be eliminated into: {{attachment:b07.svg}} Giving a solution for ''b'': {{attachment:b08.svg}} This solution is trivially rewritten as: {{attachment:b09.svg}} |
{{attachment:beta4.svg}} |
Ordinary Least Squares Univariate Proof
The model is constructed like:
This is estimated as:
This line must pass through the mean and the slope of the line must be the marginal change in Y given a unit change in X. In other words, the line must pass through two points:
where:
X‾ is the sample mean of X (estimating μX)
Y‾ is the sample mean of Y (estimating μY)
sX is the sample standard deviation of X (estimating σX)
sY is the sample standard deviation of Y (estimating σY)
and rXY is the sample correlation coefficient between X and Y (estimating ρXY)
Insert the first point into the estimation. This is quickly solved for α.
Insert the second point and the solution for α into the estimation.
Expand the formula for correlation as:
This can now be eliminated into:
Finally, b can be eloquently written as:
Giving a generic formula for the regression line: