Differences between revisions 3 and 29 (spanning 26 versions)
Revision 3 as of 2023-10-28 05:37:48
Size: 1349
Comment:
Revision 29 as of 2025-09-03 02:08:40
Size: 2065
Comment: Apparently mutlivariate regression ~= multiple regression
Deletions are marked like this. Additions are marked like this.
Line 1: Line 1:
## page was renamed from Econometrics/LinearRegression
Line 4: Line 3:
'''Ordinary Least Squares''' ('''OLS''') is a linear regression method. It minimizes root mean square errors. '''Ordinary Least Squares''' ('''OLS''') is a linear regression method, and is effectively synonymous with the '''linear regression model'''.
Line 12: Line 11:
== Univariate == == Description ==
Line 14: Line 13:
The regression line passes through two points: A linear model is expressed as either {{attachment:model.svg}} (univariate) or {{attachment:mmodel.svg}} (multivariate with ''k'' terms). Either way, a crucial assumption is that the expected value of the error term is 0, such that the [[Statistics/Moments|first moment]] is ''E[y,,i,,|x,,i,,] = α + βx,,i,,''.
Line 16: Line 15:
{{attachment:regression1.svg}}
Line 18: Line 16:
and
Line 20: Line 17:
{{attachment:regression2.svg}} === Single Regression ===
Line 22: Line 19:
Take the generic equation form of a line: In the case of a single predictor, the OLS regression is:
Line 24: Line 21:
{{attachment:b01.svg}} {{attachment:estimate.svg}}
Line 26: Line 23:
Insert the first point into this form. This formulation leaves the components explicit: the y-intercept term is the mean outcome at ''x=0'', and the slope term is marginal change to the outcome per a unit change in ''x''.
Line 28: Line 25:
{{attachment:b02.svg}} The derivation can be seen [[Statistics/OrdinaryLeastSquares/Single|here]].
Line 30: Line 27:
This can be trivially rewritten to solve for ''a'' in terms of ''b'':
Line 32: Line 28:
{{attachment:b03.svg}}
Line 34: Line 29:
Insert the second point into the original form.
Line 36: Line 30:
{{attachment:b04.svg}} === Multiple Regression ===
Line 38: Line 32:
Now additionally insert the solution for ''a'' in terms of ''b''. In the case of multiple predictors, the regression is fit like:
Line 40: Line 34:
{{attachment:b05.svg}} {{attachment:mestimate.svg}}
Line 42: Line 36:
Expand all terms to produce: But conventionally, this OLS system is solved using [[LinearAlgebra|linear algebra]] as:
Line 44: Line 38:
{{attachment:b06.svg}} {{attachment:matrix.svg}}
Line 46: Line 40:
This can now be eliminated into: Note that using a ''b'' here is [[Statistics/EconometricsNotation#Models|intentional]].
Line 48: Line 42:
{{attachment:b07.svg}} The derivation can be seen [[Statistics/OrdinaryLeastSquares/Multiple|here]].
Line 50: Line 44:
Giving a solution for ''b'': ----
Line 52: Line 46:
{{attachment:b08.svg}}
Line 54: Line 47:
This solution is trivially rewritten as:
Line 56: Line 48:
{{attachment:b09.svg}} == Estimated Coefficients ==
Line 58: Line 50:
Expand the formula for correlation as: The '''Gauss-Markov theorem''' demonstrates that (with some assumptions) the OLS estimations are the '''best linear unbiased estimators''' ('''BLUE''') for the regression coefficients. The assumptions are:
Line 60: Line 52:
{{attachment:b10.svg}}  1. Linearity
 2. Exogeneity, i.e. predictors are independent of the outcome and the error term
 3. Random sampling
 4. No perfect [[LinearAlgebra/Basis|multicolinearity]]
 5. Homoskedasticity, i.e. error terms are constant across observations
Line 62: Line 58:
This can now be eliminated into:

{{attachment:b11.svg}}

Finally, ''b'' can be eloquently written as:

{{attachment:b12.svg}}

Giving a generic formula for the regression line:

{{attachment:b13.svg}}
#5 mostly comes into the estimation of [[Statistics/StandardErrors|standard errors]], and there are alternative estimators that are robust to heteroskedasticity.

Ordinary Least Squares

Ordinary Least Squares (OLS) is a linear regression method, and is effectively synonymous with the linear regression model.


Description

A linear model is expressed as either model.svg (univariate) or mmodel.svg (multivariate with k terms). Either way, a crucial assumption is that the expected value of the error term is 0, such that the first moment is E[yi|xi] = α + βxi.

Single Regression

In the case of a single predictor, the OLS regression is:

estimate.svg

This formulation leaves the components explicit: the y-intercept term is the mean outcome at x=0, and the slope term is marginal change to the outcome per a unit change in x.

The derivation can be seen here.

Multiple Regression

In the case of multiple predictors, the regression is fit like:

mestimate.svg

But conventionally, this OLS system is solved using linear algebra as:

matrix.svg

Note that using a b here is intentional.

The derivation can be seen here.


Estimated Coefficients

The Gauss-Markov theorem demonstrates that (with some assumptions) the OLS estimations are the best linear unbiased estimators (BLUE) for the regression coefficients. The assumptions are:

  1. Linearity
  2. Exogeneity, i.e. predictors are independent of the outcome and the error term
  3. Random sampling
  4. No perfect multicolinearity

  5. Homoskedasticity, i.e. error terms are constant across observations

#5 mostly comes into the estimation of standard errors, and there are alternative estimators that are robust to heteroskedasticity.


CategoryRicottone

Statistics/OrdinaryLeastSquares (last edited 2025-09-03 02:08:40 by DominicRicottone)