MATH 4780 / MSSC 5780 Regression Analysis
Regression is a statistical technique for investigating and modeling the relationships between variables.
Can you come up with any real-world examples describing relationships between variables deterministically?
Can you provide some real examples that the variables are related each other, but not perfectly related?
💵 In general, one with more years of education earns more.
💵 Any two with the same years of education may have different annual income.
What are the unexplained variation coming from?
What other factors (variables) may affect a person’s income?
your income = f(years of education, major, GPA, college, parent's income, ...)
income
years of education
, which is known and fixed.In Intro Stats, what is the form of \(f\) and what assumptions you made on the random error \(\epsilon\) ?
income
and years of education
.Big problem: \(f(x)\) is unknown and needs to be estimated.
In Intro Stats, what is our estimated regression function \(\hat{f}\)?
We are interested in
age
, education level
, gender
, etc affect salary
?salary
increases/decreases as age
increases one unit?salary
and age
is linear, quadratic or more complicated?Parametric (Linear regression)
Nonparametric (LOESS)
👉 Some nonlinear models can be transformed to an equivalent linear model.
Which nonlinear model above can be transformed into a linear model?