Topic 11 Revisiting Old Tools

11.1 Discussion

Logistic regression is fit with a technique called maximum likelihood. What is this technique?

Example: flipping a coin 3 times, unknown probability \(p\) of getting heads

Connecting to stepwise selection

For logistic regression, stepwise selection is exactly the same, except that likelihood is used instead of RSS.
Cross-validated accuracy is an option too.

Connecting to GAMs

There can be nonlinear relationships between quantitative predictors and log odds as well.
e.g. \(\text{log odds(foreclosure)} = \beta_0 + f_1(\text{Age}) + f_2(\text{Price})\)
- Build \(f_1\) and \(f_2\) from LOESS or splines
If relationships are truly nonlinear, should help us improve prediction accuracy.

Consider how LASSO would be extended to the logistic regression setting. Using the penalized least squares criterion as a reference, how would you write a penalized criterion for logistic regression using the likelihood?

On a different note unrelated to likelihood, consider the KNN algorithm for regression. How would you modify the algorithm to…
- make a hard classification?
- produce a “soft” classification? A “soft” classification for a test case is an estimated probability of being in each of the \(K\) classes.
  A concrete example to frame your answers: say that for a particular test case, you found its 10 nearest neighbors in the training set. 5 were of Class A, 3 of Class B, and 2 of Class C.