How should we estimate the logistic regression parameters?
Unlike linear regression (Gaussian + identity), logistic regression has no closed-form solution. We cannot simply compute $\beta = (X'X)^{-1}X'y$. The logit link creates a non-linear equation that must be solved iteratively.
We have our model structure defined. Now we need to estimate the parameters ($\beta_0, \beta_1, \ldots, \beta_5$) that best fit the observed data.
Click on a card to select it.
Maximum Likelihood via IRLS is the standard approach for GLMs, and it's required for logistic regression since there's no closed-form solution.
R's glm() and Python's statsmodels use IRLS (a form of Fisher scoring)
by default. For our heart disease model, it typically converges in about 5 iterations.