ARTICLE

bias-variance tradeoff

Bias-Variance Tradeoff The Bias-Variance Tradeoff is a core concept in supervised learning and statistics describing the tension between two sources of prediction error. It is fund

浏览 0 更新 2025-11-21

Bias-Variance Tradeoff

The Bias-Variance Tradeoff is a core concept in supervised learning and statistics describing the tension between two sources of prediction error. It is fundamental for diagnosing model performance and selecting appropriate model complexity.

Error Decomposition

Let $y = f(x) + \epsilon$ be the true relationship, with $\epsilon$ as irreducible noise ( $E[\epsilon]=0$ , Var $(\epsilon)=\sigma^2$ ). For estimator $\hat{f}$ , the expected mean squared error decomposes as:

E[(y - \hat{f})^2] = \underbrace{(E[\hat{f}] - f)^2}_{\text{Bias}^2} + \underbrace{E[(\hat{f} - E[\hat{f}])^2]}_{\text{Variance}} + \underbrace{\sigma^2}_{\text{Irreducible Error}}

Bias

Bias measures systematic error from simplifying reality. High bias (underfitting) occurs when models like linear regression cannot capture complex patterns. Low bias means the model's assumptions fit the data well.

Variance

Variance measures prediction sensitivity to training data fluctuations. High variance (overfitting) occurs when flexible models like high-degree polynomial regression fit noise. Low variance yields stable predictions across datasets.

Irreducible Error

Data-inherent noise that no model can surpass—the fundamental error floor from unmeasured variables or measurement error.

The Tradeoff

Bias and variance are inversely related through model complexity:

Simple models (e.g., linear regression, shallow decision trees): low variance, high bias.
Complex models (e.g., deep neural networks): low bias, high variance.

Optimal complexity minimizes total error, producing a U-shaped test-error curve.

Diagnosis

Learning curves: high bias → both errors high and close; high variance → large gap between low training error and high validation error.

Remedies

High bias: increase model complexity, add features, reduce regularization.
High variance: add training data, simplify model, increase regularization (L1, L2), use Bagging (e.g., Random Forest).

Example: KNN

In KNN, $k$ controls the tradeoff: small $k$ → low bias, high variance; large $k$ → high bias, low variance. Cross-validation finds the optimal balance.

关于知经 KNOWECON

知经 KNOWECON 是深圳市卢可教育科技有限公司旗下的教育科技品牌，长期面向北京大学、清华大学、中国人民大学等顶尖院校，提供经济学、金融学、统计学、管理学等相关科目的专业课考研辅导与复试辅导。每年都有数十名同学在我们的帮助下完成系统备考，并成功进入理想院校。

知经主讲人喵喵学长毕业于北京大学汇丰商学院经济学专业和新加坡国立大学金融工程专业，获经济学硕士与金融工程硕士学位。他同时也是软件工程师和教育科技创业者，长期探索用讲义、题库、记忆系统、智能答疑与学习数据工具改善专业课学习体验。

我们相信，好的考研辅导不只是押题和陪跑，更是把复杂知识讲清楚、把复习路径设计清楚，并用技术让学习过程更可追踪、更可反馈、更可坚持。