李宏毅機器學習筆記（Where does the error come from ）

原創

2019-07-03 10:24

Where does the error come from 誤差來自哪裏？

一：

提出問題：不同的model對同一個testing data的performance是不同的，而且不一定越複雜的model表現越好，Error來自哪裏？

答案：Error 的來源主要是來自 bias 或者 variance

二：

提出問題：what is bias and variance？

答案： Bias：槍瞄的準不準，最後的期望值能夠落在目標上
即模型的期望輸出與其真實輸出之間的差異。 bias越小，越接近靶心。
Variance：槍性能好不好，打的散不散。
即方差表示數據的離散程度。越小越聚集

三：

提出問題：how bias and variance working？

答案：

bias：簡單model，大的bias
複雜model，小的bias，接近靶心

variance：越簡單的模型，受訓練數據的影響越小

四：

提出問題： bias v.s. variance

答案：

1. error來自bias會欠擬合

error來自variance會過擬合

2.簡單的model，bias大，variance小

複雜的model，bias小，variance大

五：

提出問題：大的bias，大的variance怎麼處理？

答案：首先判斷

1.如果您的模型不能匹配訓練樣例--------------->大的bias

2.如果你能擬合訓練數據，但測試數據誤差較大------------》較大的variance

解決：

對於bias：重新設計你的model ，可增加更多的特性作爲輸入，可使用更復雜的模型。

對於variance：最有效的方法首先是增加訓練數據，其次是Regularization（在調整regularization的位置在variance和bias中取得平衡）

六：

提出問題：如何選擇model？

答案：1.把training set 分爲 training set 和validation set ，先把各個model在trainingset套用後，看在validation上的error

2. 交叉驗證，即將全部的數據在選中的模型上驗證

3.此時的error 才代表public set上的error

ps.如果分壞了，或者可以多分

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

相關文章

李宏毅學習筆記33.GAN.04.Theory behind GAN

文章目錄簡介MLEMLE=Minimize KL DivergenceGeneratorDiscriminatorD∗D^*D∗和divergence的關係證明GD Algorithm for GAN實作Algorithm for

2020-06-15 20:27:56

李宏毅學習筆記34.GAN.05.fGAN: General Framework of GAN

文章目錄簡介f-divergenceFenchel ConjugateConnection with GANMode CollapseMode Dropping問題分析解決Mode Collapse 簡介上節在講原文GAN的時候

2020-06-15 20:27:56

李宏毅學習筆記36.GAN.06.Feature Extraction

文章目錄簡介InfoGANWhat is InfoGAN?結果VAE-GAN具體算法BiGANAlgorithmTriple GANDomain-adversarial trainingFeature Disentangle 簡介

2020-06-15 20:27:56

李宏毅學習筆記35.GAN.06.Tips for Improving GAN

文章目錄簡介JS divergence來衡量分佈的問題What is the problem of JS divergence?Least Square GAN (LSGAN)Wasserstein GAN (WGAN): Ear

2020-06-15 20:27:56

無監督學習（unsupervised learning） 1.線性方法

無監督學習（unsupervised learning） 1.線性方法 1 unspervised learning Reduction(化繁爲簡)：Clustering & Dimension，只有輸入 Generation

2020-06-14 20:16:14

半監督學習（semi-supervised learning）

# 半監督學習（semi-supervised learning） 1 introduction why semi-supervised learning? 收集數據很貴，收集有標籤的數據更貴！ superviesd：D

2020-06-14 20:16:14

無監督學習（unsupervised learning） 5.生成模型

無監督學習（unsupervised learning） 5.生成模型 1 PixelRNN 每次生成一個像素，下一個像素由之前所有的pixel決定應用：image、audio tips：每個像素用 1-of-N encod

2020-06-14 20:16:04

無監督學習（unsupervised learning） 2.詞嵌入

無監督學習（unsupervised learning） 2.詞嵌入 Word Embedding 1-of-N Encoding：每一個詞用一個向量表示，該詞對應其中的一維 ↓ word class：詞分類 ↓ word

2020-06-14 20:16:04

李宏毅學習筆記34.GAN.04.Theory behind GAN

2020-05-09 14:14:07

李宏毅學習筆記35.GAN.05.fGAN: General Framework of GAN

2020-05-09 14:14:07

李宏毅學習筆記31.GAN.02.Conditional Generation by GAN

2020-05-06 04:15:58

李宏毅學習筆記30.GAN.01.Introduction of Generative Adversarial Network

2020-05-06 04:15:58

李宏毅學習筆記32.GAN.03.Unsupervised Conditional Generation

2020-05-06 04:15:58

李宏毅學習筆記29.Anomaly Detection

2020-04-29 14:51:45

李宏毅學習筆記28.MORE ABOUT AUTO-ENCODER

2020-04-29 14:51:45

24小時熱門文章

DAPPER 事務 TRANSACTION

最新文章

最新評論文章