吳恩達機器學習之評估，判斷

原創

Polya_Xue

2020-06-26 01:40

1.When it's going on a training, the data should be divided into three parts:

training data（訓練集）,cross validation data（交叉驗證集）,and test data（測試集）

There will be three types error:

The three error can be very vital when people suppose to decide what degree of polynominal(多項式次數) to fit to a data set. We should use cross validation data and test data at the same time.

2.If a learning algorithm dosen't do as well as people are hoping, almost al the time it will be because it has either a high bias problem or a high variance problem.

So it's about underfitting problem or overfitting problem.

Here is a way to adjudge the learning algorithm is underfitting or overfitting:

increase the degree of polynominal and watch the change of both test error and cross validation error.

So, if the training error and cv error decrease at the same time, it is the underfitting problem.

Or, if the cv error far more large than train error, it is the overfitting problem.

train data作爲測試集，是最直觀，或者說貼合的反應數據的準確程度。隨着多項式的增多，函數愈發複雜，在多次嘗試下，learning algorithm逐漸趨向與完全貼合測試集，最後看上去似乎是一點失誤都沒有。而實際上，其實已經過擬合了。

但是交叉驗證集就很粗暴了，這組數據同反覆錘鍊的learning algorithm貼合度不高。所以當遇到欠擬合問題時，error很高；遇到過擬合問題時，error很高。單獨比較cv error曲線是不明顯的，拿它和train error比較，結果就很明顯了。

下面的選定合適的正則化參數也是用了相似的辦法。

3.choosing regularization parameter

normally, the defination of cost function includes regularization.Regularization can prevent overfitting.

The choosing of parameter of regularization should be suitable:

The defination of all functions:

The result:

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

吳恩達機器學習之評估，判斷

10分鐘搞定Mysql主從部署配置

如何使用 JS 判斷用戶是否處於活躍狀態

「Pygors跨平臺GUI」2：安裝MinGW-w64、MSYS2還是WSL2

[轉帖]

python列出centos7內存使用前50的進程信息

「Pygors跨平臺GUI」1：Pygors跨平臺GUI應用研究

一鍵自動化博客發佈工具,用過的人都說好(掘金篇)

lightdb數據庫超時相關控制參數

lightdb秒級增加列和刪除列（not null帶默認值）

Java ThreadPoolShutdown

雙目立體視覺-特徵提取之SURF算法

特徵提取之旋轉不變性和尺度不變性的通俗理解

雙目立體視覺-特徵檢測與特徵匹配總結

雙目立體視覺-特徵提取之SIFT算法

轉接

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結