KNN-K近鄰算法

原創

2020-04-17 02:33

KNN: k-NearestNeighbor(K個最近鄰)
KNN算法的核心思想是如果一個樣本在特徵空間中的k個最相鄰的樣本中的大多數屬於某一個類別，則該樣本也屬於這個類別，並具有這個類別上樣本的特性。

一句話描述：人以羣分，物以類聚

sklearn相關的代碼

from sklearn.neighbors import KNeighborsRegressor
from sklearn.metrics import mean_squared_error

knn = KNeighborsRegressor()
knn.fit(X_train,Y) #模型訓練
predictions = knn.predict(X_test) #模型預測

mse = mean_squared_error(Y_test,  predictions)#用均方根誤差rmse評估模型
rmse = more_features_mse**(1/2)

sklearn.neighbors.KNeighborsRegressor
The target is predicted by local interpolation of the targets associated of the nearest neighbors in the training set.

fit(self, X, y)
Fit the model using X as training data and y as target values

predict(self, X)
Predict the target for the provided data

KNN算法優點

模型簡單，不需要訓練模型

KNN算法缺點

效率低，耗時(會把當前樣本，和訓練集所有樣本都對比計算一遍)
對訓練數據依賴度特別大，對訓練數據的容錯性太差。(如果訓練數據集中，有一兩個數據是錯誤的，剛剛好又在需要分類的數值的旁邊，這樣就會直接導致預測的數據的不準確.)

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

相關文章

谷歌發佈生態系統RLDS，可在強化學習中生成、共享和使用數據集

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

2021-12-20 10:53:54

解讀數字化轉型下的數據安全：AI正在開闢新的可能性

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

2021-12-19 14:03:54

谷歌聯合哈佛大學發佈最新研究，使用NeRF創建360度完整神經場景視頻

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

Martin Anderson

2021-12-16 15:08:50

什麼纔是實現元宇宙的關鍵路徑？

{"type":"doc","content":[{"type":"heading","attrs":{"align":null,"level":1}},{"type":"paragraph","attrs":{"indent":0,"nu

2021-12-13 17:08:51

洞察數據庫變革趨勢，亞馬遜雲科技正在憑藉這項技術改變着遊戲規則

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

2021-12-10 16:53:54

基於英特爾x86平臺構建AI軟件生態系統 | InfoQ《公開課》

直播內容人工智能爲社會各個領域的技術帶來了無限可能，也誕生了很多優秀的應用。在這些應用背後，需要很強的計算性能和優化做支撐，爲其提供準確、及時的結果。在英特爾各代 x86 平臺上，AI 能力是如何進行演進的？AI 生態系統是怎樣的？其中又

InfoQ 中文站

2021-12-10 15:18:59

Rust核心團隊“有毒”

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

2021-12-10 15:08:58

2021 re:Invent ，我們到底該關注哪些發佈？

{"type":"doc","content":[{"type":"heading","attrs":{"align":null,"level":1}},{"type":"paragraph","attrs":{"indent":0,"nu

2021-12-09 15:23:56

全球首個知識增強千億大模型來了！2600億參數，代碼將在近期開源

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"blockq

2021-12-09 13:08:52

2021星空論壇：破局創新，論道數字化轉型

InfoQ 中文站

2021-12-09 12:34:02

視頻精修一幀要花2小時？AI只要5.3毫秒

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

美图影像实验室

2021-12-07 17:58:50

Python的while循環

1.while循環的格式 while 條件: 條件滿足時，做的事情1 條件滿足時，做的事情2 條件滿足時，做的事情3 ...(省略)... demo

2023-10-10 11:37:31

python初識第二天

認識現實世界與虛擬世界的橋樑感受python帶來的魔力數據類型 Python裏，最常用的數據類型有三種——字符串(str)、整數(int)和浮點數(float) 字符串，字符串英文string，簡寫str 字符串的識別方式非常簡單—

2023-02-01 22:01:30

Python 的十大特性

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

Rupam Choudhary

2021-12-16 16:04:03

Python開發工程師[金融方向] Remote/Singapore (20k - 45k)

簡單介紹：要做的事：同交易員一起開發交易相關係統；能力要求：能獨立解決問題，完成項目開發，有較強的學習能力（技術和業務）品格正直，較強的心裏承壓能力；職業前景：能提供給你完全不同於互聯網公司的報酬上限，職業途徑；與一流交易員溝通機會，瞭解他

2021-12-09 17:53:05

24小時熱門文章

最新文章

最新評論文章