：：：：：：：：線性迴歸：：：：：：：：

第一式

第二式

從式一到式二，需要添加一個 $x_{0}$ 項，其中 $x_{0}$ 爲 $x_{0}$ = 1 的常數量。只是爲了容易寫成代碼而已。

真實值=預測值+誤差（誤差是獨立且具有相同的分佈，通常認爲服從均值爲0的方差爲 $\theta ^{2}$ 的高斯分佈。）

此式意思是要找到一個θ值使得該θ與x的組合完之後，使得組合值接近y真實值的概率最大化。

爲了使得概率最大，我們用到了似然函數。

我們所希望的到的L(θ)的值是越大越好——代表了所有的y(i)與其真實值都是儘可能相等的。擊球什麼樣的θ可以使得L(θ)的整體值是最大的。

爲了使得求解變得簡單一些，我們引入對數似然函數 l(θ) = ln L(θ)

牢記，咱們要求的是似然函數L(θ)的值儘可能大，也就是使對數似然函數l(θ)的最大值，通過化簡的到上式，所以咱們要做的就是使右式J(θ)值最小。！

關於J(θ)的求解：

（上面第二步是對 θ 求偏導操作，矩陣求導不做解釋，不過可以從上圖看出一二）

#以下代碼是對以上原理的簡單應用。目前我的環境尚未搭建妥當，所以還沒有去跑代碼，先碼在這裏，等之後參考

import matplotlib.pyplot as plt
import numpy as np
from sklearn import datasets

class LinearRegression():
    def __init__(self):
        self.w = None
    def fit(self,X,y):
        #訓練階段
        #Insert constant ones for bias weights
        print (X.shape)
        #x0 = 1 
        X=np.insert(X,0,1,axis=1)
        print (X.shape)
        #對X的轉置取逆操作。
        X_ = np.linalg.iniv(X.T.dot(X))
        self.w = X_.dot(X.T).dot(y)
    def predict(self,X):
        #測試階段
        #Insert constant ones for bias weights
        X = np.insert(X,0,1,axis=1)
        y_pred = X.dot(self.w)
        return y_pred

def mean_squared_error(y_true, ypred):
    mse = np.mean(np.power(y_true - y_pred, 2))
    return mse


def main():
    #Load the diabetes dataset
    diabetes = datasets.load_diabetes()
    
    #Use only one feature
    X = diabetes.data[:, np.newaxis, 2]
    print(X.shape)
    
    #Split the data into training/testing sets
    x_train, x_test = X[:-20],X[-20:]
    
    #Split the targets into training/testing sets
    y_train, y_test = diabetes.target[:-20], diabetes.target[-20:]
    
    clf = LinearRegression()
    clf.fit(x_train, y_train)
    y_pred = clf.predict(x_test)
    
    #Print the mean squared error
    print ("Mean Souared Error:"mean_squared_error(y_test, y_pred))
    
    #Plot the results
    plt.scatter(x_test[:,0], y_test, color='black')
    plt.plot(x_test[:,0], y_pred, color='blue',linewidth=3)
    plt.show()

參考：

唐_機器學習

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

【ML_Algorithm 1】線性迴歸(Linear Regression)——算法推導及代碼實現

：：：：：：：：線性迴歸：：：：：：：：

第一式

第二式

【C進階】int &x，int* x，int *x，int**。解釋說明

JAVA工程師面試常見問題集錦【轉載，實用乾貨，致謝原作者】

【基本題型】數據結構中基礎題型

【通信2】Wi-Fi模塊（ESP8266-01/ESP-01）入網，成功往IOT平臺POST數據！！！

【通信1】Wi-Fi模塊（ESP8266-01/ESP-01）使用流程

Mac下配置sublime實現LaTeX

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結