機器學習中的凸和非凸優化問題

原創

大眼呆萌君

2020-07-04 18:42

題目（145）：機器學習中的優化問題，哪些是凸優化問題，哪些是非凸優化問題？請各舉一個例子。

凸優化定義
凸優化問題
非凸優化問題
凸優化定義：公式、geometric insight
凸優化問題：邏輯迴歸；通過Hessian matrix的半正定性質判定；局部最優等價於全部最優
非凸優化問題：PCA；PCA求解方式

凸優化問題

邏輯迴歸

$L_i(\theta) = \log(1+\exp(-y_i \theta^T x_i))$

損失函數推導
logistic regression model:
$\log \frac{p}{1-p}=\theta^T x \Rightarrow p = \frac{\exp(\theta^T x)}{1+\exp(\theta^T x)}$

$\max \text{MLE} \simeq -\min \log \text{MLE}:= \min L(x,y;\theta)$

$\begin{aligned} L &= - (y \log p + (1-y) \log (1-p)) \\ &= - y \log \frac{1}{1+\exp(-\theta^T x)} - (1-y) \log \frac{1}{1+\exp(\theta^T x)}\\ &= y \log (1+\exp(-\theta^T x)) + (1-y) \log (1+\exp(\theta^T x))\\ &=\log (1+\exp(-\theta^T x \cdot y)), \end{aligned}$

where $Y \in \{0,1\}$ and $p=P(Y=1|X=x)$ .

$\textcolor{red}{\text{\small 其它例子：SVM, linear regression}}$

非凸優化問題

PCA

$\min_{V V^T}L(V)= \| X-V^T V X\|_F^2$

$\textcolor{gray}{\textit{\small (minimise the reconstruction error)}}$

$\textcolor{red}{\text{\small Formulation from the perspective of maximising the variance}}$

驗證該目標爲非凸問題：檢查定義
If $V^\ast$ is the minimum, then $-V^\ast$ is also the minimum as $L(V^\ast)=L(-V^\ast)$ .
$\begin{aligned} L\large(\frac{1}{2} V^\ast + \frac{1}{2} (-V^\ast) \large)=L(0)&=\|X\|_F^2 \\ &> \| X-V^{\ast T} V^\ast X\|_F^2=\frac{1}{2} L(V^\ast) + \frac{1}{2} L(-V^\ast) \end{aligned}$

求解: $\textcolor{red}{\text{\small SVD}}$

$\textcolor{red}{\text{\small 其它例子：low-rank model (e.g. matrix decomposition), deep neural network}}$

參考文獻：

《百面機器學習》

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

機器學習中的凸和非凸優化問題

凸優化問題

邏輯迴歸

非凸優化問題

PCA

linux安裝cuda和cudnn

模擬手機設備：使用 Playwright 實現移動端自動化測試

Mellanox網卡開啓SR-IOV

全面系統的AI學習路徑，幫助普通人也能玩轉AI

HTML 00 Tutorial

uni-app實現上拉加載

vue3編譯優化之“靜態提升”

又是一個月-20240513

flask 如何保證返回json有序

linux服務器設置ssh免密

梯度下降、隨機梯度下降法、及其改進

機器學習中的凸和非凸優化問題

L1正則項與稀疏性

驗證梯度的正確性

Deep Learning相關概念

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結