核稀疏表示公式推導

本文是對文章kernel sparse representation with local patterns
for face recognition中2.2節公式的推導
在優化求解中,很多人可能都會見過這個公式:

min β 1 2 ∥ X β - y ∥ 22

而根據正則項的不同,l1正則項,l2正則項,可以進行細分:
http://blog.csdn.net/liyuan123zhouhui/article/details/51882926

對於稀疏表示中的稀疏求解:

min β 1 2 ∥ X β - y ∥ 22 + λ ∥ β ∥ 1

左邊項是待求解的係數,右邊那一項是正則項,是對解的約束,
是對解的稀疏性的約束,至於爲什麼加這個正則項就會產生稀疏解
可以看看上面的網址

X表示的是字典,bata表示的是稀疏表示係數,lambda是l1正則項的係數

按照2.2節中的公式3:φ(⋅) 是隱式的特徵映射,將向量映射到核空間中,我們假設該映射滿足以下條件:
φ(x)Tφ(x)=1 when ∥x∥2=1
即,當X的二範數爲1時,其核空間的內積也爲1
對於最優解,採用座標下降法進行求解,那麼將其他βi(i=1⋯n,i≠j) 固定,只對βj 進行求偏導數:

0 = φ (x j) T [\sum i = 1 n β i φ (x i) - φ (y)] + λ sign (β j)

對於正則項的求導,由於是 λ 乘以βj 的l1範數,也就是所有的 β 絕對值的和,那麼求導後就只剩下λ∣∣βj∣∣ ,對一個數的絕對值進行求導,得到的就是他的符號函數,對公式進行下一步推導:

0 = φ (x j) T φ (x j) β j + φ (x j) T ⎡ ⎣ \sum i = 1, i \neq j n β i φ (x i) - φ (y) ⎤ ⎦ + λ sign (β j)

↓

β j + λ sign (β j) = φ (x j) T ⎡ ⎣ φ (y) - \sum i = 1, i \neq j n β i φ (x i) ⎤ ⎦

↓

(∣ ∣ β j ∣ ∣ + λ) sign (β j) = φ (x j) T ⎡ ⎣ φ (y) - \sum i = 1, i \neq j n β i φ (x i) ⎤ ⎦

左邊括號內都大於0:

sign (β j) = sign ⎛ ⎝ φ (x j) T ⎡ ⎣ φ (y) - \sum i = 1, i \neq j n β i φ (x i) ⎤ ⎦ ⎞ ⎠

↓

β j = α - λ sign (α)

α = φ (x j) T ⎡ ⎣ φ (y) - \sum i = 1, i \neq j n β i φ (x i) ⎤ ⎦

↓

β j = sign (α) (| α | - λ) +

還有這樣一個約束:

(s) + = {s, s > 0 0, o t h e r w i s e

這個約束是因爲

sign(α) 和

sign(βj) 的符號是相等的,因此

|α|−λ 只能夠大於等於0
由於

φ(⋅) 是核映射,因此,上面的公式在覈空間可以重新寫做:

α = K (x j, y) - \sum n i = 1, i \neq j β i K (x j, x i)

有了l1範數,再自己推導l2範數就簡單了,直接給出結果:
代價函數:

min 1 2 ∥ X β - y ∥ 22 + λ ∥ β ∥ 22

β j = ⎡ ⎣ K (x j, y) - \sum i = 1, i \neq j n β i K (x j, x i) ⎤ ⎦ / (1 + 2 λ)

elastic net:
代價函數:

min 1 2 ∥ X β - y ∥ 22 + λ ∥ β ∥ 1 + (1 - λ) ∥ β ∥ 22

β j = s i g n ( α ) { | α | - λ } + ( 3 - 2 λ )

其中:

α = φ (x j) T ⎡ ⎣ φ (y) - \sum i = 1, i \neq j n β i φ (x i) ⎤ ⎦

↓

(s) + = {s, s > 0 0, o t h e r w i s e

下面給出matlab代碼:


function [beta iter] = KernelCoorDescent ( R, Z, opt )

% Kernel Coordinate Descent (KCD) Algorithm Version 1.0
% KCD is for learning sparse representation in kernel space.
%
% Input:
% R = K(X,X) is a P-by-P kernel matrix, where P is the number of samples.
%    For example, R = X' * X in linear case.X是訓練樣本？
% Z = K(X,Y) is a P-by-1 kernel vector. For example, Z = X' * Y in linear
%    case.
% opt is a structure containing options for the algorithm.
% opt.lambda is the parameter for the l1 penality.
% opt.tol is the tolerance for convergence.
% opt.iter_num is the maximum number of iterations.
%
% Output:
% beta: the cofficient vector for the sparse representation
%
% Reference:
%    Cuicui Kang, Shengcai Liao, Shiming Xiang, and Chunhong Pan. 
%    "Kernel Sparse Representation With Pixel-level and Region-level 
%    Image Feature Kernels For Face Recognition", Neurocomputing, 
%    Volume 133, Pages141-152, 2014.
%
% Author: Kang Cuicui
% Email : [email protected]
% Date : 2013/11/20

if isfield( opt, 'lambda')
    lambda  = opt.lambda ;
else
    lambda  = 0.01 ;
end

if isfield( opt, 'tol')
    tol  = opt.tol ;
else
    tol  = 1e-6 ;
end

if isfield( opt, 'iter_num')
    iter_num  = opt.iter_num ;
else
    iter_num  = 50 ;
end

% initializition
[P, N]= size(R);
beta = zeros( N, 1 );

for iter = 1: iter_num

    prebeta = beta;

    for j = 1 : P

        a = Z(j) - R( j, : )*beta;
        a = a + beta(j, :);

        if abs( a ) < lambda        
            beta( j, : ) = 0;        
        else        
            beta( j, : ) = sign( a ).*( abs( a ) - lambda );        
        end

    end

%     if mod ( iter, 5) == 0
%         fprintf('CoorDescent Iteration %d of %d \n', iter, iter_num);
%     end
    if norm(beta-prebeta, 'fro' )/norm(prebeta, 'fro' )  < tol
        break;
    end

end

參考文獻:
kernel sparse representation with local patterns for face recognition
Regularization path for generalized linear models vis coordinate descent

核稀疏表示公式推導

MySQL 核心模塊揭祕 | 18 期 | 鎖在內存里長什麼樣*

使用perf工具生成火焰圖

大齡程序員思考

響應式界面控件DevExtreme * 更強的數據分析和可視化功能

HttpSecurity 是如何組裝過濾器鏈的

數說海南——近6年海南各市縣人口簡單看

長序列中Transformers的高級注意力機制總結

WebStorm 創建 Vue 項目

caffe用python產生prototxt文件

selu激活函數和自歸一化網絡(SNN)

tf moving average

tensorflow調參總結（不斷更新中）

caffe group參數

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結