清晰易懂版相關濾波推導

讀研期間做相關濾波視覺跟蹤，讀了很多相關濾波的論文，但是由於當年矩陣論和數字信號處理基礎太菜，一直沒搞明白相關濾波的閉合解是怎麼推出來的，各種論文裏面的推導都是直接給個結果，過程不詳。最近終於搞明白了推導的過程和原理，在這裏記錄一下。我覺得應該是全網最清晰最易懂的相關濾波推導了。

如果不加說明，以下粗體字 $\mathbf{x}$ 代表列向量，加帽 $\hat{\mathbf{x}}$ 代表 $\mathbf{x}$ 的傅里葉變換

在推導之前首先回顧幾個定理：

循環卷積定理 若 $\mathbf{x}$ 的傅里葉變換爲 $\hat{\mathbf{x}}$ ， $\mathbf{y}$ 的傅里葉變換爲 $\hat{\mathbf{y}}$ ，則 $\mathbf{x}$ 與 $\mathbf{y}$ 的卷積 $\mathbf{x}\otimes\mathbf{y}$ 的傅里葉變換爲 $\hat{\mathbf{x}}$ 與 $\hat{\mathbf{y}}$ 的點積 $\hat{\mathbf{x}}\odot\hat{\mathbf{y}}$ 。

帕斯瓦爾定理
$||\mathbf{x}||^2=\frac{1}{N}||\hat{\mathbf{x}}||^2$
$N$ 爲信號長度。

還有一個小結論，信號的反序的傅里葉變換等於原信號傅里葉變換的共軛

證明一下，記 $\mathbf{y}$ 爲 $\mathbf{x}$ 的反序，即 $\mathbf{y}[k]=\mathbf{x}[N-k]$
$\hat{\mathbf{y}}[k]=\sum_{n=0}^{N-1}\mathbf{x}[N-n]e^{-j\frac{2\pi}{N}n}$
令 $N-n=t$
$\hat{\mathbf{y}}[k]=\sum_{t=0}^{N-1}\mathbf{x}[t]e^{-j\frac{2\pi}{N}(N-t)}\\ =\sum_{t=0}^{N-1}\mathbf{x}[t]e^{j\frac{2\pi}{N}t}=\hat{\mathbf{x}^{*}}[k]$

在卷積操作中，信號要經過反褶操作，而信號的互相關沒有經過反褶操作，因此則 $\mathbf{x}$ 與 $\mathbf{y}$ 的互相關 $\mathbf{x}*\mathbf{y}$ 的傅里葉變換爲 $\hat{\mathbf{x}^{*}}$ 與 $\hat{\mathbf{y}}$ 的點積 $\hat{\mathbf{x}^{*}}\odot\hat{\mathbf{y}}$

有了這幾個定理，現在開始推導吧。

相關濾波是要學習一個與 $\mathbf{x}$ 維度相同的濾波器 $\mathbf{h}$ ，使兩者的互相關 $\mathbf{h}*\mathbf{x}$ 儘量接近目標函數 $\mathbf{y}$ ，使下面的目標函數最小。
$E(\mathbf{h})=||\mathbf{h}*\mathbf{x}-\mathbf{y}||^2+\lambda||\mathbf{h}||^2$
根據帕斯瓦爾定理，最小化上式相當於最小化
$E(\hat{\mathbf{h}})=||\hat{\mathbf{h}}\odot\hat{\mathbf{x}}-\hat{\mathbf{y}}||^2+\lambda||\hat{\mathbf{h}}||^2$
把點積寫成矩陣相乘的形式
$E(\hat{\mathbf{h}})=||diag(\hat{\mathbf{x}})\hat{\mathbf{h}}-\hat{\mathbf{y}}||^2+\lambda||\hat{\mathbf{h}}||^2$
往下推還需要知道幾個結論
$||\mathbf{x}||^2=\mathbf{x}^H\mathbf{x}$
H表示共軛轉置

下面這兩個結論參見《matrix cookbook》
$\frac{\partial \mathbf{x^TAx}}{\partial \mathbf{x}}=\mathbf{A+A^T}$

$\frac{\partial \mathbf{x^T a}}{\partial \mathbf{x}}=\frac{\partial \mathbf{a^T x}}{\partial \mathbf{x}}=\mathbf{a}$

接着往下推
$E(\hat{\mathbf{{h}}})=(diag(\hat{\mathbf{x}})\hat{\mathbf{h}}-\hat{\mathbf{y}})^H(diag(\hat{\mathbf{x}})\hat{\mathbf{h}}-\hat{\mathbf{y}})+\lambda \hat{\mathbf{h}}^H \hat{\mathbf{h}}\\ =\hat{\mathbf{h}}^Hdiag(\hat{\mathbf{x}^*})diag(\hat{\mathbf{x}})\hat{\mathbf{h}}-\hat{\mathbf{h}}^Hdiag(\hat{\mathbf{x}^*})\hat{\mathbf{y}}-\hat{\mathbf{y}}^Hdiag(\hat{\mathbf{x}})\hat{\mathbf{h}}+\hat{\mathbf{y}}^H\hat{\mathbf{y}}+\lambda \hat{\mathbf{h}}^H\hat{\mathbf{h}}$
將目標函數對 $\hat{\mathbf{h}}$ 求偏導(共軛轉置和轉置不太一樣，但是驗證一下也是成立的，不太嚴謹地求個導)
$\frac{\partial E(\hat{\mathbf{h}})}{\partial \hat{\mathbf{h}}}=2diag(\hat{\mathbf{x}^{*}}\odot{\hat{\mathbf{x}}})\hat{\mathbf{h}}-2diag(\hat{\mathbf{x}^{*}})\hat{\mathbf{y}}+2\lambda \hat{\mathbf{h}}$
令 $\frac{\partial E(\hat{\mathbf{h}})}{\partial \hat{\mathbf{h}}}=0$ ，解得
$\hat{\mathbf{h}}=(diag(\hat{\mathbf{x}^{*}}\odot{\hat{\mathbf{x}}})+\lambda I)^{-1}diag(\hat{\mathbf{x}^{*}})\hat{\mathbf{y}}\\ =\frac{\hat{\mathbf{x}^{*}}\odot\hat{\mathbf{y}}}{\hat{\mathbf{x}^{*}}\odot{\hat{\mathbf{x}}}+\lambda}$

這就是論文裏常見的那個公式的由來了。

多通道情況的推導參見fDSST論文的附錄，利用了每個像素之間的獨立性。

另外 Henriques的KCF論文[2]裏是利用循環矩陣傅里葉變換對角化的性質去推導的，很多博主的推導都指出原論文推導結果錯誤，推出的濾波器結果是
$\hat{\mathbf{w}}=\frac{\hat{\mathbf{x}}\odot\hat{\mathbf{y}}}{\hat{\mathbf{x}^*}\odot\hat{\mathbf{x}}+\lambda}$
分母上的 $\hat{\mathbf{x}}$ 沒有共軛，而且從論文的附錄5的推導看，確實推不到式(57)，後來看了一下作者的學位論文，推導這個公式的起點不是論文中帶有共軛轉置的(3)式，而是論文的(2)式。
$\mathbf{w}=(X^TX+\lambda I)^{-1}X^Ty$
也就是說樣本矩陣 $X$ 學習到的濾波器 $\mathbf{w}$ 在空域中都是實數。

回顧一下論文裏的樣本矩陣
$X=C(\mathbf{x})=\left[\begin{array}{ccccc}{x_{1}} & {x_{2}} & {x_{3}} & {\cdots} & {x_{n}} \\ {x_{n}} & {x_{1}} & {x_{2}} & {\cdots} & {x_{n-1}} \\ {x_{n-1}} & {x_{n}} & {x_{1}} & {\cdots} & {x_{n-2}} \\ {\vdots} & {\vdots} & {\vdots} & {\ddots} & {\vdots} \\ {x_{2}} & {x_{3}} & {x_{4}} & {\cdots} & {x_{1}}\end{array}\right]$
再使用這個性質
$X=F diag(\hat{\mathbf{x}})F^{H}$
這個時候發現 $X^T$ 的第一行正好是 $\mathbf{x}$ 的倒序，因此它的傅里葉變換爲 $\hat{\mathbf{x}^*}$ , 所以
$X^TX=Fdiag(\hat{\mathbf{x}^*})F^HFdiag(\hat{\mathbf{x}})F^H=Fdiag(\hat{\mathbf{x}^*}\odot\hat{\mathbf{x}})F^H$
再代入上面的式子
$\mathbf{w}=(Fdiag(\hat{\mathbf{x}^*}\odot\hat{\mathbf{x}})F^H+\lambda FF^H)^{-1}Fdiag(\hat{\mathbf{x}^*})F^H\mathbf{y}\\ =Fdiag(\frac{1}{\hat{\mathbf{x}^*}\odot\hat{\mathbf{x}}+\lambda})F^{H}Fdiag(\hat{\mathbf{x}^*})F^H\mathbf{y}\\ =Fdiag(\frac{\hat{\mathbf{x}}}{\hat{\mathbf{x}^*}\odot\hat{\mathbf{x}}+\lambda})\hat{\mathbf{y}^*}$
兩邊乘 $F^H$
$\hat{\mathbf{w}}^*=diag(\frac{\hat{\mathbf{x}}}{\hat{\mathbf{x}^*}\odot\hat{\mathbf{x}}+\lambda})\hat{\mathbf{y}^*}$
兩邊取共軛
$\hat{\mathbf{w}}=\frac{\hat{\mathbf{x}^{*}}\odot\hat{\mathbf{y}}}{\hat{\mathbf{x}^*}\odot\hat{\mathbf{x}}+\lambda}$
結果相同，因此原論文結果正確，只是推導過程錯誤。

記錄一點自己的理解，希望可以幫到大家。

參考博文

KCF目標跟蹤方法分析與總結

KCF公式推導錯誤及驗證

MOSSE（DSST）類和KCF類中濾波器推導結果不一致的解釋

參考文獻

[1] Petersen K B, Pedersen M S The Matrix Cookbook book

[2] Danelljan M , Häger, Gustav, Khan F S , et al. Discriminative Scale Space Tracking[J]. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2016, 39(8):1561-1575. paper

[3] Henriques J F, Rui C, Martins P, et al. High-Speed Tracking with Kernelized Correlation Filters[J]. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2015, 37(3):583-596. paper

清晰易懂版相關濾波推導

參考博文

參考文獻

再談23種設計模式（3）：行爲型模式（學習筆記）

Power Automate Desktop 安裝完，登錄後老是提示one driver 錯誤

微前端學習筆記(4):從微前端到微模塊之EMP與hel-micro方案探索

微前端學習筆記（1）：微前端總體架構概述，從微服務發微

985 碩士程序員，空窗 4 個月沒有 Offer！

一文搞懂 Spring 循環依賴

賽博鬥地主——使用大語言模型扮演Agent智能體玩牌類遊戲。

VScode右鍵打開(添加到右鍵)

記一次 .NET某工控視覺自動化系統卡死分析

WindowsServer--SQL Server搭建主從同步實現讀寫分離 - 事務性分發

快速下載Visual Tracker Benchmark數據集

Reliable Re-Detection for Long-Term Tracking筆記

Performance Evaluation Methodology for Long-Term Visual Object Tracking 閱讀筆記

相關濾波代碼中常用的matlab函數總結

將Temple Color數據集標註轉換爲tracker benchmark所需格式

Mac下配置sublime實現LaTeX

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結