Log-Sum-Exp Pooling

原創

2019-08-06 19:11

Log-Sum-Exp Pooling

Papers

From Image-level to Pixel-level Labeling with Convolutional Networks
ChestX-ray8: Hospital-scale Chest X-ray Database and Benchmarks on Weakly-Supervised Classiﬁcation and Localization of Common Thorax Diseases

LSE Pooling

在閱讀這兩篇文章之前，我印象中常用的 Pooling 有 Max Pooling 和 Average Pooling，而這兩篇文章中用到了 Log-Sum-Exp Pooling，其定義爲：

$x_p=\frac{1}{r}\cdot log[\frac{1}{S}\cdot \sum_{(i,j)\in\mathbf{S}}exp(r\cdot x_{ij})]$

其中， $x_{ij}$ 表示在 $(i,j)$ 的激活值， $(i,j)$ 是池化區域 $\mathbf{S}$ 的一點並且 $S=s\times s$ 是池化區域 $\mathbf{S}$ 總點數， $r$ 是超參數。

在第一篇文章中，作者提到 LSE Pooling 的作用爲：

The hyper-parameter r controls how smooth one wants the approximation to be: high r values implies having an effect similar to the max, very low values will have an effect similar to the score averaging. The advantage of this aggregation is that pixels having similar scores will have a similar weight in the training procedure, r controlling this notion of “similarity”.

在第二篇文章中，作者提到 LSE Pooling 的作用爲：

By controlling the hyper-parameter, r, the pooled value ranges from the maximum in S (when $r\to\infin$ ) to average ( $r\to0$ ).

一個直觀的理解可以看下圖：

數學證明

作爲一個嚴謹的大學僧，肯定不會止步於直觀理解啦，數學證明走起！

在證明前，不妨把式子簡化一點：

$x_p=\frac{1}{r}\cdot log[\frac{1}{n}\cdot \sum_{i=1}^{n}exp(r\cdot x_i)]$

證明 $r\to0$ 相當於 Average Pooling

首先，我們需要藉助均值不等式：

$\frac{a_1+a_2+...+a_n}{n}\ge\sqrt[n]{a_1\cdot a_2...a_n}$

當且僅當 $a_1=a_2=...=a_n$ 時取等號。
$\begin{aligned} x_p &= \frac{1}{r}\cdot log[\frac{1}{n}\cdot \sum_{i=1}^{n}exp(r\cdot x_i)] \\ &= log(\frac{1}{n}\cdot\sum_{i=1}^{n}e^{r\cdot x_i})^{\frac{1}{r}} \end{aligned}$

應用均值不等式：

$\begin{aligned} (\frac{1}{n}\cdot \sum_{i=1}^{n}e^{r\cdot x_i})^{\frac{1}{r}} &\ge (\prod_{i=1}^{n} e^{r\cdot x_i})^{\frac{1}{n}\cdot\frac{1}{r}} \\ &= (\prod_{i=1}^{n} e^{x_i})^{\frac{1}{n}} \end{aligned}$
當 $r = 0$ 時，可取等號。代入整個式子：

$\begin{aligned} x_p &= log(\frac{1}{n}\cdot\sum_{i=1}^{n}e^{r\cdot x_i})^{\frac{1}{r}} \\ &\ge log(\prod_{i=1}^{n} e^{x_i})^{\frac{1}{n}} \\ &= \frac{1}{n}\sum_{i=1}^{n}x_i \end{aligned}$
於是 $r\to0$ 相當於 Average Pooling 得證。

證明 $r\to \infin$ 相當於 Max Pooling

$\begin{aligned} x_p &= \frac{1}{r}\cdot log[\frac{1}{n}\cdot \sum_{i=1}^{n}exp(r\cdot x_i)] \\ &= log(\sum_{i=1}^{n}e^{r\cdot x_i})^{\frac{1}{r}} - \frac{1}{r}\cdot log(n) \end{aligned}$

因爲 $r > 0$ ，我們有：

$\begin{aligned} max(e^{r\cdot x_i})^{\frac{1}{r}} \le (\sum_{i=1}^{n}e^{r\cdot x_i})^{\frac{1}{r}} \le [n\cdot max(e^{r\cdot x_i})]^{\frac{1}{r}} \end{aligned}$
代入整個式子，得：

$max(x_i)\le log(\sum_{i=1}^{n}e^{r\cdot x_i})^{\frac{1}{r}} \le \frac{1}{r}\cdot log(n)+max(x_i)$
當 $r\to\infin$ 時有： $\frac{1}{r}\cdot log(n)\to0$ ，故 $r \to\infin$ 相當於 Max Pooling 得證

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

Log-Sum-Exp Pooling

Log-Sum-Exp Pooling

Papers

LSE Pooling

數學證明

證明 $r\to0$ 相當於 Average Pooling

證明 $r\to \infin$ 相當於 Max Pooling

使用VirtualBox搭建私有云

Ideas For Weakly Supervised Object Localization

Log-Sum-Exp Pooling

ColorSketch

Capacity Facility Location Problem

Mac下配置sublime實現LaTeX

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結

Log-Sum-Exp Pooling

Log-Sum-Exp Pooling

Papers

LSE Pooling

數學證明

證明 r→0r\to0r→0 相當於 Average Pooling

證明 r→∞r\to \infinr→∞ 相當於 Max Pooling

證明 $r\to0$ 相當於 Average Pooling

證明 $r\to \infin$ 相當於 Max Pooling