論文SDP + RCNN | Exploit All the Layers: Fast and Accurate CNN Object Detector with SDP and CRC

Abstract

本文提出了兩種目標檢測的措施，兼具精度與效率：1.scale-dependent pooling （精度）2. layer wise casaded rejection classifiers（效率）

1 Introduction

首先作者簡要介紹了RCNN等方法，FRCNN的缺點：
1. Fast-RCNN由於是從pooling層bounding box所以不能準確的識別小物體，不能判斷，不能判斷bbox是否太小。
2. Multi-scale輸入因爲儲存和計算量的問題不容易在非常深的網絡應用。
3. 將數以百計的bounding box池化並送入高維fc層，會非常的耗時
本文通過scale-dependent pooling（SDP）層來處理目標的scale variation等問題。具體思路是：不同尺寸的物體可能在不同的層上得到不同的反應，比如可能小的object會在淺層得到一個strong activation，但是大的物體可能在深層得到strong activation。
本文的第二個貢獻是提出了cascaded rejection classifier。根據boosting classifiers 的原理，作者因爲前面基層是一個弱分類器，他可以快速的否定一個easy negative。於是得到了下面的框架：

P. A. Viola and M. J. Jones. Rapid object detection using a boosted
cascade of simple features. In CVPR, pages 511–518, 2001. 2, 4

Z. Cai, M. Saberian, and N. Vasconcelos. Learning complexity- aware
cascades for deep pedestrian detection. In ICCV, 2015. 6

M. Mathias, R. Benenson, R. Timofte, and L. J. V. Gool. Han- dling
occlusions with franken-classifiers. In ICCV, pages 1505–1512, 2013.2, 4

A. Angelova, A. Krizhevsky, V. Vanhoucke, A. Ogale, and D. Fergu- son.
Real-time pedestrian detection with deep network cascades. In BMVC,
2015. 2, 4

本文方法與他們不同的是這種級聯是在同一個網絡中的級聯，幾乎不增加額外的計算量。
Using convolutional features

S. Xie and Z. Tu. Holistically-nested edge detection. CoRR,
abs/1504.06375, 2015. 3, 4 C. Lee, S. Xie, P. W. Gallagher, Z. Zhang,
and Z. Tu. Deeply- supervised nets. CoRR, abs/1409.5185, 2014. 3

本文沒有直接的將各層特徵融合，而是建立了各自的分類器

3 Scale-Dependent pooling

如上圖所示，SDP是將Fast rcnn中不同尺寸的proposal輸入不同的sdp層中，如0-64像素高的proposal就給第三個卷積層的sdp，如果大於128就輸入到第五個。每個sdp層後都有fc層。這樣做的好處就是不需要大量的resize圖片，節省了計算，另外不同的特徵層處理不同的proposal可以得到更consistent signal。
本文共有3個SDP，每個有2個fc+relu dropout.

4 Cascaded Rejection classifiers

加了SDP的Fast rcnn也加入了額外的4個fc層，所以爲了減少計算量作者提出了CRC來減少proposal的數量，結構如下：

作者根據adaboost來學習分類器：

2248, 2010. 2, 3 [10] Y. Freund, R. Schapire, and N. Abe. A short introduction to boosting. Journal-Japanese Society For Artificial Intelligence, 14(771780):1612, 1999. 2, 4, 5

具體思路是：當一個proposal經過卷基層時會的到一組特徵，使用fastrcnn裏的roipooling策略得到mxmxc個CRC Rls的特徵，對於所有的proposal如果是foreground其分類標籤是1，否則是0，於是轉化爲了adaboost的訓練形式，每個Rls訓練50個弱分類器，訓練之後再進入Rl+1s繼續訓練。
如上圖所示，第一個fc層用來轉變特徵，第二個fc用來集成weak-leraners。測試時使用CRC可以加快速度3.2倍，如果再使用SVD可加快4.6倍。

5 Experiments

作者使用的是Edgebox proposal ，augment with ACF。

ACF P. Doll ́ar, R. Appel, S. Belongie, and P. Perona. Fast feature
pyra- mids for object detection. Pattern Analysis and Machine
Intelligence, IEEE Transactions on, 36(8):1532–1545, 2014. 3, 5

轉載自：
https://blog.csdn.net/bea_tree/article/details/51880175

論文SDP + RCNN | Exploit All the Layers: Fast and Accurate CNN Object Detector with SDP and CRC

Abstract

1 Introduction

3 Scale-Dependent pooling

4 Cascaded Rejection classifiers

5 Experiments

容器中nginx無法使用同一個網絡下的容器域名

Python: SunMoonTimeCalculator

NETCore中實現一個輕量無負擔的極簡任務調度ScheduleTask

docker使用特定的網絡

使用c#強大的表達式樹實現對象的深克隆之解決循環引用的問題

「Pygors跨平臺GUI」2：安裝MinGW-w64、MSYS2還是WSL2

「Pygors跨平臺GUI」1：Pygors跨平臺GUI應用研究

nodejs學習07——API

避免DbContext同時在多個線程調用

GPT-4o 引領人機交互新風向，向量數據庫賽道沸騰了

論文Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks

Python爬蟲走代理

windows下python3安裝opencv

記一次編寫python爬蟲遇到的問題

python調試方法

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結

論文SDP + RCNN | Exploit All the Layers: Fast and Accurate CNN Object Detector with SDP and CRC

Abstract

1 Introduction

2 Related works Neural network cascades

3 Scale-Dependent pooling

4 Cascaded Rejection classifiers

5 Experiments