Paper閱讀小結

原創

zhoutongchi

2020-02-21 21:53

Detection of Human Actions from a Single Example

其中paper“Training-Free, Generic Object Detection Using Locally Adaptive Regression Kernels”爲其期刊版

關鍵點：

1，提出一種衡量局部時空特性的度量；

2，通過PCA對所得度量進行主成份提取；

3，通過MCS求query與target的相似性：RV；

4，對RV所得的值通過經驗數據進行分析（與兩個閾值對比）。

Multiclass Object Recognition with Sparse, Localized Features

其中paper“Object class recognition and localization using sparse features with limited receptive ﬁelds”

關鍵點：

1，主導思想爲HMAX即層次最大化模型；

2，在HMAX模型中的改進：sparsification、lateral inhibition；

3，將SVM對權重比高的特徵進行訓練。

An Experimental Comparison of Min-Cut/Max-Flow Algorithm for Energy Minimization in Vision

關鍵點：

在源點與匯點均使用廣度優先搜索

The chains model for detecting parts by their context

關鍵點：

1，通過SIFT提取特徵；

2，對特徵進行選擇（興趣點相鄰需保持一定尺度）；

3，定義chain模型，並運用貝葉斯概率理論進行概率計算；

4， Full Object Detection 則將其退化成類似於STAR的模型。

Action Recognition using Context and Appearance Distribution Features

關鍵點：

1，檢測興趣點，並得到Multi-scale 時空上下文特徵（即時空分佈特徵）和局部視頻（此處的局部視頻，即爲興趣點分佈的XYT立體）分佈（對其進行PCA降維）；

2，運用AFMKL將時空上下文特徵與局部特徵分佈進行融合；

3，計算其得分，並選出最優解。

Joint Segmentation and Classiﬁcation of Human Actions in Video

關鍵點：

1，與一般的先segment圖片再進行Action detection不同，本文將segment與classify結合在一起，從而達到更快速的效果；

2，有監督的訓練：運用SVM訓練樣本（視頻），提取出含有時空特徵的庫（特徵選取很巧妙）；

3，動態規劃進行segment視頻，通過得到的score進行取最優解。

Real-Time Human Pose Recognition in Parts from Single Depth Images

關鍵點：

1，採用深度圖像，可以更好的模擬人眼；

2，一個特色：通過現實數據，然後運用機器合成圖像（500k），並選出100k作爲所需圖像（保證兩兩圖像之間的距離至少大於5cm）；

3，訓練，通過body part標記（深度圖像裏的特徵）、骨骼點位置推算；

4，通過隨機化決策森林得到Joint position。

Optimal Spatio-Temporal Path Discovery for Video Event Detection

關鍵點：

1，對路徑連通條件進行限制；

2，對所有路徑的discriminative score進行計算，運用“最優路徑算法”。

Robust Real-Time Face Detection

關鍵點：

1，運用了一種新奇的圖像表示方法，“integral image”；

2，分類器使用adaboost分類器；

3，多種分類器的綜合應用“cascade”模型；

4，主要特點是，先用一種可以得到含有99%人臉的算法，使數據量將爲原來的50%，然後在對含有人臉的部分運用“cascade”式的分類器運用。

Action Recognition with Multiscale Spatio-Temporal Contexts

關鍵點：

1，運用bag of words方法；

2，提取特徵->局部特徵分析（讀取XYT三個方向上的上下文特徵）->綜合分析特徵（integration）->MKL（在文中有詳細流程圖）。

Baby Talk: Understanding and Generating Simple Image Descriptions

關鍵點：

將一幅圖片運用一句話表示，其中包括：名詞、動詞、形容詞。

Learning Context for Collective Activity Recognition

關鍵點：

當檢測到的目標爲同樣的動作時，可通過crowd中其他人的位置與動作判斷目標的動作。

A Unified Framework for Locating and Recognizing Human Actions

關鍵點：

對deformable part進行訓練，並將其與input video進行score，然後通過score對興趣點進行分析及SVM分類。

Human Action Recognition by Learning Bases of Action Attributes and Parts

關鍵點：

通過名詞+動詞方法，通過識別基本parts來判斷動詞，從而得到圖片內容及action

zhoutongchi

發佈了18 篇原創文章 · 獲贊 3 · 訪問量 8萬+

私信關注

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

Paper閱讀小結

Detection of Human Actions from a Single Example

Multiclass Object Recognition with Sparse, Localized Features

An Experimental Comparison of Min-Cut/Max-Flow Algorithm for Energy Minimization in Vision

The chains model for detecting parts by their context

Action Recognition using Context and Appearance Distribution Features

Joint Segmentation and Classiﬁcation of Human Actions in Video

Real-Time Human Pose Recognition in Parts from Single Depth Images

Optimal Spatio-Temporal Path Discovery for Video Event Detection

Robust Real-Time Face Detection

Action Recognition with Multiscale Spatio-Temporal Contexts

Baby Talk: Understanding and Generating Simple Image Descriptions

Learning Context for Collective Activity Recognition

A Unified Framework for Locating and Recognizing Human Actions

Human Action Recognition by Learning Bases of Action Attributes and Parts

Reading Group

典型相關分析(Canonical Correlation Analysis）

Compressive Sensing

壓縮感知--續

計算機視覺領域的一些牛人博客，超有實力的研究機構等的網站鏈接

Mac下配置sublime實現LaTeX

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結