論文筆記：SORT

這是發表在ICIP 2016上的一篇文章。文章提出了一個基於卡爾曼濾波和匈牙利算法的多目標跟蹤算法，實現了高速跟蹤。

之前的算法建立的模型夠過於複雜，因此算法的速度很慢或者精度不高。本算法只採用一些簡單的信息（位置、速度、尺度，跟蹤過程不使用外形特徵），從而實現高速且較快的多目標跟蹤。

使用faster rcnn的檢測結果，只取其中行人部分（且輸出概率大於80%）。

每個目標的狀態表示爲如下向量：
$\mathbf{x}=[u,\ v,\ s,\ r,\dot{u},\dot{v},\dot{s}]^{T}$
如果檢測框跟一個跟蹤目標匹配，就更新目標的狀態（速度用卡爾曼濾波處理）。如果沒有檢測框與跟蹤目標匹配，就用線性速度模型簡單地更新目標狀態。

用匈牙利算法匹配檢測位置和跟蹤目標的狀態，使之擁有儘可能大的IOU。此外，IOU低於IOU $_{min}$ 的框忽略。

當一個檢測結果與現存的被跟蹤目標IOU低於IOU $_{min}$ ，就初始化一個新的跟蹤目標並把速度設爲0 。
當存在 $T_{LOST}$ 幀沒有匹配到一個檢測結果，這個跟蹤目標就自動刪除。考慮到線性速度模型精度太差，文中 $T_{LOST}$ 取1 。

MOTA (↑): Multi-object tracking accuracy.
MOTP (↑): Multi-object tracking precision.
FAF (↓): number of false alarms per frame.
MT (↑): number of mostly tracked trajectories. I.e. target has the same label for at least 80% of its life span.
ML (↓): number of mostly lost trajectories. i.e. target is not tracked for at least 20% of its life span.
FP (↓): number of false detections.
FN (↓): number of missed detections.
ID SW (↓): number of times an ID switches to a different previously tracked object.
Frag (↓): number of fragmentations where a track is interrupted by miss detection.

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

論文筆記：SIMPLE ONLINE AND REALTIME TRACKING（SORT）