key point of 《You Only Look Once: Unified, Real-Time Object Detection》

原創

2020-02-22 16:41

A single neural network in one evalution

Introduction

(1)simple
We reframe object detection as a single regression problem, straight from image pixels to bounding box coordinates and class probabilities.
(2)global image
unlike sliding window and region proposal-based techniques, YOLO sees the intire image during training and test time so it implicitly encodes contextual information about classes as well as their appearance.
and Fast R-CNN mistakes background patch because it can’t see the larger context.
(3)YOLO learn generalizable representations(概括性特徵) of objects
a wide margin（大邊緣）
(4)YOLO lags behind(落後於) current detection sys in accuracy

2 Unified Detection

entire image ->bounding boxs
predicts all bounding box across all classes for an image simultaneously
end to end
define confidence as Pr(object )*IOU
if Pr(object) = 0; confidence = 0
if Pr(object) = 1; confidence = IOU(intersection over uion)
IOU用來表示預測框與真實框重合的程度，其最小值是0，最大值是1。具體算法請參照如下文章

http://blog.csdn.net/u011534057/article/details/54845298

each bounding box consists of 5 prediction:x,y,w,h,confidence
each grid cell also predicts C conditional class probabilities, Pr(Classi|Object ), predict one set of probabilities per grid cell, regardless of the number of boxes B

At test time we multiply the conditional class probabilities and the individual box confidence prediction
Pr(Classi|Object )*Pr(object )*IOU =
class-specific confidence scores for each box
Question: the map of the box confidence and the grid cell
the center of the box in the cell

Model:
(1)detection as regression problem
(2)image ->divide S*S cells
(3)cell ->B bounding boxes
(4)the predictions are S*S(B*5+C) tensor

parameters: for YOLO on PASCAL VOC S = 7, B =2. The data had 20 labels classes, so C = 20. the prediction is 7*7*(5*2+20)

羅澤

發佈了126 篇原創文章 · 獲贊 117 · 訪問量 46萬+

私信關注

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

key point of 《You Only Look Once: Unified, Real-Time Object Detection》

Introduction

2 Unified Detection

再談23種設計模式（3）：行爲型模式（學習筆記）

Power Automate Desktop 安裝完，登錄後老是提示one driver 錯誤

微前端學習筆記(4):從微前端到微模塊之EMP與hel-micro方案探索

微前端學習筆記（1）：微前端總體架構概述，從微服務發微

985 碩士程序員，空窗 4 個月沒有 Offer！

一文搞懂 Spring 循環依賴

賽博鬥地主——使用大語言模型扮演Agent智能體玩牌類遊戲。

VScode右鍵打開(添加到右鍵)

記一次 .NET某工控視覺自動化系統卡死分析

WindowsServer--SQL Server搭建主從同步實現讀寫分離 - 事務性分發

Qt顯示對話框的基本邏輯

key point of 《You Only Look Once: Unified, Real-Time Object Detection》

Qt Weigets Application中使用qss文件的方法

python統計詞頻的方法

第一次安裝caffe的步驟

Mac下配置sublime實現LaTeX

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結