image caption筆記（三）：《Show, Attend and Tell_Neural Image Caption》

原創

2020-06-04 11:05

一、模型結構

對LSTM部分做出的改動，其餘與NIC相同。

與原本的lstm公式相比多了一個，就是attention應用的結果。

首先我們給不同位置的特徵設置權重權重的值和爲1 這很自然就會想到使用softmax

在每個時刻t，我們都要設置不同位置的權重。在每個時刻，根據前一刻的狀態確定當前的權重，權重不同，代表對不同位置的關注度不同。

是第i個位置的圖像特徵，是softmax歸一化之後的t時刻的權重

是一個多層感知器，也就是簡單的全連接網絡。得到權重以後，

這裏的有兩種 hard attention 和soft attention ，因爲soft簡單，只介紹soft。

在得到當前時刻的後，產生概率預測。

是前一個時刻的輸出，也就是當前時刻的輸入。

二、總結

就是在每個時刻的輸入圖像特徵加了權重對不同位置的特徵加了不同的關注度。

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

相關文章

image caption筆記（二）：《Show and Tell : A Neural Image Caption Generator》

2020-06-04 11:05:43

image caption筆記（五）：《Knowing When to Look: Adaptive Attention》

2020-06-04 11:05:43

image caption筆記(一)：RNN、LSTM和GRU的理解

2020-06-04 11:05:32

image caption筆記（六）：《self_critical (scst)》

2020-06-04 11:05:32

image caption筆記（四）：《Image Captioning with Semantic Attention》

2020-06-04 11:05:32

image caption筆記（五）：《SCA-CNN》

2020-06-04 11:05:32

image caption筆記（七）：《Bottom-Up and Top-Down Attention》

2020-06-04 11:05:32

Image captioning with visual attention（TF2.0基於注意機制的圖像字幕）

2020-05-23 02:28:41

Positional encodings

2020-02-21 23:41:20

畢業前的計劃

2020-02-20 13:43:42

subprocess.py報錯：FileNotError: [Errno 2] No such file or directory: java: java

在運行coco計算ImageCaption得分時，出現以下錯誤： subprocess.py報錯：FileNotError: [Errno 2] No such file or directory: 'java': 'java' 原因：

清晨的光明

2020-07-08 02:37:26

面向遙圖像數據的Image Caption研究附源碼

面向遙感圖像數據的Image Caption 相關理論知識請參見其他文章，這裏只從工程角度進行描寫，重點是源代碼。參考網址： 1.面向遙感圖像的Image caption 數據集：【乾貨】讓遙感圖像活起來：遙感圖像描述生成的模型與數據

Jerry_liu20080504

2020-06-15 20:10:19

image caption筆記（二）：《Show and Tell : A Neural Image Caption Generator》

2020-06-04 11:05:43

image caption筆記（五）：《Knowing When to Look: Adaptive Attention》

2020-06-04 11:05:43

image caption筆記(一)：RNN、LSTM和GRU的理解

2020-06-04 11:05:32

24小時熱門文章

最新文章

最新評論文章