基本信息来源于合作网站,原文需代理用户跳转至来源网站获取       
摘要:
Images are complex multimedia data which contain rich semantic information.Most of current image description generator algorithms only generate plain description,with the lack of distinction between primary and secondary object,leading to insufficient high-level semantic and accuracy under public evaluation criteria.The major issue is the lack of effective network on high-level semantic sentences generation,which contains detailed description for motion and state of the principal object.To address the issue,this paper proposes the Attention-based Feedback Long Short-Term Memory Network(AFLN).Based on existing codec framework,there are two independent sub tasks in our method:attention-based feedback LSTM network during decoding and the Convolutional Block Attention Module(CBAM)in the coding phase.First,we propose an attentionbased network to feedback the features corresponding to the generated word from the previous LSTM decoding unit.We implement feedback guidance through the related field mapping algorithm,which quantifies the correlation between previous word and latter word,so that the main object can be tracked with highlighted detailed description.Second,we exploit the attention idea and apply a lightweight and general module called CBAM after the last layer of VGG 16 pretraining network,which can enhance the expression of image coding features by combining channel and spatial dimension attention maps with negligible overheads.Extensive experiments on COCO dataset validate the superiority of our network over the state-of-the-art algorithms.Both scores and actual effects are proved.The BLEU 4 score increases from 0.291 to 0.301 while the CIDEr score rising from 0.912 to 0.952.
推荐文章
基于LSTM-Attention神经网络的文本特征提取方法
LSTM-Attention
注意力机制
文本分类
神经网络
文本特征提取
softmax
一种改进的Attention-Based LSTM特征选择模型
高校学术活动
信息提取
文本分类
结合注意力机制的长短期记忆网络
重点信息
基于MAC-LSTM的问题分类研究
问答系统
问题分类
注意力机制
疑问词注意力机制
卷积神经网络
长短时记忆模型
基于CNN-LSTM的QAR数据特征提取与预测
深度学习
融合卷积神经网络
长短时记忆网络
特征提取
时间序列预测
内容分析
关键词云
关键词热度
相关文献总数  
(/次)
(/年)
文献信息
篇名 Feedback LSTM Network Based on Attention for Image Description Generator
来源期刊 计算机、材料和连续体(英文) 学科 工学
关键词 Image DESCRIPTION GENERATOR FEEDBACK LSTM NETWORK ATTENTION CBAM
年,卷(期) 2019,(5) 所属期刊栏目
研究方向 页码范围 575-589
页数 15页 分类号 TP3
字数 语种
DOI
五维指标
传播情况
(/次)
(/年)
引文网络
引文网络
二级参考文献  (0)
共引文献  (0)
参考文献  (0)
节点文献
引证文献  (0)
同被引文献  (0)
二级引证文献  (0)
2019(0)
  • 参考文献(0)
  • 二级参考文献(0)
  • 引证文献(0)
  • 二级引证文献(0)
研究主题发展历程
节点文献
Image
DESCRIPTION
GENERATOR
FEEDBACK
LSTM
NETWORK
ATTENTION
CBAM
研究起点
研究来源
研究分支
研究去脉
引文网络交叉学科
相关学者/机构
期刊影响力
计算机、材料和连续体(英文)
月刊
1546-2218
江苏省南京市浦口区东大路2号东大科技园A
出版文献量(篇)
346
总下载数(次)
4
总被引数(次)
0
论文1v1指导