文献详情 >Visuals to Text:A Comprehensiv... 收藏

Visuals to Text:A Comprehensive Review on Automatic Image Captioning

Visuals to Text: A Comprehensive Review on Automatic Image Captioning

作者：Yue Ming Nannan Hu Chunxiao Fan Fan Feng Jiangwan Zhou Hui Yu Yue Ming;Nannan Hu;Chunxiao Fan;Fan Feng;Jiangwan Zhou;Hui Yu

作者机构：Beijing University of Posts and TelecommunicationsBeijing 100876China School of Creative TechnologiesUniversity of PortsmouthPortsmouth PO12DJUK

出版物：《IEEE/CAA Journal of Automatica Sinica》 (自动化学报（英文版）)

年卷期：2022年第9卷第8期

页面：1339-1365页

核心收录：

学科分类：12[管理学] 1201[管理学-管理科学与工程(可授管理学、工学学位)] 081104[工学-模式识别与智能系统] 08[工学] 080203[工学-机械设计及理论] 0835[工学-软件工程] 0802[工学-机械工程] 0811[工学-控制科学与工程] 0812[工学-计算机科学与技术（可授工学、理学学位）]

基　　金：supported by Beijing Natural Science Foundation of China(L201023) the Natural Science Foundation of China(62076030)

主　　题：Artificial intelligence attention mechanism encoder-decoder framework image captioning multi-modal understanding training strategies

摘要：Image captioning refers to automatic generation of descriptive texts according to the visual content of *** is a technique integrating multiple disciplines including the computer vision(CV),natural language processing(NLP)and artificial *** recent years,substantial research efforts have been devoted to generate image caption with impressive *** summarize the recent advances in image captioning,we present a comprehensive review on image captioning,covering both traditional methods and recent deep learning-based ***,we first briefly review the early traditional works based on the retrieval and *** deep learning-based image captioning researches are focused,which is categorized into the encoder-decoder framework,attention mechanism and training strategies on the basis of model structures and training manners for a detailed *** that,we summarize the publicly available datasets,evaluation metrics and those proposed for specific requirements,and then compare the state of the art methods on the MS COCO ***,we provide some discussions on open challenges and future research directions.

本地馆藏 |

1、借阅数量：每证可借书6册，期刊2册，团体读者证可借书刊300册。 2、借阅时间：个人借期为30天，每本书可续借1次，借期为30天；团体借期为90天。 3、归还地点：3楼服务台、自助借还设备、还书箱、各分馆 4、馆际互借：读者未能在本馆获取所需文献资料，可至参考咨询阅览室服务台填写《南通市图书馆馆际互借读者申请表》，根据馆际互借协议，我馆将为读者向其他馆代借文献。馆际互借过程中所产生的费用（资料复印、邮寄费等），由读者个人承担。 5、服务电话续借：59003605 59003606 咨询：81100100 59003600

电子资源

目录详情 | 试阅读 |

读者评论与其他读者分享你的观点

学校读者

用户名:未登录

我的评分

欢迎您,

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

看过本文的还看了

相关文献

该作者的其他文献

Visuals to Text:A Comprehensive Review on Automatic Image Captioning

读者评论与其他读者分享你的观点

请选择收藏分类：

欢迎您,

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

看过本文的还看了

相关文献

该作者的其他文献

Visuals to Text:A Comprehensive Review on Automatic Image Captioning

读者评论 与其他读者分享你的观点

请选择收藏分类： 新增自定义分类 确定 取消

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

读者评论与其他读者分享你的观点

请选择收藏分类：