咨询与建议

限定检索结果

文献类型

  • 6 篇 期刊文献
  • 1 篇 会议

馆藏范围

  • 7 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 6 篇 工学
    • 4 篇 机械工程
    • 3 篇 计算机科学与技术...
    • 2 篇 软件工程
    • 1 篇 控制科学与工程
  • 2 篇 管理学
    • 2 篇 管理科学与工程(可...
  • 1 篇 文学
    • 1 篇 外国语言文学

主题

  • 7 篇 visual question ...
  • 1 篇 counterfactual
  • 1 篇 language bias
  • 1 篇 transformer
  • 1 篇 deep learning
  • 1 篇 multi-modal feat...
  • 1 篇 large language m...
  • 1 篇 natural language...
  • 1 篇 cross-model fusi...
  • 1 篇 curriculum learn...
  • 1 篇 multi-modal lear...
  • 1 篇 knowledge inject...
  • 1 篇 text feature ext...
  • 1 篇 knowledge-based ...
  • 1 篇 concept learning
  • 1 篇 image feature pr...
  • 1 篇 attention mechan...
  • 1 篇 dynamic network
  • 1 篇 computer vision
  • 1 篇 weakly-supervise...

机构

  • 1 篇 school of data s...
  • 1 篇 academy for engi...
  • 1 篇 data61 commonwea...
  • 1 篇 school of automa...
  • 1 篇 school of comput...
  • 1 篇 jd explore acade...
  • 1 篇 school of comput...
  • 1 篇 department of ge...
  • 1 篇 school of data s...
  • 1 篇 the key laborato...
  • 1 篇 college of resou...
  • 1 篇 university of sy...
  • 1 篇 school of comput...
  • 1 篇 school of artifi...
  • 1 篇 school of public...
  • 1 篇 beijing universi...
  • 1 篇 southeast univer...
  • 1 篇 school of statis...
  • 1 篇 school of comput...

作者

  • 1 篇 peng yang
  • 1 篇 huahu xu
  • 1 篇 zhengtong yin
  • 1 篇 boyue wang
  • 1 篇 mingzhe liu
  • 1 篇 xiang-yang xue
  • 1 篇 dikai fang
  • 1 篇 da-cheng tao
  • 1 篇 yan-wei fu
  • 1 篇 yuan meng
  • 1 篇 wenfeng zheng
  • 1 篇 xuan liu
  • 1 篇 lirong yin
  • 1 篇 xingyu liu
  • 1 篇 yang xue-jiao
  • 1 篇 yueming ding
  • 1 篇 xiaoyan li
  • 1 篇 qiang sun
  • 1 篇 xiaoqian ju
  • 1 篇 zhongjian hu

语言

  • 6 篇 英文
  • 1 篇 中文
检索条件"主题词=Visual question answering"
7 条 记 录,以下是1-10 订阅
排序:
Learning a Mixture of Conditional Gating Blocks for visual question answering
收藏 引用
Journal of Computer Science & Technology 2024年 第4期39卷 912-928页
作者: Qiang Sun Yan-Wei Fu Xiang-Yang Xue School of Statistics and Information Shanghai University of International Business and EconomicsShanghai 201620China Academy for Engineering and Technology Fudan UniversityShanghai 200433China School of Data Science Fudan UniversityShanghai 200433China School of Computer Science Fudan UniversityShanghai 200433China
As a Turing test in multimedia,visual question answering(VQA)aims to answer the textual question with a given ***,the“dynamic”property of neural networks has been explored as one of the most promising ways of improv... 详细信息
来源: 维普期刊数据库 维普期刊数据库 评论
Prompting Large Language Models with Knowledge-Injection for Knowledge-Based visual question answering
收藏 引用
Big Data Mining and Analytics 2024年 第3期7卷 843-857页
作者: Zhongjian Hu Peng Yang Fengyuan Liu Yuan Meng Xingyu Liu School of Computer Science and Engineering Southeast University the Key Laboratory of Computer Network and Information Integration(Southeast University) Ministry of Education of the People’s Republic of ChinaNanjing 211189China Southeast University-Monash University Joint Graduate School(Suzhou) Southeast UniversitySuzhou 215125China
Previous works employ the Large Language Model(LLM)like GPT-3 for knowledge-based visual question answering(VQA).We argue that the inferential capacity of LLM can be enhanced through knowledge *** methods that utilize... 详细信息
来源: 维普期刊数据库 维普期刊数据库 评论
Improved Blending Attention Mechanism in visual question answering
收藏 引用
Computer Systems Science & Engineering 2023年 第10期47卷 1149-1161页
作者: Siyu Lu Yueming Ding Zhengtong Yin Mingzhe Liu Xuan Liu Wenfeng Zheng Lirong Yin School of Automation University of Electronic Science and Technology of ChinaChengdu610054China College of Resource and Environment Engineering Guizhou UniversityGuiyang550025China School of Data Science and Artificial Intelligence Wenzhou University of TechnologyWenzhou325000China School of Public Affairs and Administration University of Electronic Science and Technology of ChinaChengdu611731China Department of Geography and Anthropology Louisiana State UniversityBaton Rouge70803LAUSA
visual question answering(VQA)has attracted more and more attention in computer vision and natural language *** are committed to studying how to better integrate image features and text features to achieve better resu... 详细信息
来源: 维普期刊数据库 维普期刊数据库 评论
A survey of deep learning-based visual question answering
收藏 引用
Journal of Central South University 2021年 第3期28卷 728-746页
作者: HUANG Tong-yuan YANG Yu-ling YANG Xue-jiao School of Artificial Intelligence Chongqing University of TechnologyChongqing 401135China School of Computer Science and Engineering Chongqing University of TechnologyChongqing 400054China
With the warming up and continuous development of machine learning,especially deep learning,the research on visual question answering field has made significant progress,with important theoretical research significanc... 详细信息
来源: 维普期刊数据库 维普期刊数据库 同方期刊数据库 同方期刊数据库 评论
Contrastive visual-question-Caption Counterfactuals on Biased Samples for visual question answering
Contrastive Visual-Question-Caption Counterfactuals on Biase...
收藏 引用
第43届中国控制会议
作者: Xiaoqian Ju Boyue Wang Xiaoyan Li Beijing University of Technology
The issue of language priors persists in existing visual question answering(VQA) models, hindering their ability to generalize across diverse QA distributions. Traditional strategies for counterfactual sample synthesi... 详细信息
来源: cnki会议 评论
visual Superordinate Abstraction for Robust Concept Learning
收藏 引用
Machine Intelligence Research 2023年 第1期20卷 79-91页
作者: Qi Zheng Chao-Yue Wang Dadong Wang Da-Cheng Tao University of Sydney Sydney 2008Australia JD Explore Academy Beijing 100176China DATA61 Commonwealth Scientific and Industrial Research OrganisationSydney 2122Australia
Concept learning constructs visual representations that are connected to linguistic semantics, which is fundamental to vision-language tasks. Although promising progress has been made, existing concept learners are st... 详细信息
来源: 维普期刊数据库 维普期刊数据库 同方期刊数据库 同方期刊数据库 评论
Improving VQA via Dual-Level Feature Embedding Network
收藏 引用
Intelligent Automation & Soft Computing 2024年 第3期39卷 397-416页
作者: Yaru Song Huahu Xu Dikai Fang School of Computer Engineering and Science Shanghai UniversityShanghai200444China
visual question answering(VQA)has sparked widespread interest as a crucial task in integrating vision and *** primarily uses attention mechanisms to effectively answer questions to associate relevant visual regions wi... 详细信息
来源: 维普期刊数据库 维普期刊数据库 评论