咨询与建议

限定检索结果

文献类型

  • 6 篇 期刊文献

馆藏范围

  • 6 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 6 篇 工学
    • 5 篇 机械工程
    • 5 篇 计算机科学与技术...
    • 1 篇 控制科学与工程
  • 2 篇 理学
    • 2 篇 数学
  • 1 篇 法学
    • 1 篇 社会学
  • 1 篇 教育学
    • 1 篇 心理学(可授教育学...
  • 1 篇 医学
    • 1 篇 基础医学(可授医学...

主题

  • 6 篇 multi-modal lear...
  • 2 篇 weakly-supervise...
  • 2 篇 knowledge graph
  • 1 篇 machine learning...
  • 1 篇 transformer
  • 1 篇 deep learning
  • 1 篇 generalization
  • 1 篇 visual transform...
  • 1 篇 curriculum learn...
  • 1 篇 optimization
  • 1 篇 image generation
  • 1 篇 low-level vision
  • 1 篇 multi-task learn...
  • 1 篇 diagnosis
  • 1 篇 entity linking
  • 1 篇 poly encoders
  • 1 篇 image synthesis
  • 1 篇 knowledge graph ...
  • 1 篇 high-level visio...
  • 1 篇 concept learning

机构

  • 2 篇 key laboratory o...
  • 2 篇 school of comput...
  • 2 篇 school of cyber ...
  • 1 篇 data61 commonwea...
  • 1 篇 national institu...
  • 1 篇 school of artifi...
  • 1 篇 university of sy...
  • 1 篇 school of artifi...
  • 1 篇 school of scienc...
  • 1 篇 medical school o...
  • 1 篇 shanghai clinica...
  • 1 篇 institute for in...
  • 1 篇 basira laborator...
  • 1 篇 youtu lab tencen...
  • 1 篇 jd explore acade...
  • 1 篇 casia-llvision j...
  • 1 篇 nlpr institute o...
  • 1 篇 state key labora...
  • 1 篇 school of biomed...
  • 1 篇 department of re...

作者

  • 2 篇 qiushuo zheng
  • 2 篇 meng wang
  • 2 篇 guilin qi
  • 2 篇 hao wen
  • 1 篇 weiming dong
  • 1 篇 huapeng wei
  • 1 篇 minxuan lin
  • 1 篇 fan tang
  • 1 篇 da-cheng tao
  • 1 篇 kekai sheng
  • 1 篇 yifan xu
  • 1 篇 yingying deng
  • 1 篇 junfeng zhang
  • 1 篇 chen gan
  • 1 篇 dadong wang
  • 1 篇 wen ji
  • 1 篇 qian wang
  • 1 篇 kelei he
  • 1 篇 huang yu
  • 1 篇 mengdan zhang

语言

  • 6 篇 英文
检索条件"主题词=Multi-modal learning"
6 条 记 录,以下是1-10 订阅
排序:
Visual Entity Linking via multi-modal learning
收藏 引用
Data Intelligence 2022年 第1期4卷 1-19页
作者: Qiushuo Zheng Hao Wen Meng Wang Guilin Qi School of Cyber Science and Engineering Southeast UniversityNanjing 211189China School of Computer Science and Engineering Southeast UniversityNanjing 211189China Key Laboratory of Computer Network and Information Integration(Southeast University) Ministry of EducationNanjing 211189China
Existing visual scene understanding methods mainly focus on identifying coarse-grained concepts about the visual objects and their relationships,largely neglecting fine-grained scene understanding.In fact,many data-dr... 详细信息
来源: 维普期刊数据库 维普期刊数据库 同方期刊数据库 同方期刊数据库 评论
A survey of multi-modal learning theory
收藏 引用
中山大学学报:自然科学版(中英文) 2023年 第5期62卷 38-49页
作者: HUANG Yu HUANG Longbo Institute for Interdisciplinary Information Sciences Tsinghua UniversityBeijing 100084China
Deep multi-modal learning,a rapidly growing field with a wide range of practical applications,aims to effectively utilize and integrate information from multiple sources,known as modalities.Despite its impressive empi... 详细信息
来源: 维普期刊数据库 维普期刊数据库 同方期刊数据库 同方期刊数据库 评论
Transformers in computational visual media:A survey
收藏 引用
Computational Visual Media 2022年 第1期8卷 33-62页
作者: Yifan Xu Huapeng Wei Minxuan Lin Yingying Deng Kekai Sheng Mengdan Zhang Fan Tang Weiming Dong Feiyue Huang Changsheng Xu NLPR Institute of AutomationChinese Academy of SciencesBeijing 100190China School of Artificial Intelligence University of Chinese Academy of SciencesBeijing 100040China School of Artificial Intelligence Jilin UniversityChangchun 130012China Youtu Lab Tencent Inc.Shanghai 200233China CASIA-LLVISION Joint Lab Beijing 100190China
Transformers,the dominant architecture for natural language processing,have also recently attracted much attention from computational visual media researchers due to their capacity for long-range representation and hi... 详细信息
来源: 维普期刊数据库 维普期刊数据库 同方期刊数据库 同方期刊数据库 评论
Visual Superordinate Abstraction for Robust Concept learning
收藏 引用
Machine Intelligence Research 2023年 第1期20卷 79-91页
作者: Qi Zheng Chao-Yue Wang Dadong Wang Da-Cheng Tao University of Sydney Sydney 2008Australia JD Explore Academy Beijing 100176China DATA61 Commonwealth Scientific and Industrial Research OrganisationSydney 2122Australia
Concept learning constructs visual representations that are connected to linguistic semantics, which is fundamental to vision-language tasks. Although promising progress has been made, existing concept learners are st... 详细信息
来源: 维普期刊数据库 维普期刊数据库 同方期刊数据库 同方期刊数据库 评论
Transformers in medical image analysis
收藏 引用
Intelligent Medicine 2023年 第1期3卷 59-78页
作者: Kelei He Chen Gan Zhuoyuan Li Islem Rekik Zihao Yin Wen Ji Yang Gao Qian Wang Junfeng Zhang Dinggang Shen Medical School of Nanjing University NanjingJiangsu 210093China National Institute of Healthcare Data Science at Nanjing University NanjingJiangsu 210093China BASIRA Laboratory Faculty of Computer and Informatics EngineeringIstanbul Technical UniversityIstanbulTurkey School of Science and Engineering ComputingUniversity of DundeeUK State Key Laboratory for Novel Software Technology Nanjing UniversityNanjingJiangsu 210093China School of Biomedical Engineering ShanghaiTech UniversityShanghai 201210China Department of Research and Development Shanghai United Imaging Intelligence Co.Ltd.Shanghai 200030China Shanghai Clinical Research and Trial Center Shanghai 201703China
Transformers have dominated the field of natural language processing and have recently made an impact in the area of computer vision.In the field of medical image analysis,transformers have also been successfully used... 详细信息
来源: 维普期刊数据库 维普期刊数据库 评论
Faster Zero-shot multi-modal Entity Linking via Visual-Linguistic Representation
收藏 引用
Data Intelligence 2022年 第3期4卷 493-508页
作者: Qiushuo Zheng Hao Wen Meng Wang Guilin Qi Chaoyu Bai School of Cyber Science and Engineering Southeast UniversityNanjing 211189China School of Computer Science and Engineering Southeast UniversityNanjing 211189China Key Laboratory of Computer Network and Information Integration(Southeast University) Ministry of EducationNanjing 211189China
multi-modal entity linking plays a crucial role in a wide range of knowledge-based modal-fusion tasks, i.e., multi-modal retrieval and multi-modal event extraction. We introduce the new ZEro-shot multi-modal Entity Li... 详细信息
来源: 维普期刊数据库 维普期刊数据库 同方期刊数据库 同方期刊数据库 评论