咨询与建议

限定检索结果

文献类型

  • 1 篇 期刊文献

馆藏范围

  • 1 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 1 篇 工学
    • 1 篇 计算机科学与技术...

主题

  • 1 篇 artificial intel...
  • 1 篇 natural language...
  • 1 篇 speech processin...
  • 1 篇 neural network
  • 1 篇 multimodal appro...

机构

  • 1 篇 key laboratory o...
  • 1 篇 department of co...
  • 1 篇 college of elect...

作者

  • 1 篇 xia li
  • 1 篇 guoquan wang
  • 1 篇 yawei wang
  • 1 篇 jiale ren
  • 1 篇 hong liu
  • 1 篇 yidi li

语言

  • 1 篇 英文
检索条件"主题词=multimodal approaches"
1 条 记 录,以下是1-10 订阅
排序:
Audio-visual keyword transformer for unconstrained sentence-level keyword spotting
收藏 引用
CAAI Transactions on Intelligence Technology 2024年 第1期9卷 142-152页
作者: Yidi Li Jiale Ren Yawei Wang Guoquan Wang Xia Li Hong Liu Key Laboratory of Machine Perception Peking UniversityShenzhen Graduate SchoolShenzhenChina College of Electronics and Information Engineering Sichuan UniversityChengduChina Department of Computer Science ETH ZurichZurichSwitzerland
As one of the most effective methods to improve the accuracy and robustness of speech tasks,the audio-visual fusion approach has recently been introduced into the field of Keyword Spotting(KWS).However,existing audio-... 详细信息
来源: 维普期刊数据库 维普期刊数据库 评论