Multi-Head Self-Attention via Vision Transformer for Zero-Shot Learning - 百度学术

高级搜索

包含全部检索词

包含精确检索词

包含至少一个检索词

不包含检索词

出现检索词的位置

文章任何位置

位于文章标题

作者

机构

出版物期刊

会议

发表时间 -

语言检索范围不限

不限英文中文

论文查重优惠 

Multi-Head Self-Attention via Vision Transformer for Zero-Shot Learning

来自 arXiv.org

阅读量：

165

作者：

F Alamri，A Dutta

展开

摘要：

Zero-Shot Learning (ZSL) aims to recognise unseen object classes, which are not observed during the training phase. The existing body of works on ZSL mostly relies on pretrained visual features and lacks the explicit attribute localisation mechanism on images. In this work, we propose an attention-based model in the problem settings of ZSL to learn attributes useful for unseen class recognition. Our method uses an attention mechanism adapted from Vision Transformer to capture and learn discriminative attributes by splitting images into small patches. We conduct experiments on three popular ZSL benchmarks (i.e., AWA2, CUB and SUN) and set new state-of-the-art harmonic mean results {on all the three datasets}, which illustrate the effectiveness of our proposed method.

展开

DOI：

10.48550/arXiv.2108.00045

年份：

2021

收藏引用批量引用报错分享

全部来源免费下载求助全文

arXiv.org

ResearchGate

钛学术

学术范

钛学术 (全网免费下载) 查看更多

学术范 (全网免费下载)

学术范 (全网免费下载)

钛学术 (全网免费下载)

通过文献互助平台发起求助，成功后即可免费获取论文全文。

请先登入

我们已与文献出版商建立了直接购买合作。

你可以通过身份认证进行实名认证，认证成功后本次下载的费用将由您所在的图书馆支付

您可以直接购买此文献，1~5分钟即可下载全文，部分资源由于网络原因可能需要更长时间，请您耐心等待哦~

身份认证全文购买

相似文献

参考文献

引证文献

研究点推荐

Zero-Shot Learning Multi-Head Self-Attention Vision Transformer

辅助模式

0

引用

文献可以批量引用啦~
欢迎点我试用！