Simulating Human Visual System Based on Vision Transformer
文献类型:会议论文
作者 | Qiu, Mengyu2; Guo, Yi1; Zhang, Mingguang2; Zhang, Jingwei2; Lan, Tian2; Liu, Zhilin2 |
出版日期 | 2023 |
会议日期 | 2023-10-13 |
会议地点 | Sydney, AUSTRALIA |
关键词 | Visual scanpath prediction fixation duration prediction saccade Sequences visual attention scene analysis |
DOI | 10.1145/3607822.3616408 |
英文摘要 | The human visual system (HVS) is capable of responding in real-time to complex visual environments. During the process of freely observing visual scenes, predicting eye movements and visual fixations is a task known as scanpath prediction, which aims to simulate the HVS. In this paper, we propose a visual transformer-based model to study the attentional processes of the human visual system in analyzing visual scenes, thereby achieving scanpath prediction. This technology has important applications in human-computer interaction, virtual reality, augmented reality, and other fields. We have significantly simplified the workflow of scanpath prediction and the overall model architecture, achieving performance superior to existing methods. |
产权排序 | 2 |
会议录 | ACM SYMPOSIUM ON SPATIAL USER INTERACTION, SUI 2023 |
会议录出版者 | ASSOC COMPUTING MACHINERY |
语种 | 英语 |
ISBN号 | 979-8-4007-0281-5 |
WOS记录号 | WOS:001138802600058 |
源URL | [http://ir.opt.ac.cn/handle/181661/97182] |
专题 | 西安光学精密机械研究所_光学影像学习与分析中心 |
通讯作者 | Zhang, Mingguang |
作者单位 | 1.Chinese Acad Sci Xian, Xian Inst Opt & Precis Mech, Xian, Peoples R China 2.Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing, Peoples R China |
推荐引用方式 GB/T 7714 | Qiu, Mengyu,Guo, Yi,Zhang, Mingguang,et al. Simulating Human Visual System Based on Vision Transformer[C]. 见:. Sydney, AUSTRALIA. 2023-10-13. |
入库方式: OAI收割
来源:西安光学精密机械研究所
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。