中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Simulating Human Visual System Based on Vision Transformer

文献类型:会议论文

作者Qiu, Mengyu2; Guo, Yi1; Zhang, Mingguang2; Zhang, Jingwei2; Lan, Tian2; Liu, Zhilin2
出版日期2023
会议日期2023-10-13
会议地点Sydney, AUSTRALIA
关键词Visual scanpath prediction fixation duration prediction saccade Sequences visual attention scene analysis
DOI10.1145/3607822.3616408
英文摘要

The human visual system (HVS) is capable of responding in real-time to complex visual environments. During the process of freely observing visual scenes, predicting eye movements and visual fixations is a task known as scanpath prediction, which aims to simulate the HVS. In this paper, we propose a visual transformer-based model to study the attentional processes of the human visual system in analyzing visual scenes, thereby achieving scanpath prediction. This technology has important applications in human-computer interaction, virtual reality, augmented reality, and other fields. We have significantly simplified the workflow of scanpath prediction and the overall model architecture, achieving performance superior to existing methods.

产权排序2
会议录ACM SYMPOSIUM ON SPATIAL USER INTERACTION, SUI 2023
会议录出版者ASSOC COMPUTING MACHINERY
语种英语
ISBN号979-8-4007-0281-5
WOS记录号WOS:001138802600058
源URL[http://ir.opt.ac.cn/handle/181661/97182]  
专题西安光学精密机械研究所_光学影像学习与分析中心
通讯作者Zhang, Mingguang
作者单位1.Chinese Acad Sci Xian, Xian Inst Opt & Precis Mech, Xian, Peoples R China
2.Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing, Peoples R China
推荐引用方式
GB/T 7714
Qiu, Mengyu,Guo, Yi,Zhang, Mingguang,et al. Simulating Human Visual System Based on Vision Transformer[C]. 见:. Sydney, AUSTRALIA. 2023-10-13.

入库方式: OAI收割

来源:西安光学精密机械研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。