Cross-model retrieval with deep learning for business application
文献类型:会议论文
作者 | Wang, Yufei1; Wang, Huanting2,3; Yang, Jiating2; Chen, Jianbo3 |
出版日期 | 2021-03-09 |
会议日期 | 2020-11-14 |
会议地点 | Busan, Korea, Republic of |
关键词 | Cross-modal retrieval Audio features Deep hashing Useful information |
卷号 | 1802 |
期号 | 3 |
DOI | 10.1088/1742-6596/1802/3/032035 |
英文摘要 | Cross-modal retravel has been used in many fields, such as business and search engines. Most search engines for business are text-based, but text-based search engines are limited by equipment and the strict requirement for knowledge. Text-based search needs keyboards to finish the search process, which requires users to have the knowledge of using keyboards. Compared to the text-based search, audio-based search has advantages. First, it avoids the traditional ways of inputting information. And it gets rid of the gap in time between inputting information for searching and getting useful information. In this paper, we propose a way to use audio to search images for business applications. We use deep learning to implement cross-modal retrieval systems between images and audio. We first extract features from images and audio respectively. And then we implement a neural network with two identical networks to learn the correspondence between images and audio. The first network extracts the features from images and audio further for calculation, and the second network learns whether two features from different modalities are related. This research provides a new way for business applications to search for information more instantly. © Published under licence by IOP Publishing Ltd. |
产权排序 | 2 |
会议录 | 7th International Conference on Computer-Aided Design, Manufacturing, Modeling and Simulation, CDMMS 2020 - 2. Algorithm Design and Computational Science
![]() |
会议录出版者 | IOP Publishing Ltd |
语种 | 英语 |
ISSN号 | 17551307;17551315 |
源URL | [http://ir.opt.ac.cn/handle/181661/94577] ![]() |
专题 | 西安光学精密机械研究所_光学影像学习与分析中心 |
通讯作者 | Yang, Jiating |
作者单位 | 1.Simon Fraser University, 8888 University Dr, Bumaby; BC; V5A 1S6, Canada 2.Xian Institute of Optics and Precision Mechanics, Chinese Academy of Sciences, Xian, China 3.University of Chinese Academy of Sciences, Beijing; 100049, China |
推荐引用方式 GB/T 7714 | Wang, Yufei,Wang, Huanting,Yang, Jiating,et al. Cross-model retrieval with deep learning for business application[C]. 见:. Busan, Korea, Republic of. 2020-11-14. |
入库方式: OAI收割
来源:西安光学精密机械研究所
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。