中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Cross-model retrieval with deep learning for business application

文献类型:会议论文

作者Wang, Yufei1; Wang, Huanting2,3; Yang, Jiating2; Chen, Jianbo3
出版日期2021-03-09
会议日期2020-11-14
会议地点Busan, Korea, Republic of
关键词Cross-modal retrieval Audio features Deep hashing Useful information
卷号1802
期号3
DOI10.1088/1742-6596/1802/3/032035
英文摘要

Cross-modal retravel has been used in many fields, such as business and search engines. Most search engines for business are text-based, but text-based search engines are limited by equipment and the strict requirement for knowledge. Text-based search needs keyboards to finish the search process, which requires users to have the knowledge of using keyboards. Compared to the text-based search, audio-based search has advantages. First, it avoids the traditional ways of inputting information. And it gets rid of the gap in time between inputting information for searching and getting useful information. In this paper, we propose a way to use audio to search images for business applications. We use deep learning to implement cross-modal retrieval systems between images and audio. We first extract features from images and audio respectively. And then we implement a neural network with two identical networks to learn the correspondence between images and audio. The first network extracts the features from images and audio further for calculation, and the second network learns whether two features from different modalities are related. This research provides a new way for business applications to search for information more instantly. © Published under licence by IOP Publishing Ltd.

产权排序2
会议录7th International Conference on Computer-Aided Design, Manufacturing, Modeling and Simulation, CDMMS 2020 - 2. Algorithm Design and Computational Science
会议录出版者IOP Publishing Ltd
语种英语
ISSN号17551307;17551315
源URL[http://ir.opt.ac.cn/handle/181661/94577]  
专题西安光学精密机械研究所_光学影像学习与分析中心
通讯作者Yang, Jiating
作者单位1.Simon Fraser University, 8888 University Dr, Bumaby; BC; V5A 1S6, Canada
2.Xian Institute of Optics and Precision Mechanics, Chinese Academy of Sciences, Xian, China
3.University of Chinese Academy of Sciences, Beijing; 100049, China
推荐引用方式
GB/T 7714
Wang, Yufei,Wang, Huanting,Yang, Jiating,et al. Cross-model retrieval with deep learning for business application[C]. 见:. Busan, Korea, Republic of. 2020-11-14.

入库方式: OAI收割

来源:西安光学精密机械研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。