中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Remote Sensing Image Generation From Audio

文献类型:期刊论文

作者Zheng, Zhiyuan1,2; Chen, Jun2; Zheng, Xiangtao1; Lu, Xiaoqiang1
刊名IEEE GEOSCIENCE AND REMOTE SENSING LETTERS
出版日期2021-06
卷号18期号:6页码:994-998
关键词Remote sensing Semantics Feature extraction Gallium nitride Neural networks Sensors Mel frequency cepstral coefficient Cross-modal generation reranking
ISSN号1545-598X;1558-0571
DOI10.1109/LGRS.2020.2992324
产权排序1
英文摘要

Generating image from other modal data has attracted much attention in cross-modal studies, since the generated image offers intuitive vision information. Unlike the previous works which generate an image from text, a novel task is introduced, generating an image from audio. However, semantic gap intrinsically exists in cross-modal data, which disturbs the generative results. In order to explore the relevance between the audio and image, a novel reranking audio-image translation method is proposed. The proposed method: 1) maps the audio and image into a uniform feature space; 2) designs an audio-audio matching network to match the related audio; and 3) adopts an audio-image matching network for every matched audio to generate a related image, and the most frequent image is voted as the final result. Extensive experiments on two remote sensing cross-modal data sets demonstrate that the proposed method can visualize the content of audio.

语种英语
WOS记录号WOS:000652799700012
出版者IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
源URL[http://ir.opt.ac.cn/handle/181661/94861]  
专题西安光学精密机械研究所_光学影像学习与分析中心
通讯作者Zheng, Xiangtao
作者单位1.Chinese Acad Sci, Xian Inst Opt & Precis Mech, Key Lab Spectral Imaging Technol CAS, Xian 710119, Peoples R China
2.Wuhan Univ, Sch Comp Sci, Natl Engn Res Ctr Multimedia Software, Wuhan 430072, Peoples R China
推荐引用方式
GB/T 7714
Zheng, Zhiyuan,Chen, Jun,Zheng, Xiangtao,et al. Remote Sensing Image Generation From Audio[J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS,2021,18(6):994-998.
APA Zheng, Zhiyuan,Chen, Jun,Zheng, Xiangtao,&Lu, Xiaoqiang.(2021).Remote Sensing Image Generation From Audio.IEEE GEOSCIENCE AND REMOTE SENSING LETTERS,18(6),994-998.
MLA Zheng, Zhiyuan,et al."Remote Sensing Image Generation From Audio".IEEE GEOSCIENCE AND REMOTE SENSING LETTERS 18.6(2021):994-998.

入库方式: OAI收割

来源:西安光学精密机械研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。