Remote Sensing Image Generation From Audio
文献类型:期刊论文
作者 | Zheng, Zhiyuan1,2; Chen, Jun2; Zheng, Xiangtao1![]() ![]() |
刊名 | IEEE GEOSCIENCE AND REMOTE SENSING LETTERS
![]() |
出版日期 | 2021-06 |
卷号 | 18期号:6页码:994-998 |
关键词 | Remote sensing Semantics Feature extraction Gallium nitride Neural networks Sensors Mel frequency cepstral coefficient Cross-modal generation reranking |
ISSN号 | 1545-598X;1558-0571 |
DOI | 10.1109/LGRS.2020.2992324 |
产权排序 | 1 |
英文摘要 | Generating image from other modal data has attracted much attention in cross-modal studies, since the generated image offers intuitive vision information. Unlike the previous works which generate an image from text, a novel task is introduced, generating an image from audio. However, semantic gap intrinsically exists in cross-modal data, which disturbs the generative results. In order to explore the relevance between the audio and image, a novel reranking audio-image translation method is proposed. The proposed method: 1) maps the audio and image into a uniform feature space; 2) designs an audio-audio matching network to match the related audio; and 3) adopts an audio-image matching network for every matched audio to generate a related image, and the most frequent image is voted as the final result. Extensive experiments on two remote sensing cross-modal data sets demonstrate that the proposed method can visualize the content of audio. |
语种 | 英语 |
WOS记录号 | WOS:000652799700012 |
出版者 | IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC |
源URL | [http://ir.opt.ac.cn/handle/181661/94861] ![]() |
专题 | 西安光学精密机械研究所_光学影像学习与分析中心 |
通讯作者 | Zheng, Xiangtao |
作者单位 | 1.Chinese Acad Sci, Xian Inst Opt & Precis Mech, Key Lab Spectral Imaging Technol CAS, Xian 710119, Peoples R China 2.Wuhan Univ, Sch Comp Sci, Natl Engn Res Ctr Multimedia Software, Wuhan 430072, Peoples R China |
推荐引用方式 GB/T 7714 | Zheng, Zhiyuan,Chen, Jun,Zheng, Xiangtao,et al. Remote Sensing Image Generation From Audio[J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS,2021,18(6):994-998. |
APA | Zheng, Zhiyuan,Chen, Jun,Zheng, Xiangtao,&Lu, Xiaoqiang.(2021).Remote Sensing Image Generation From Audio.IEEE GEOSCIENCE AND REMOTE SENSING LETTERS,18(6),994-998. |
MLA | Zheng, Zhiyuan,et al."Remote Sensing Image Generation From Audio".IEEE GEOSCIENCE AND REMOTE SENSING LETTERS 18.6(2021):994-998. |
入库方式: OAI收割
来源:西安光学精密机械研究所
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。