中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Stereo Depth Estimation with Echoes

文献类型:会议论文

作者Zhang, Chenghao; Tian, Kun; Ni, Bolin; Meng, Gaofeng; Fan, Bin; Zhang, Zhaoxiang; Pan, Chunhong
出版日期2022-10
会议日期2022.10.24
会议地点以色列特拉维夫
英文摘要
Stereo depth estimation is particularly amenable to local  textured regions while echoes have good depth estimations for global textureless regions, thus the two modalities complement each other. Motivated by the reciprocal relationship between both modalities, in this paper, we propose an end-to-end framework named StereoEchoes for stereo depth estimation with echoes. A Cross-modal Volume Refinement module is designed to transfer the complementary knowledge of the audio modality to the visual modality at feature level. A Relative Depth Uncertainty Estimation module is further proposed to yield pixel-wise confidence for multimodal depth fusion at output space. As there is no dataset for this new problem, we introduce two Stereo-Echo datasets named Stereo-Replica and Stereo-Matterport3D for the first time. Remarkably, we show empirically that our StereoEchoes, on Stereo-Replica and Stereo-Matterport3D, outperforms stereo depth estimation methods by 25%/13.8% RMSE, and surpasses the state-of-the-art audio-visual depth prediction method by 25.3%/42.3% RMSE.
语种英语
源URL[http://ir.ia.ac.cn/handle/173211/51492]  
专题自动化研究所_模式识别国家重点实验室_遥感图像处理团队
通讯作者Meng, Gaofeng
作者单位1.School of Artificial Intelligence, University of Chinese Academy of Sciences
2.University of Science and Technology Beijing
3.NLPR, Institute of Automation, Chinese Academy of Sciences
4.CAIR, HK Institute of Science and Innovation, Chinese Academy of Sciences
推荐引用方式
GB/T 7714
Zhang, Chenghao,Tian, Kun,Ni, Bolin,et al. Stereo Depth Estimation with Echoes[C]. 见:. 以色列特拉维夫. 2022.10.24.

入库方式: OAI收割

来源:自动化研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。