Stereo Depth Estimation with Echoes
文献类型:会议论文
作者 | Zhang, Chenghao; Tian, Kun; Ni, Bolin; Meng, Gaofeng; Fan, Bin; Zhang, Zhaoxiang; Pan, Chunhong |
出版日期 | 2022-10 |
会议日期 | 2022.10.24 |
会议地点 | 以色列特拉维夫 |
英文摘要 | Stereo depth estimation is particularly amenable to local textured regions while echoes have good depth estimations for global textureless regions, thus the two modalities complement each other. Motivated by the reciprocal relationship between both modalities, in this paper, we propose an end-to-end framework named StereoEchoes for stereo depth estimation with echoes. A Cross-modal Volume Refinement module is designed to transfer the complementary knowledge of the audio modality to the visual modality at feature level. A Relative Depth Uncertainty Estimation module is further proposed to yield pixel-wise confidence for multimodal depth fusion at output space. As there is no dataset for this new problem, we introduce two Stereo-Echo datasets named Stereo-Replica and Stereo-Matterport3D for the first time. Remarkably, we show empirically that our StereoEchoes, on Stereo-Replica and Stereo-Matterport3D, outperforms stereo depth estimation methods by 25%/13.8% RMSE, and surpasses the state-of-the-art audio-visual depth prediction method by 25.3%/42.3% RMSE. |
语种 | 英语 |
源URL | [http://ir.ia.ac.cn/handle/173211/51492] |
专题 | 自动化研究所_模式识别国家重点实验室_遥感图像处理团队 |
通讯作者 | Meng, Gaofeng |
作者单位 | 1.School of Artificial Intelligence, University of Chinese Academy of Sciences 2.University of Science and Technology Beijing 3.NLPR, Institute of Automation, Chinese Academy of Sciences 4.CAIR, HK Institute of Science and Innovation, Chinese Academy of Sciences |
推荐引用方式 GB/T 7714 | Zhang, Chenghao,Tian, Kun,Ni, Bolin,et al. Stereo Depth Estimation with Echoes[C]. 见:. 以色列特拉维夫. 2022.10.24. |
入库方式: OAI收割
来源:自动化研究所
浏览0
下载0
收藏0
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。