MVContrast: Unsupervised Pretraining for Multi-view 3D Object Recognition
文献类型:期刊论文
| 作者 | Luequan Wang; Hongbin Xu; Wenxiong Kang |
| 刊名 | Machine Intelligence Research
![]() |
| 出版日期 | 2023 |
| 卷号 | 20期号:6页码:872-883 |
| 关键词 | Multi view, unsupervised pretraining, contrastive learning, 3D vision, shape recognition |
| ISSN号 | 2731-538X |
| DOI | 10.1007/s11633-023-1430-z |
| 英文摘要 | 3D shape recognition has drawn much attention in recent years. The view-based approach performs best of all. However, the current multi-view methods are almost all fully supervised, and the pretraining models are almost all based on ImageNet. Although the pretraining results of ImageNet are quite impressive, there is still a significant discrepancy between multi-view datasets and ImageNet. Multi-view datasets naturally retain rich 3D information. In addition, large-scale datasets such as ImageNet require considerable cleaning and annotation work, so it is difficult to regenerate a second dataset. In contrast, unsupervised learning methods can learn general feature representations without any extra annotation. To this end, we propose a three-stage unsupervised joint pretraining model. Specifically, we decouple the final representations into three fine-grained representations. Data augmentation is utilized to obtain pixel level representations within each view. And we boost the spatial invariant features from the view level. Finally, we exploit global information at the shape level through a novel extract-and-swap module. Experimental results demonstrate that the proposed method gains significantly in 3D object classification and retrieval tasks, and shows generalization to cross-dataset tasks. |
| 源URL | [http://ir.ia.ac.cn/handle/173211/56015] ![]() |
| 专题 | 自动化研究所_学术期刊_International Journal of Automation and Computing |
| 作者单位 | School of Automation Science and Engineering, South China University of Technology, Guangzhou 510641, China |
| 推荐引用方式 GB/T 7714 | Luequan Wang,Hongbin Xu,Wenxiong Kang. MVContrast: Unsupervised Pretraining for Multi-view 3D Object Recognition[J]. Machine Intelligence Research,2023,20(6):872-883. |
| APA | Luequan Wang,Hongbin Xu,&Wenxiong Kang.(2023).MVContrast: Unsupervised Pretraining for Multi-view 3D Object Recognition.Machine Intelligence Research,20(6),872-883. |
| MLA | Luequan Wang,et al."MVContrast: Unsupervised Pretraining for Multi-view 3D Object Recognition".Machine Intelligence Research 20.6(2023):872-883. |
入库方式: OAI收割
来源:自动化研究所
浏览0
下载0
收藏0
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。

