中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Efficient 3D Scene Semantic Segmentation via Active Learning on Rendered 2D Images

文献类型:期刊论文

作者Mengqi Rong1,2,3; Hainan Cui1,2,3; Shuhan Shen1,2,3
刊名IEEE Transactions on Image Processing
出版日期2023-06-20
卷号32页码:3521-3535
文献子类期刊论文
英文摘要

Inspired by Active Learning and 2D-3D semantic fusion, we proposed a novel framework for 3D scene semantic segmentation based on rendered 2D images, which could efficiently achieve semantic segmentation of any large-scale 3D scene with only a few 2D image annotations. In our framework, we first render perspective images at certain positions in the 3D scene. Then we continuously fine-tune a pre-trained network for image semantic segmentation and project all dense predictions to the 3D model for fusion. In each iteration, we evaluate the 3D semantic model and re-render images in several representative areas where the 3D segmentation is not stable and send them to the network for training after annotation. Through this iterative process of rendering-segmentation-fusion, it can effectively generate difficult-to-segment image samples in the scene, while avoiding complex 3D annotations, so as to achieve label-efficient 3D scene segmentation. Experiments on three large-scale indoor and outdoor 3D datasets demonstrate the effectiveness of the proposed method compared with other state-of-the-art.

源URL[http://ir.ia.ac.cn/handle/173211/52435]  
专题精密感知与控制研究中心_精密感知与控制
通讯作者Shuhan Shen
作者单位1.CASIA-SenseTime Research Group, Beijing, China
2.School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing, China
3.Institute of Automation, Chinese Academy of Sciences, Beijing, China
推荐引用方式
GB/T 7714
Mengqi Rong,Hainan Cui,Shuhan Shen. Efficient 3D Scene Semantic Segmentation via Active Learning on Rendered 2D Images[J]. IEEE Transactions on Image Processing,2023,32:3521-3535.
APA Mengqi Rong,Hainan Cui,&Shuhan Shen.(2023).Efficient 3D Scene Semantic Segmentation via Active Learning on Rendered 2D Images.IEEE Transactions on Image Processing,32,3521-3535.
MLA Mengqi Rong,et al."Efficient 3D Scene Semantic Segmentation via Active Learning on Rendered 2D Images".IEEE Transactions on Image Processing 32(2023):3521-3535.

入库方式: OAI收割

来源:自动化研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。