中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Text-Free Controllable 3-D Point Cloud Generation

文献类型:期刊论文

作者Xiao, Haihong6; Kang, Wenxiong4,5,6; Li YQ(李玉琼)1,2,3; Xu, Hongbin6
刊名IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT
出版日期2024
卷号73页码:12
关键词3-D point cloud text-free controllable point cloud generation text-guided 3-D generative modeling
ISSN号0018-9456
DOI10.1109/TIM.2024.3353839
通讯作者Kang, Wenxiong(auwxkang@scut.edu.cn) ; Li, Yuqiong(liyuqiong@imech.ac.cn)
英文摘要Generating 3-D shapes with text inputs has long been a peculiar challenge in computer vision, which requires methodological know-how as well as a sense of art. Recently, text-to-image generation has driven remarkable progress, raising tremendous interest in text-guided shape generation, which further paves the way for industrial design. Nevertheless, prior efforts on text-guided 3-D synthesis either lack geometric details, are limited by the simple text input, or need expensive optimization and additional postprocessing, which make them unfriendly for novices. In this research, we present TFCNet, a novel approach for text-free controllable point cloud generation. In the training phase, we first design an empirically robust cross-modal skeletal point generator (CM-SPG) to predict skeletal points of the specific shape conditioned on the single image input. Then, we develop a diffusion-based dense point generator, which takes skeletal points as geometric guidance to produce dense point clouds that are faithful to the input images. In the inference phase, we propose an efficient text-free nonparametric transfer regime, which does not require separate training and can directly generate point cloud shapes while being semantically faithful to the provided text input. As evidenced by our experiments on the ShapeNet(v2) and CO3D datasets, our proposed method outperforms existing state of-the-art methods both quantitatively and qualitatively.
分类号一类
资助项目National Natural Science Foundation of China
WOS研究方向Engineering ; Instruments & Instrumentation
语种英语
WOS记录号WOS:001174112800006
资助机构National Natural Science Foundation of China
其他责任者Kang, Wenxiong ; Li, Yuqiong
源URL[http://dspace.imech.ac.cn/handle/311007/94750]  
专题力学研究所_流固耦合系统力学重点实验室(2012-)
力学研究所_非线性力学国家重点实验室
作者单位1.Guangdong Aerosp Res Acad, Guangzhou 511458, Peoples R China
2.Chinese Acad Sci, Inst Mech, State Key Lab Nonlinear Mech, Beijing 100190, Peoples R China;
3.Chinese Acad Sci, Key Lab Mech Fluid Solid Coupling Syst, Inst Mech, Beijing 100190, Peoples R China;
4.Pazhou Lab, Young Scholar Project Ctr, Guangzhou 510335, Peoples R China;
5.South China Univ Technol, Sch Future Technol, Guangzhou 510641, Peoples R China;
6.South China Univ Technol, Sch Automat Sci & Engn, Guangzhou 511442, Peoples R China;
推荐引用方式
GB/T 7714
Xiao, Haihong,Kang, Wenxiong,Li YQ,et al. Text-Free Controllable 3-D Point Cloud Generation[J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT,2024,73:12.
APA Xiao, Haihong,Kang, Wenxiong,李玉琼,&Xu, Hongbin.(2024).Text-Free Controllable 3-D Point Cloud Generation.IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT,73,12.
MLA Xiao, Haihong,et al."Text-Free Controllable 3-D Point Cloud Generation".IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT 73(2024):12.

入库方式: OAI收割

来源:力学研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。