中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Rethinking Semantic Image Compression: Scalable Representation With Cross-Modality Transfer

文献类型:期刊论文

作者Zhang, Pingping2; Wang, Shiqi2,3; Wang, Meng2; Li, Jiguo4; Wang, Xu1,5; Kwong, Sam2,3
刊名IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY
出版日期2023-08-01
卷号33期号:8页码:4441-4445
ISSN号1051-8215
关键词Semantic image compression cross-modality scalable coding
DOI10.1109/TCSVT.2023.3241225
英文摘要This article proposes the scalable cross-modality compression (SCMC) paradigm, in which the image compression problem is further cast into a representation task by hierarchically sketching the image with different modalities. Herein, we adopt the conceptual organization philosophy to model the overwhelmingly complicated visual patterns, based upon the semantic, structure, and signal level representation accounting for different tasks. The SCMC paradigm that incorporates the representation at different granularities supports diverse application scenarios, such as high-level semantic communication and low-level image reconstruction. The decoder, which enables the recovery of the visual information, benefits from the scalable coding based upon the semantic, structure, and signal layers. Qualitative and quantitative results demonstrate that the SCMC can convey accurate semantic and perceptual information of images, especially at low bitrates, and promising rate-distortion performance has been achieved compared to state-of-the-art methods. The code will be available online https://github.com/ppingzhang/SCMC.
资助项目National Natural Science Foundation of China[62022002] ; National Natural Science Foundation of China[61871270] ; Shenzhen Science and Technology Program[JCYJ20220530140816037] ; Shenzhen Natural Science Foundation[JCYJ20200109110410133] ; Hong Kong Innovation and Technology Commission (InnoHK) ; Hong Kong General Research Fund-Research Grants Council (GRF-RGC)[11209819] ; Hong Kong General Research Fund-Research Grants Council (GRF-RGC)[9042816] ; Hong Kong General Research Fund-Research Grants Council (GRF-RGC)[11203820] ; Hong Kong General Research Fund-Research Grants Council (GRF-RGC)[9042598]
WOS研究方向Engineering
语种英语
出版者IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
WOS记录号WOS:001045167400070
源URL[http://119.78.100.204/handle/2XEOYT63/21371]  
专题中国科学院计算技术研究所期刊论文_英文
通讯作者Wang, Shiqi
作者单位1.Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China
2.City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
3.City Univ Hong Kong, Shenzhen Res Inst, Shenzhen 518057, Peoples R China
4.Univ Chinese Acad Sci, Inst Comp Technol, Beijing 100049, Peoples R China
5.Shenzhen Univ, Guangdong Lab Artificial Intelligence & Digital Ec, Shenzhen 518060, Peoples R China
推荐引用方式
GB/T 7714
Zhang, Pingping,Wang, Shiqi,Wang, Meng,et al. Rethinking Semantic Image Compression: Scalable Representation With Cross-Modality Transfer[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY,2023,33(8):4441-4445.
APA Zhang, Pingping,Wang, Shiqi,Wang, Meng,Li, Jiguo,Wang, Xu,&Kwong, Sam.(2023).Rethinking Semantic Image Compression: Scalable Representation With Cross-Modality Transfer.IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY,33(8),4441-4445.
MLA Zhang, Pingping,et al."Rethinking Semantic Image Compression: Scalable Representation With Cross-Modality Transfer".IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY 33.8(2023):4441-4445.

入库方式: OAI收割

来源:计算技术研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。