|
作者 | Wu ZQ(吴仲琦)1,2 ; Guo JW(郭建伟)1,2 ; Zhuang CQ(庄传青)1; Xiao J(肖俊)1; Yan DM(严冬明)1,2 ; Zhang XP(张晓鹏)1,2
|
刊名 | Computational Visual Media
 |
出版日期 | 2022
|
页码 | 14 |
关键词 | specular highlight detection
specular highlight removal
Unet
Transformer
|
英文摘要 | Specular highlight detection and removal is
a fundamental problem in computer vision and image
processing. In this paper, we present an efficient end-
to-end deep learning model for automatically detecting
and removing specular highlights in a single image.
In particular, an encoder-decoder network is utilized
to detect specular highlights, then a novel Unet-
Transformer network performs highlight removal; we
append transformer modules instead of feature maps
in the Unet architecture. We also introduce a highlight
detection module as a mask to guide the removal task.
Thus, these two networks can be jointly trained in
an effective manner. Thanks to the hierarchical and
global properties of the transformer mechanism, our
framework is able to establish relationships between
continuous self-attention layers, making it possible
to directly model the mapping between the diffuse
area and the specular highlight area, and reduce
indeterminacy within areas containing strong specular
highlight reflection. Experiments on public benchmark
and real-world images demonstrate that our approach
outperforms state-of-the-art methods for both highlight
detection and removal tasks. |
语种 | 英语
|
源URL | [http://ir.ia.ac.cn/handle/173211/48692]  |
专题 | 模式识别国家重点实验室_三维可视计算
|
通讯作者 | Guo JW(郭建伟); Xiao J(肖俊) |
作者单位 | 1.中国科学院大学 2.中国科学院自动化研究所
|
推荐引用方式 GB/T 7714 |
Wu ZQ,Guo JW,Zhuang CQ,et al. Joint Specular Highlight Detection and Removal in Single Images via Unet-Transformer[J]. Computational Visual Media,2022:14.
|
APA |
Wu ZQ,Guo JW,Zhuang CQ,Xiao J,Yan DM,&Zhang XP.(2022).Joint Specular Highlight Detection and Removal in Single Images via Unet-Transformer.Computational Visual Media,14.
|
MLA |
Wu ZQ,et al."Joint Specular Highlight Detection and Removal in Single Images via Unet-Transformer".Computational Visual Media (2022):14.
|