Sharing Weights in Shallow Layers via Rotation Group Equivariant Convolutions
文献类型:期刊论文
作者 | Zhiqiang Chen4![]() ![]() ![]() ![]() |
刊名 | Machine Intelligence Research
![]() |
出版日期 | 2022 |
卷号 | 19期号:2页码:115-126 |
关键词 | Convolutional neural networks (CNNs) group equivariance higher-degree weight sharing parameter efficiency |
ISSN号 | 2731-538X |
DOI | 10.1007/s11633-022-1324-5 |
英文摘要 | The convolution operation possesses the characteristic of translation group equivariance. To achieve more group equivariances, rotation group equivariant convolutions (RGEC) are proposed to acquire both translation and rotation group equivariances. However, previous work paid more attention to the number of parameters and usually ignored other resource costs. In this paper, we construct our networks without introducing extra resource costs. Specifically, a convolution kernel is rotated to different orientations for feature extractions of multiple channels. Meanwhile, much fewer kernels than previous works are used to ensure that the output channel does not increase. To further enhance the orthogonality of kernels in different orientations, we construct the non-maximum-suppression loss on the rotation dimension to suppress the other directions except the most activated one. Considering that the low-level-features benefit more from the rotational symmetry, we only share weights in the shallow layers (SWSL) via RGEC. Extensive experiments on multiple datasets (i.e., ImageNet, CIFAR, and MNIST) demonstrate that SWSL can effectively benefit from the higher-degree weight sharing and improve the performances of various networks, including plain and ResNet architectures. Meanwhile, the convolutional kernels and parameters are much fewer (e.g., 75%, 87.5% fewer) in the shallow layers, and no extra computation costs are introduced. |
语种 | 英语 |
源URL | [http://ir.ia.ac.cn/handle/173211/55936] ![]() |
专题 | 自动化研究所_学术期刊_International Journal of Automation and Computing |
作者单位 | 1.School of Instrumentation and Optoelectronic Engineering, Beihang University, Beijing 100191, China 2.Ningbo HwaMei Hospital, University of Chinese Academy of Sciences, Ningbo 315012, China 3.Center for Excellence in Brain Science and Intelligence Technology, Chinese Academy of Sciences, Beijing 100190, China 4.Research Center for Brain-inspired Intelligence, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China |
推荐引用方式 GB/T 7714 | Zhiqiang Chen,Ting-Bing Xu,Jinpeng Li,et al. Sharing Weights in Shallow Layers via Rotation Group Equivariant Convolutions[J]. Machine Intelligence Research,2022,19(2):115-126. |
APA | Zhiqiang Chen,Ting-Bing Xu,Jinpeng Li,&Huiguang He.(2022).Sharing Weights in Shallow Layers via Rotation Group Equivariant Convolutions.Machine Intelligence Research,19(2),115-126. |
MLA | Zhiqiang Chen,et al."Sharing Weights in Shallow Layers via Rotation Group Equivariant Convolutions".Machine Intelligence Research 19.2(2022):115-126. |
入库方式: OAI收割
来源:自动化研究所
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。