CoConv: Learning Dynamic Cooperative Convolution for Image Recognition

Published in International Conference on Multimedia & Expo (Oral), 2021

Abstract. In this paper, we present a conceptually simple, yet powerful method for image recognition. The method, called Cooperative Dynamic Convolution (CoConv), introduces a cooperative learning of dynamic convolution from multiple convolutional experts. CoConv can be used as a substitute for the traditional static convolution, and can be seamlessly integrated in various visual models. Moreover, CoConv is easy to train with only a minimal computational overhead introduced in the inference phase. CoConv is trained by using multiple convolutional experts simultaneously, and the convolutional weights are merged by a weighted summation before convolutional operations for efficiency during inference. Results from extensive experiments show that CoConv leads to consistent improvement for image classification on various datasets, independent of the choice of the base convolutional network. Remarkably, CoConv improves the top-1 classification accuracy of ResNet18 by 3.06% on ImageNet.

Recommended citation: Kien X. Nguyen, Tiffany Ryu, Jocelyn Zhang, Xu Ma, Qing Yang, Song Fu, Paparao Palacharla, Nannan Wang, and Xi Wang. "CoConv: Learning Dynamic Cooperative Convolution for Image Recognition." In IEEE International Conference on Multimedia & Expo, 2021.
Download Paper | Download Bibtex