Zernike Pooling: Generalizing Average Pooling Using Zernike Moments

Abstract:


Most of the established neural network architectures in computer vision are essentially composed of the same building blocks (e.g., convolutional, normalization, regularization, pooling layers, etc.), with their main difference being the connectivity of these components within the architecture and not the components themselves. In this paper we propose a generalization of the traditional average pooling operator. Based on the requirements of efficiency (to provide information without repetition), equivalence (to be able to produce the same output as average pooling) and extendability (to provide a natural way of obtaining novel information), we arrive at a formulation that generalizes average pooling using the Zernike moments. Experimental results on Cifar 10 , Cifar 100 and Rotated MNIST data-sets showed that the proposed method was able to outperform the two baseline approaches, global average pooling and average pooling 2x2, as well as the two variants of Stochastic pooling and AlphaMEX in every case. A worst-case performance analysis on Cifar-100 showed that significant gains in classification accuracy can be realised with only a modest 10% increase in training time.


  • T. Theodoridis, K. Loumponias, N. Vretos, P. Daras, "Zernike Pooling: Generalizing Average Pooling Using Zernike Moments", IEEE Access, Volume 9, pp. 121128-121136, 2021. DOI: https://doi.org/10.1109/ACCESS.2021.3108630

  • Full document available here.
    Contact Information

    Dr. Petros Daras, Research Director
    6th km Charilaou – Thermi Rd, 57001, Thessaloniki, Greece
    P.O.Box: 60361
    Tel.: +30 2310 464160 (ext. 156)
    Fax: +30 2310 464164
    Email: daras(at)iti(dot)gr