This paper addresses three issues in integrating part-based representations into convolutional neural networks (CNNs) for object recognition. First, most part-based mod-els rely on a few pre-specified object parts. However, the optimal object parts for recognition often vary from cat-egory to category. Second, acquiring training data with part-level annotation is labor-intensive. Third, modeling spatial relationships between parts in CNNs often involves an exhaustive search of part templates over multiple net-work streams. We tackle the three issues by introducing a new network layer, called co-occurrence layer. It can ex-tend a convolutional layer to encode the co-occurrence be-tween the visual parts detected by the numerous neurons, instead of a few pre-specified parts. To this end, the feature maps serve as both filters and images, and mutual correla-tion filtering is conducted between them. The co-occurrence layer is end-to-end trainable. The resultant co-occurrence features are rotation-and translation-invariant, and are ro-bust to object deformation. By applying this new layer to the VGG-16 and ResNet-152, we achieve the recogni-tion rates of 83.6% and 85.8% on the Caltech-UCSD bird benchmark, respectively. The source code is available at https://github.com/yafangshih/Deep-COOC.
|Original language||American English|
|Title of host publication||2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)|
|Publisher||Institute of Electrical and Electronics Engineers Inc.|
|Number of pages||10|
|State||Published - 6 Nov 2017|
|Name||Proceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017|
Shih, Y. F., Yeh, Y. M., Lin, Y. Y., Weng, M. F., Lu, Y. C., & Chuang, Y. Y. (2017). Deep co-occurrence feature learning for visual object recognition. In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (pp. 7302-7311). (Proceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017; Vol. 2017-January). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/CVPR.2017.772