CDN4: A cross-view Deep Nearest Neighbor Neural Network for fine-grained few-shot classification (original) (raw)
Li, Xiaoxu, Ding, Shuo, Xie, Jiyang, Yang, Xiaochen ORCID: https://orcid.org/0000-0002-9299-5951, Ma, Zhanyu and Xue, Jing-Hao(2025) CDN4: A cross-view Deep Nearest Neighbor Neural Network for fine-grained few-shot classification.Pattern Recognition, 163, 111466. (doi: 10.1016/j.patcog.2025.111466)
Abstract
The fine-grained few-shot classification is a challenging task in computer vision, aiming to classify images with subtle and detailed differences given scarce labeled samples. A promising avenue to tackle this challenge is to use spatially local features to densely measure the similarity between query and support samples. Compared with image-level global features, local features contain more low-level information that is rich and transferable across categories. However, methods based on spatially localized features have difficulty distinguishing subtle category differences due to the lack of sample diversity. To address this issue, we propose a novel method called Cross-view Deep Nearest Neighbor Neural Network (CDN4). CDN4 applies a random geometric transformation to augment a different view of support and query samples and subsequently exploits four similarities between the original and transformed views of query local features and those views of support local features. The geometric augmentation increases the diversity between samples of the same class, and the cross-view measurement encourages the model to focus more on discriminative local features for classification through the cross-measurements between the two branches. Extensive experiments validate the superiority of CDN4, which achieves new state-of-the-art results in few-shot classification across various fine-grained benchmarks.
| Item Type: | Articles |
|---|---|
| Keywords: | Few-shot learning, fine-grained image classification, deep neural network, data augmentation. |
| Status: | Published |
| Refereed: | Yes |
| Glasgow Author(s) Enlighten ID: | Yang, Dr Xiaochen |
| Creator Roles: | Yang, X.Writing – review and editing, Visualization, Formal analysis |
| Authors: | Li, X., Ding, S., Xie, J., Yang, X., Ma, Z., and Xue, J.-H. |
| College/School: | College of Science and Engineering > School of Mathematics and Statistics > Statistics |
| Journal Name: | Pattern Recognition |
| Publisher: | Elsevier |
| ISSN: | 0031-3203 |
| ISSN (Online): | 1873-5142 |
| Published Online: | 18 February 2025 |
| Copyright Holders: | Copyright © 2025 The Authors |
| First Published: | First published in Pattern Recognition 163:111466 |
| Publisher Policy: | Reproduced under a Creative Commons licence |
University Staff: Request a correction | Enlighten Editors: Update this record
Deposit and Record Details
| ID Code: | 348748 |
|---|---|
| Depositing User: | Ms Gail Annan |
| Datestamp: | 20 Feb 2025 10:38 |
| Last Modified: | 18 Mar 2025 11:04 |
| Date of acceptance: | 10 February 2025 |
| Date of first online publication: | 18 February 2025 |
| Date Deposited: | 20 February 2025 |
| Data Availability Statement: | Yes |