CDN4: A cross-view Deep Nearest Neighbor Neural Network for fine-grained few-shot classification (original) (raw)

Li, Xiaoxu, Ding, Shuo, Xie, Jiyang, Yang, Xiaochen ORCID logo ORCID: https://orcid.org/0000-0002-9299-5951, Ma, Zhanyu and Xue, Jing-Hao(2025) CDN4: A cross-view Deep Nearest Neighbor Neural Network for fine-grained few-shot classification.Pattern Recognition, 163, 111466. (doi: 10.1016/j.patcog.2025.111466)

Abstract

The fine-grained few-shot classification is a challenging task in computer vision, aiming to classify images with subtle and detailed differences given scarce labeled samples. A promising avenue to tackle this challenge is to use spatially local features to densely measure the similarity between query and support samples. Compared with image-level global features, local features contain more low-level information that is rich and transferable across categories. However, methods based on spatially localized features have difficulty distinguishing subtle category differences due to the lack of sample diversity. To address this issue, we propose a novel method called Cross-view Deep Nearest Neighbor Neural Network (CDN4). CDN4 applies a random geometric transformation to augment a different view of support and query samples and subsequently exploits four similarities between the original and transformed views of query local features and those views of support local features. The geometric augmentation increases the diversity between samples of the same class, and the cross-view measurement encourages the model to focus more on discriminative local features for classification through the cross-measurements between the two branches. Extensive experiments validate the superiority of CDN4, which achieves new state-of-the-art results in few-shot classification across various fine-grained benchmarks.

Item Type:	Articles
Keywords:	Few-shot learning, fine-grained image classification, deep neural network, data augmentation.
Status:	Published
Refereed:	Yes
Glasgow Author(s) Enlighten ID:	Yang, Dr Xiaochen
Creator Roles:	Yang, X.Writing – review and editing, Visualization, Formal analysis
Authors:	Li, X., Ding, S., Xie, J., Yang, X., Ma, Z., and Xue, J.-H.
College/School:	College of Science and Engineering > School of Mathematics and Statistics > Statistics
Journal Name:	Pattern Recognition
Publisher:	Elsevier
ISSN:	0031-3203
ISSN (Online):	1873-5142
Published Online:	18 February 2025
Copyright Holders:	Copyright © 2025 The Authors
First Published:	First published in Pattern Recognition 163:111466
Publisher Policy:	Reproduced under a Creative Commons licence

University Staff: Request a correction | Enlighten Editors: Update this record

Deposit and Record Details

ID Code:	348748
Depositing User:	Ms Gail Annan
Datestamp:	20 Feb 2025 10:38
Last Modified:	18 Mar 2025 11:04
Date of acceptance:	10 February 2025
Date of first online publication:	18 February 2025
Date Deposited:	20 February 2025
Data Availability Statement:	Yes