CROSS-SCALE QUERY-SUPPORT ALIGNMENT APPROACH FOR SMALL OBJECT DETECTION IN THE FEW-SHOT REGIME

Pierre Le Jeune, Anissa Mokraoui

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Lecture 10 Oct 2023

Small object detection is a challenging task in computer vision. In the few-shot regime, this problem is reinforced. Leveraging useful information from only a few examples is difficult, in particular with small objects. We hypothesize that features extracted from small objects are noisy and often dominated by background information. In addition, recent detectors rely on multi-scale features and visually similar objects of different sizes may have unaligned representations. We address these issues with Cross-Scale Query-Support Alignment (XQSA) a novel attention mechanism that combines features from query and support images at different scales. This allows matching objects of different sizes and therefore improves Few-Shot Object Detection (FSOD) performance. Extensive experiments are conducted on four distinct datasets, including natural images (Pascal VOC and MS COCO) and aerial images (DOTA and DIOR). XQSA improves the detection of small objects on all tested datasets. In aerial images, which contain smaller objects, it yields significant gains for the overall detection and outperforms the state-of-the-art results on DOTA and DIOR.

Tags:

object detection

few-shot learning

attention mechanisms

aerial images

CROSS-SCALE QUERY-SUPPORT ALIGNMENT APPROACH FOR SMALL OBJECT DETECTION IN THE FEW-SHOT REGIME

Pierre Le Jeune, Anissa Mokraoui

More Like This

KEYNOTE: Keras, A shortcut to master AI

KEYNOTE: Learning from Data in post-Foundation Models Era: bringing learning and reasoning together

AN L2-NORMALIZED SPATIAL ATTENTION NETWORK FOR ACCURATE AND FAST CLASSIFICATION OF BRAIN TUMORS IN 2D T1-WEIGHTED CE-MRI IMAGES

Join an IEEE Society