High-Resolution Attention Network With Acoustic Segment Model For Acoustic Scene Classification

Xue Bai, Jun Du, Jia Pan, Heng-Shun Zhou, Chin-Hui Lee, Yan-Hui Tu

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 15:40

04 May 2020

The spectral information of acoustic scenes is diverse and complex, which poses challenges for acoustic scene tasks. To improve the classification performance, a variety of convolutional neural networks (CNNs) are proposed to extract richer semantic information of scene utterances. However, the different regions of the features extracted from CNN-based encoder have different importance. In this paper, we propose a novel strategy for acoustic scene classification, namely high-resolution attention network with acoustic segment model (HRAN-ASM). In this approach, we utilize fully CNN to obtain high-level semantic information and then adopt two-stage attention strategy to select the relevant acoustic scene segments. Besides, the acoustic segment model (ASM) proposed in our recent work provides embedding vectors for this attention mechanism. The performance is evaluated on DCASE 2018 Task 1a, showing 70.5% good classification accuracy under single system and no data expansion, which is superior to CNN-based self-attention mechanism and highly competitive.

Tags:

sps conference

icassp 2020 virtual conference

May 2020

icassp 2020

High-Resolution Attention Network With Acoustic Segment Model For Acoustic Scene Classification

Xue Bai, Jun Du, Jia Pan, Heng-Shun Zhou, Chin-Hui Lee, Yan-Hui Tu

Value-Added Bundle(s) Including this Product

ICASSP 2020 Virtual Conference - Presentation Videos Product Bundle

More Like This

IEEE ICASSP 2023, 4-10 June 2023, Greece. Virtual and In-Person Conference - Presentation Videos Product Bundle

IEEE ICASSP 2024, 1 4-19 April 2024, Seoul, Korea. Conference Presentation Videos Bundle

ICIP 2022, October 16-19, 2022, Bordeaux, France - Presentation Videos Product Bundle

Join an IEEE Society