Confidence Estimation For Black Box Automatic Speech Recognition Systems Using Lattice Recurrent Neural Networks

Alexandros Kastanos, Anton Ragni, Mark Gales

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 14:37

04 May 2020

Recently, there has been growth in providers of speech transcription services enabling others to leverage technology they would not normally be able to use. As a result, speech-enabled solutions have become commonplace. Their success critically relies on the quality, accuracy, and reliability of the underlying speech transcription systems. Those black box systems, however, offer limited means for quality control as only word sequences are typically available. This paper examines this limited resource scenario for confidence estimation, a measure commonly used to assess transcription reliability. In particular, it explores what other sources of word and sub-word level information available in the transcription process could be used to improve confidence scores. To encode all such information this paper extends lattice recurrent neural networks to handle sub-words. Experimental results using the IARPA OpenKWS 2016 evaluation system show that the use of additional information yields significant gains in confidence estimation accuracy. The implementation for this model can be found online.

Tags:

sps conference

icassp 2020 virtual conference

May 2020

icassp 2020

Confidence Estimation For Black Box Automatic Speech Recognition Systems Using Lattice Recurrent Neural Networks

Alexandros Kastanos, Anton Ragni, Mark Gales

Value-Added Bundle(s) Including this Product

ICASSP 2020 Virtual Conference - Presentation Videos Product Bundle

More Like This

IEEE ICASSP 2023, 4-10 June 2023, Greece. Virtual and In-Person Conference - Presentation Videos Product Bundle

IEEE ICASSP 2024, 1 4-19 April 2024, Seoul, Korea. Conference Presentation Videos Bundle

ICIP 2022, October 16-19, 2022, Bordeaux, France - Presentation Videos Product Bundle

Join an IEEE Society