Incorporating Written Domain Numeric Grammars Into End-To-End Contextual Speech Recognition Systems For Improved Recognition Of Numeric Sequences

Ben Haynor, Petar Aleksic

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 12:10

04 May 2020

Accurate recognition of numeric sequences is crucial for many contextual speech recognition applications. For example, a user might create a calendar event and be prompted by a virtual assistant for the time, date, and duration of the event. We propose a modular and scalable solution for improved recognition of numeric sequences. We use finite state transducers built from written domain numeric grammars to increase the likelihood of hypotheses containing matching numeric entities during beam search in an end-to-end speech recognition system. Using our technique results in relative reduction in word error rate of up to 59\% on a variety of numeric sequence recognition tasks (times, percentages, digit sequences, ...).

Tags:

sps conference

icassp 2020 virtual conference

May 2020

icassp 2020

Incorporating Written Domain Numeric Grammars Into End-To-End Contextual Speech Recognition Systems For Improved Recognition Of Numeric Sequences

Ben Haynor, Petar Aleksic

Value-Added Bundle(s) Including this Product

ICASSP 2020 Virtual Conference - Presentation Videos Product Bundle

More Like This

IEEE ICASSP 2023, 4-10 June 2023, Greece. Virtual and In-Person Conference - Presentation Videos Product Bundle

IEEE ICASSP 2024, 1 4-19 April 2024, Seoul, Korea. Conference Presentation Videos Bundle

ICIP 2022, October 16-19, 2022, Bordeaux, France - Presentation Videos Product Bundle

Join an IEEE Society