INJECTING TEXT AND CROSS-LINGUAL SUPERVISION IN FEW-SHOT LEARNING FROM SELF-SUPERVISED MODELS

Matthew Wiesner, Desh Raj, Sanjeev Khudanpur

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 00:11:26

13 May 2022

Self-supervised model pretraining has recently garnered significant interest. However, using additional resources in fine-tuning these models has received less attention. We demonstrate how universal phoneset acoustic models can leverage cross-lingual supervision to improve transfer of pretrained self-supervised representations to new languages. We also show how target-language text can be used to enable and improve fine-tuning with the lattice-free maximum mutual information (LF-MMI) objective. In three low-resource languages these techniques greatly improved few-shot learning performance.

Tags:

lattice-free mmi

self-supervised

cross-lingual asr

few-shot learning

INJECTING TEXT AND CROSS-LINGUAL SUPERVISION IN FEW-SHOT LEARNING FROM SELF-SUPERVISED MODELS

Matthew Wiesner, Desh Raj, Sanjeev Khudanpur

Value-Added Bundle(s) Including this Product

ICASSP 2022, May 2022 Virtual and In-Person Conference - Presentation Videos Product Bundle

More Like This

KEYNOTE: Learning from Data in post-Foundation Models Era: bringing learning and reasoning together

FEW-SHOT HYPERSPECTRAL IMAGE CLASSIFICATION BASED ON CROSS-DOMAIN SPECTRAL SEMANTIC RELATION TRANSFORMER

TASK-AGNOSTIC OPEN-SET PROTOTYPE FOR FEW-SHOT OPEN-SET RECOGNITION

Join an IEEE Society