12 nov 2021

IIT Madras, 2AI4Bharat, 3Microsoft, 4RBCDSAI

In this paper, the authors study if the wav2vec style pre-training transfers to Indic languages (yes, it does). To answer the question the authors curate 17,314 hours of raw audio data for pre-training across 40 languages from 4 language families. The authors do ablation studies on pre-training corpus, fine-tuning data, and task-specific language information.

Step 1: Curated 17,314 hours of raw audio data for pre-training across 40 languages.

Step 2: IndicWav2Vec: A multilingual ASR model for Indian Languages

Step 3: Results and discussion.