A Robin Hood approach to forced alignment: English-trained algorithms and their use on Australian languages

Authors

  • Sarah Babinski Yale University
  • Rikker Dockum Yale University
  • J. Hunter Craft Yale University
  • Anelisa Fergus Yale University
  • Dolly Goldenberg Yale University
  • Claire Bowern Yale University

DOI:

https://doi.org/10.3765/plsa.v4i1.4468

Keywords:

Australian languages, phonetics, forced alignment, language documentation, Yidiny

Abstract

Forced alignment automatically aligns audio recordings of spoken language with transcripts at the segment level, greatly reducing the time required to prepare data for phonetic analysis. However, existing algorithms are mostly trained on a few well-documented languages. We test the performance of three algorithms against manually aligned data. For at least some tasks, unsupervised alignment (either based on English or trained from a small corpus) is sufficiently reliable for it to be used on legacy data for low-resource languages. Descriptive phonetic work on vowel inventories and prosody can be accurately captured by automatic alignment with minimal training data. Consonants provided significantly more challenges for forced alignment.

Downloads

Published

2019-03-15

How to Cite

Babinski, Sarah, Rikker Dockum, J. Hunter Craft, Anelisa Fergus, Dolly Goldenberg, and Claire Bowern. 2019. “A Robin Hood Approach to Forced Alignment: English-Trained Algorithms and Their Use on Australian Languages”. Proceedings of the Linguistic Society of America 4 (1): 3:1–12. https://doi.org/10.3765/plsa.v4i1.4468.