Home > Archive > 2024 > Volume 14 Number 2 (2024) >
IJML 2024 Vol.14(2): 43-47
DOI: 10.18178/ijml.2024.14.2.1156

Automatic Speech Recognition Variance: Consecutive Runs of Low-Resource Languages in Whisper

Laurel Lord* and Mark Newman
Department of Data Sciences, Harrisburg University of Science and Technology, Harrisburg, USA
Email: lalord@my.harrisburgu.edu (L.L.); mnewman@harrisburgu.edu (M.N.)
*Corresponding author

Manuscript received September 15, 2023; revised November 26, 2023; accepted January 19, 2024; published April 26, 2024

Abstract—This study employs OpenAI’s Whisper to explore the manifestation of variance in an Automatic Speech Recognition (ASR) system. Three trained languages from Whisper’s current offerings (English, French, and Haitian Kreyòl) and one untrained (Saint Lucian Kwéyòl) completed thirty consecutive runs each, across five model sizes. Etymologically complex yet orthographically simple, mutually intelligible languages may challenge ASR system capabilities. However, a phonetically similar trained language model generated approximate phonetic transcripts for an untrained one. Despite implicit variance hurdles like non-determinism and data deficiencies, ASR systems may aid in documenting high-orality, low-resource languages.

Keywords—automatic speech recognition, creole, low-resource languages, Whisper

[PDF]

Cite: Laurel Lord and Mark Newman, "Automatic Speech Recognition Variance: Consecutive Runs of Low-Resource Languages in Whisper," International Journal of Machine Learning vol. 14, no. 2, pp. 43-47, 2024.

Copyright © 2024 by the authors. This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited (CC BY 4.0).

General Information

  • E-ISSN: 2972-368X
  • Abbreviated Title: Int. J. Mach. Learn.
  • Frequency: Quaterly
  • DOI: 10.18178/IJML
  • Editor-in-Chief: Dr. Lin Huang
  • Executive Editor:  Ms. Cherry L. Chen
  • Abstracing/Indexing: Inspec (IET), Google Scholar, Crossref, ProQuest, Electronic Journals LibraryCNKI.
  • E-mail: ijml@ejournal.net


Article Metrics in Dimensions