Asr-dictation on smartphones for vowel pronunciation practice
Main Article Content
Abstract
This study aims to explore mobile-assisted Automated Speech Recognition (ASR) dictation systems for vowel pronunciation practice by examining whether ARS can be useful for pronunciation improvement and speech recognition accuracy. Additionally, learners’ attitudes towards using these systems were explored. Twenty-one Macedonian EFL learners practiced pronouncing 26 words with the following minimal pairs: /i/, /ɪ/; /æ/, /ɛ/; /u/, /ʊ/; /ɑ/, /ʌ/. The participants were divided into an experimental group (n=11) and a control group (n=10). This study used a mixed methods approach including qualitative and quantitative analysis. Results demonstrated that while the control group did not show any improvement, the experimental group improved their accuracy. ASR written output and human judgment was also found to be within an acceptable agreement for most vowels. Furthermore, while occasional inaccurate feedback sometimes caused frustration, ASR training was generally enjoyed and considered as a practical and safe environment for practice. The findings provide some support for the use of ASR in EFL classrooms with careful planning and direction from the teacher. Using ASR as a tool for controlled and structured practice with individual words is particularly applicable when the focus is to raise learners’ phonological awareness and perception of English vowel sounds.
Downloads
Article Details
Copyright © 2014 Blaže Koneski Faculty of Philology, Skopje
Journal of Contemporary Philology (JCP)
Современа филологија
References
Chen, H. H.-J. (2011). Developing and evaluating an oral skills training website supported by automatic speech recognition technology. ReCALL, 23(1): 59–78.
Coniam, D. (1999). Voice recognition software accuracy with second language speakers of English. SYSTEM, 27(1): 49-64.
Cucchiarini, C., Neri, A., and Strik, H. (2009). Oral proficiency training in Dutch L2: The contribution of ASR-based corrective feedback. Speech Communication, 51(10), 853-863.
Cucchiarini, C., Strik, H. and Boves, L. (2000). Different aspects of expert pronunciation quality ratings and their relation to scores produced by speech recognition algorithm. Speech Communication, 30(2–3): 109–119.
Dodd, S., and Mills, J. (1996). Phonetics and phonology. Solving language problems: from general to applied linguistics, 13–33. CORE: University of Exeter Press.
Derwing, T. M., Munro, M. J., and Carbonaro, M. (2000). Does popular speech recognition software work with ESL speech? TESOL Quarterly, (34): 592–603.
Ehsani, F., and Knodt, E. (1998). Speech technology in computer-aided language learning: Strengths and limitations of a new CALL paradigm. Language Learning and Technology, 2(1): 54–73.
Eskenazi, M. (1999). Using a computer in foreign language pronunciation training: What advantages? CALICO Journal, (16): 447–469.
Flege, J. E. (1995). Second-language speech learning: Theory, findings, and problems. In Strange, W. (ed.). Speech perception and linguistic experience. Issues in cross-linguistic research, 233-277. Timonium, MD: York Press.
Flege, J. E., (2007). Language contact in biliangualism: Phonetic system interactions. Labarotory Phonology, (9): 353-382.
Kirkova-Naskova, A. (2012). Interlanguage phonology: comparison between the English and the Macedonian vowel systems. In Annual Symposium of the Faculty of Philology ‘Blaze Koneski (38): 141-152 [original] Киркова-Наскова, А. (2012). Меѓујазична фонологија: споредба на вокалните системи на англискиот и на македонскиот јазик. Во: Годишен зборник на Филолошкиот факултет „Блаже Конески“, кн. 38, 141–152.
Kirkova-Naskova, A. (2010). Native speaker perceptions of accented speech: The English pronunciation of Macedonian EFL learners. Research in Language, (8): 1-21.
Levis, J., and Grant, L. (2003). Integrating pronunciation into ESL/EFL classrooms. TESOL Journal, 12(2): 13-19.
Levis, J., and Suvorov, R. (2013). Automatic speech recognition. In C. Chapelle (ed.). The encyclopedia of applied linguistics, 1–8. Hoboken, NJ: Blackwell Publishing.
Liakin, D., Cardoso, W., and Liakina, N. (2014). Learning L2 pronunciation with a mobile speech recognizer: French /y/. CALICO Journal, 32(1): 1-25.
McCrocklin, S. M. (2019). ASR-based dictation practice for second language pronunciation improvement. Journal of Second Language Pronunciation, 5(1): 98-118.
McCrocklin, S. M. (2016). Pronunciation learner autonomy: The potential of Automatic Speech Recognition. System, (57): 25–42.
Mroz, A. (2018). Seeing how people hear you: French learners experiencing intelligibility through automatic speech recognition. Foreign Language Annals, 51(3), 617-637.
Neri, A., Cucchiarini, C., and Strik, H. (2006). Selecting segmental errors in L2 Dutch for optimal pronunciation training. International Review of Applied Linguistics, (44): 357–404.
Neri, A., Cucchiarini, C., and Strik, H. (2008). The effectiveness of computer-based speech corrective feedback for improving segmental quality in L2 Dutch. ReCALL: the Journal of EUROCALL, 20(2): 225.
Neri, A., Cucchiarini, C., Strik, H.Boves, L. (2002). The pedagogy-technology interface in Computer Assisted Pronunciation Training. Computer Assisted Language Learning, 15(5): 441–467.
Swain, M. (1985). Communicative competence: Some roles of comprehensible input and comprehensible output in its development. In S. Gass and C. Madden (eds.). Input in second language acquisition, 235–253. Rowley, MA: Newbury House.
Victori, M., and Lockhart, W. (1995). Enhancing metacognition in self-directed language learning. System, 23(2): 223-234.
McCrocklin, S. M. (2014). The potential of Automatic Speech Recognition for fostering pronunciation learners’ autonomy. Graduate Theses and Dissertations. 13902.
https://lib.dr.iastate.edu/etd/13902
Thomas, D.R. (2003). A general inductive approach for qualitative data analysis. [Online] Available from: www.health.auckland.ac.nz/hrmas/Inductive2003.pdf . [Accessed: Маy 5th, 2020]