Real-Time ASR Transcription as Cognitive Scaffolding: Enhancing Intelligibility of Indonesian-Accented English in ELF Communication
DOI:
https://doi.org/10.56393/didactica.v6i1.3820Keywords:
Automatic Speech Recognition, Intelligibility, Indonesian-accented English, English as a Lingua Franca, ScaffoldingAbstract
English as a Lingua Franca (ELF) communication prioritizes intelligibility over native-like accuracy among speakers with diverse linguistic backgrounds. This mixed-methods study examines whether real-time automatic speech recognition (ASR) transcription enhances the intelligibility of Indonesian-accented English (IAE) in ELF contexts. Data were collected from Universitas PGRI Kanjuruhan Malang students through pre- and post-intelligibility transcription tasks, completed with and without Google Live Transcribe, as well as questionnaires and semi-structured interviews. Quantitative results revealed a statistically significant improvement in listener intelligibility when ASR support was available, particularly for low-frequency vocabulary, technical terms, and sentence-final elements, alongside reduced performance variability. Qualitative findings indicated positive user perceptions of real-time transcription as accessible, user-friendly, and supportive of comprehension and communicative confidence, despite occasional transcription errors. Overall, the findings suggest that real-time ASR transcription functions as cognitive scaffolding that mitigates accent-related processing challenges in ELF communication. This study contributes to ELF and CALL literature by providing empirical evidence that ASR-mediated interaction functions as cognitive scaffolding, supporting listener intelligibility in multilingual English use, particularly in the processing of low-frequency vocabulary, technical terms, and sentence-final elements.
Downloads
References
Alhalangy, A., & AbdAlgane, M. (2023). Exploring the impact of AI on the EFL context: A case study of Saudi universities. Journal of Intercultural Communication, 23(2), 41–49. https://doi.org/10.36923/jicc.v23i2.125
Ardini, S. N. (2024). An acoustic study of Jonglish community: Javanese-accented speech. Forum for Linguistic Studies, 6(2), 1167. https://doi.org/10.59400/fls.v6i2.1167
Barrot, J. S. (2023). Using ChatGPT for second language writing: Pitfalls and potentials. Assessing Writing, 57, 100745. https://doi.org/10.1016/j.asw.2023.100745
Bashori, M., van Hout, R., Strik, H., & Cucchiarini, C. (2024). I can speak: Improving English pronunciation through automatic speech recognition-based language learning systems. Innovation in Language Learning and Teaching, 18(5), 443–461. https://doi.org/10.1080/17501229.2024.2315101
Cogo, A., Fang, F., Kordia, S., Sifakis, N., & Siqueira, S. (2021). Developing ELF research for critical language education. AILA Review, 34(2), 187–211. https://doi.org/10.1075/aila.21007.cog
Dizon, G. (2023). Affordances and constraints of intelligent personal assistants for second-language learning. RELC Journal, 54(3), 848–855. https://doi.org/10.1177/00336882211020548
Farida, F., Supardi, S., Abduh, A., Muchtar, J., Rosmaladewi, R., & Arham, M. (2024). Technology and hybrid multimedia for language learning and cross-cultural communication in higher education. ASEAN Journal of Science and Engineering, 4(2), 331–348. https://doi.org/10.17509/ajse.v4i2.72609
Fiedler, S. (2022). English as a lingua franca and linguistic justice: Insights from exchange students’ experiences. International Journal of the Sociology of Language, 2022(277), 17–32. https://doi.org/10.1515/ijsl-2021-0075
Foster, M., & Welsh, A. (2021). English usage in the linguistic landscape of Balikpapan’s main thoroughfares. Indonesia and the Malay World, 49(145), 448–469. https://doi.org/10.1080/13639811.2021.1959162
Giles, H., Edwards, A. L., & Walther, J. B. (2023). Communication accommodation theory: Past accomplishments, current trends, and future prospects. Language Sciences, 99, 101571. https://doi.org/10.1016/j.langsci.2023.101571
Goudarzi, A., & Moya-Galé, G. (2021). Automatic speech recognition in noise for Parkinson's disease: A pilot study. Frontiers in Artificial Intelligence, 4, 809321. https://doi.org/10.3389/frai.2021.809321
Gottardi, W., Almeida, J. F. D., & Tumolo, C. H. S. (2022). Tecnologias de reconhecimento automático da fala e texto-fala para o aprimoramento da pronúncia em L2: Reflexões das suas aplicabilidades. Texto Livre, 15, e36736. https://doi.org/10.35699/1983-3652.2022.36736
Hannah, L., Kim, H., & Jang, E. E. (2022). Investigating the effects of task type and linguistic background on accuracy in automated speech recognition systems: Implications for use in language assessment of young learners. Language Assessment Quarterly, 19(3), 289–313. https://doi.org/10.1080/15434303.2022.2038172
Inceoglu, S., Chen, W. H., & Lim, H. (2023). Assessment of L2 intelligibility: Comparing L1 listeners and automatic speech recognition. ReCALL, 35(1), 89–104. https://doi.org/10.1017/S0958344022000192
Jiang, M. Y. C., Jong, M. S. Y., Wu, N., Shen, B., Chai, C. S., Lau, W. W. F., & Huang, B. (2022). Integrating automatic speech recognition technology into vocabulary learning in a flipped English class for Chinese college students. Frontiers in Psychology, 13, 902429. https://doi.org/10.3389/fpsyg.2022.902429
Johnson, C., & Cardoso, W. (2024). Hey Google, let’s write. CALICO Journal, 41(2), 122–145. https://doi.org/10.1558/cj.22431
Kim, S. (2024). English as a lingua franca in Japan: Multilingual postgraduate students’ attitudes towards English accents. Journal of Multilingual and Multicultural Development, 45(2), 536–550. https://doi.org/10.1080/01434632.2021.1909053
Kusumaningputri, R. (2024). Negotiating voices in English as a lingua franca: Indonesian multilingual identity in English digital interactions. Journal of Multilingual and Multicultural Development, 45(10), 4554–4571. https://doi.org/10.1080/01434632.2023.2173758
Lai, K. W. K., & Chen, H. J. H. (2024). An exploratory study on the accuracy of three speech recognition software programs for young Taiwanese EFL learners. Interactive Learning Environments, 32(5), 1582–1596. https://doi.org/10.1080/10494820.2022.2122511
Lee, S., Jeon, J., & Choe, H. (2025). Enhancing pre-service teachers' Global Englishes awareness with technology: A focus on AI chatbots in 3D metaverse environments. TESOL Quarterly, 59(1), 49–74. https://doi.org/10.1002/tesq.3300
Leong, H. J., Badiozaman, I. F., & Yap, A. (2023). Negotiating the challenges in speaking English for Indonesian undergraduate students in an ESL university. Studies in English Language and Education, 10(2), 822–840. https://doi.org/10.24815/siele.v10i2.26563
Lim, S. M. (2023). ELF teacher talk: Examining speech modification in Japanese classrooms. Asian Englishes, 25(3), 468–484. https://doi.org/10.1080/13488678.2022.2080425
Liu, W., & Wang, Y. (2024). The effects of using AI tools on critical thinking in English literature classes among EFL learners: An intervention study. European Journal of Education, 59(4), e12804. https://doi.org/10.1111/ejed.12804
Ngo, T. T. A. (2023). The perception by university students of the use of ChatGPT in education. International Journal of Emerging Technologies in Learning, 18(17), 4. https://doi.org/10.3991/ijet.v18i17.39019
Ngo, T. T. N., Chen, H. H. J., & Lai, K. K. W. (2024). The effectiveness of automatic speech recognition in ESL/EFL pronunciation: A meta-analysis. ReCALL, 36(1), 4–21. https://doi.org/10.1017/S0958344023000113
Octaberlina, L. R., Afif, I. M., & Rofiki, I. (2022). An investigation on the speaking constraints and strategies used by college students studying English as EFL learners. International Journal of Learning, Teaching and Educational Research, 21(9), 232–249. https://doi.org/10.26803/ijlter.21.9.14
O’Neal, G. (2021). What is the effect of successive segmental repair on the mutual intelligibility of ELF users? System, 103, 102683. https://doi.org/10.1016/j.system.2021.102683
Purwati, A. A., Hamzah, Z., Hamzah, M. L., & Deli, M. M. (2023). Digital and entrepreneurial literacy in increasing students' entrepreneurial interest in the technological era. In International Conference on Business Management and Accounting (Vol. 2, No. 1, pp. 34–43). https://doi.org/10.35145/icobima.v2i1.3498
Radzikowski, K., Wang, L., Yoshie, O., & Nowak, R. (2021). Accent modification for speech recognition of non-native speakers using neural style transfer. EURASIP Journal on Audio, Speech, and Music Processing, 2021(1), 11. https://doi.org/10.1186/s13636-021-00199-3
Sari, D. K., Amelia, R., Dharmajaya, R., Sari, L. M., & Fitri, N. K. (2021). Positive correlation between general public knowledge and attitudes regarding COVID-19 outbreak 1 month after first cases reported in Indonesia. Journal of Community Health, 46(1), 182–189. https://doi.org/10.1007/s10900-020-00866-0
Syam, A. R., Gardner, S., & Cribb, M. (2024). Pronunciation features of Indonesian-accented English. Languages, 9(6), 222. https://doi.org/10.3390/languages9060222
Sun, W. (2023). The impact of automatic speech recognition technology on second language pronunciation and speaking skills of EFL learners: A mixed methods investigation. Frontiers in Psychology, 14, 1210187. https://doi.org/10.3389/fpsyg.2023.1210187
Tsai, S. C. (2023). Learning with mobile augmented reality and automatic speech recognition-based materials for English listening and speaking skills: Effectiveness and perceptions of non-English major EFL students. Journal of Educational Computing Research, 61(2), 444–465. https://doi.org/10.1177/07356331221111203
Thir, V. (2023). Co-text, context, and listening proficiency as crucial variables in intelligibility among nonnative users of English. Studies in Second Language Acquisition, 45(5), 1210–1231. https://doi.org/10.1017/S0272263123000207
Wulandari, B. A., Piscioneri, M., & Ikram, W. (2021). Examining students' challenges in oracy in academic context classes. International Journal of Language Education, 5(1), 598–615. https://doi.org/10.26858/ijole.v5i1.16002
Xiao, W., & Park, M. (2021). Using automatic speech recognition to facilitate English pronunciation assessment and learning in an EFL context: Pronunciation error diagnosis and pedagogical implications. International Journal of Computer-Assisted Language Learning and Teaching, 11(3), 74–91. https://doi.org/10.4018/IJCALLT.2021070105
Zhao, X. (2023). Leveraging artificial intelligence (AI) technology for English writing: Introducing Wordtune as a digital writing assistant for EFL writers. RELC Journal, 54(3), 890–894. https://doi.org/10.1177/00336882221094089

