Real-Time ASR Transcription as Cognitive Scaffolding: Enhancing Intelligibility of Indonesian-Accented English in ELF Communication

Authors

DOI:

https://doi.org/10.56393/didactica.v6i1.3820

Keywords:

Automatic Speech Recognition, Intelligibility, Indonesian-accented English, English as a Lingua Franca, Scaffolding

Abstract

English as a Lingua Franca (ELF) communication prioritizes intelligibility over native-like accuracy among speakers with diverse linguistic backgrounds. This mixed-methods study examines whether real-time automatic speech recognition (ASR) transcription enhances the intelligibility of Indonesian-accented English (IAE) in ELF contexts. Data were collected from Universitas PGRI Kanjuruhan Malang students through pre- and post-intelligibility transcription tasks, completed with and without Google Live Transcribe, as well as questionnaires and semi-structured interviews. Quantitative results revealed a statistically significant improvement in listener intelligibility when ASR support was available, particularly for low-frequency vocabulary, technical terms, and sentence-final elements, alongside reduced performance variability. Qualitative findings indicated positive user perceptions of real-time transcription as accessible, user-friendly, and supportive of comprehension and communicative confidence, despite occasional transcription errors. Overall, the findings suggest that real-time ASR transcription functions as cognitive scaffolding that mitigates accent-related processing challenges in ELF communication. This study contributes to ELF and CALL literature by providing empirical evidence that ASR-mediated interaction functions as cognitive scaffolding, supporting listener intelligibility in multilingual English use, particularly in the processing of low-frequency vocabulary, technical terms, and sentence-final elements.

Downloads

Download data is not yet available.

Author Biographies

Andy Andy, Universitas PGRI Kanjuruhan Malang

Universitas PGRI Kanjuruhan Malang

Mallaury Milliard, Université Catholique de Lille

Université Catholique de Lille, France

References

Alhalangy, A., & AbdAlgane, M. (2023). Exploring the impact of AI on the EFL context: A case study of Saudi universities. Journal of Intercultural Communication, 23(2), 41–49. https://doi.org/10.36923/jicc.v23i2.125

Ardini, S. N. (2024). An acoustic study of Jonglish community: Javanese-accented speech. Forum for Linguistic Studies, 6(2), 1167. https://doi.org/10.59400/fls.v6i2.1167

Barrot, J. S. (2023). Using ChatGPT for second language writing: Pitfalls and potentials. Assessing Writing, 57, 100745. https://doi.org/10.1016/j.asw.2023.100745

Bashori, M., van Hout, R., Strik, H., & Cucchiarini, C. (2024). I can speak: Improving English pronunciation through automatic speech recognition-based language learning systems. Innovation in Language Learning and Teaching, 18(5), 443–461. https://doi.org/10.1080/17501229.2024.2315101

Cogo, A., Fang, F., Kordia, S., Sifakis, N., & Siqueira, S. (2021). Developing ELF research for critical language education. AILA Review, 34(2), 187–211. https://doi.org/10.1075/aila.21007.cog

Dizon, G. (2023). Affordances and constraints of intelligent personal assistants for second-language learning. RELC Journal, 54(3), 848–855. https://doi.org/10.1177/00336882211020548

Farida, F., Supardi, S., Abduh, A., Muchtar, J., Rosmaladewi, R., & Arham, M. (2024). Technology and hybrid multimedia for language learning and cross-cultural communication in higher education. ASEAN Journal of Science and Engineering, 4(2), 331–348. https://doi.org/10.17509/ajse.v4i2.72609

Fiedler, S. (2022). English as a lingua franca and linguistic justice: Insights from exchange students’ experiences. International Journal of the Sociology of Language, 2022(277), 17–32. https://doi.org/10.1515/ijsl-2021-0075

Foster, M., & Welsh, A. (2021). English usage in the linguistic landscape of Balikpapan’s main thoroughfares. Indonesia and the Malay World, 49(145), 448–469. https://doi.org/10.1080/13639811.2021.1959162

Giles, H., Edwards, A. L., & Walther, J. B. (2023). Communication accommodation theory: Past accomplishments, current trends, and future prospects. Language Sciences, 99, 101571. https://doi.org/10.1016/j.langsci.2023.101571

Goudarzi, A., & Moya-Galé, G. (2021). Automatic speech recognition in noise for Parkinson's disease: A pilot study. Frontiers in Artificial Intelligence, 4, 809321. https://doi.org/10.3389/frai.2021.809321

Gottardi, W., Almeida, J. F. D., & Tumolo, C. H. S. (2022). Tecnologias de reconhecimento automático da fala e texto-fala para o aprimoramento da pronúncia em L2: Reflexões das suas aplicabilidades. Texto Livre, 15, e36736. https://doi.org/10.35699/1983-3652.2022.36736

Hannah, L., Kim, H., & Jang, E. E. (2022). Investigating the effects of task type and linguistic background on accuracy in automated speech recognition systems: Implications for use in language assessment of young learners. Language Assessment Quarterly, 19(3), 289–313. https://doi.org/10.1080/15434303.2022.2038172

Inceoglu, S., Chen, W. H., & Lim, H. (2023). Assessment of L2 intelligibility: Comparing L1 listeners and automatic speech recognition. ReCALL, 35(1), 89–104. https://doi.org/10.1017/S0958344022000192

Jiang, M. Y. C., Jong, M. S. Y., Wu, N., Shen, B., Chai, C. S., Lau, W. W. F., & Huang, B. (2022). Integrating automatic speech recognition technology into vocabulary learning in a flipped English class for Chinese college students. Frontiers in Psychology, 13, 902429. https://doi.org/10.3389/fpsyg.2022.902429

Johnson, C., & Cardoso, W. (2024). Hey Google, let’s write. CALICO Journal, 41(2), 122–145. https://doi.org/10.1558/cj.22431

Kim, S. (2024). English as a lingua franca in Japan: Multilingual postgraduate students’ attitudes towards English accents. Journal of Multilingual and Multicultural Development, 45(2), 536–550. https://doi.org/10.1080/01434632.2021.1909053

Kusumaningputri, R. (2024). Negotiating voices in English as a lingua franca: Indonesian multilingual identity in English digital interactions. Journal of Multilingual and Multicultural Development, 45(10), 4554–4571. https://doi.org/10.1080/01434632.2023.2173758

Lai, K. W. K., & Chen, H. J. H. (2024). An exploratory study on the accuracy of three speech recognition software programs for young Taiwanese EFL learners. Interactive Learning Environments, 32(5), 1582–1596. https://doi.org/10.1080/10494820.2022.2122511

Lee, S., Jeon, J., & Choe, H. (2025). Enhancing pre-service teachers' Global Englishes awareness with technology: A focus on AI chatbots in 3D metaverse environments. TESOL Quarterly, 59(1), 49–74. https://doi.org/10.1002/tesq.3300

Leong, H. J., Badiozaman, I. F., & Yap, A. (2023). Negotiating the challenges in speaking English for Indonesian undergraduate students in an ESL university. Studies in English Language and Education, 10(2), 822–840. https://doi.org/10.24815/siele.v10i2.26563

Lim, S. M. (2023). ELF teacher talk: Examining speech modification in Japanese classrooms. Asian Englishes, 25(3), 468–484. https://doi.org/10.1080/13488678.2022.2080425

Liu, W., & Wang, Y. (2024). The effects of using AI tools on critical thinking in English literature classes among EFL learners: An intervention study. European Journal of Education, 59(4), e12804. https://doi.org/10.1111/ejed.12804

Ngo, T. T. A. (2023). The perception by university students of the use of ChatGPT in education. International Journal of Emerging Technologies in Learning, 18(17), 4. https://doi.org/10.3991/ijet.v18i17.39019

Ngo, T. T. N., Chen, H. H. J., & Lai, K. K. W. (2024). The effectiveness of automatic speech recognition in ESL/EFL pronunciation: A meta-analysis. ReCALL, 36(1), 4–21. https://doi.org/10.1017/S0958344023000113

Octaberlina, L. R., Afif, I. M., & Rofiki, I. (2022). An investigation on the speaking constraints and strategies used by college students studying English as EFL learners. International Journal of Learning, Teaching and Educational Research, 21(9), 232–249. https://doi.org/10.26803/ijlter.21.9.14

O’Neal, G. (2021). What is the effect of successive segmental repair on the mutual intelligibility of ELF users? System, 103, 102683. https://doi.org/10.1016/j.system.2021.102683

Purwati, A. A., Hamzah, Z., Hamzah, M. L., & Deli, M. M. (2023). Digital and entrepreneurial literacy in increasing students' entrepreneurial interest in the technological era. In International Conference on Business Management and Accounting (Vol. 2, No. 1, pp. 34–43). https://doi.org/10.35145/icobima.v2i1.3498

Radzikowski, K., Wang, L., Yoshie, O., & Nowak, R. (2021). Accent modification for speech recognition of non-native speakers using neural style transfer. EURASIP Journal on Audio, Speech, and Music Processing, 2021(1), 11. https://doi.org/10.1186/s13636-021-00199-3

Sari, D. K., Amelia, R., Dharmajaya, R., Sari, L. M., & Fitri, N. K. (2021). Positive correlation between general public knowledge and attitudes regarding COVID-19 outbreak 1 month after first cases reported in Indonesia. Journal of Community Health, 46(1), 182–189. https://doi.org/10.1007/s10900-020-00866-0

Syam, A. R., Gardner, S., & Cribb, M. (2024). Pronunciation features of Indonesian-accented English. Languages, 9(6), 222. https://doi.org/10.3390/languages9060222

Sun, W. (2023). The impact of automatic speech recognition technology on second language pronunciation and speaking skills of EFL learners: A mixed methods investigation. Frontiers in Psychology, 14, 1210187. https://doi.org/10.3389/fpsyg.2023.1210187

Tsai, S. C. (2023). Learning with mobile augmented reality and automatic speech recognition-based materials for English listening and speaking skills: Effectiveness and perceptions of non-English major EFL students. Journal of Educational Computing Research, 61(2), 444–465. https://doi.org/10.1177/07356331221111203

Thir, V. (2023). Co-text, context, and listening proficiency as crucial variables in intelligibility among nonnative users of English. Studies in Second Language Acquisition, 45(5), 1210–1231. https://doi.org/10.1017/S0272263123000207

Wulandari, B. A., Piscioneri, M., & Ikram, W. (2021). Examining students' challenges in oracy in academic context classes. International Journal of Language Education, 5(1), 598–615. https://doi.org/10.26858/ijole.v5i1.16002

Xiao, W., & Park, M. (2021). Using automatic speech recognition to facilitate English pronunciation assessment and learning in an EFL context: Pronunciation error diagnosis and pedagogical implications. International Journal of Computer-Assisted Language Learning and Teaching, 11(3), 74–91. https://doi.org/10.4018/IJCALLT.2021070105

Zhao, X. (2023). Leveraging artificial intelligence (AI) technology for English writing: Introducing Wordtune as a digital writing assistant for EFL writers. RELC Journal, 54(3), 890–894. https://doi.org/10.1177/00336882221094089

Downloads

Published

2026-01-24

How to Cite

Andy, A., Muzammil, L., & Milliard, M. (2026). Real-Time ASR Transcription as Cognitive Scaffolding: Enhancing Intelligibility of Indonesian-Accented English in ELF Communication. Didactica : Jurnal Kajian Pendidikan Dan Pembelajaran, 6(1), 1–11. https://doi.org/10.56393/didactica.v6i1.3820