User Acceptance of Automatic Speech Recognition in University Oral English Instruction: An Analysis from the Perspective of Learners’ Learning Styles
DOI:
https://doi.org/10.53469/jssh.2025.7(03).13Keywords:
Automatic Speech Recognition (ASR), User acceptance, Learning styles, Oral English instructionAbstract
Situated in an era when artificial intelligence has been applied across a wide range of fields, the domain of foreign language education has also gradually integrated new technologies. One representative technology applied in this field is Automatic Speech Recognition (ASR). The present study, based on the Unified Theory of Acceptance and Use of Technology (UTAUT) model, designs a user acceptance questionnaire for ASR and aims to explore the impact of learning styles on students’ acceptance levels. The results demonstrate that visual, introverted, leveler and deductive learning styles are positively correlated with user acceptance of ASR; kinesthetic and sharpener learning styles are negatively linked to the user acceptance; while auditory and extraverted learning styles do not have significant impact on user acceptance of ASR. The results of this study provide important theoretical foundations and practical guidance for the application of ASR in foreign language education, particularly oral English instruction.
References
Bashori, M., van Hout, R., Strik, H., & Cucchiarini, C. (2021). Effects of ASR-based websites on EFL learners’ vocabulary, speaking anxiety, and language enjoyment. System, 99, 102496. https://doi.org/10.1016/j.system.2021.102496
Brown, S. A., & Venkatesh, V. (2005). Model of adoption of technology in households: A baseline model test and extension incorporating household life cycle. MIS Quarterly, 29(3), 399–426. https://doi.org/10.2307/25148690
Cavus, N., & Ibrahim, D. (2017). Learning English using children’s stories in mobile devices. British Journal of Educational Technology, 48(2), 625–641. https://doi.org/10.1111/bjet.12427
Chen, C.H., Koong, C.S., & Liao, C. (2022). Influences of integrating dynamic assessment into a speech recognition learning design to support students’ English speaking skills, learning anxiety, and cognitive load. Educational Technology & Society, 25(1), 1–14. https://doi.org/10.30191/ETS.202201_25(1).0001
Chiu, T. L., Liou, H. C., & Yeh, Y. (2007). A Study of web-based oral activities enhanced by Automatic Speech Recognition for EFL college learning. Computer Assisted Language Learning, 20(3), 209–233. https://doi.org/10.1080/09588220701489374
Cohen, A., Oxford, R. L., & Chi, J. C. (2001). Learning style survey: Assessing your own learning styles. Center for Advanced Research on Language Acquisition, University of Minnesota.
Dandu, G., Charyulu, G., Kumari, K., Dandu, G., Charyulu, G. M., & Kumari, K. L. (2024). AI-driven language learning: The impact of Rosetta Stone on ESL students’ speaking proficiency and self-control. Rupkatha Journal, 16(4). https://doi.org/10.21659/rupkatha.v16n4.09
de Vries, B. P., Cucchiarini, C., Bodnar, S., Strik, H., & van Hout, R. (2015). Spoken grammar practice and feedback in an ASR-based CALL system. Computer Assisted Language Learning, 28(6), 550–576. https://doi.org/10.1080/09588221.2014.889713
Elimat, A. K., & AbuSeileek, A. F. (2014). Automatic speech recognition technology as an effective means for teaching pronunciation. JALT CALL Journal, 10(1), 21–47. https://doi.org/10.29140/jaltcall.v10n1.166
Evers, K., & Chen, S. (2021). Effects of automatic speech recognition software on pronunciation for adults with different learning styles. Journal of Educational Computing Research, 59(4), 669–685. https://doi.org/10.1177/0735633120972011
Griffiths, C., & Soruç, A. (2020). Individual differences in language learning: A complex systems theory perspective. Palgrave Macmillan. https://doi.org/10.1007/978-3-030-52900-0
Hamuddin, B., & Dahler. (2018). Blogs as Powerful Learning Tools: The Perception from EFL Students in Riau Main Island Indonesia. IOP Conference Series: Earth and Environmental Science, 156(1), 12060. https://doi.org/10.1088/1755-1315/156/1/012060
Hinton, G., Deng, L., Yu, D., Dahl, G. E., Mohamed, A. R., Jaitly, N., ... & Kingsbury, B. (2012). Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups. IEEE Signal Processing Magazine, 29(6), 82–97. https://doi.org/10.1109/MSP.2012.2205597
Jiang, M. Y. C., Jong, M. S. Y., Lau, W. W. F., Chai, C. S., & Wu, N. (2021). Using automatic speech recognition technology to enhance EFL learners’ oral language complexity in a flipped classroom. Australasian journal of educational technology, 37(2), 110–131. https://doi.org/10.14742/ajet.6798
Liu, X., Xu, M., Li, M., Han, M., Chen, Z., Mo, Y., Chen, X., & Liu, M. (2019). Improving English pronunciation via automatic speech recognition technology. International Journal of Innovation and Learning, 25(2), 126–140. https://doi.org/10.1504/IJIL.2019.097674
McCrocklin, S. (2016). Pronunciation learner autonomy: The potential of automatic speech recognition. System, 57, 25–42. https://doi.org/10.1016/j.system.2015.12.013
McCrocklin, S. (2019). Learners’ feedback regarding ASR-based dictation practice for pronunciation learning. CALICO Journal, 36(2), 119–137. https://doi.org/10.1558/cj.34738
Michael, C. (2017). Automated Speech Recognition in language learning: Potential models, benefits and impact[J]. Training, Language and Culture, 1(1), 46–61.
Mikolov, T., Chen, K., Corrado, G., & Dean, J. (2013). Efficient estimation of word representations in vector space. arXiv.Org. https://doi.org/10.48550/arxiv.1301.3781
Nguyen, T.-H., Hwang, W.-Y., Pham, X.-L., & Ma, Z.-H. (2018). User-oriented EFL speaking through application and exercise: Instant speech translation and shadowing in authentic context. Educational Technology & Society, 21(4), 129–142.
Rawas, S. (2024). AI: the future of humanity. Discover Artificial Intelligence, 4(1), 25–14. https://doi.org/10.1007/s44163-024-00118-3
Shadiev, R., Wu, T.-T., Sun, A., & Huang, Y.-M. (2018). Applications of speech-to-text recognition and computer-aided translation for facilitating cross-cultural learning through a learning activity: Issues and their solutions. Educational Technology Research and Development, 66(1), 191–214. https://doi.org/10.1007/s11423-017-9556-8
Tsai, S.-C. (2023). Learning with mobile augmented reality- and automatic speech recognition-based materials for English listening and speaking skills: Effectiveness and perceptions of non-English major English as a foreign language students. Journal of Educational Computing Research, 61(2), 444–465. https://doi.org/10.1177/07356331221111203
Venkatesh, V., Morris, M. G., Davis, G. B., & Davis, F. D. (2003). User acceptance of information technology: Toward a unified view. MIS Quarterly, 27(3), 425–478. https://doi.org/10.2307/30036540
Venkatesh, V., Thong, J. Y. L., & Xu, X. (2012). Consumer acceptance and use of information technology: Extending the unified theory of acceptance and use of technology. MIS Quarterly, 36(1), 157–178. https://doi.org/10.2307/41410412
Yaniafari, R. P., Olivia, V., & S. (2022). The potential of ASR for improving English pronunciation: A review. KnE Social Sciences, 7(7), 281–289. https://doi.org/10.18502/kss.v7i7.10670
Young, S., Evermann, G., Kershaw, D., Gales, M., Odell, J., Ollason, D., ... & Woodland, P. (2002). The HTK book (for HTK version 3.2). Cambridge University Engineering Department.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Yuan Fan

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.