Deep Reinforcement Learning and BigBird-BiLSTM Models in Automated Essay Scoring: An Exploratory Study

Authors

  • Gopaldas Harinath King Abdul-Aziz University, Jeddah, KSA
  • Venkateswar Rao King Abdul-Aziz University, Jeddah, KSA

DOI:

https://doi.org/10.53469/jrve.2025.7(02).02

Keywords:

Automated essay grading, deep reinforcement learning, BigBird-BiLSTM, semantic features, evaluation metrics

Abstract

This paper evaluates the potential of Deep Reinforcement Learning (DRL) and BigBird-BiLSTM models in enhancing Automated Essay Grading (AEG) systems. Leveraging the Hewlett dataset, the study examines how these models handle semantic features and scalability challenges compared to existing frameworks. Evaluation metrics such as Mean Squared Error (MSE), Root Mean Squared Error (RMSE), and R-squared (R2) highlight the strengths and limitations of each model.

References

Page, Ellis B. "Project Essay Grade: PEG." Handbook of Writing Research, 1968.

Diederich, P. B., et al. "The Nature and Growth of Predictive Judgments in Scoring Essays." ETS Research, 1961.

Shermis, M. D., & Burstein, J. "Automated Essay Scoring: A Cross-Disciplinary Perspective." Routledge, 2003.

Attali, Y., & Burstein, J. "Automated Essay Scoring with e-rater." ETS, 2006.

Haendchen Filho, A., et al. "Using Support Vector Machines for Automated Essay Scoring." ICML, 2018.

Larkey, L. S. "Automatic Essay Grading Using Text Categorization Techniques." Proceedings of SIGIR, 1998.

Hinton, G., et al. "Learning Representations by Back- Propagating Errors." Nature, 1986.

Schuster, M., & Paliwal, K. K. "Bidirectional Recurrent Neural Networks." IEEE Transactions, 1997.

Hochreiter, S., & Schmidhuber, J. "Long Short-Term Memory." Neural Computation, 1997.

Mikolov, T., et al. "Word2Vec: Efficient Estimation of Word Representations." Google AI, 2013.

Pennington, J., et al. "GloVe: Global Vectors for Word Representation." EMNLP, 2014.

Devlin, J., et al. "BERT: Pre-training of Deep Bidirectional Transformers." NAACL-HLT, 2019.

Vaswani, A., et al. "Attention Is All You Need." NeurIPS, 2017.

Zaheer, M., et al. "BigBird: Transformers for Longer Sequences." NeurIPS, 2020.

Beltagy, I., et al. "Longformer: The Long-Document Transformer." arXiv preprint, 2020.

Zhang, H., & Litman, D. "Improving AEG with BiLSTM and Co-Attention Layers." IEEE TLT, 2020.

Liu, Y., et al. "Multi-way Attention for AEG." ACL, 2019.

Mnih, V., et al. "Human-Level Control through Deep Reinforcement Learning." Nature, 2015.

Silver, D., et al. "Mastering the Game of Go with Deep Neural Networks and Tree Search." Nature, 2016.

Sutton, R. S., & Barto, A. G. "Reinforcement Learning: An Introduction." MIT Press, 1998.

Hasanah, U., et al. "Domain-Specific Adaptation for AEG." IJCAI, 2020.

Chen, Z., et al. "Cross-Domain AEG with LSTM." EMNLP, 2021.

Ridley, R., et al. "Multi-Task Learning for AEG." COLING, 2021.

Nadeem, F., et al. "Hybrid Approaches for AEG." AAAI, 2019.

Ormerod, C., et al. "Exploring Efficient Transformer Architectures for AEG." ACL, 2021.

Page, Ellis B. "Project Essay Grade: PEG." Handbook of Writing Research, 1968.

Haendchen Filho, Aluizio, et al. "Using Support Vector Machines for Automated Essay Scoring." Proceedings of the International Conference on Machine Learning, 2018.

Li, Xia, et al. "BiLSTM with Word2Vec for Automated Essay Scoring." Journal of Artificial Intelligence Research, 2019.

Devlin, Jacob, et al. "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding." Proceedings of NAACL-HLT, 2019.

Zaheer, Manzil, et al. "BigBird: Transformers for Longer Sequences." NeurIPS, 2020.

Zhang, Haoran, and Diane Litman. "Improving Automated Essay Scoring with BiLSTM and Co- Attention Layers." IEEE Transactions on Learning Technologies, 2020.

Mnih, Volodymyr, et al. "Human-level Control through Deep Reinforcement Learning." Nature, 2015.

Silver, David, et al. "Mastering the Game of Go with Deep Neural Networks and Tree Search." Nature, 2016.

Downloads

Published

2025-02-28

How to Cite

Harinath, G., & Rao, V. (2025). Deep Reinforcement Learning and BigBird-BiLSTM Models in Automated Essay Scoring: An Exploratory Study. Journal of Research in Vocational Education, 7(2), 7–11. https://doi.org/10.53469/jrve.2025.7(02).02

Issue

Section

Articles