Detection and Correction of English Grammar Errors Using Natural Language Processing Methods
Loading...

Date
2025
Authors
Yazici, M.
Baygin, M.
Karabey Aksakalli, I.K.
Journal Title
Journal ISSN
Volume Title
Publisher
Institute of Electrical and Electronics Engineers Inc.
Open Access Color
OpenAIRE Downloads
OpenAIRE Views
Abstract
This study proposes a two-stage approach for the automatic detection and correction of grammatical errors in English. In the first stage, incorrect sentences were identified using the Random Forest classifier, which showed the highest success among various machine learning algorithms such as Random Forest, Decision Tree, Multi-Layer Perceptron, and Linear Support Vector Machines. In the second stage, grammatical correction was performed on only these sentences using the transformer-based BART model and T5 model. The dataset utilized is the Pranav GEC dataset published on Kaggle. After preprocessing, labeling, and balancing, sentence representations were generated using the TF-IDF vectorization technique. Experimental results indicate that the Random Forest model achieved a high F1 score in the classification task, while the BART model provided highly effective corrections, especially in terms of precision. The proposed method reduces computational cost while improving overall accuracy and provides more balanced results compared to traditional seq2seq-based approaches. This study offers a technically and pedagogically effective solution model for grammatical error correction systems and has practical potential for integration with language learning technologies. It also provides a foundation for future research in areas such as multilingual support, user feedback-based learning, and contextsensitive correction. © 2025 IEEE.
Description
Keywords
Bart, Deep Learning, Grammatical Error Correction, Natural Language Processing, Random Forest
Fields of Science
Citation
WoS Q
N/A
Scopus Q
N/A
Source
-- 9th International Artificial Intelligence and Data Processing Symposium, IDAP 2025 -- 2025-09-06 through 2025-09-07 -- Malatya -- 215321
