Bilgilendirme: Kurulum ve veri kapsamındaki çalışmalar devam etmektedir. Göstereceğiniz anlayış için teşekkür ederiz.
 

Detection and Correction of English Grammar Errors Using Natural Language Processing Methods

Loading...
Publication Logo

Date

2025

Authors

Yazici, M.
Baygin, M.
Karabey Aksakalli, I.K.

Journal Title

Journal ISSN

Volume Title

Publisher

Institute of Electrical and Electronics Engineers Inc.

Open Access Color

OpenAIRE Downloads

OpenAIRE Views

Research Projects

Journal Issue

Abstract

This study proposes a two-stage approach for the automatic detection and correction of grammatical errors in English. In the first stage, incorrect sentences were identified using the Random Forest classifier, which showed the highest success among various machine learning algorithms such as Random Forest, Decision Tree, Multi-Layer Perceptron, and Linear Support Vector Machines. In the second stage, grammatical correction was performed on only these sentences using the transformer-based BART model and T5 model. The dataset utilized is the Pranav GEC dataset published on Kaggle. After preprocessing, labeling, and balancing, sentence representations were generated using the TF-IDF vectorization technique. Experimental results indicate that the Random Forest model achieved a high F1 score in the classification task, while the BART model provided highly effective corrections, especially in terms of precision. The proposed method reduces computational cost while improving overall accuracy and provides more balanced results compared to traditional seq2seq-based approaches. This study offers a technically and pedagogically effective solution model for grammatical error correction systems and has practical potential for integration with language learning technologies. It also provides a foundation for future research in areas such as multilingual support, user feedback-based learning, and contextsensitive correction. © 2025 IEEE.

Description

Keywords

Bart, Deep Learning, Grammatical Error Correction, Natural Language Processing, Random Forest

Fields of Science

Citation

WoS Q

N/A

Scopus Q

N/A

Source

-- 9th International Artificial Intelligence and Data Processing Symposium, IDAP 2025 -- 2025-09-06 through 2025-09-07 -- Malatya -- 215321

Volume

Issue

Start Page

End Page

Google Scholar Logo
Google Scholar™

Sustainable Development Goals

SDG data could not be loaded because of an error. Please refresh the page or try again later.