eprintid: 3350 rev_number: 8 eprint_status: archive userid: 345 dir: disk0/00/00/33/50 datestamp: 2018-12-20 01:45:09 lastmod: 2018-12-20 01:45:09 status_changed: 2018-12-20 01:45:09 type: monograph metadata_visibility: show creators_name: Nguyen Binh, Nguyen creators_name: Nguyen Van, Vinh creators_id: vinhnv@vnu.edu.vn corp_creators: VNU University of Engineering and Technology corp_creators: VNU University of Engineering and Technology title: Statistical Machine Translation For Vietnamese Grammatical Error Correction ispublished: unpub subjects: IT divisions: fac_fit abstract: Nowadays, along with the development of Natural Language Processing, there are a lot of research which use Statistical Machine Translation for grammatical error correction. Despite the fact that, there are a few researches which can be applied to Vietnamese. As a result, our purpose is to implement grammatical error correction in Vietnamese. The problem can easily describe like this: you have a wrong sentence as input, after being processed by the model, you will have the right sentence as output. In this research, we focus on applying Statistical Machine Translation to Vietnamese. This is a part of Machine Learning approach in order to solve the grammatical error correction problem. At first, we will try to create a list of all kind of Vietnamese’s error. Then, we aim for correcting simple error, like spelling error, then we develop the system step by step to handle and correct complex error. To do that, the model need lots of data to train, so we collect as much Vietnamese sentences as possible, and turn them into wrong to make parallel data. The data will be divided into three parts, which are used for training, tuning, and testing, respectively. After all, the model achieved some results, where the sentences with spelling mistake is corrected better than others. The result is not too good, but it can be seen that we can apply Statistical Machine Translation for the Grammatical error correction problem. date: 2018 publisher: Conference full_text_status: public monograph_type: technical_report citation: Nguyen Binh, Nguyen and Nguyen Van, Vinh (2018) Statistical Machine Translation For Vietnamese Grammatical Error Correction. Technical Report. Conference. (Unpublished) document_url: https://eprints.uet.vnu.edu.vn/eprints/id/eprint/3350/1/SMT_Grammatical%20error%20correction_Report_Nguyen_Tuan_Vinh-2018%20%281%29.pdf