Grammatical Error Correction in Low-Resource Scenarios
Grammatical error correction in English is a long studied problem with many existing systems and datasets. However, there has been only a limited research on error correction of other languages. In this paper, we present a new dataset AKCES-GEC on grammatical error correction for Czech. We t hen make experiments on Czech, German and Russian and show that when utilizing synthetic parallel corpus, Transformer neural machine translation mo del can reach new state-of-the-art results on these datasets. AKCES-GEC is published under CC BY-NC-SA 4.0 license at https://hdl.handle.net/11234/1-3057 and the source code of the GEC model is available at https://github.com/ufal/low-resource-gec-wnut2019.
READ FULL TEXT