ニホンゴ ガクシュウシャ ノ サクブン コーパス ノ ゲンゴ キョウイク ケンキュウ ノ タメ ノ ゴヨウ タグ アノテーション ノ ジドウカ (original) (raw)
Recently, various types of learner corpora have been compiled and utilized for linguistic and educational research. As web-based application programs have been developed for language learners, a large size of language learners' texts is able to be collected on the web. These learner corpora include not only correct sentences but also incorrect sentences. Our object is to take advantage of these incorrect sentences for linguistic and educational research. In language education field, the researchers and language teachers wish to investigate the mechanism why learners make such errors, for leaners not to make the same mistake again and to use the insights learned from such corpora. However, it is not an easy task to process large corpora without any annotation nor any software to search in them. In order to make use of the corpora for those research, it is required to extract the errors in them, to add useful information and to learn from the insights appearing in the real use. To this end, this study aims to do several tasks regarding learner corpora facilitation. The tasks are listed below.