2019 Fiscal Year Annual Research Report
The Corpus of Kansai Vernacular Japanese
Project/Area Number |
17K02761
|
Research Institution | Kwansei Gakuin University |
Principal Investigator |
HEFFERNAN K 関西学院大学, 総合政策学部, 教授 (60580595)
|
Project Period (FY) |
2017-04-01 – 2020-03-31
|
Keywords | corpus / dialect / morphology / コーパス / 方言 / 形態学 |
Outline of Annual Research Achievements |
The two primary objectives of this research project were to create a corpus of Kansai vernacular Japanese, and to make that corpus available on the internet. In total, 138 sociolinguistic interviews were conducted. Each interview was transcribed, and checked for errors. The transcriptions were parsed and tagged with part of speech data using Mecab. The tagged data was checked by students hired for this job, and mistakes were corrected. I estimate the final accuracy rate is about 98%. The final data, along with supporting documents such as a description of the transcription methods, are available on a google website. The data is shared under a creative commons license. Users may be downloaded and used free of charge. However, users are prohibited from using the data for profit.
|
Remarks |
The corpus data may be downloaded from this website.
|