2019 年度実績報告書

The Corpus of Kansai Vernacular Japanese

研究課題

研究課題/領域番号	17K02761
研究機関	関西学院大学
研究代表者	HEFFERNAN K 関西学院大学, 総合政策学部, 教授 (60580595)
研究期間 (年度)	2017-04-01 – 2020-03-31
キーワード	corpus / dialect / morphology / コーパス / 方言 / 形態学
研究実績の概要	The two primary objectives of this research project were to create a corpus of Kansai vernacular Japanese, and to make that corpus available on the internet. In total, 138 sociolinguistic interviews were conducted. Each interview was transcribed, and checked for errors. The transcriptions were parsed and tagged with part of speech data using Mecab. The tagged data was checked by students hired for this job, and mistakes were corrected. I estimate the final accuracy rate is about 98%. The final data, along with supporting documents such as a description of the transcription methods, are available on a google website. The data is shared under a creative commons license. Users may be downloaded and used free of charge. However, users are prohibited from using the data for profit.
備考	The corpus data may be downloaded from this website.