Development of an algorithm to decide similarities of world languages with considering differences in language names and classifications
Project/Area Number |
23650129
|
Research Category |
Grant-in-Aid for Challenging Exploratory Research
|
Allocation Type | Multi-year Fund |
Research Field |
Library and information science/Humanistic social informatics
|
Research Institution | Yamaguchi University |
Principal Investigator |
|
Co-Investigator(Kenkyū-buntansha) |
INUI Hideyuki 山口大学, 人文学部, 准教授 (10241754)
WU Ren 山口短期大学, 情報メディア学科, 准教授 (70708015)
|
Project Period (FY) |
2011-04-28 – 2015-03-31
|
Project Status |
Completed (Fiscal Year 2014)
|
Budget Amount *help |
¥3,640,000 (Direct Cost: ¥2,800,000、Indirect Cost: ¥840,000)
Fiscal Year 2013: ¥1,170,000 (Direct Cost: ¥900,000、Indirect Cost: ¥270,000)
Fiscal Year 2012: ¥1,040,000 (Direct Cost: ¥800,000、Indirect Cost: ¥240,000)
Fiscal Year 2011: ¥1,430,000 (Direct Cost: ¥1,100,000、Indirect Cost: ¥330,000)
|
Keywords | 言語系統分類 / 文学情報 / 形質状態法 / 基礎語彙 / 文字情報 / 文字列照合 / アルゴリズム / GIS |
Outline of Final Research Achievements |
Identification of language correspondences between two different sets of language data is one of the main problems in world's languages matching. We proposed a method which enables this identification by using two measures of language name similarity and language classification similarity, having succeeded in searching 88% languages included in one set of language data that relate to another set of language data. We further improved the accuracy by taking into account brother information in a language classification tree. Their method still has a problem, that is, their method gave an inappropriate decision even if either of these two similarities has a complete matching. To address this problem, we define two kinds of new measures: one is a similarity of languages based on brother information, and the other is a language general similarity that integrates the similarities of language name and language classification. The effectiveness of this method was confirmed by experiments.
|
Report
(5 results)
Research Products
(8 results)