Project/Area Number |
16200018
|
Research Category |
Grant-in-Aid for Scientific Research (A)
|
Allocation Type | Single-year Grants |
Section | 一般 |
Research Field |
情報図書館学・人文社会情報学
|
Research Institution | International Research Center for Japanese Studies |
Principal Investigator |
YAMADA Shoji International Research Center for Japanese Studies, Research Division, Associate Professor, 研究部, 助教授 (20248751)
|
Co-Investigator(Kenkyū-buntansha) |
HAYAKAWA Monta International Research Center for Japanese Studies, Professor, 文化資料研究企画室, 教授 (10208605)
AIBA Atsushi International Research Center for Japanese Studies, Professor, 文化資料研究企画室, 教授 (20273154)
AIDA Mitsuru National Institute for Japanese Literature, Research Associate, 文学形成研究系, 助手 (00249921)
HARA Shoichiro Kyoto University, Professor, 地域総合情報センター, 教授 (50218616)
SHIBAYAMA Mamoru Kyoto University, Professor, 東南アジア研究所, 教授 (10162645)
|
Project Period (FY) |
2004 – 2006
|
Project Status |
Completed (Fiscal Year 2006)
|
Budget Amount *help |
¥39,780,000 (Direct Cost: ¥30,600,000、Indirect Cost: ¥9,180,000)
Fiscal Year 2006: ¥13,390,000 (Direct Cost: ¥10,300,000、Indirect Cost: ¥3,090,000)
Fiscal Year 2005: ¥12,480,000 (Direct Cost: ¥9,600,000、Indirect Cost: ¥2,880,000)
Fiscal Year 2004: ¥13,910,000 (Direct Cost: ¥10,700,000、Indirect Cost: ¥3,210,000)
|
Keywords | Data Mining / Kojiruien / Full Text Database / Historical Documents / Knowledge Discovery |
Research Abstract |
We constructed the full text data of Chi part of Kojiruien (4,200 pages) attaching XML tags. The fourth proofed complete text data of volume I of Chi part is available with extended characters and the conversion table to Unicode. The volume II and III revised second and first, respectively ; besides, we digitized Kuzushi-ji yoorei jiten (Tokyodo Shoten), the standard dictionary for reading historical documents, to complement pre-modern terms. We developed a web-based system to browse Ten part text of Kojiruien and the full page images of whole Kojiruien (67,000 pages) with linking Kojiruien index (42,000 terms) and its phonetic index (64,000 entries). Through our research, electric information of basic pre-modern terms was developed. The Kojiruien index was converted into a thesaurus dictionary; we uploaded the thesaurus and a dictionary usable for Japanese morphological analysis to the website of our project. The web-based system of Kojiruien full text and page images are also opened at the International Research Center for Japanese Studies and the National Institute for Japanese Literature. In addition, we developed a Wiki-based prototype system and conducted a feasibility study of its effectiveness. As a case study of data mining for the Humanities, we conducted a data mining study to Renga Haikai database of the International Research Center for Japanese Studies. Through the investigation, we generated a collocation dictionary of pre-modern terms; by using the dictionary, we were successfully extracted a new finding on transition of hototogisu image in renga. Finally, we published the collocation dictionary of renga.
|