2022 Fiscal Year Final Research Report
Development of a shared text repository for data-driven historical research
Project/Area Number |
20K20138
|
Research Category |
Grant-in-Aid for Early-Career Scientists
|
Allocation Type | Multi-year Fund |
Review Section |
Basic Section 90020:Library and information science, humanistic and social informatics-related
|
Research Institution | National Museum of Japanese History |
Principal Investigator |
Hashimoto Yuta 国立歴史民俗博物館, 大学共同利用機関等の部局等, 准教授 (10802712)
|
Project Period (FY) |
2020-04-01 – 2023-03-31
|
Keywords | データ構造化 / データ駆動型研究 / マークアップ / エンティティリンキング / クラウドソーシング |
Outline of Final Research Achievements |
The objective of this study was to establish a foundation for data-driven research on Jppanese historical documents through the construction of a shared repository for Japanese historical text. Initially, the plan was to focus on the development of a markup language for text structuring. However, the approach was shifted to two methods of structuring: 1) standoff markup and 2) entity linking. Based on these methods, efforts were made to construct a platform for structuring historical text. As a result, the achievements include the publication of "Markup Together [Ansei Edo Earthquake]" (https://markup.honkoku.org/) and its improved version, "Annotate Together" (https://ansei2.vercel.app/), which allow collaborative markup and annotation of historical materials.
|
Free Research Field |
人文情報学
|
Academic Significance and Societal Importance of the Research Achievements |
本研究は,わが国に大量に保存されている歴史資料を構造データ化し,データ駆動型研究の素材として提供するための基礎を構築する研究である.「みんなでマークアップ」および「みんなで注釈」では,実験的に1855年の安政江戸地震の記録史料を対象に構造化を実施しているが,災害被害を地図上に可視化し,計量的に処理することが可能になった.このシステムを他の史料群に適用することで,データサイエンス的手法を駆使した新しいアプローチの歴史研究が可能になることが期待される.
|