Develpment of a Method of Text Encoding for Japanese Historical Texts Accoding to Methdological Commons

Research Project

Project/Area Number	23K28385
Project/Area Number (Other)	23H03696 (2023)
Research Category	Grant-in-Aid for Scientific Research (B)
Allocation Type	Multi-year Fund (2024) Single-year Grants (2023)
Section	一般
Review Section	Basic Section 90020:Library and information science, humanistic and social informatics-related
Research Institution	International Institute for Digital Humanities
Principal Investigator	永崎研宣一般財団法人人文情報学研究所, 人文情報学研究部門, 主席研究員 (30343429)
Co-Investigator(Kenkyū-buntansha)	間淵洋子和洋女子大学, 人文学部, 准教授 (10415614) 岡田一祐慶應義塾大学, 文学部(三田), 准教授 (80761220) 中村覚東京大学, 史料編纂所, 助教 (80802743) 後藤真国立歴史民俗博物館, 大学共同利用機関等の部局等, 准教授 (90507138) 王一凡一般財団法人人文情報学研究所, 人文情報学研究部門, 研究員 (20998215)
Project Period (FY)	2023-04-01 – 2026-03-31
Project Status	Granted (Fiscal Year 2024)
Budget Amount *help	¥18,720,000 (Direct Cost: ¥14,400,000、Indirect Cost: ¥4,320,000) Fiscal Year 2025: ¥6,760,000 (Direct Cost: ¥5,200,000、Indirect Cost: ¥1,560,000) Fiscal Year 2024: ¥5,850,000 (Direct Cost: ¥4,500,000、Indirect Cost: ¥1,350,000) Fiscal Year 2023: ¥6,110,000 (Direct Cost: ¥4,700,000、Indirect Cost: ¥1,410,000)
Keywords	TEIガイドライン / テキスト構造化 / 日本語テキスト資料 / 東アジア古典籍 / 日本古典籍 / 日本語歴史コーパス / 古辞書 / 日本史データ / 仏典テキストデータ
Outline of Research at the Start	本研究は、人文学向けテキストデータ構築の国際デファクト標準であるTEI (Text Encoding Initiative) ガイドラインの検討を通じ、日本の歴史的テキストを機械可読性の高い形で横断的に扱えるように構造化する具体的な手法を確立するとともに、日本文化に関わる研究データを国際的な学術流通の遡上にのせ、日本の人文学のためのデジタル時代の国際的な研究基盤を確立することを目指す。
Outline of Annual Research Achievements	2023年度の本研究の実績としては、TEI協会東アジア／日本分科会の定例研究会をオンラインで共催し、ほぼ毎週、49回にわたり、東アジア／日本に関するテキスト構造化に関する議論と実践を行った。定例研究会においては、TEI (Text Encoding Initiative)ガイドライン及びODDの日本語訳、方言談話資料の協働マークアップが主なテーマとなるとともに、オンライン開催のため国内外各地でテキスト構造化に取り組む研究者・実践者が集い、テキスト構造化に関する活発な議論が展開された。この活動をベースとしつつ、日本の歴史的テキストの構造化に関する取り組みが進められた。以下に、テキストのタイプ毎にみてみよう。和歌に関しては、近代短歌、歌合及び虫歌合に関する取り組みに協力する形でテキスト構造化のルールが検討され、それを踏まえた統合ビューワ及び個別対応のビューワの双方の開発が進められた。この成果はTEI国際会議、じんもんこんシンポジウム等で発表された。　近代文学の草稿に関しては、テキスト構造化に関する議論と個別対応の専用ビューワの開発が行われ、これもドイツ・パーダーボルンで開催されたTEI国際会議で発表された。　古辞書に関しては、歴史的辞書の構造化ガイドラインであるTEI-lex0のプロジェクトを率いるToma Tasovacを招聘してワークショップを開催し、議論を深めた。　また、歴史史料全般に関して、RDFとテキスト構造化との関係について検討するとともにこれもTEI国際会議で発表した。　仏典に関するテキスト構造化も議論し、情報処理学会人文科学とコンピュータ研究会や日本印度学仏教学会で共同発表を行った。以上の構造化実践を踏まえ、本年1月に汎用のTEI古典籍ビューワをWeb公開し、誰もが気軽にテキスト構造化に取り組める環境を提供した。
Current Status of Research Progress	Current Status of Research Progress 2: Research has progressed on the whole more than it was originally planned. Reason そのために日本を含む東アジアの古典籍に対応しようとしてきた。なかでも、テキスト構造化のターゲットとしていた分野として、和歌に関する議論をかなり深めることができた。これ以外にも、近代文学草稿、仏教文献、古辞書、歴史史料などのタイプのテキストに関して議論を進めることができ、そのうちのいくつかのタイプについてはそれぞれに定義した構造に基づくビューワを開発し、そこでの表示に至るまで対応し、国内外各地の研究集会で発表を行うことができた。さらに、これに対応する東アジア古典籍向けの汎用ビューワの開発と公開を実現できた。これにより、テキスト構造化の意義をわかりやすく示すことができるようになった。そして、これに対するフィードバックも収集し発表することができたため、今後のさらなる展開につながることになった。さらに、これを発展させる形でAI-OCRと構造化テキストとの自動対照の仕組みを開発した。これは不完全なAIを信頼できるデジタル学術基盤に組込んでいくための手法として非常に有効なものであり、これは国内外各地の研究集会で発表しただけでなく、2023年4月にはウィーン大学での関連国際シンポジウムにて招待講演を行い、それぞれに様々な反響を集めた。
Strategy for Future Research Activity	今後の研究の推進方策としては、対応するテキストのタイプの拡充、既存のタイプのテキストに関するさらなる探究、それらに基づくビューワの機能追加を進めていく。テキストのタイプに関しては、2023年度に議論を行った古辞書に関する構造に本年度は特に力を入れる。ビューワに関しても、古辞書向けのものはこれまでとはまったく異なるものになり、すでに古辞書ビューワは国内外の研究プロジェクトによって開発されているため、そうしたものを参照しつつ互換性のある形で開発を進めていく。一方、資料画像との対応に関してIIIF対応画像との連携を行う。これにもテキスト構造に関する議論とビューワ開発における検討が必要になるため、その両面から研究開発を進めていく。最終年度には国際シンポジウムを開催して本研究の成果を広く公表する予定である。

Report

(1 results)

2023 Annual Research Report

Research Products
(26 results)

All 2024 2023 Other

All Journal Article (8 results) (of which Peer Reviewed: 5 results, Open Access: 1 results) Presentation (13 results) (of which Int'l Joint Research: 13 results, Invited: 3 results) Book (1 results) Remarks (2 results) Funded Workshop (2 results)

[Journal Article] 仏典研究とテキスト構造化2024
- Author(s)
  永崎研宣
- Journal Title
  
  印度学仏教学研究
  
  Volume: 72 Pages: 725-730
- Related Report
  2023 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] TEI古典籍ビューワによる構造化テキストの可視化2024
- Author(s)
  永崎研宣, 本間淳, 幾浦裕之, 佐久間祐惟, Wenlu Wang
- Journal Title
  
  研究報告人文科学とコンピュータ（CH）
  
  Volume: 2024-CH-134(10) Pages: 1-5
- Related Report
  2023 Annual Research Report
[Journal Article] 勅撰和歌集の構造化と提示手法に関する試み ―嘉禄二年本『古今和歌集』を事例として―2023
- Author(s)
  幾浦裕之, 永崎研宣, 加藤弓枝
- Journal Title
  
  じんもんこん2023論文集
  
  Volume: 2023 Pages: 183-190
- Related Report
  2023 Annual Research Report
- Peer Reviewed
[Journal Article] 日本語方言談話資料のTEIによる構造化の試み2023
- Author(s)
  中川奈津子, 岡田一祐, 永崎研宣, 北﨑勇帆, 王一凡, 曹芳慧, 藤原静香, 塚越柚季, 乙川文英, 小川潤, 片倉峻平, 左藤仁宏, Wenlu Wang, 石田友梨, 宮川創, 佐久間祐惟, 塩井祥子, 井上慶淳, 村瀬友洋, 関慎太朗, 田良島哲, 嵩井里恵子, 渡邉眞儀, 中町信孝, 幾浦裕之
- Journal Title
  
  じんもんこん2023論文集
  
  Volume: 2023 Pages: 83-90
- Related Report
  2023 Annual Research Report
- Peer Reviewed
[Journal Article] TEIに準拠した近代短歌テキストのマークアップ手法の提案,2023
- Author(s)
  村田祐菜, 永崎研宣
- Journal Title
  
  じんもんこん2023論文集
  
  Volume: 2023 Pages: 99-104
- Related Report
  2023 Annual Research Report
- Peer Reviewed
[Journal Article] OCR の高精度化を踏まえたデジタル学術編集版の新展開2023
- Author(s)
  永崎研宣, 大向一輝, 下田正弘
- Journal Title
  
  じんもんこん2023論文集
  
  Volume: 2023 Pages: 177-182
- Related Report
  2023 Annual Research Report
- Peer Reviewed
[Journal Article] 東アジア古典籍のための動的なデジタル学術編集版の構築2023
- Author(s)
  永崎研宣
- Journal Title
  
  研究報告人文科学とコンピュータ（CH）
  
  Volume: 2023-CH-133(5) Pages: 1-4
- Related Report
  2023 Annual Research Report
[Journal Article] 教の伝統的知識体系の構造化に向けて―日本の漢訳経典注釈書に対するマークアップについての一試論―2023
- Author(s)
  佐久間祐惟, 永崎研宣, 左藤仁宏, 村瀬友洋, 下田正弘
- Journal Title
  
  研究報告人文科学とコンピュータ（CH）
  
  Volume: 2023-CH-132(11) Pages: 1-6
- Related Report
  2023 Annual Research Report
[Presentation] Lessons from the Journey to Unicode: Standardizing Character Encoding in the SAT Daizokyo Text Database2024
- Author(s)
  Kiyonori Nagasaki, Yifan Wang
- Organizer
  Unicode and the Humanities, Carnegie Mellon University
- Related Report
  2023 Annual Research Report
- Int'l Joint Research / Invited
[Presentation] The Formation of Digital Humanities in Japan: From Global and Local Contexts2023
- Author(s)
  Kiyonori Nagasaki
- Organizer
  DATA CURATION AND DIGITAL HUMANITIES IN ASIA, KAIST
- Related Report
  2023 Annual Research Report
- Int'l Joint Research / Invited
[Presentation] Teaching Encoding In and Out of the Classroom2023
- Author(s)
  Jakacki, Diane / Croxall, Brian / Jenstad, Janelle / Crompton, Constance / del rio Riande, Gimena / Nguテェ Um, Emmanuel / Cummings, James / Duguid, Timothy / Nagasaki, Kiyonori / Scholger, Martina / Viglianti, Raffaele
- Organizer
  Joint MEC TEI conference 2023
- Related Report
  2023 Annual Research Report
- Int'l Joint Research
[Presentation] Digital Representation of 'A Match of Crickets in Ten Rounds of Verse and Image': Text Encoding and Viewer Implementation for Japanese Poetry Match2023
- Author(s)
  Fujiwara, Shizuka / Nagasaki, Kiyonori / Ikuura, Hiroyuki / Morita, Teiko / Ikura, Yoichi / Matumoto, Ohki / Yumie, Kato
- Organizer
  Joint MEC TEI conference 2023
- Related Report
  2023 Annual Research Report
- Int'l Joint Research
[Presentation] 3D Text Encoding and TEI: Text, Editions, and Spatiality2023
- Author(s)
  Ogawa, Jun / Nagasaki, Kiyonori / Kitamoto, Asanobu
- Organizer
  Joint MEC TEI conference 2023
- Related Report
  2023 Annual Research Report
- Int'l Joint Research
[Presentation] n AI-assisted Digital Scholarly Editing System for Buddhist Studies2023
- Author(s)
  Nagasaki, Kiyonori / Shimoda, Masahiro
- Organizer
  Joint MEC TEI conference 2023
- Related Report
  2023 Annual Research Report
- Int'l Joint Research
[Presentation] A Preliminary Proposal for Digital Scholarly Editing that Uses Modern Japanese autograph manuscripts: How to markup autograph manuscripts of Rampo Edogawa2023
- Author(s)
  Shioi, Sachiko / Nagasaki, Kiyonori
- Organizer
  Joint MEC TEI conference 2023
- Related Report
  2023 Annual Research Report
- Int'l Joint Research
[Presentation] Digitizing Buddhist Genealogy: Encoding the Shinran Shonin Montei Kyomyo-cho2023
- Author(s)
  Sato, Yoshihiro / Nagasaki, Kiyonori / Shimoda, Masahiro
- Organizer
  Joint MEC TEI conference 2023
- Related Report
  2023 Annual Research Report
- Int'l Joint Research
[Presentation] Toward a TEI/RDF Encoding for Semantic Annotations: Concept and Implementation as LOD Editor2023
- Author(s)
  Ogawa, Jun / Nagasaki, Kiyonori / Nakamura, Satoru / Ohmukai, Ikki / Kitamoto, Asanobu
- Organizer
  Joint MEC TEI conference 2023
- Related Report
  2023 Annual Research Report
- Int'l Joint Research
[Presentation] Legal Issues in Digital Humanities: Analysis of Recent Advocacy and Continuing and Emerging Issues2023
- Author(s)
  Ketzan, Erik; Nayyer, Kim; Dombrowski, Quinn; Tilton, Lauren; de Smedt, Koenraad; Kamocki, Paweナ?; Trollip, Benito; Nagasaki, Kiyonori
- Organizer
  Digital Humanities 2023
- Related Report
  2023 Annual Research Report
- Int'l Joint Research
[Presentation] It Takes a Village: Building an Infrastructure for 3D Scholarly Editions2023
- Author(s)
  Papadopoulos, Costas; Schreibman, Susan; Gillikin Schoueri, Kelly; Cope, Jamie; Blundell, Jon; Ogawa, Jun; Nagasaki, Kiyonori
- Organizer
  Digital Humanities 2023
- Related Report
  2023 Annual Research Report
- Int'l Joint Research
[Presentation] Multilingual taxonomy initiative - TaDiRAH as community of practice2023
- Author(s)
  Borek, Luise; Hastik, Canan; Dombrowski, Quinn; Broeder, Daan; Rockenberger, Annika; Nagasaki, Kiyonori; Mochizuki, Ryo; Katakura, Shumpei; Cupar, Drahomira; Ohmukai, Ikki
- Organizer
  Digital Humanities 2023
- Related Report
  2023 Annual Research Report
- Int'l Joint Research
[Presentation] Revisiting Text Encoding for Buddhist Studies2023
- Author(s)
  Kiyonori Nagasaki
- Organizer
  International Symposium: Advanced Computational Methods for Studying Buddhist Texts, University of Vienna
- Related Report
  2023 Annual Research Report
- Int'l Joint Research / Invited
[Book] 古典の再生（分担執筆：永崎研宣, 幾浦裕之, 藤原静香「古典本文をWebに載せるーTEIガイドラインに準拠したテキストデータ構築」, pp. 416-438.）2024
- Author(s)
  盛田帝子編
- Total Pages
  448
- Publisher
  文学通信
- ISBN
  9784867660423
- Related Report
  2023 Annual Research Report
[Remarks] TEI研究会
- URL
  https://tei.dhii.jp/
- Related Report
  2023 Annual Research Report
[Remarks] TEI古典籍ビューワ
- URL
  https://tei.dhii.jp/teiviewer4eaj
- Related Report
  2023 Annual Research Report
[Funded Workshop] Digitization of Historical Lexicography and TEI-Lex02023
- Related Report
  2023 Annual Research Report
[Funded Workshop] 国際シンポジウムデジタル・ヒューマニティーズと研究基盤欧州と日本の最新トレンド2023
- Related Report
  2023 Annual Research Report

Develpment of a Method of Text Encoding for Japanese Historical Texts Accoding to Methdological Commons

Principal Investigator

永崎 研宣 一般財団法人人文情報学研究所, 人文情報学研究部門, 主席研究員 (30343429)

¥18,720,000 (Direct Cost: ¥14,400,000、Indirect Cost: ¥4,320,000)

Current Status of Research Progress

Reason

Report

Research Products

[Journal Article] 仏典研究とテキスト構造化2024

Author(s)

Journal Title

Related Report

[Journal Article] TEI古典籍ビューワによる構造化テキストの可視化2024

Author(s)

Journal Title

Related Report

[Journal Article] 勅撰和歌集の構造化と提示手法に関する試み ―嘉禄二年本『古今和歌集』を事例として―2023

Author(s)

Journal Title

Related Report

[Journal Article] 日本語方言談話資料のTEIによる構造化の試み2023

Author(s)

Journal Title

Related Report

[Journal Article] TEIに準拠した近代短歌テキストのマークアップ手法の提案,2023

Author(s)

Journal Title

Related Report

[Journal Article] OCR の高精度化を踏まえたデジタル学術編集版の新展開2023

Author(s)

Journal Title

Related Report

[Journal Article] 東アジア古典籍のための動的なデジタル学術編集版の構築2023

Author(s)

Journal Title

Related Report

[Journal Article] 教の伝統的知識体系の構造化に向けて―日本の漢訳経典注釈書に対するマークアップについての一試論―2023

Author(s)

Journal Title

Related Report

[Presentation] Lessons from the Journey to Unicode: Standardizing Character Encoding in the SAT Daizokyo Text Database2024

Author(s)

Organizer

Related Report

[Presentation] The Formation of Digital Humanities in Japan: From Global and Local Contexts2023

Author(s)

Organizer

Related Report

[Presentation] Teaching Encoding In and Out of the Classroom2023

Author(s)

Organizer

Related Report

[Presentation] Digital Representation of 'A Match of Crickets in Ten Rounds of Verse and Image': Text Encoding and Viewer Implementation for Japanese Poetry Match2023

Author(s)

Organizer

Related Report

[Presentation] 3D Text Encoding and TEI: Text, Editions, and Spatiality2023

Author(s)

Organizer

Related Report

[Presentation] n AI-assisted Digital Scholarly Editing System for Buddhist Studies2023

Author(s)

Organizer

Related Report

[Presentation] A Preliminary Proposal for Digital Scholarly Editing that Uses Modern Japanese autograph manuscripts: How to markup autograph manuscripts of Rampo Edogawa2023

Author(s)

Organizer

Related Report

[Presentation] Digitizing Buddhist Genealogy: Encoding the Shinran Shonin Montei Kyomyo-cho2023

Author(s)

Organizer

Related Report

[Presentation] Toward a TEI/RDF Encoding for Semantic Annotations: Concept and Implementation as LOD Editor2023

Author(s)

Organizer

Related Report

[Presentation] Legal Issues in Digital Humanities: Analysis of Recent Advocacy and Continuing and Emerging Issues2023

Author(s)

Organizer

Related Report

永崎研宣一般財団法人人文情報学研究所, 人文情報学研究部門, 主席研究員 (30343429)

[Funded Workshop] 国際シンポジウムデジタル・ヒューマニティーズと研究基盤欧州と日本の最新トレンド2023