2019 Fiscal Year Annual Research Report

Construction of archives of Indian classics with phrase index by means of corpus based extraction of formulaic sequences

Research Project

Project/Area Number	16K12544
Research Institution	Ryukoku University
Principal Investigator	中谷英明龍谷大学, 公立大学の部局等, 研究員 (20140395)
Co-Investigator(Kenkyū-buntansha)	芝野耕司東京外国語大学, その他部局等, 名誉教授 (50216024)
Project Period (FY)	2016-04-01 – 2020-03-31
Keywords	インド古典 / フレーズ抽出 / テキスト成立階層 / リグ・ヴェーダ / Ngram / MapReduce / パーリ仏典 / アーカイブ
Outline of Annual Research Achievements	従来の言語データ処理は中間処理データの膨大さによる計算限界があったため、小規模文献の分析に終始してきた。しかし、芝野がグーグルのビッグデータ処理技法MapReduce (2004年）を用いて新技法として開発したフレ－ズ自動抽出法は、すべての「フレーズ」（連続する単語群）の抽出を初めて可能にした。すなわち通常のNgram分析は4gram等の特定のNgramを利用するのに対し、芝野の統合Ngram分析は、一文に含まれるすべてのNgramを生成し、その中から重複を排除した出現文リストを作成し、さらに出現文リストが同一の場合、最長のNgram以外を削除することによって、Formulaic Sequencesを抽出する。このアルゴリズムによってそれぞれの文献固有のフレーズ表現を抽出することが可能となった。得られる情報は、1．ユニークフレーズ：Ngram、頻度、出現文リスト、2．重複リスト：Ngram、削除Ngram、である。例えば『リグ・ヴェーダ』の場合、雷神インドラを形容する常套句 nRtamaM vAjasAtau「戦利品獲得において最も豪胆な」がヴィシュヴァーミトラ家の歌集（すなわち3巻）特有のものであること、また後代に付加された10巻にはこれを真似たものがあることなど、フレーズ抽出法に基づいて作成されたインデックスは『リグ・ヴェーダ』成立の歴史的過程を一目で彷彿させるデータに満ちている。ほぼすべてが大まかな成立時期しか判っておらず、また一文献内に制作時期の異なる諸層が混在することの多いインド古典、パーリ仏典等にとって、フレーズ分析が必須ツールとなり、成立過程が明確化されることによってその内容理解が格段に進展することが見込まれる。

Research Products
(8 results)

All 2020 2019 Other

All Journal Article (2 results) (of which Int'l Joint Research: 1 results, Peer Reviewed: 1 results, Open Access: 1 results) Presentation (5 results) (of which Int'l Joint Research: 2 results, Invited: 3 results) Remarks (1 results)

[Journal Article] Mining formulaic sequences from a huge corpus of Japanese TV closed caption2020
- Author(s)
  Minako Nakamura, Kohji Shibano
- Journal Title
  
  DH(Digital Humanities), Budapest 2019
  
  Volume: 1 Pages: 未定
- Peer Reviewed
[Journal Article] インド・チベット古典学と日仏東洋学会2020
- Author(s)
  中谷英明
- Journal Title
  
  日仏東洋学会通信
  
  Volume: 43 Pages: 29-46
- Open Access / Int'l Joint Research
[Presentation] ブッダの「自省利他」2019
- Author(s)
  中谷英明
- Organizer
  龍谷大学創立380周年記念講演会
- Invited
[Presentation] 世界の人々のアイデンティティとしての「自省利他」の研究－社会実装を視野に入れて2019
- Author(s)
  中谷英明
- Organizer
  科研費挑戦的研究（開拓）「世界の人々のアイデンティティとしての「自省利他」の研究－社会実装を視野に入れて」第1回研究会
[Presentation] Mining formulaic sequences from a huge corpus of Japanese TV closed caption2019
- Author(s)
  Minako Nakamura, Kohji Shibano
- Organizer
  DH(Digital Humanities), Budapest 2019, Hungary
- Int'l Joint Research
[Presentation] 自省利他ー仏陀の哲学 Introspective Altruism－ Philosophy of the Buddha －2019
- Author(s)
  中谷英明
- Organizer
  仏光山大学校長論壇、台湾
- Int'l Joint Research / Invited
[Presentation] 自省利他の思想 ―『スッタニパータ』八頌品における釈尊の教え2019
- Author(s)
  中谷英明
- Organizer
  駒澤大学成道会法要記念講演
- Invited
[Remarks] 日仏東洋学会
- URL
  http://www.classics.jp/sofjeo/

2019 Fiscal Year Annual Research Report

Construction of archives of Indian classics with phrase index by means of corpus based extraction of formulaic sequences

Principal Investigator

中谷 英明 龍谷大学, 公立大学の部局等, 研究員 (20140395)

Research Products

[Journal Article] Mining formulaic sequences from a huge corpus of Japanese TV closed caption2020

Author(s)

Journal Title

[Journal Article] インド・チベット古典学と日仏東洋学会2020

Author(s)

Journal Title

[Presentation] ブッダの「自省利他」2019

Author(s)

Organizer

[Presentation] 世界の人々のアイデンティティとしての「自省利他」の研究－社会実装を視野に入れ て2019

Author(s)

Organizer

[Presentation] Mining formulaic sequences from a huge corpus of Japanese TV closed caption2019

Author(s)

Organizer

[Presentation] 自省利他ー仏陀の哲学 Introspective Altruism－ Philosophy of the Buddha －2019

Author(s)

Organizer

[Presentation] 自省利他の思想 ―『スッタニパータ』八頌品における釈尊の教え2019

Author(s)

Organizer

[Remarks] 日仏東洋学会

URL

中谷英明龍谷大学, 公立大学の部局等, 研究員 (20140395)

[Presentation] 世界の人々のアイデンティティとしての「自省利他」の研究－社会実装を視野に入れて2019