Construction of archives of Indian classics with phrase index by means of corpus based extraction of formulaic sequences

Research Project

Project/Area Number	16K12544
Research Category	Grant-in-Aid for Challenging Exploratory Research
Allocation Type	Multi-year Fund
Research Field	Library and information science/Humanistic social informatics
Research Institution	Ryukoku University (2018-2019) Kansai Gaidai University (2016-2017)
Principal Investigator	Nakatani Hideaki 龍谷大学, 公私立大学の部局等, 研究員 (20140395)
Co-Investigator(Kenkyū-buntansha)	芝野耕司東京外国語大学, その他部局等, 名誉教授 (50216024)
Project Period (FY)	2016-04-01 – 2020-03-31
Project Status	Completed (Fiscal Year 2019)
Budget Amount *help	¥2,990,000 (Direct Cost: ¥2,300,000、Indirect Cost: ¥690,000) Fiscal Year 2018: ¥910,000 (Direct Cost: ¥700,000、Indirect Cost: ¥210,000) Fiscal Year 2017: ¥1,040,000 (Direct Cost: ¥800,000、Indirect Cost: ¥240,000) Fiscal Year 2016: ¥1,040,000 (Direct Cost: ¥800,000、Indirect Cost: ¥240,000)
Keywords	インド古典 / フレーズ抽出 / テキスト内階層 / リグ・ヴェーダ / Ngram / MapReduce / Formulaic Sequence / テキストデータベース / テキスト成立階層 / パーリ仏典 / アーカイブ / マハーバーラタ / フレーズインデックス / 統合Ngram / 層分け / インド古典アーカイブ / インド古典の相対年代 / 百道梵書 / パーリ聖典 / インド古典の層分け / 統合アーカイブ
Outline of Final Research Achievements	The all new automatic phrase extraction algorithm developed by Kohji Shibano using Google's MapReduce has enabled the automatic extraction of all "phrases" (groups of consecutive words) from a large-scale database that could not be processed before. Not only do Indian classics contain a large number of works of unknown author and date, but a single work often contains several hidden layers of different dates. Automatic phrase extraction, however, indicating all locations of unique "phrases", will clarify the relationship of multiple works, as well as the existence of layers of different date within the same work. Thus, phrase analytic research of the Indian classics will prove the accuracy of oral textual transmission from ancient India and will allow a far precise understanding of Indian classics.
Academic Significance and Societal Importance of the Research Achievements	インド古典の二特徴は、数千年に渡る正確無比の口頭伝承と、歴史意識の欠如から来る制作年代の不詳である。また一書が制作年代の異なる数層から成ることも稀ではない。従って、インド古典学にとって、諸文献の相対年代及び一文献内の諸層の確定は必須の前提条件であるが、残念ながら現在そうはなっていない。芝野が考案したフレーズ自動抽出法は、諸文献の関係及び文献内諸層の確定に力を発揮し、従来研究者が長年かけて得た理解を数カ月で獲得することを可能にした。 21世紀には世界の最重要国の一つとなるインドの伝統的心性の理解には、インド古典の理解が必須であり、フレーズ分析による古典理解の画期的進展が期待されるゆえんである。

Report

(5 results)

2019 Annual Research Report Final Research Report ( PDF )
2018 Research-status Report
2017 Research-status Report
2016 Research-status Report

Research Products
(29 results)

All 2020 2019 2018 2017 2016 Other

All Journal Article (14 results) (of which Int'l Joint Research: 13 results, Peer Reviewed: 10 results, Open Access: 2 results, Acknowledgement Compliant: 1 results) Presentation (14 results) (of which Int'l Joint Research: 8 results, Invited: 4 results) Remarks (1 results)

[Journal Article] Mining formulaic sequences from a huge corpus of Japanese TV closed caption2020
- Author(s)
  Minako Nakamura, Kohji Shibano
- Journal Title
  
  DH(Digital Humanities), Budapest 2019
  
  Volume: 1
- Related Report
  2019 Annual Research Report
- Peer Reviewed
[Journal Article] インド・チベット古典学と日仏東洋学会2020
- Author(s)
  中谷英明
- Journal Title
  
  日仏東洋学会通信
  
  Volume: 43 Pages: 29-46
- Related Report
  2019 Annual Research Report
- Open Access / Int'l Joint Research
[Journal Article] Analyzing Usefulness of Dialogues from Closed Caption TV Corpus as an Example of Can-do Statements for Language Learning2018
- Author(s)
  Hajime Mochizuki and Kohji Shibano
- Journal Title
  
  2018 Hawaii University Conference, Arts, Humanities, Social Sciences & Education (AHSE), Hawaii, USA
  
  Volume: 1
- Related Report
  2017 Research-status Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Analyzing formulaic sequences in spoken Japanese from a large Japanese TV closed caption corpus2017
- Author(s)
  Kohji Shibano
- Journal Title
  
  The 18th World Congress of Applied Linguistics (AILA 2017), 23-28 July 2017
  
  Volume: 1
- Related Report
  2017 Research-status Report
- Peer Reviewed / Int'l Joint Research
[Journal Article] Developing Intimacy by Style-shifting in Japanese: A TV Subtitle Corpus-based Study, XIAO Tingting,2017
- Author(s)
  Kohji Shibano
- Journal Title
  
  The 2017 conference of the American Association for Applied Linguistics (AAAL 2017), 18-21 March, 2017
  
  Volume: 1
- Related Report
  2017 Research-status Report
- Peer Reviewed / Int'l Joint Research
[Journal Article] Searching Discourse Segments for Formulaic Sequences in a Closed Caption TV Corpus for Language Learning,2017
- Author(s)
  Hajime Mochizuki and Kohji Shibano
- Journal Title
  
  World Conference on E-Learning in Corporate, Government, Healthcare, and Higher Education
  
  Volume: 1 Pages: 19-27
- Related Report
  2017 Research-status Report
- Peer Reviewed / Int'l Joint Research
[Journal Article] Discourse Segment Clustering with Word Embedding based on Formulaic Sequences for Language Education2017
- Author(s)
  Kohji ShibanoHajime Mochizuki and Kohji Shibano,
- Journal Title
  
  International Conference on Education and Multimedia Technology (ICEMT 2017)
  
  Volume: 1
- Related Report
  2017 Research-status Report
- Peer Reviewed / Int'l Joint Research
[Journal Article] The Acquisition of a Japanese Practical Formulaic Sequences List from a Closed Caption TV Corpus2017
- Author(s)
  Hajime Mochizuki and Kohji Shibano,
- Journal Title
  
  Hawaii University Conferences, STAM/STEAM Education Conference
  
  Volume: 1
- Related Report
  2017 Research-status Report
- Peer Reviewed / Int'l Joint Research
[Journal Article] Buddha’s denial of the universality of mind2017
- Author(s)
  Hideaki NAKATANI
- Journal Title
  
  Bouddhisme et universalisme : de l’Inde au Japon
  
  Volume: 1
- Related Report
  2016 Research-status Report
- Int'l Joint Research
[Journal Article] Le Pali, langue de la realisation de l’enseignement du Buddha2017
- Author(s)
  Hideaki NAKATANI
- Journal Title
  
  Symposium Hieroglossie I, College de France
  
  Volume: 1
- Related Report
  2016 Research-status Report
- Int'l Joint Research / Acknowledgement Compliant
[Journal Article] 2．Modification of word2vec by Formulaic Sequences and Extraction of Useful Expressions for Language Learning from Closed Caption TV Corpus2017
- Author(s)
  Hajime Mochizuki and Kohji Shibano
- Journal Title
  
  The IAFOR International Conference on Language Learning, Hawaii 2017, IICLLHawaii 2017, Honolulu, USA
  
  Volume: １
- Related Report
  2016 Research-status Report
- Peer Reviewed / Int'l Joint Research
[Journal Article] 3．Developing Intimacy by Style-shifting in Japanese: A TV Subtitle Corpus-based Study, XIAO Tingting2017
- Author(s)
  Kohji Shibano
- Journal Title
  
  The 2017 conference of the American Association for Applied Linguistics (AAAL 2017)
  
  Volume: 1
- Related Report
  2016 Research-status Report
- Peer Reviewed / Int'l Joint Research
[Journal Article] 4．Analyzing formulaic sequences in spoken Japanese from a large Japanese TV closed caption corpus2017
- Author(s)
  Kohji Shibano
- Journal Title
  
  The 18th World Congress of Applied Linguistics (AILA 2017), 23-28 July 2017, Rio de Janeiro, Brazil
  
  Volume: 1
- Related Report
  2016 Research-status Report
- Int'l Joint Research
[Journal Article] 1．Extracting Formulaic Sequences Containing Useful Expressions for Language Learning from Closed Caption TV Corpus2016
- Author(s)
  Hajime Mochizuki and Kohji Shibano
- Journal Title
  
  World Conference on E-Learning in Corporate, Government, Healthcare, and Higher Education, E-Learn 2016, Alexandria, USA
  
  Volume: 1 Pages: 29-37
- Related Report
  2016 Research-status Report
- Peer Reviewed / Int'l Joint Research
[Presentation] ブッダの「自省利他」2019
- Author(s)
  中谷英明
- Organizer
  龍谷大学創立380周年記念講演会
- Related Report
  2019 Annual Research Report
- Invited
[Presentation] 世界の人々のアイデンティティとしての「自省利他」の研究－社会実装を視野に入れて2019
- Author(s)
  中谷英明
- Organizer
  科研費挑戦的研究（開拓）「世界の人々のアイデンティティとしての「自省利他」の研究－社会実装を視野に入れて」第1回研究会
- Related Report
  2019 Annual Research Report
[Presentation] Mining formulaic sequences from a huge corpus of Japanese TV closed caption2019
- Author(s)
  Minako Nakamura, Kohji Shibano
- Organizer
  DH(Digital Humanities), Budapest 2019, Hungary
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] 自省利他ー仏陀の哲学 Introspective Altruism－ Philosophy of the Buddha －2019
- Author(s)
  中谷英明
- Organizer
  仏光山大学校長論壇、台湾
- Related Report
  2019 Annual Research Report
- Int'l Joint Research / Invited
[Presentation] 自省利他の思想 ―『スッタニパータ』八頌品における釈尊の教え2019
- Author(s)
  中谷英明
- Organizer
  駒澤大学成道会法要記念講演
- Related Report
  2019 Annual Research Report
- Invited
[Presentation] インド古典学・チベット古典学と日仏東洋学会2019
- Author(s)
  中谷英明
- Organizer
  日仏東洋学会
- Related Report
  2018 Research-status Report
- Invited
[Presentation] nalyzing formulaic sequences in spoken Japanese from a large Japanese TV closed caption corpus2018
- Author(s)
  Kohji Shibano
- Organizer
  The 18th World Congress of Applied Linguistics
- Related Report
  2017 Research-status Report
- Int'l Joint Research
[Presentation] Analyzing formulaic sequences in spoken Japanese from a large Japanese TV closed caption corpus2017
- Author(s)
  Kohji Shibano
- Organizer
  The 18th World Congress of Applied Linguistics (AILA 2017), 23-28 July 2017, Rio de Janeiro, Brazil
- Place of Presentation
  Rio de Janeiro, Brazil
- Year and Date
  2017-07-23
- Related Report
  2016 Research-status Report
- Int'l Joint Research
[Presentation] Developing Intimacy by Style-shifting in Japanese: A TV Subtitle Corpus-based Study, XIAO Tingting2017
- Author(s)
  Kohji Shibano
- Organizer
  The 2017 conference of the American Association for Applied Linguistics (AAAL 2017)
- Place of Presentation
  Portland, USA
- Year and Date
  2017-03-18
- Related Report
  2016 Research-status Report
[Presentation] 八頌品（アッタカ・ヴァッガ）の韻律と思想2017
- Author(s)
  中谷英明
- Organizer
  日本印度学仏教学会第68回学術大会
- Related Report
  2017 Research-status Report
[Presentation] eveloping Intimacy by Style-shifting in Japanese: A TV Subtitle Corpus-based Study, XIAO Tingting,2017
- Author(s)
  Kohji Shibano
- Organizer
  The 2017 conference of the American Association for Applied Linguistics
- Related Report
  2017 Research-status Report
- Int'l Joint Research
[Presentation] Analyzing Usefulness of Dialogues from Closed Caption TV Corpus as an Example of Can-do Statements for Language Learning,2017
- Author(s)
  Hajime Mochizuki and Kohji Shibano
- Organizer
  2018 Hawaii University Conference, Arts, Humanities, Social Sciences & Education
- Related Report
  2017 Research-status Report
- Int'l Joint Research
[Presentation] Discourse Segment Clustering with Word Embedding based on Formulaic Sequences for Language Education2017
- Author(s)
  Hajime Mochizuki and Kohji Shibano
- Organizer
  International Conference on Education and Multimedia Technology
- Related Report
  2017 Research-status Report
- Int'l Joint Research
[Presentation] The Acquisition of a Japanese Practical Formulaic Sequences List from a Closed Caption TV Corpus,2017
- Author(s)
  Hajime Mochizuki and Kohji Shibano
- Organizer
  2017 Hawaii University Conferences, STAM/STEAM Education Conference
- Related Report
  2017 Research-status Report
- Int'l Joint Research
[Remarks] 日仏東洋学会
- URL
  http://www.classics.jp/sofjeo/
- Related Report
  2019 Annual Research Report

Construction of archives of Indian classics with phrase index by means of corpus based extraction of formulaic sequences

Principal Investigator

Nakatani Hideaki 龍谷大学, 公私立大学の部局等, 研究員 (20140395)

¥2,990,000 (Direct Cost: ¥2,300,000、Indirect Cost: ¥690,000)

Report

Research Products

[Journal Article] Mining formulaic sequences from a huge corpus of Japanese TV closed caption2020

Author(s)

Journal Title

Related Report

[Journal Article] インド・チベット古典学と日仏東洋学会2020

Author(s)

Journal Title

Related Report

[Journal Article] Analyzing Usefulness of Dialogues from Closed Caption TV Corpus as an Example of Can-do Statements for Language Learning2018

Author(s)

Journal Title

Related Report

[Journal Article] Analyzing formulaic sequences in spoken Japanese from a large Japanese TV closed caption corpus2017

Author(s)

Journal Title

Related Report

[Journal Article] Developing Intimacy by Style-shifting in Japanese: A TV Subtitle Corpus-based Study, XIAO Tingting,2017

Author(s)

Journal Title

Related Report

[Journal Article] Searching Discourse Segments for Formulaic Sequences in a Closed Caption TV Corpus for Language Learning,2017

Author(s)

Journal Title

Related Report

[Journal Article] Discourse Segment Clustering with Word Embedding based on Formulaic Sequences for Language Education2017

Author(s)

Journal Title

Related Report

[Journal Article] The Acquisition of a Japanese Practical Formulaic Sequences List from a Closed Caption TV Corpus2017

Author(s)

Journal Title

Related Report

[Journal Article] Buddha’s denial of the universality of mind2017

Author(s)

Journal Title

Related Report

[Journal Article] Le Pali, langue de la realisation de l’enseignement du Buddha2017

Author(s)

Journal Title

Related Report

[Journal Article] 2．Modification of word2vec by Formulaic Sequences and Extraction of Useful Expressions for Language Learning from Closed Caption TV Corpus2017

Author(s)

Journal Title

Related Report

[Journal Article] 3．Developing Intimacy by Style-shifting in Japanese: A TV Subtitle Corpus-based Study, XIAO Tingting2017

Author(s)

Journal Title

Related Report

[Journal Article] 4．Analyzing formulaic sequences in spoken Japanese from a large Japanese TV closed caption corpus2017

Author(s)

Journal Title

Related Report

[Journal Article] 1．Extracting Formulaic Sequences Containing Useful Expressions for Language Learning from Closed Caption TV Corpus2016

Author(s)

Journal Title

Related Report

[Presentation] ブッダの「自省利他」2019

Author(s)

Organizer

Related Report

[Presentation] 世界の人々のアイデンティティとしての「自省利他」の研究－社会実装を視野に入れ て2019

Author(s)

Organizer

Related Report

[Presentation] Mining formulaic sequences from a huge corpus of Japanese TV closed caption2019

Author(s)

Organizer

Related Report

[Presentation] 自省利他ー仏陀の哲学 Introspective Altruism－ Philosophy of the Buddha －2019

Author(s)

Organizer

Related Report

[Presentation] 自省利他の思想 ―『スッタニパータ』八頌品における釈尊の教え2019

Author(s)

[Presentation] 世界の人々のアイデンティティとしての「自省利他」の研究－社会実装を視野に入れて2019