Basic research concerning adjacency probabilities in the development of a morphological analysis dictionary for classical Japanese poetry

Research Project

Project/Area Number	22520458
Research Category	Grant-in-Aid for Scientific Research (C)
Allocation Type	Single-year Grants
Section	一般
Research Field	Japanese linguistics
Research Institution	Tokyo Institute of Technology
Principal Investigator	YAMAMOTO Hilofumi 東京工業大学, 留学生センター, 准教授 (30241756)
Project Period (FY)	2010-04-01 – 2013-03-31
Project Status	Completed (Fiscal Year 2013)
Budget Amount *help	¥2,990,000 (Direct Cost: ¥2,300,000、Indirect Cost: ¥690,000) Fiscal Year 2012: ¥1,040,000 (Direct Cost: ¥800,000、Indirect Cost: ¥240,000) Fiscal Year 2011: ¥650,000 (Direct Cost: ¥500,000、Indirect Cost: ¥150,000) Fiscal Year 2010: ¥1,300,000 (Direct Cost: ¥1,000,000、Indirect Cost: ¥300,000)
Keywords	和歌 / 通時分析 / 古語辞書 / 形態素 / ネットワーク分析 / 語彙論 / 連接規則 / 機械学習 / 辞書 / 連接 / 日本語 / 平安時代 / 解析システム / 古語 / 形態素解析 / 古代語辞書 / 通時変化 / 品詞体系 / 八代集 / シソーラス / 通時的言語 / 語彙論的トポロジー
Research Abstract	The principal investigator has previously developed a tool for the morphological processing of waka poems in 2007. However, its range of applicability was limited to the Hachidaishu. The goal of the present research is to automatically segment and annotate part-of-speech tags for the Nijuichidaishu using the previously annotated segmentation data and token adjacency probabilities of the Hachidaishu. Using the KyTea (Kyoto Text Analysis Toolkit) morpheme segmentation toolkit, with its default L2 regularized SVM learning algorithm, model learning took less than a minute. This model also achieved a high segmentation accuracy of around 96% on the Nijuichidaishu. While there is some remaining work to be done concerning the addition of unknown tokens and the learning of adjacency probabilities around unknown words, the development of a dictionary that can segment the Nijuichidaishu with a high accuracy can be considered complete.

Report

(4 results)

Research Products
(48 results)

All 2014 2013 2012 2011 2010 Other

All Journal Article (24 results) (of which Peer Reviewed: 20 results) Presentation (21 results) Remarks (3 results)

[Journal Article] 『今昔物語集』と『宇治拾遺物語』の同文説話における語の対応2014
- Author(s)
  田中牧郎, 山元啓史
- Journal Title
  
  日本語の研究(日本語学会)
  
  Volume: Vol. 10, no. 1
- Related Report
  2013 Final Research Report
- Peer Reviewed
[Journal Article] 『今昔物語集』と『宇治拾遺物語』の同文説話における語の対応2014
- Author(s)
  田中牧郎, 山元啓史
- Journal Title
  
  日本語の研究, 日本語学会
  
  Volume: Vol. 10, no. 1 Pages: 16-31
- Related Report
  2012 Annual Research Report
- Peer Reviewed
[Journal Article] 不確かな情報が含まれる文の形式2013
- Author(s)
  ボル・ホドシチェク, 山元啓史
- Journal Title
  
  経済社会研究プロジェクト高度科学技術社会リスク・ソリューション2012(東京工業大学大学院社会理工学研究科)
  
  Volume: Vol. 2012
- Related Report
  2013 Final Research Report
[Journal Article] Analysis and Application of Mid-Rank Lexicons of Modern Japanese2013
- Author(s)
  Bor Hodoscek, Hilofumi Yamamoto
- Journal Title
  
  IPSJ Symposium 2013 Sig-CH, IPSJ Sym- posium 2013 Sig-CH, Pacific Neighbor- hood Consortium
  
  Volume: Vol. 2013, No. 4
- Related Report
  2013 Final Research Report
- Peer Reviewed
[Journal Article] A Corpus Study of Emotive Adjec- tives and Verbs of the Heian Japanese2012
- Author(s)
  Makiro Tanaka, Hilofumi Yamamoto
- Journal Title
  
  SNPD2012, Proceedings 13th ACIS In- ternational Conference on Software En- gineering, Artificial Intelligence, Net- working and Parallel/Distributed Com- puting, IEEE
  
  Volume: Vol. SNPD.2012, No. 101
- Related Report
  2013 Final Research Report
- Peer Reviewed
[Journal Article] Diachronic Corpus and Linguistic Space : New Methods for the Analysis of Language Change2012
- Author(s)
  Hilofumi Yamamoto, Makiro Tanaka, Yasuhiro Kondo
- Journal Title
  
  SNPD2012, Proceedings 13th ACIS In- ternational Conference on Software En- gineering, Artificial Intelligence, Net- working and Parallel/Distributed Com- puting, IEEE
  
  Volume: Vol. SNPD2012, No. 101
- Related Report
  2013 Final Research Report
- Peer Reviewed
[Journal Article] A Corpus Study of Emotive Adjectives and Verbs of the Heian Japanese2012
- Author(s)
  Makiro Tanaka, Hilofumi Yamamoto
- Journal Title
  
  SNPD2012, Proceedings 13th ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing, IEEE
  
  Volume: Vol. SNPD.2012, No. 101 Pages: 377-380
- Related Report
  2012 Annual Research Report
- Peer Reviewed
[Journal Article] Diachronic Corpus and Linguistic Space: New Methods for the Analysis of Language Change2012
- Author(s)
  Hilofumi Yamamoto, Makiro Tanaka, Yasuhiro Kondo
- Journal Title
  
  SNPD2012, Proceedings 13th ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing, IEEE
  
  Volume: Vol. SNPD2012, No. 101 Pages: 381-384
- Related Report
  2012 Annual Research Report
- Peer Reviewed
[Journal Article] グラフを用いた集合演算による和歌用語の解析2011
- Author(s)
  山元啓史
- Journal Title
  
  語彙研究,語彙研究会
  
  Volume: Vol. 9
- NAID
  40020106907
- Related Report
  2013 Final Research Report
- Peer Reviewed
[Journal Article] Quantitative Analysis of Loanwords of Eight Literary Works in the Heian Pe- riod (794–1185)2011
- Author(s)
  Makiro Tanaka, Hilofumi Yamamoto
- Journal Title
  
  Osaka symposium on digital humanities 2011, Osaka simpo- sium on digital humanities 2011
  
  Volume: Vol. 1, No. 1
- Related Report
  2013 Final Research Report
- Peer Reviewed
[Journal Article] Development of the thesaurus of classi- cal Japanese poetic vocabulary2011
- Author(s)
  Hilofumi Yamamoto, Makiro Tanaka
- Journal Title
  
  Asialex 2011, Lexicography : Theoretical and Practical Perspectives
  
  Volume: Vol. 2011
- Related Report
  2013 Final Research Report
- Peer Reviewed
[Journal Article] An analysis of Sino-Japanese words of the Heian period for the development of the historical Japanese dictionary2011
- Author(s)
  Makiro Tanaka, Hilofumi Yamamoto
- Journal Title
  
  Asialex 2011, Lexicography : Theoret- ical and Practical Perspectives
  
  Volume: Vol. 2011
- Related Report
  2013 Final Research Report
- Peer Reviewed
[Journal Article] 「山吹」をめぐる和歌語彙の空間2011
- Author(s)
  山元啓史
- Journal Title
  
  じんもんこんシンポジウム2011,人文科学とコンピュータシンポジウム論文集(情報処理学会)
  
  Volume: Vol. 2011, No. 8
- NAID
  120006702478
- Related Report
  2013 Final Research Report
- Peer Reviewed
[Journal Article] グラフを用いた集合演算による和歌用語の解析2011
- Author(s)
  山元啓史
- Journal Title
  
  語彙研究
  
  Volume: 9 Pages: 86-94
- NAID
  40020106907
- Related Report
  2011 Annual Research Report
- Peer Reviewed
[Journal Article] Quantitative Analysis of Loanwords of Eight Literary Works in the Heian Period (794-1185)2011
- Author(s)
  Makiro Tanaka, Hilofumi Yamamoto
- Journal Title
  
  Osaka symposium on digital humanities 2011
  
  Volume: 1 Pages: 51-52
- Related Report
  2011 Annual Research Report
- Peer Reviewed
[Journal Article] Graph Representation of the Connotations of Classical Japanese Poetic Vocabvilary2011
- Author(s)
  Hilofumi Yamamoto
- Journal Title
  
  Osaka symposium on digital humanities 2011
  
  Volume: 1 Pages: 42-42
- Related Report
  2011 Annual Research Report
- Peer Reviewed
[Journal Article] An analysis of Sino-Japanese words of the Heian period for the development of the historical Japanese dictionary2011
- Author(s)
  Makiro Tanaka, Yamamoto Hilofumi
- Journal Title
  
  Asialex 2011, Lexicography : Theoretical and Practical Perspectives
  
  Pages: 496-505
- Related Report
  2011 Annual Research Report
- Peer Reviewed
[Journal Article] Development of the thesaurus of classical Japanese poetic vocabulary2011
- Author(s)
  Hilofumi Yamamoto, Makiro Tanaka
- Journal Title
  
  Asialex 2011, Lexicography : Theoretical and Practical Perspectives
  
  Pages: 576-585
- Related Report
  2011 Annual Research Report
- Peer Reviewed
[Journal Article] 「山吹」をめぐる和歌語彙の空間2011
- Author(s)
  山元啓史
- Journal Title
  
  じんもんこんシンポジウム2011,人文科学とコンピュータシンポジウム論文集,情報処理学会
  
  Volume: 8 Pages: 141-146
- NAID
  120006702478
- Related Report
  2011 Annual Research Report
- Peer Reviewed
[Journal Article] BCCWJ複合辞辞書の仕様・開発・評価2011
- Author(s)
  山元啓史
- Journal Title
  
  特定領域研究「日本語コーパス」平成22年度公開ワークショップ(研究成果報告会)予稿集
  
  Pages: 535-544
- Related Report
  2011 Annual Research Report
[Journal Article] BCCWJ複合辞辞書の仕様・開発・評価2011
- Author(s)
  近藤泰弘、坂野収、多田知子、岡田純子、山元啓史
- Journal Title
  
  特定領域研究「日本語コーパス」平成22年度公開ワークショップ(研究成果報告会)予稿集
  
  Pages: 534-544
- Related Report
  2010 Annual Research Report
[Journal Article] 八代集用語のモデリングシステム2010
- Author(s)
  山元啓史
- Journal Title
  
  じんもんこん2010,人文科学とコンピュータシンポジウム,じんもんこん2010,人文科学とコンピュータシンポジウム(情報処理学会)
  
  Volume: Vol. 2010, No. 15
- NAID
  170000075244
- Related Report
  2013 Final Research Report
- Peer Reviewed
[Journal Article] 八代集用語のモデリングシステム2010
- Author(s)
  山元啓史
- Journal Title
  
  じんもんこん2010, 人文科学とコンピュータシンポジウム, 情報処理学会
  
  Volume: 15 Pages: 247-254
- NAID
  170000075244
- Related Report
  2010 Annual Research Report
- Peer Reviewed
[Journal Article] ブーリアン演算による歌ことばモデルの解析2010
- Author(s)
  山元啓史
- Journal Title
  
  第16回公開シンポジウム「人文科学とデータベース」論文集、人文系データベース協議会
  
  Volume: 16 Pages: 37-44
- Related Report
  2010 Annual Research Report
[Presentation] Lexical Modeling of Yamabuki (Japanese Kerria) in Clas- sical Japanese Poetry2013
- Author(s)
  Hilofumi Yamamoto
- Organizer
  JADH2013 DH- JAC2013 Conference, JADH2013 DH- JAC2013 Conference Abstracts, Vol. 2013
- Place of Presentation
  Ky-oto : Ritsumeikan University
- Related Report
  2013 Final Research Report
[Presentation] A Diachronic and Synchronic Inves- tigation into the Properties of Mid- Rank Words in Modern Japanese2013
- Author(s)
  Bor Hodoscek, Hilofumi Yamamoto
- Organizer
  JADH2013 DH-JAC2013 Conference, JADH2013 DH-JAC2013 Conference Abstracts, Vol. 2013
- Place of Presentation
  Kyoto : Ritsumeikan University
- Related Report
  2013 Final Research Report
[Presentation] Lexical Modeling of Yamabuki (Japanese Kerria) in Classical Japanese Poetry2013
- Author(s)
  Hilofumi Yamamoto
- Organizer
  JADH2013 DH-JAC2013 Conference
- Place of Presentation
  京都大学百周年時計台記念館
- Related Report
  2012 Annual Research Report
[Presentation] ITを活用した日本語分析,大阪電気通信大学情報学研究施設主催2012
- Author(s)
  山元啓史
- Organizer
  公開ワークショップ「ITを活用した目的志向の日本語教育・運用支援」
- Place of Presentation
  大阪:電気通信大学
- Year and Date
  2012-03-20
- Related Report
  2013 Final Research Report
[Presentation] Design of Serial Com- parison Model for the Diachronic Cor- pus Study of Japanese2012
- Author(s)
  Hilofumi Yamamoto, Makiro Tanaka, Yasuhiro Kondo
- Organizer
  JADH 2012, JADH 2012 conference abstracts, Vol. 2012
- Place of Presentation
  Tokyo : University of Tokyo
- Related Report
  2013 Final Research Report
[Presentation] Emotive Adjectives and Verbs of the Heian Japanese2012
- Author(s)
  Makiro Tanaka, Hilofumi Yamamoto
- Organizer
  JADH 2012, JADH 2012 conference abstracts, Vol. 2012
- Place of Presentation
  Tokyo : University of Tokyo
- Related Report
  2013 Final Research Report
[Presentation] 平安時代日本語の感情形容詞と感情動詞:『源氏物語』『今昔物語集』のコーパス分析を通して2012
- Author(s)
  田中牧郎, 山元啓史
- Organizer
  国立国語研究所国際シンポジウム「日本語の自他と項交替」
- Place of Presentation
  東京:国立国語研究所
- Related Report
  2013 Final Research Report
[Presentation] 通時コーパスと言語空間論2012
- Author(s)
  山元啓史, 田中牧郎, 近藤泰弘
- Organizer
  コーパス日本語学ワークショップ,コーパス日本語学ワークショップ予稿集,国立国語研究所言語資源研究系・コーパス開発センター, Vol. 1, No. 1
- Place of Presentation
  東京:国立国語研究所
- Related Report
  2013 Final Research Report
[Presentation] Design of Serial Comparison Model for the Diachronic Corpus Study of Japanese2012
- Author(s)
  Hilofumi Yamamoto, Makiro Tanaka, Yasuhiro Kondo
- Organizer
  JADH 2012
- Place of Presentation
  東京大学本郷キャンパス工学部２号館
- Related Report
  2012 Annual Research Report
[Presentation] Emotive Adjectives and Verbs of the Heian Japanese2012
- Author(s)
  Makiro Tanaka, Hilofumi Yamamoto
- Organizer
  JADH 2012
- Place of Presentation
  東京大学本郷キャンパス工学部２号館
- Related Report
  2012 Annual Research Report
[Presentation] 平安時代日本語の感情形容詞と感情動詞：『源氏物語』『今昔物語集』のコーパス分析を通して2012
- Author(s)
  田中牧郎, 山元啓史
- Organizer
  国立国語研究所国際シンポジウム「日本語の自他と項交替」
- Place of Presentation
  東京都立川市国立国語研究所
- Related Report
  2012 Annual Research Report
[Presentation] 「山吹」をめぐる和歌語彙の空間2011
- Author(s)
  山元啓史
- Organizer
  じんもんこんシンポジウム2011,情報処理学会
- Place of Presentation
  京都
- Year and Date
  2011-12-10
- Related Report
  2011 Annual Research Report
[Presentation] Quantitative Analysis of Loanwords of Eight Literary Works in the Heian Period (794-1185)2011
- Author(s)
  Makiro Taaaka, Hilofumi Yamamoto
- Organizer
  Osaka symposium on digital humanities 2011
- Place of Presentation
  大阪
- Year and Date
  2011-09-13
- Related Report
  2011 Annual Research Report
[Presentation] Graph Representation of the Connotations of Classical Japanese Poetic Vocabvilary2011
- Author(s)
  Hilofumi Yamamoto
- Organizer
  Osaka symposium on digital humanities 2011
- Place of Presentation
  大阪
- Year and Date
  2011-09-13
- Related Report
  2011 Annual Research Report
[Presentation] An analysis of Sino-Japanese words of the Heian period for the development of the historical Japanese dictionary2011
- Author(s)
  Makiro Tanaka, Hilofumi Yamamoto
- Organizer
  Asialex 2011
- Place of Presentation
  京都
- Year and Date
  2011-08-22
- Related Report
  2011 Annual Research Report
[Presentation] Development of the thesaurus of classical Japanese poetic vocabulary2011
- Author(s)
  Hilofiuni Yamamoto, Makiro Tanaka
- Organizer
  Asialex 2011
- Place of Presentation
  京都
- Year and Date
  2011-08-22
- Related Report
  2011 Annual Research Report
[Presentation] BCCWJ複合辞辞書の仕様・開発・評価2011
- Author(s)
  山元啓史
- Organizer
  特定領域研究「日本語コーパス」平成22年度公開ワークショップ(研究成果報告会)
- Place of Presentation
  東京
- Year and Date
  2011-03-16
- Related Report
  2010 Annual Research Report
[Presentation] Graph Represen- tation of the Connotations of Classi- cal Japanese Poetic Vocabulary2011
- Author(s)
  Hilofumi Yamamoto
- Organizer
  Osaka symposium on digital humanities 2011, Osaka symposium on digital humanities 2011, Vol. 1, No. 1
- Place of Presentation
  Osaka : Osaka University
- Related Report
  2013 Final Research Report
[Presentation] BCCWJ複合辞辞書の仕様・開発・評価2011
- Author(s)
  山元啓史
- Organizer
  特定領域研究「日本語コーパス」平成22年度公開ワークショップ(研究成果報告会),特定領域研究「日本語コーパス」平成22年度公開ワークショップ(研究成果報告会)予稿集,文部科学省科学研究費特定領域研究「日本語コーパス」総括班
- Place of Presentation
  東京:国立国語研究所
- Related Report
  2013 Final Research Report
[Presentation] ブーリアン演算による歌ことばモデルの解析,第16回公開シンポジウム「人文科学とデータベース」2010
- Author(s)
  山元啓史
- Organizer
  第16回公開シンポジウム「人文科学とデータベース」論文集,第16回公開シンポジウム「人文科学とデータベース」実行委員会
- Place of Presentation
  京都:花園大学
- Year and Date
  2010-11-27
- Related Report
  2013 Final Research Report
[Presentation] 通時コーパスで見る語彙論的トポロジーとトランジション2010
- Author(s)
  山元啓史
- Organizer
  NINJAL共同研究発表会・シンポジウム「通時コーパスの設計」研究発表会
- Place of Presentation
  東京:国立国語研究所
- Year and Date
  2010-03-03
- Related Report
  2013 Final Research Report
[Remarks] 科学研究費助成金によるプロジェクトWEBページ「和歌形態素解析用辞書開発のための用語連接規則に関する基礎研究」
- URL
  http://warbler.ryu.titech.ac.jp/~yamagen/waka/kaken2010.html
- Related Report
  2013 Final Research Report
[Remarks] 和歌形態素解析用辞書開発のための用語連接規則に関する基礎研究
- URL
  http://warbler.ryu.titech.ac.jp/~yamagen/waka/kaken2010.html
- Related Report
  2012 Annual Research Report
[Remarks]
- URL
  http://warbler.ryu.titech.ac.jp/~yamagen/waka/kaken2010.html
- Related Report
  2011 Annual Research Report

Basic research concerning adjacency probabilities in the development of a morphological analysis dictionary for classical Japanese poetry

Principal Investigator

YAMAMOTO Hilofumi 東京工業大学, 留学生センター, 准教授 (30241756)

¥2,990,000 (Direct Cost: ¥2,300,000、Indirect Cost: ¥690,000)

Report

Research Products

[Journal Article] 『今昔物語集』と『宇治拾遺物語』の同文説話における語の対応2014

Author(s)

Journal Title

Related Report

[Journal Article] 『今昔物語集』と『宇治拾遺物語』の同文説話における語の対応2014

Author(s)

Journal Title

Related Report

[Journal Article] 不確かな情報が含まれる文の形式2013

Author(s)

Journal Title

Related Report

[Journal Article] Analysis and Application of Mid-Rank Lexicons of Modern Japanese2013

Author(s)

Journal Title

Related Report

[Journal Article] A Corpus Study of Emotive Adjec- tives and Verbs of the Heian Japanese2012

Author(s)

Journal Title

Related Report

[Journal Article] Diachronic Corpus and Linguistic Space : New Methods for the Analysis of Language Change2012

Author(s)

Journal Title

Related Report

[Journal Article] A Corpus Study of Emotive Adjectives and Verbs of the Heian Japanese2012

Author(s)

Journal Title

Related Report

[Journal Article] Diachronic Corpus and Linguistic Space: New Methods for the Analysis of Language Change2012

Author(s)

Journal Title

Related Report

[Journal Article] グラフを用いた集合演算による和歌用語の解析2011

Author(s)

Journal Title

NAID

Related Report

[Journal Article] Quantitative Analysis of Loanwords of Eight Literary Works in the Heian Pe- riod (794–1185)2011

Author(s)

Journal Title

Related Report

[Journal Article] Development of the thesaurus of classi- cal Japanese poetic vocabulary2011

Author(s)

Journal Title

Related Report

[Journal Article] An analysis of Sino-Japanese words of the Heian period for the development of the historical Japanese dictionary2011

Author(s)

Journal Title

Related Report

[Journal Article] 「山吹」をめぐる和歌語彙の空間2011

Author(s)

Journal Title

NAID

Related Report

[Journal Article] グラフを用いた集合演算による和歌用語の解析2011

Author(s)

Journal Title

NAID

Related Report

[Journal Article] Quantitative Analysis of Loanwords of Eight Literary Works in the Heian Period (794-1185)2011

Author(s)

Journal Title

Related Report

[Journal Article] Graph Representation of the Connotations of Classical Japanese Poetic Vocabvilary2011

Author(s)

Journal Title

Related Report

[Journal Article] An analysis of Sino-Japanese words of the Heian period for the development of the historical Japanese dictionary2011

Author(s)

Journal Title

Related Report

[Journal Article] Development of the thesaurus of classical Japanese poetic vocabulary2011

Author(s)

Journal Title