The development of a multi-purpose electric dictionary for morphological analyzers

Planned Research

Project Area	Compilation of a balanced corpus of written Japanese: Infrastructure for the coming Japanese linguistics
Project/Area Number	18061002
Research Category	Grant-in-Aid for Scientific Research on Priority Areas
Allocation Type	Single-year Grants
Review Section	Humanities and Social Sciences
Research Institution	Chiba University
Principal Investigator	DEN Yasuharu 千葉大学, 文学部, 教授 (70291458)
Co-Investigator(Kenkyū-buntansha)	YAMADA Atsushi 京都高度技術研究所, 研究部, 主席研究員 (20240004) MINEMATSU Nobuaki 東京大学, 大学院・新領域創成科学研究科, 准教授 (90273333) UCHIMOTO Kiyotaka 情報通信研究機構, 総合企画部, プランニングマネージャー (60358885) OGISO Tomonobu 国立国語研究所, 言語・資源研究系, 准教授 (20337489) KOISO Hanae 国立国語研究所, 理論・構造研究系, 准教授 (30312200)
Project Period (FY)	2006 – 2010
Project Status	Completed (Fiscal Year 2010)
Budget Amount *help	¥91,900,000 (Direct Cost: ¥91,900,000) Fiscal Year 2010: ¥17,700,000 (Direct Cost: ¥17,700,000) Fiscal Year 2009: ¥19,000,000 (Direct Cost: ¥19,000,000) Fiscal Year 2008: ¥19,000,000 (Direct Cost: ¥19,000,000) Fiscal Year 2007: ¥19,000,000 (Direct Cost: ¥19,000,000) Fiscal Year 2006: ¥17,200,000 (Direct Cost: ¥17,200,000)
Keywords	電子化辞書 / 形態素解析 / 書き言葉コーパス / 音変化 / アクセント / アクセント変化 / 辞書データベース / 単位の自動構成
Research Abstract	(1) An electric dictionary for morphological analyzers with the following characteristics has been developed. ・ Lexical entries with uniform unit-size based on Short-Unit Words ・ Hierarchical representation of lexical entries, consisting of lemma, form, orthography, and pronunciation, which enables us to deal with variations in orthography and word form ・ Rich information including features for phonological and accentual sandhi (2) A version for morphological analyzer MeCab has been derived from the dictionary database, with several updates, which amounts to 210K lemma and 330K orthographic entries and which achieves an accuracy of 98.9% in part-of-speech tagging and an accuracy of 98.6% in lemma identification. 3) A version of the dictionary database represented by XML files has also been developed, which enables users to build customized dictionaries for morphological analyzers according to the user’s preference and purpose. (4) Post-processing tools, including Middle- and Long-Unit-Word analyzers, have been developed for advanced use of the dictionary, such as syntactic analysis and text-to-speech application.

Report

(7 results)

2010 Annual Research Report Final Research Report ( PDF )
2009 Annual Research Report
2008 Annual Research Report Self-evaluation Report ( PDF )
2007 Annual Research Report
2006 Annual Research Report

Research Products
(108 results)

All 2011 2010 2009 2008 2007 2006 Other

All Journal Article (92 results) (of which Peer Reviewed: 24 results) Presentation (3 results) Book (8 results) Remarks (5 results)

[Journal Article] 「中古和文UniDic」における言語単位の設計2011
- Author(s)
  小椋秀樹・須永哲矢・小木曽智信・近藤明日子・田中牧郎
- Journal Title
  
  言語処理学会第17回年次大会発表論文集
  
  Pages: 312-315
- Related Report
  2010 Annual Research Report
[Journal Article] Web版コーパス検索アプリケーション「中納言」の公開2011
- Author(s)
  中村壮範・小木曽智信
- Journal Title
  
  言語処理学会第17回年次大会発表論文集
  
  Pages: 344-347
- Related Report
  2010 Annual Research Report
[Journal Article] 『現代日本語書き言葉均衡コーパス』における形態論情報付きXMLフォーマット2011
- Author(s)
  小木曽智信・間淵洋子・前川喜久雄
- Journal Title
  
  言語処理学会第17回年次大会発表論文集
  
  Pages: 352-355
- Related Report
  2010 Annual Research Report
[Journal Article] 『現代日本語書き言葉均衡コーパス』に基づくオノマトペの分析-品詞性の検討を中心に-2011
- Author(s)
  宮内佐夜香・小木曽智信・小磯花絵・小椋秀樹
- Journal Title
  
  言語処理学会第17回年次大会発表論文集
  
  Pages: 651-654
- Related Report
  2010 Annual Research Report
[Journal Article] 長単位に基づく『現代日本語書き言葉均衡コーパス』の品詞比率に関する分析2011
- Author(s)
  冨士池優美・小西光・小椋秀樹・小木曽智信・小磯花絵
- Journal Title
  
  言語処理学会第17回年次大会発表論文集
  
  Pages: 663-666
- Related Report
  2010 Annual Research Report
[Journal Article] テキストの多様性をとらえる分類指標の体系化の試み2011
- Author(s)
  小磯花絵・田中弥生・小木曽智信・近藤明日子
- Journal Title
  
  言語処理学会第17回年次大会発表論文集
  
  Pages: 683-686
- Related Report
  2010 Annual Research Report
[Journal Article] 複合名詞内アクセント句境界を用いたアクセント結合予測の高精度化に関する実験的検討2011
- Author(s)
  高野克弥・清水信哉・峯松信明・広瀬啓吉
- Journal Title
  
  日本音響学会2011年春季研究発表会講演論文集
  
  Pages: 363-364
- Related Report
  2010 Annual Research Report
[Journal Article] 長単位に基づく媒体・カテゴリ間の品詞比率に関する分析2011
- Author(s)
  冨士池優美・小西光・小椋秀樹・小木曽智信・小磯花絵
- Journal Title
  
  特定領域研究「日本語コーパス」平成22年度公開ワークショップ(研究成果報告会)予稿集
  
  Pages: 273-280
- Related Report
  2010 Annual Research Report
[Journal Article] BCCWJに基づくオノマトペの品詞と意味についての分析2011
- Author(s)
  宮内佐夜香・小木曽智信・小磯花絵・小椋秀樹
- Journal Title
  
  特定領域研究「日本語コーパス」平成22年度公開ワークショップ(研究成果報告会)予稿集
  
  Pages: 281-288
- Related Report
  2010 Annual Research Report
[Journal Article] Web版コーパス検索アプリケーション「中納言」のデモンストレーション2011
- Author(s)
  中村壮範・小木曽智信
- Journal Title
  
  特定領域研究「日本語コーパス」平成22年度公開ワークショップ(研究成果報告会)予稿集
  
  Pages: 289-292
- Related Report
  2010 Annual Research Report
[Journal Article] 階層的形態論情報を考慮した『現代日本語書き言葉均衡コーパス』の公開用XMLフォーマット2011
- Author(s)
  小木曽智信・間淵洋子・前川喜久雄
- Journal Title
  
  特定領域研究「日本語コーパス」平成22年度公開ワークショップ(研究成果報告会)予稿集
  
  Pages: 293-300
- Related Report
  2010 Annual Research Report
[Journal Article] BCCWJに基づく中・長単位解析ツール2011
- Author(s)
  小澤俊介・内元清貴・伝康晴
- Journal Title
  
  特定領域研究「日本語コーパス」平成22年度公開ワークショップ(研究成果報告会)予稿集
  
  Pages: 331-338
- Related Report
  2010 Annual Research Report
[Journal Article] UniDicを用いた音声認識用言語モデルの作成2011
- Author(s)
  山田篤
- Journal Title
  
  特定領域研究「日本語コーパス」平成22年度公開ワークショップ(研究成果報告会)予稿集
  
  Pages: 339-342
- Related Report
  2010 Annual Research Report
[Journal Article] UniDic2:設計と実装2011
- Author(s)
  小木曽智信・伝康晴
- Journal Title
  
  特定領域研究「日本語コーパス」平成22年度公開ワークショップ(研究成果報告会)予稿集
  
  Pages: 411-418
- Related Report
  2010 Annual Research Report
[Journal Article] テキストの多様性をとらえる分類指標の構築を目指して2011
- Author(s)
  小磯花絵・田中弥生・小木曽智信・近藤明日子
- Journal Title
  
  特定領域研究「日本語コーパス」平成22年度公開ワークショップ(研究成果報告会)予稿集
  
  Pages: 431-438
- Related Report
  2010 Annual Research Report
[Journal Article] Design, compilation, and preliminary analyses of Balanced Corpus of Contemporary Written Japanese2010
- Author(s)
  K. Maekawa, M. Yamazaki, T. Maruyama, M. Yamaguchi, H. Ogura, W. Kashino, T. Ogiso, H. Koiso, and Y. Den
- Journal Title
  
  Proceedings of LREC2010
  
  Pages: 1483-1486
- Related Report
  2010 Final Research Report
- Peer Reviewed
[Journal Article] 中古和文を対象とした形態素解析辞書の開発2010
- Author(s)
  小木曽智信・小椋秀樹・田中牧郎・近藤明日子・伝康晴
- Journal Title
  
  情報処理学会研究報告
  
  Volume: 2010-CH-85 Pages: 49-64
- NAID
  110008003480
- Related Report
  2010 Final Research Report
[Journal Article] Design, compilation, and preliminary analyses of Balanced Corpus of Contemporary Written Japanese2010
- Author(s)
  K.Maekawa, M.Yamazaki, T.Maruyama, M.Yamaguchi, H.Ogura, W.Kashino, T.Ogiso, H.Koiso, Y.Den
- Journal Title
  
  Proceedings of LREC2010
  
  Pages: 1483-1486
- Related Report
  2010 Annual Research Report
- Peer Reviewed
[Journal Article] 『現代日本語書き言葉均衡コーパス』長単位解析に基づく予備的分析2010
- Author(s)
  冨士池優美・小椋秀樹・小西光・小木曽智信・小磯花絵
- Journal Title
  
  特定領域研究「日本語コーパス」平成22年度全体会議予稿集
  
  Pages: 101-108
- Related Report
  2010 Annual Research Report
[Journal Article] 汎用後処理ツールを用いた短単位解析結果の再解析2010
- Author(s)
  アブドレイムアブドハリリ・伝康晴
- Journal Title
  
  特定領域研究「日本語コーパス」平成22年度全体会議予稿集
  
  Pages: 141-144
- Related Report
  2010 Annual Research Report
[Journal Article] 汎用後処理ツールを用いた音変化処理の評価2010
- Author(s)
  山田篤・渡部涼子・小木曽智信
- Journal Title
  
  特定領域研究「日本語コーパス」平成22年度全体会議予稿集
  
  Pages: 145-150
- Related Report
  2010 Annual Research Report
[Journal Article] BCCWJに基づく長単位解析ツール2010
- Author(s)
  小澤俊介・内元清貴・伝康晴
- Journal Title
  
  特定領域研究「日本語コーパス」平成22年度全体会議予稿集
  
  Pages: 151-156
- Related Report
  2010 Annual Research Report
[Journal Article] 中古和文を対象とした形態素解析辞書の開発2010
- Author(s)
  小木曽智信・小椋秀樹・田中牧郎・近藤明日子・伝康晴
- Journal Title
  
  情報処理学会研究報告 2010-CH-85
  
  Pages: 49-64
- NAID
  110008003480
- Related Report
  2009 Annual Research Report
[Journal Article] 機械翻訳に適した短単位に基づく中国語単語分割について2010
- Author(s)
  王軼謳・内元清貴・風間淳一・Kruengkrai Canasai・鳥澤健太郎
- Journal Title
  
  言語処理学会第16回年次大会発表論文集
- Related Report
  2009 Annual Research Report
[Journal Article] 形態素解析辞書のベンチマークテスト―IPAdic・NAIST-jdic・UniDicのジャンル別精度比較2010
- Author(s)
  小木曽智信・小椋秀樹・小磯花絵・宮内佐夜香・渡部涼子・伝康晴
- Journal Title
  
  言語処理学会第16回年次大会発表論文集
- Related Report
  2009 Annual Research Report
[Journal Article] 形態素解析辞書UniDicにおける同語異語判別について2010
- Author(s)
  小椋秀樹・原裕・小木曽智信・小磯花絵・宮内佐夜香
- Journal Title
  
  言語処理学会第16回年次大会発表論文集
- Related Report
  2009 Annual Research Report
[Journal Article] 修辞ユニットを用いた書き言葉の分析―「書き言葉・話し言葉」と(脱)文脈化の関係―2010
- Author(s)
  佐野大樹・小磯花絵
- Journal Title
  
  社会言語科学会第23回研究大会発表論文集
  
  Pages: 182-185
- Related Report
  2009 Annual Research Report
- Peer Reviewed
[Journal Article] UniDic汎用後処理ツールの設計と実装2010
- Author(s)
  山田篤・伝康晴
- Journal Title
  
  特定領域研究「日本語コーパス」平成21年度公開ワークショップ予稿集
  
  Pages: 23-28
- Related Report
  2009 Annual Research Report
[Journal Article] 形態素解析辞書UniDicにおける語彙素見出しの立項方針2010
- Author(s)
  小椋秀樹・原裕・小木曽智信・小磯花絵・宮内佐夜香
- Journal Title
  
  特定領域研究「日本語コーパス」平成21年度公開ワークショップ予稿集
  
  Pages: 85-92
- Related Report
  2009 Annual Research Report
[Journal Article] 『現代日本語書き言葉均衡コーパス』における長単位解析の進捗状況2010
- Author(s)
  冨士池優美・小椋秀樹・小西光・小木曽智信・小磯花絵・内元清貴・小澤俊介
- Journal Title
  
  特定領域研究「日本語コーパス」平成21年度公開ワークショップ予稿集
  
  Pages: 93-100
- Related Report
  2009 Annual Research Report
[Journal Article] MeCab版形態素解析辞書4種のジャンル別解析精度比較―UniDicとIPAdic, NAIST-jdic, JUMANdic―2010
- Author(s)
  小木曽智信・小椋秀樹・小磯花絵・宮内佐夜香・渡部涼子・伝康晴
- Journal Title
  
  特定領域研究「日本語コーパス」平成21年度公開ワークショップ予稿集
  
  Pages: 175-182
- Related Report
  2009 Annual Research Report
[Journal Article] 長単位情報に基づくジャンル間の文体に関する分析2010
- Author(s)
  小磯花絵・小木曽智信・小椋秀樹・宮内佐夜香
- Journal Title
  
  特定領域研究「日本語コーパス」平成21年度公開ワークショップ予稿集
  
  Pages: 183-190
- Related Report
  2009 Annual Research Report
[Journal Article] Design, compilation, and preliminary analyses of balanced corpus of contemporary written Japanese2010
- Author(s)
  K.Maekawa, M.Yamazaki, T.Maruyama, M.Yamaguchi, H.Ogura, W.Kashino, T.Ogiso, H.Koiso, Y.Den
- Journal Title
  
  Proceedings of the 7th International Conference on Language Resources and Evaluation (掲載確定)
- Related Report
  2009 Annual Research Report
- Peer Reviewed
[Journal Article] Development of an on-line word accent dictionary of Japanese2009
- Author(s)
  H. Hirano, M. Suzuki, K. Innami, N. Minematsu, and K. Hirose
- Journal Title
  
  Proceedings of JSAA-ICJLE 2009
  
  Volume: 24 Pages: 640-646
- Related Report
  2010 Final Research Report
[Journal Article] 多様な目的に適した形態素解析システム用電子化辞書2009
- Author(s)
  伝康晴
- Journal Title
  
  人工知能学会誌
  
  Volume: 24 Pages: 640-646
- Related Report
  2010 Final Research Report
[Journal Article] 話し言葉における引用節・挿入節の自動認定および係り受け解析への応用2009
- Author(s)
  浜辺良二・内元清貴・河原達也・井佐原均
- Journal Title
  
  自然言語処理
  
  Volume: 16(1) Pages: 3-23
- NAID
  10024758516
- Related Report
  2010 Final Research Report
- Peer Reviewed
[Journal Article] 形態論情報の自動付与とその問題点2009
- Author(s)
  小木曽智信
- Journal Title
  
  国文学解釈と鑑賞
  
  Volume: 74(1) Pages: 35-43
- Related Report
  2010 Final Research Report
[Journal Article] 多様な目的に適した形態素解析システム用電子化辞書2009
- Author(s)
  伝康晴
- Journal Title
  
  人工知能学会誌 24
  
  Pages: 640-646
- Related Report
  2009 Annual Research Report
[Journal Article] Development of an on-line word accent dictionary of Japanese2009
- Author(s)
  H.Hirano, M.Suzuki, K.Innami, N.Minematsu, K.Hirose
- Journal Title
  
  Proceedings of JSAA-ICJLE 2009
- Related Report
  2009 Annual Research Report
- Peer Reviewed
[Journal Article] An error-driven word-character hybrid model for joint Chinese word segmentation and POS tag2009
- Author(s)
  C.Kruengkrai, K.Uchimoto, J.Kazama, Y.Wang, K.Torisawa, H.Isahara
- Journal Title
  
  Proceedings of ACL-IJCNLP 2009
  
  Pages: 513-521
- Related Report
  2009 Annual Research Report
- Peer Reviewed
[Journal Article] Improving dependency parsing with subtrees from auto-parsed data2009
- Author(s)
  W.Chen, J.Kazama, K.Uchimoto, K.Torisawa
- Journal Title
  
  Proceedings of EMNLP 2009
  
  Pages: 570-579
- Related Report
  2009 Annual Research Report
- Peer Reviewed
[Journal Article] ジャンル別UniDic作成の試み, 特定領域研究「日本語コーパス」2009
- Author(s)
  小木曽智信・伝康晴・渡部涼子
- Journal Title
  
  平成20年度公開ワークショップ(研究成果報告会)予稿集
  
  Pages: 17-22
- Related Report
  2008 Self-evaluation Report
[Journal Article] 形態論情報の自動付与とその問題点2009
- Author(s)
  小木曽智信
- Journal Title
  
  国文学解釈と鑑賞 74(1)
  
  Pages: 35-43
- Related Report
  2008 Annual Research Report
[Journal Article] 話し言葉における引用節・挿入節の自動認定および係り受け解析への応用2009
- Author(s)
  浜辺良二・内元清貴・河原達也・井佐原均
- Journal Title
  
  自然言語処理 16(1)
  
  Pages: 3-23
- NAID
  10024758516
- Related Report
  2008 Annual Research Report
- Peer Reviewed
[Journal Article] CRFを用いたアクセント変形予測モデルの規則処理に基づく改良2009
- Author(s)
  印南圭祐・渡辺美知子・峯松信明・広瀬啓吉
- Journal Title
  
  言語処理学会第15回年次大会発表論文集
  
  Pages: 574-577
- Related Report
  2008 Annual Research Report
[Journal Article] コーパスに基づく多様なジャンルの文体比較-短単位情報に着目して-2009
- Author(s)
  小磯花絵・小木曽智信・小椋秀樹・宮内佐夜香
- Journal Title
  
  言語処理学会第15回年次大会発表論文集
  
  Pages: 594-597
- Related Report
  2008 Annual Research Report
[Journal Article] 語種を観点とした近代語と現代語の語彙の比較-形態素解析辞書「近代文語UniDic」「UniDiclを用いて-2009
- Author(s)
  近藤明日子・小木曽智信
- Journal Title
  
  言語処理学会第15回年次大会発表論文集
  
  Pages: 741-744
- Related Report
  2008 Annual Research Report
[Journal Article] 現代語コーパスの利用による近代語形態素解析の精度向上2009
- Author(s)
  小木曽智信・伝康晴・渡部涼子・近藤明日子
- Journal Title
  
  言語処理学会第15回年次大会発表論文集
  
  Pages: 801-804
- Related Report
  2008 Annual Research Report
[Journal Article] ジャンル別UniDic作成の試み2009
- Author(s)
  小木曽智信・伝康晴・渡部涼子
- Journal Title
  
  特定領域研究「日本語コーパス」平成20年度公開ワークショップ(研究成果報告会)予稿集
  
  Pages: 17-22
- Related Report
  2008 Annual Research Report
[Journal Article] 『現代日本語書き言葉均衡コーパス』における形態論情報付与作業の進捗状況2009
- Author(s)
  小椋秀樹・小木曽智信・小磯花絵・冨士池優美・宮内佐夜香・渡部涼子・竹内ゆかり・小川志乃・小西光・原裕・中村壮範
- Journal Title
  
  特定領域研究「日本語コーパス」平成20年度公開ワークショップ(研究成果報告会)予稿集
  
  Pages: 57-64
- Related Report
  2008 Annual Research Report
[Journal Article] 形態論情報データベースの構成2009
- Author(s)
  小木曽智信・小椋秀樹・小磯花絵・冨士池優美・宮内佐夜香・渡部涼子・竹内ゆかり・小川志乃・小西光・原裕・中村壮範
- Journal Title
  
  特定領域研究「日本語コーパス」平成20年度公開ワークショップ(研究成果報告会)予稿集
  
  Pages: 65-70
- Related Report
  2008 Annual Research Report
[Journal Article] 短単位を対象とした連濁の処理について2009
- Author(s)
  山田篤
- Journal Title
  
  特定領域研究「日本語コーパス」平成20年度公開ワークショップ(研究成果報告会)予稿集
  
  Pages: 93-98
- Related Report
  2008 Annual Research Report
[Journal Article] 規則処理のアクセント属性を導入したCRFによるアクセント結合処理2009
- Author(s)
  印南圭祐・峯松信明
- Journal Title
  
  特定領域研究「日本語コーパス」平成20年度公開ワークショップ(研究成果報告会)予稿集
  
  Pages: 175-181
- Related Report
  2008 Annual Research Report
[Journal Article] 規則処理を参考にしたCRFによるアクセント結合モデル2009
- Author(s)
  印南圭祐・渡辺美知子・峯松信明・広瀬啓吉
- Journal Title
  
  日本音響学会春季講演論文集
  
  Pages: 473-476
- Related Report
  2008 Annual Research Report
[Journal Article] コンピュータの辞書2009
- Author(s)
  小木曽智信
- Journal Title
  
  新「ことば」シリーズ22「辞書を知る」
  
  Pages: 114-117
- Related Report
  2008 Annual Research Report
[Journal Article] Word-level dependency-structure annotation to Corpus of Spontaneous Japanese and its application2008
- Author(s)
  K. Uchimoto and Y. Den
- Journal Title
  
  Proceedings of LREC2008
  
  Pages: 3118-3122
- Related Report
  2010 Final Research Report
- Peer Reviewed
[Journal Article] A proper approach to Japanese morphological analysis： Dictionary, model, and evaluation2008
- Author(s)
  Y. Den, J. Nakamura, T. Ogiso, and H. Ogura
- Journal Title
  
  Proceedings of LREC2008
  
  Pages: 1019-1024
- Related Report
  2010 Final Research Report
- Peer Reviewed
[Journal Article] Word-level dependency-structure annotation to Corpus of Spontaneous Japanese and its application2008
- Author(s)
  Uchimoto, K., Den, Y.
- Journal Title
  
  Proceedings of the 6th International Conference on Language Resources and Evaluation
  
  Pages: 3118-3122
- Related Report
  2008 Self-evaluation Report
- Peer Reviewed
[Journal Article] A proper approach to Japanese morphological analysis: Dictionary, model, and evaluation2008
- Author(s)
  Den, Y., Nakamura, J., Ogiso, T., Ogura, H.
- Journal Title
  
  Proceedings of the 6th International Conference on Language Resources and Evaluation
  
  Pages: 1019-1024
- Related Report
  2008 Self-evaluation Report
- Peer Reviewed
[Journal Article] 近代文語文を対象とした形態素解析辞書・近代文語UniDic2008
- Author(s)
  小木曽智信・小椋秀樹・近藤明日子
- Journal Title
  
  日本語学会2008年度春季大会予稿集
  
  Pages: 211-218
- Related Report
  2008 Annual Research Report
- Peer Reviewed
[Journal Article] 話し言葉の整形作業における削除箇所の自動同定2008
- Author(s)
  尾嶋憲治・河原達也・秋田祐哉・内元清貴
- Journal Title
  
  情報処理学会研究報告 2008-NL-185
  
  Pages: 85-91
- NAID
  110006793717
- Related Report
  2008 Annual Research Report
[Journal Article] A proper approach to Japanese morphological analysis : Dictionary, model, and evaluation2008
- Author(s)
  Den. Y., Nakamura, J., Ogiso, T., and Ogura. H
- Journal Title
  
  Proc. of LREC2008
  
  Pages: 1019-1024
- Related Report
  2008 Annual Research Report
- Peer Reviewed
[Journal Article] Word-level dependency-structure annotation to Corpus of Spontaneous Japanese and its application2008
- Author(s)
  Uchimoto, K. and Den. Y
- Journal Title
  
  Proc. of LREC2008
  
  Pages: 3118-3122
- Related Report
  2008 Annual Research Report
- Peer Reviewed
[Journal Article] 『現代日本語書き言葉均衡コーパス』にもとづくジャンル間の文体差に関わる要因の分析2008
- Author(s)
  小磯花絵・小木曽智信・小椋秀樹・冨士池優美・宮内佐夜香
- Journal Title
  
  社会言語科学会第22回研究大会発表論文集
  
  Pages: 192-195
- Related Report
  2008 Annual Research Report
[Journal Article] 平成19年度進捗状況報告 : 電子化辞書班(多様な目的に適した形態素解析システム用電子化辞書の開発)2008
- Author(s)
  伝康晴・峯松信明・小木曽智信・小磯花絵・山田篤・内元清貴
- Journal Title
  
  特定領域研究「日本語コーパス」平成20年度全体会議予稿集
  
  Pages: 15-18
- Related Report
  2008 Annual Research Report
[Journal Article] 短単位情報に基づくジャンル間の文体に関する分析2008
- Author(s)
  小磯花絵・小木曽智信・小椋秀樹
- Journal Title
  
  特定領域研究「日本語コーパス」平成20年度全体会議予稿集
  
  Pages: 99-106
- Related Report
  2008 Annual Research Report
[Journal Article] 平成19年度進捗状況報告:電子化辞書班(多様な目的に適した形態素解析システム用電子化辞書の開発)2008
- Author(s)
  伝康晴・山田篤・峯松信明・内元清貴・小木曽智信・小磯花絵
- Journal Title
  
  特定領域研究「日本語コーパス」平成19年度公開ワークショップ(研究成果報告会)予稿集
  
  Pages: 79-84
- Related Report
  2007 Annual Research Report
[Journal Article] 中・長単位解析システムの開発2008
- Author(s)
  内元清貴・伝康晴
- Journal Title
  
  特定領域研究「日本語コーパス」平成19年度公開ワークショップ(研究成果報告会)予稿集
  
  Pages: 159-166
- Related Report
  2007 Annual Research Report
[Journal Article] CRFを用いたアクセント結合処理に対する誤り分析とその改良に向けた考察2008
- Author(s)
  印南圭祐・渡辺美知子・峯松信明・広瀬啓吉
- Journal Title
  
  日本音響学会春季講演論文集
  
  Pages: 413-414
- Related Report
  2007 Annual Research Report
[Journal Article] 語種情報を用いた同表記異音語の解消2008
- Author(s)
  伝康晴・中村純平・小木曽智信・小椋秀樹
- Journal Title
  
  言語処理学会第14回年次大会発表論文集
  
  Pages: 69-72
- Related Report
  2007 Annual Research Report
[Journal Article] 形態素解析誤りの多い助詞・助動詞の再解析2008
- Author(s)
  中村純平・伝康晴
- Journal Title
  
  言語処理学会第14回年次大会発表論文集
  
  Pages: 73-76
- Related Report
  2007 Annual Research Report
[Journal Article] 「現代日本語書き言葉均衡コーパス」の長単位認定基準について2008
- Author(s)
  冨士池優美・小椋秀樹・小木曽智信・小磯花絵・内元清貴・相馬さつき・中村壮範
- Journal Title
  
  言語処理学会第14回年次大会発表論文集
  
  Pages: 931-934
- Related Report
  2007 Annual Research Report
[Journal Article] 形態素解析用辞書UniDicへの語種情報の実装と政府刊行白書の語種比率の分析2008
- Author(s)
  小椋秀樹・小木曽智信・原裕・小磯花絵・冨士池優美
- Journal Title
  
  言語処理学会第14回年次大会発表論文集
  
  Pages: 935-938
- Related Report
  2007 Annual Research Report
[Journal Article] CRFに基づくアクセント変形予測モデルにおけるエラー解析2008
- Author(s)
  印南圭祐・渡辺美知子・峯松信明・広瀬啓吉
- Journal Title
  
  言語処理学会第14回年次大会発表論文集
  
  Pages: 969-972
- Related Report
  2007 Annual Research Report
[Journal Article] CRF-based statistical learning of Japanese accent sandhi for developing Japanese text-to-speech synthesis systems2007
- Author(s)
  N. Minematsu, R. Kuroiwa, K. Hirose, and M. Watanabe
- Journal Title
  
  Proceedings of ISCA Workshop on Speech Synthesis
  
  Pages: 148-153
- Related Report
  2010 Final Research Report
- Peer Reviewed
[Journal Article] Morphological annotation of a large spontaneous speech corpus in Japanese2007
- Author(s)
  K. Uchimoto, and H. Isahara
- Journal Title
  
  Proceedings of IJCAI2007
  
  Pages: 1731-1737
- Related Report
  2010 Final Research Report
- Peer Reviewed
[Journal Article] コーパス日本語学のための言語資源：形態素解析用電子化辞書の開発とその応用2007
- Author(s)
  伝康晴・小木曽智信・小椋秀樹・山田篤・峯松信明・内元清貴・小磯花絵
- Journal Title
  
  日本語科学
  
  Volume: 22 Pages: 101-122
- Related Report
  2010 Final Research Report
- Peer Reviewed
[Journal Article] コーパス日本語学のための言語資源:形態素解析用電子化辞書の開発とその応用2007
- Author(s)
  伝康晴・小木曽智信・小椋秀樹・山田篤・峯松信明・内元清貴・小磯花絵
- Journal Title
  
  日本語科学 22
  
  Pages: 101-122
- Related Report
  2008 Self-evaluation Report 2007 Annual Research Report
- Peer Reviewed
[Journal Article] Morphological annotation of a large spontaneous speech corpus in Japanese2007
- Author(s)
  Uchimoto, K., & Isahara, H
- Journal Title
  
  Proc. of IJCAI2007
  
  Pages: 1731-1737
- Related Report
  2007 Annual Research Report
- Peer Reviewed
[Journal Article] CRF-based statistical learning of Japanese accent sandhi for developing Japanese text-to-speech synthesis systems2007
- Author(s)
  Minematsu, N., Kuroiwa, R., Hirose, K., & Watanabe, M
- Journal Title
  
  Proc. of ISCA Workshop on Speech Synthesis
  
  Pages: 148-153
- Related Report
  2007 Annual Research Report
- Peer Reviewed
[Journal Article] 日本語研究に適した形態素解析ソフトウェア-「unidic」と「茶まめ」-2007
- Author(s)
  小木曽智信・小椋秀樹・伝康晴
- Journal Title
  
  日本語学会2007年度秋季大会予稿集
  
  Pages: 255-262
- NAID
  110006946075
- Related Report
  2007 Annual Research Report
- Peer Reviewed
[Journal Article] Morphological annotation of a large spontaneous speech corpus in Japanese2007
- Author(s)
  K.Uchimoto, H.Isahara
- Journal Title
  
  Proceedings of the Twentieth International Joint Conference on Artificial Intelligence
  
  Pages: 1731-1737
- Related Report
  2006 Annual Research Report
[Journal Article] 特定話者による大規模アクセントラベリングとそのデータベース化2007
- Author(s)
  黒岩龍, 峯松信明, 伝康晴, 広瀬啓吉
- Journal Title
  
  日本音響学会2007年春季研究発表会講演論文集
  
  Pages: 299-300
- Related Report
  2006 Annual Research Report
[Journal Article] 日本語音声合成を目的としたアクセント処理のための規則と統計的学習2007
- Author(s)
  黒岩龍, 峯松信明, 広瀬啓吉
- Journal Title
  
  日本音響学会春季講演論文集
  
  Pages: 301-302
- Related Report
  2006 Annual Research Report
[Journal Article] 平成18年度進捗状況報告 : 電子化辞書班(多様な目的に適した形態素解析システム用電子化辞書の開発)2007
- Author(s)
  伝康晴, 山田篤, 峯松信明, 内元清貴, 小木曽智信
- Journal Title
  
  日本語コーパス(特定領域研究)(平成18年度公開ワークショップ予稿集)
  
  Pages: 37-46
- Related Report
  2006 Annual Research Report
[Journal Article] 「現代日本語書き言葉均衡コーパス」における短単位の概要2007
- Author(s)
  小椋秀樹, 小木曽智信, 小磯花絵, 冨士池優美, 相馬さつき, 渡部涼子, 服部龍太郎
- Journal Title
  
  日本語コーパス(特定領域研究)(平成18年度公開ワークショップ予稿集)
  
  Pages: 101-108
- Related Report
  2006 Annual Research Report
[Journal Article] 単独ラベラによる大規模アクセントラベリングとそれを用いた統計的アクセント結合処理の実装2007
- Author(s)
  峯松信明, 黒岩龍
- Journal Title
  
  日本語コーパス(特定領域研究)(平成18年度公開ワークショップ予稿集)
  
  Pages: 143-152
- Related Report
  2006 Annual Research Report
[Journal Article] 「現代日本語書き言葉均衡コーパス」の短単位解析について2007
- Author(s)
  小椋秀樹, 小木曽智信, 小磯花絵, 冨士池優美, 相馬さつき
- Journal Title
  
  言語処理学会第13回年次大会発表論文集
  
  Pages: 720-723
- Related Report
  2006 Annual Research Report
[Journal Article] 大規模アクセントラベリングコーパスの構築とそれに基づくハイブリッド型アクセント結合処理2007
- Author(s)
  黒岩龍, 峯松信明, 伝康晴, 広瀬啓吉
- Journal Title
  
  言語処理学会第13回年次大会発表論文集
  
  Pages: 724-727
- Related Report
  2006 Annual Research Report
[Journal Article] 単独ラベラによる大規模アクセントデータベースの構築およびそれを利用した統計的アクセント結合処理の検討2007
- Author(s)
  黒岩龍, 峯松信明, 伝康晴, 広瀬啓吉
- Journal Title
  
  電子情報通信学会技術研究報告 SP2006-174
  
  Pages: 31-36
- NAID
  110006248462
- Related Report
  2006 Annual Research Report
[Journal Article] Dependency-structure annotation to Corpus of Spontaneous Japanese2006
- Author(s)
  K. Uchimoto, R. Hamabe, T. Maruyama, K. Takanashi, T. Kawahara, and H. Isahara
- Journal Title
  
  Proceedings of LREC2006
  
  Pages: 635-638
- Related Report
  2010 Final Research Report
- Peer Reviewed
[Journal Article] Dependency-structure annotation to Corpus of Spontaneous Japanese2006
- Author(s)
  K.Uchimoto, R.Hamabe, T.Maruyama, K.Takanashi, T.Kawahara, H.Isahara
- Journal Title
  
  Proceedings of the Fifth International Conference on Language Resources and Evaluation
  
  Pages: 635-638
- Related Report
  2006 Annual Research Report
[Presentation] テキストの多様性をとらえる分類指標の体系化の試み2011
- Author(s)
  小磯花絵・田中弥生・小木曽智信・近藤明日子
- Organizer
  言語処理学会第17回年次大会
- Place of Presentation
  豊橋技術科学大学（愛知）
- Year and Date
  2011-03-09
- Related Report
  2010 Final Research Report
[Presentation] 形態素解析辞書のベンチマークテスト―IPAdic・NAIST-jdic・UniDic のジャンル別精度比較―2010
- Author(s)
  小木曽智信・小椋秀樹・小磯花絵・宮内佐夜香・渡部涼子・伝康晴
- Organizer
  言語処理学会第16回年次大会
- Place of Presentation
  東京大学（東京）
- Year and Date
  2010-03-10
- Related Report
  2010 Final Research Report
[Presentation] UniDic汎用後処理ツールの設計と実装2010
- Author(s)
  山田篤・伝康晴
- Organizer
  特定領域研究「日本語コーパス」平成21年度公開ワークショップ
- Place of Presentation
  東京工業大学（東京）
- Related Report
  2010 Final Research Report
[Book] 特定領域研究「日本語コーパス」平成22年度研究成果報告書『現代日本語書き言葉均衡コーパス』形態論情報規定集第4版（上・下）2011
- Author(s)
  小椋秀樹・小磯花絵・冨士池優美・宮内佐夜香・小西光・原裕
- Total Pages
  359
- Publisher
  国立国語研究所
- Related Report
  2010 Final Research Report
[Book] 特定領域研究「日本語コーパス」平成 22年度研究成果報告書『現代日本語書き言葉均衡コーパス』形態論情報データベースの設計と実装改訂版2011
- Author(s)
  小木曽智信・中村壮範
- Total Pages
  145
- Publisher
  国立国語研究所
- Related Report
  2010 Final Research Report
[Book] 『現代日本語書き言葉均衡コーパス』形態論情報規定集第4版(上・下)2011
- Author(s)
  小椋秀樹・小磯花絵・冨士池優美・宮内佐夜香・小西光・原裕
- Total Pages
  359
- Publisher
  特定領域研究「日本語コーパス」平成22年度研究成果報告書
- Related Report
  2010 Annual Research Report
[Book] 『現代日本語書き言葉均衡コーパス』形態論情報データベースの設計と実装改訂版2011
- Author(s)
  小木曽智信・中村壮範
- Total Pages
  145
- Publisher
  特定領域研究「日本語コーパス」平成22年度研究成果報告書
- Related Report
  2010 Annual Research Report
[Book] 『現代日本語書き言葉均衡コーパス』形態論情報規程集(特定領域研究「日本語コーパス」特定領域研究「日本語コーパス」平成21年度研究成果報告書, 第3版)2010
- Author(s)
  小椋秀樹・小磯花絵・冨士池優美・宮内佐夜香・原裕
- Total Pages
  295
- Related Report
  2009 Annual Research Report
[Book] 平成20年度研究成果報告書『現代日本語書き言葉均衡コーパス』形態論情報データベースの設計と実装, 特定領域研究「日本語コーパス」特定領域研究「日本語コーパス」2009
- Author(s)
  小木曽智信・中村壮範
- Total Pages
  141
- Related Report
  2008 Self-evaluation Report
[Book] 『現代日本語書き言葉均衡コーパス』形態論情報規程集2009
- Author(s)
  小椋秀樹・小磯花絵・冨士池優美・原裕
- Total Pages
  250
- Publisher
  特定領域研究「日本語コーパス」平成20年度研究成果報告書
- Related Report
  2008 Annual Research Report
[Book] 『現代日本語書き言葉均衡コーパス』形態論情報データベースの設計と実装2009
- Author(s)
  小木曽智信・中村壮範
- Total Pages
  141
- Publisher
  特定領域研究「日本語コーパス」特定領域研究「日本語コーパス」平成20年度研究成果報告書
- Related Report
  2008 Annual Research Report
[Remarks]
- URL
  http://download.unidic.org/
- Related Report
  2010 Final Research Report
[Remarks]
- URL
  http://download.unidic.org/
- Related Report
  2010 Annual Research Report
[Remarks]
- URL
  http://unidic.download.org/
- Related Report
  2009 Annual Research Report
[Remarks] 形態素解析システム用辞書UniDic公開ホームページ
- URL
  http://unidic.download.org/
- Related Report
  2008 Self-evaluation Report
[Remarks] 形態素解析辞書UniDic
- URL
  http://unidic.download.org/
- Related Report
  2008 Annual Research Report

The development of a multi-purpose electric dictionary for morphological analyzers

Principal Investigator

DEN Yasuharu 千葉大学, 文学部, 教授 (70291458)

¥91,900,000 (Direct Cost: ¥91,900,000)

Report

Research Products

[Journal Article] 「中古和文UniDic」における言語単位の設計2011

Author(s)

Journal Title

Related Report

[Journal Article] Web版コーパス検索アプリケーション「中納言」の公開2011

Author(s)

Journal Title

Related Report

[Journal Article] 『現代日本語書き言葉均衡コーパス』における形態論情報付きXMLフォーマット2011

Author(s)

Journal Title

Related Report

[Journal Article] 『現代日本語書き言葉均衡コーパス』に基づくオノマトペの分析-品詞性の検討を中心に-2011

Author(s)

Journal Title

Related Report

[Journal Article] 長単位に基づく『現代日本語書き言葉均衡コーパス』の品詞比率に関する分析2011

Author(s)

Journal Title

Related Report

[Journal Article] テキストの多様性をとらえる分類指標の体系化の試み2011

Author(s)

Journal Title

Related Report

[Journal Article] 複合名詞内アクセント句境界を用いたアクセント結合予測の高精度化に関する実験的検討2011

Author(s)

Journal Title

Related Report

[Journal Article] 長単位に基づく媒体・カテゴリ間の品詞比率に関する分析2011

Author(s)

Journal Title

Related Report

[Journal Article] BCCWJに基づくオノマトペの品詞と意味についての分析2011

Author(s)

Journal Title

Related Report

[Journal Article] Web版コーパス検索アプリケーション「中納言」のデモンストレーション2011

Author(s)

Journal Title

Related Report

[Journal Article] 階層的形態論情報を考慮した『現代日本語書き言葉均衡コーパス』の公開用XMLフォーマット2011

Author(s)

Journal Title

Related Report

[Journal Article] BCCWJに基づく中・長単位解析ツール2011

Author(s)

Journal Title

Related Report

[Journal Article] UniDicを用いた音声認識用言語モデルの作成2011

Author(s)

Journal Title

Related Report

[Journal Article] UniDic2:設計と実装2011

Author(s)

Journal Title

Related Report

[Journal Article] テキストの多様性をとらえる分類指標の構築を目指して2011

Author(s)

Journal Title

Related Report

[Journal Article] Design, compilation, and preliminary analyses of Balanced Corpus of Contemporary Written Japanese2010

Author(s)

Journal Title

Related Report

[Journal Article] 中古和文を対象とした形態素解析辞書の開発2010

Author(s)

Journal Title

NAID

Related Report

[Journal Article] Design, compilation, and preliminary analyses of Balanced Corpus of Contemporary Written Japanese2010

Author(s)

Journal Title

Related Report

[Journal Article] 『現代日本語書き言葉均衡コーパス』長単位解析に基づく予備的分析2010