2011 Fiscal Year Annual Research Report

法令文作成・英訳支援環境の構築:ターミノロジーと翻訳メモリの利用

Research Project

Project/Area Number	23300094
Research Institution	Nagoya University
Principal Investigator	外山勝彦名古屋大学, 大学院・情報科学研究科, 准教授 (70217561)
Keywords	法令文作成支援 / 法令英訳支援 / 法令文コーパス / 語彙知識獲得
Research Abstract	本研究は,政策法務業務の増大に対応するために必要な日本語法令文の作成と,国際社会のグローバル化に対応するために必要な日本語法令文の英訳に対して,計算機を用いて作業支援することを目的としている.本年度は,主に以下の成果を得た. 1.浅い構文情報付き日英法令文対訳コーパスの構築日本語法令10本(13,569文)に対して,SVMに基づいたチャンカYamChaを用いて,名詞句・動詞句を抽出する浅い構文解析(チャンキング)を施した.その結果,チャンキング済の日本語法令コーパスは法令17本(28,393文)となった. 2.法令翻訳メモリの構築文対応付き日英対訳法令文276,597文(法令259本,ことなり原文129,120文,ことなり対訳文147,119文)を用いて,翻訳メモリを構築した.また,この翻訳メモリに対する検索システムも構築した.この翻訳メモリは標準的なTMX形式に変換して,出力できるものである.また,法令文の間の距離関数を実験的に比較し,翻訳対象の法令文に類似した文を翻訳メモリから選択する手法を検討した. 3.構文情報付き日英法令文対訳コーパスからの対訳パターン自動抽出ブートストラップに基づく対訳文からの対訳語彙意味カテゴリ自動抽出手法b-Monakaを開発した.この手法は,語彙意味カテゴリ抽出と対訳表現獲得の2段階の処理を統合するものである.文対応付き日英対訳法令193本(90,273文)を用いて実験したところ,特に繰り返し30回において精度82.0%で対訳表現を抽出できた.また,従来手法では困難であった複数の語からなる長い専門用語の抽出が可能であった.さらに,対訳文を語彙の文脈情報として使う方法の有効性を示すことができた.
Current Status of Research Progress	Current Status of Research Progress 2: Research has progressed on the whole more than it was originally planned. Reason 本年度研究実施計画のうち,英訳法令文に対するチャンキングは未実施である.その実施の前に,対訳語彙意味カテゴリの自動抽出手法を開発して利用する方が効率的であり,それを優先したためである.
Strategy for Future Research Activity	引き続き,浅い構文情報付き日英法令文対訳コーパスの構築を推進する.また,チャンキングによって抽出した名詞句・動詞句に対して,今年度開発した対訳語彙意味カテゴリ自動抽出技術などを用いて,それらの対訳表現を自動抽出する.さらに,法令ターミノロジーの構築について,それに蓄積する情報の表現形式の策定,および,それに収録する語彙やそれに関する情報の抽出・獲得手法を検討する.

Research Products
(8 results)

All 2012 2011

All Journal Article (4 results) (of which Peer Reviewed: 4 results) Presentation (4 results)

[Journal Article] グラフカーネルを用いた非分かち書き文からの漸次的語彙知識獲得2011
- Author(s)
  萩原正人, 小川泰弘, 外山勝彦
- Journal Title
  
  人工知能学会論文誌
  
  Volume: Vol.26,No.3 Pages: 440-450
- Peer Reviewed
[Journal Article] Design and Compilation of Syntactically Tagged Corpus of Japanese Statutory Sentences2011
- Author(s)
  Y.Ogawa, M.Yamada, R.Kato, K.Toyama
- Journal Title
  
  Lecture Notes in Computer Science
  
  Volume: 6797 Pages: 141-152
- Peer Reviewed
[Journal Article] Recall-Oriented Evaluation Metrics for Consistent Translation of Japanese Legal Sentences2011
- Author(s)
  Y.Ogawa, M.Mori, K.Toyama
- Journal Title
  
  Proc.5th Int.Workshop on Juris-informatics
  
  Pages: 62-73
- Peer Reviewed
[Journal Article] Bootstrapping-based Extraction of Bilingual Dictionary Terms from Parallel Corpus2011
- Author(s)
  J.Renzhe, Y.Ogawa, A.Rachmatullah, K.Toyama
- Journal Title
  
  Proc.Joint Int.Symp.on Natural Language Processing and Agricultural Ontology Service 2011
  
  Pages: 95-99
- Peer Reviewed
[Presentation] 名古屋大学法情報研究センターの最近の活動2012
- Author(s)
  外山勝彦
- Organizer
  法情報学の今後の展開を考える会
- Place of Presentation
  ホテルアローレ(石川県)
- Year and Date
  2012-01-27
[Presentation] 法令対訳表現に対する再現率指向の統一性評価指標2011
- Author(s)
  森雅紀, 小川泰弘, 外山勝彦
- Organizer
  平成23年度電気関係学会東海支部連合大会
- Place of Presentation
  三重大学(三重県)
- Year and Date
  2011-09-27
[Presentation] Design and Development of Japanese Law Translation Database System2011
- Author(s)
  K.Toyama, D.Saito, Y.Sekine, Y.Ogawa, T.Kakuta, T.Kimiura, Y.Matsuura
- Organizer
  Law via the Internet 2011
- Place of Presentation
  香港大学(香港)
- Year and Date
  2011-06-09
[Presentation] Brief Introduction to Bilingual KWIC for Taiwan Laws2011
- Author(s)
  K.Toyama
- Organizer
  TaiwanLII Opening Ceremony and Round Table Forum
- Place of Presentation
  中正大学(台湾)(招待講演)
- Year and Date
  2011-06-07

2011 Fiscal Year Annual Research Report

法令文作成・英訳支援環境の構築:ターミノロジーと翻訳メモリの利用

Principal Investigator

外山 勝彦 名古屋大学, 大学院・情報科学研究科, 准教授 (70217561)

Current Status of Research Progress

Reason

Research Products

[Journal Article] グラフカーネルを用いた非分かち書き文からの漸次的語彙知識獲得2011

Author(s)

Journal Title

[Journal Article] Design and Compilation of Syntactically Tagged Corpus of Japanese Statutory Sentences2011

Author(s)

Journal Title

[Journal Article] Recall-Oriented Evaluation Metrics for Consistent Translation of Japanese Legal Sentences2011

Author(s)

Journal Title

[Journal Article] Bootstrapping-based Extraction of Bilingual Dictionary Terms from Parallel Corpus2011

Author(s)

Journal Title

[Presentation] 名古屋大学法情報研究センターの最近の活動2012

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 法令対訳表現に対する再現率指向の統一性評価指標2011

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Design and Development of Japanese Law Translation Database System2011

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Brief Introduction to Bilingual KWIC for Taiwan Laws2011

Author(s)

Organizer

Place of Presentation

Year and Date

外山勝彦名古屋大学, 大学院・情報科学研究科, 准教授 (70217561)