Advanced Dependency Structure Analysis Using Minimum Total Penalty Method

Research Project

Project/Area Number	12680372
Research Category	Grant-in-Aid for Scientific Research (C)
Allocation Type	Single-year Grants
Section	一般
Research Field	Intelligent informatics
Research Institution	The University of Electro-Communications
Principal Investigator	OZEKI Kazuhiko The University of Electro-Communications, Faculty of Electro-Communications, Department of Information and Communication Engineering, Professor, 電気通信学部, 教授 (50214135)
Co-Investigator(Kenkyū-buntansha)	TAKAGI Kazuyuki The University of Electro-Communications, Faculty of Electro-Communications, Department of Information and Communication Engineering, Research Associate, 電気通信学部, 助手 (70272755)
Project Period (FY)	2000 – 2002
Project Status	Completed (Fiscal Year 2002)
Budget Amount *help	¥1,100,000 (Direct Cost: ¥1,100,000) Fiscal Year 2002: ¥600,000 (Direct Cost: ¥600,000) Fiscal Year 2001: ¥500,000 (Direct Cost: ¥500,000)
Keywords	dependency analysis / inter-phrase dependency strength / phrase significance / sentence compression / segmentation of long sentences / support vector machine / 係り受け規則 / 決定木
Research Abstract	1. Development of Sentence Compression Algorithm The sentence compression problem was formulated as a problem of selecting an optimal subsequence of phrases from a given sentence. Then, based on our dependency analysis technique, an efficient algorithm was developed to solve the problem. 2. Estimation of inter-phrase dependency strength and phrase significance By using about 34,000 sentences in Kyoto University Corpus, inter-phrase dependency strength was estimated. It is based on the statistical frequency of inter-phrase dependency distance, and was estimated for each modifying phrase class and modified phrase class. Also, a sentence compression experiment was conducted in which human subjects compressed 200 sentences. The result was analyzed statistically and the remaining rate for each phrase class was calculated. Based on the result, phrase significance value for each phrase class was estimated. 3. Subjective Evaluation of Compressed Sentences A subjective evaluation experiment was perf … More ormed for sentences automatically compressed by using the above algorithm together with the estimated inter-phrase dependency and phrase significance. In this experiment, 200 test sentences, which are different from the sentences in 2, were used. 5 subjects were employed for evaluating the quality of compressed sentences. Subjective evaluation was performed from the following points of view : (a) total impression, (b) retention of information, and c grammatical correctness. For comparison, the same kind of evaluation experiment was done for sentences compressed by humans, and also by a random method. It was found that the quality of sentences compressed by the proposed method lies just between those of human compression and random compression. 4. Segmentation of Long Sentences Because long sentences are difficult to analyze syntactically, it is desirable to segment long sentences into shorter ones. In this work, a support vector machine (SVM) technique was applied to the problem. Vectors consisting of surface attribute values of relevant phrases were input to the SVM, and segmentation points were automatically estimated. As a result, 77% of precision and 84% of recall were obtained. Correct sentence segmentation rate was 72%. Less

Report

(4 results)

2002 Annual Research Report Final Research Report Summary
2001 Annual Research Report
2000 Annual Research Report

Research Products

(41 results)

All Other

All Publications (41 results)

[Publications] 小黒玲: "文節重要度と係り受け整合度に基づく日本語文簡約アルゴリズム"自然言語処理. 8. 3-18 (2001)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2002 Final Research Report Summary
[Publications] 廣瀬幸由: "日本語読み上げ文の係り受け解析における韻律的特徴量の有効性"自然言語処理. 8. 71-89 (2001)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2002 Final Research Report Summary
[Publications] Rei Oguro: "An efficient algorithm for Japanese sentence compaction based on phrase importance and inter-phrase dependency"Proceedings of TSD 2000. 103-108 (2000)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2002 Final Research Report Summary
[Publications] Yoshiyuki Hirose: "Effectiveness of prosodic features in syntactic analysis of read Japanese sentences"Proceedings of ICSLP 2000. 3. 215-218 (2000)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2002 Final Research Report Summary
[Publications] Kazuyuki Takagi: "Pause information for dependency analysis of read Japanese sentences"Proceedings of EUROSPEECH 2001. 2. 1041-1044 (2001)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2002 Final Research Report Summary
[Publications] Kazuhiko Ozeki: "The use of prosody in Japanese dependency structure analysis"Proceedings of ISCA Tutorial and Research Workshop on Speech Recogniton and Understanding. 123-126 (2001)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2002 Final Research Report Summary
[Publications] Rei Oguro: "Evaluation of a Japanese sentence compression method based on phrase significance and inter-phrase dependency"Proceedings of TSD 2002. 27-32 (2002)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2002 Final Research Report Summary
[Publications] Kazuyuki Takagi: "Combination of pause and F0 information in dependency analysis of Japanese sentences"Proceedings of ICSLP 2002. 2. 1173-1176 (2002)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2002 Final Research Report Summary
[Publications] 小黒玲: "文節重要度と係り受け整合度に基づいた文簡約実験"電子情報通信学会技術報告. NLC 2001-3. 15-20 (2001)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2002 Final Research Report Summary
[Publications] 久保田新: "係り受け解析におけるポーズ・ピッチの利用法の検討"日本音響学会2001年秋季研究発表会講演論文集. I. 271-272 (2001)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2002 Final Research Report Summary
[Publications] 久保田新: "韻律を利用した係り受け解析におけるポーズ・基本周波数情報の統合法の検討"日本音響学会2002年春季研究発表会講演論文集. I. 395-396 (2002)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2002 Final Research Report Summary
[Publications] 諸岡祐平: "文節間係り受け整合度と文節重要度を用いて自動簡約した日本語文の主観評価"情報処理学会自然言語処理研究会資料. 153-2. 9-16 (2003)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2002 Final Research Report Summary
[Publications] 根岸知弘: "サポートベクターマシンによる日本語長文の短文分割"言語処理学会第9回年次大会発表論文集. (発表予定). (2003)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2002 Final Research Report Summary
[Publications] 諸岡祐平: "係り受け整合度と文節重要度を用いた自動簡約文の主観評価"言語処理学会第9回年次大会発表論文集. (発表予定). (2003)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2002 Final Research Report Summary
[Publications] 呂美蓉: "日本語読み上げ文の係り受け解析における複数ポーズ情報の利用"日本音響学会2003年春季研究発表会講演論文集. I(発表予定). (2003)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2002 Final Research Report Summary
[Publications] 沖本真美子: "韻律情報を用いた日本語読み上げ文の係り受け解析におけるニューラルネットワークの利用"日本音響学会2003年春季研究発表会講演論文集. I(発表予定). (2003)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2002 Final Research Report Summary
[Publications] Rei Oguro, Kazuhiko Ozeki, Yujie Zhang, and Kazuyuki Takagi: "A Japanese sentence compaction algorithm based on phrase significance and inter-phrase dependency"Journal of Natural Language Processing. 8, No.3. 3-18 (2001)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2002 Final Research Report Summary
[Publications] Yoshiyuki Hirose, Kazuhiko Ozeki, and Kazuyuki Takagi: "Effectiveness of prosodic features in dependency analysis of read Japanese sentences"Journal of Natural Language Processing. 8, No.4. 71-89 (2001)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2002 Final Research Report Summary
[Publications] Rei Oguro, Kazuhiko Ozeki, Yujie Zhang, and Kazuyuki Takagi: "An efficient algorithm for Japanese sentence compaction based on phrase importance and inter-phrase dependency"Proceedings of TSD 2000. 103-108 (2000)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2002 Final Research Report Summary
[Publications] Yoshiyuki Hirose, Kazuhiko Ozeki, and Kazuyuki Takagi: "Effectiveness of prosodic features in syntactic analysis of read Japanese sentences"Proceedings of ICSLP 2000. 3. 215-218 (2000)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2002 Final Research Report Summary
[Publications] Kazuyuki Takagi and Kazuhiko Ozeki: "Pause information for dependency analysis of read Japanese sentences"Proceedings of EUROSPEECH 2002. 2. 1041-1044 (2001)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2002 Final Research Report Summary
[Publications] Kazuhiko Ozeki, Kazuyuki Takagi, and Hajime Kubota: "The use of prosody in Japanese dependency structure analysis"Proceedings of ISCA Workshop on Prosody in Speech Recognition and Understanding. 123-126 (2001)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2002 Final Research Report Summary
[Publications] Rei Oguro, Hiromi Sekiya, Yuhei Morooka, Kazuyuki Takagi, and Kazuhiko Ozeki: "Evaluation of a Japanese sentence compression method based on phrase significance and inter-phrase dependency"Proceedings of TSD 2002. 27-32 (2002)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2002 Final Research Report Summary
[Publications] Kazuyuki Takagi, Hajime Kubota, and Kazuhiko Ozeki: "Combination of pause and F0 information in dependency analysis of Japanese sentences"Proceedings of ICSLP 2002. 2. 1173-1176 (2002)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2002 Final Research Report Summary
[Publications] Rei Oguro, Kazuyuki Takagi, and Kazuhiko Ozeki: "A sentence compaction experiment based on phrase significance and inter-phrase dependency"Technical Report of IEICE, NLC 2001-3. 15-20 (2001)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2002 Final Research Report Summary
[Publications] Hajime Kubota, Kazuyuki Takagi, and Kazuhiko Ozeki: "Utilization of pause and pitch information for dependency analysis of Japanese sentences"Proceedings of the 2001 Fall Meeting of the Acoustical Society of Japan. I. 271-272 (2001)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2002 Final Research Report Summary
[Publications] Hajime Kubota, Kazuyuki Takagi, and Kazuhiko Ozeki: "Association method of pause and pitch information in dependency analysis of Japanese sentences"Proceedings of the 2002 Spring Meeting of the Acoustical Society of Japan. I. 271-272 (2001)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2002 Final Research Report Summary
[Publications] Yuhei Morooka, Rei Oguro, Kazuyuki Takagi, and Kazuhiko Ozeki: "Subjective evaluation of Japanese sentences automatically compressed by using inter-phrase dependency and phrase significance"Technical Report of IPSJ. 153-2. 9-16 (2003)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2002 Final Research Report Summary
[Publications] Tomohiro Negishi, Kazuyuji Takagi, and Kazuhiko Ozeki: "Segmentation of long Japanese sentences using a support vector machine"Proceedings of 2003 Annual Meeting of Natural Language Processing Society. to be presented. (2003)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2002 Final Research Report Summary
[Publications] Yuhei Morooka, Rei Oguro, Kazuyuki Takagi, and Kazuhiko Ozeki: "Subjective evaluation of automatically compressed sentences using inter-phrase dependency and phrase significance"Proceedings of 2003 Annual Meeting of Natural Language Processing Society. to be presented. (2003)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2002 Final Research Report Summary
[Publications] Meirong Lu, Kazuyuki Takagi, and Kazuhiko Ozeki: "The use of multiple pause information in dependency analysis of Japanese sentences"Proceedings of the 2003 Spring Meeting of the Acoustical Society of Japan. to be presented. (2003)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2002 Final Research Report Summary
[Publications] Mamiko Okimoto, Yoshio Ogawa, Kazuyuki Takagi, and Kazuhiko Ozeki: "A neural network approach to dependency analysis of Japanese sentences using prosodic information"Proceedings of the 2003 Spring Meeting of the Acoustical Society of Japan. to be presented. (2003)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2002 Final Research Report Summary
[Publications] Rei Oguro: "Evaluation of a Japanese sentence compression method based on phrase significance and inter-phrase dependency"Proceedings of TSD 2002. 27-32 (2002)
- Related Report
  2002 Annual Research Report
[Publications] Kazuyuki Takagi: "Combination of pause and F0 information in dependency analysis of Japanese sentences"Proceedings of ICSLP 2002. 2. 1173-1176 (2002)
- Related Report
  2002 Annual Research Report
[Publications] 諸岡祐平: "文節間係り受け整合度と文節重要度を用いて自動簡約した日本語文の主観評価"情報処理学会自然言語処理研究会資料. 153-2. 9-16 (2003)
- Related Report
  2002 Annual Research Report
[Publications] 根岸知弘: "サポートベクターマシンによる日本語長文の短文分割"言語処理学会第9回年次大会発表論文集. (発表予定). (2003)
- Related Report
  2002 Annual Research Report
[Publications] 諸岡祐平: "文節間係り受け整合度と文節重要度を用いた自動簡約文の主観評価"言語処理学会第9回年次大会発表論文集. (発表予定). (2003)
- Related Report
  2002 Annual Research Report
[Publications] 呂美蓉: "日本語読み上げ文の係り受け解析における複数ポーズ情報の利用"日本音響学会2003年春季研究発表会講演論文集. I(発表予定). (2003)
- Related Report
  2002 Annual Research Report
[Publications] 沖本真美子: "韻律情報を用いた日本語読み上げ文の係り受け解析におけるニューラルネットワークの利用"日本音響学会2003年春季研究発表会講演論文集. I(発表予定). (2003)
- Related Report
  2002 Annual Research Report
[Publications] 小黒玲: "文節重要度と係り受け整合度に基づく日本語文簡約アルゴリズム"自然言語処理. Vol.8, No.3. 3-18 (2001)
- Related Report
  2001 Annual Research Report
[Publications] 小黒玲: "文節重要度と係り受け整合度に基づいた文簡約実験"電子情報通信学会技術研究報告. NLC2001-3. 15-20 (2001)
- Related Report
  2001 Annual Research Report

Advanced Dependency Structure Analysis Using Minimum Total Penalty Method

Principal Investigator

OZEKI Kazuhiko The University of Electro-Communications, Faculty of Electro-Communications, Department of Information and Communication Engineering, Professor, 電気通信学部, 教授 (50214135)

¥1,100,000 (Direct Cost: ¥1,100,000)

Report

Research Products

[Publications] 小黒玲: "文節重要度と係り受け整合度に基づく日本語文簡約アルゴリズム"自然言語処理. 8. 3-18 (2001)

Description

Related Report

[Publications] 廣瀬幸由: "日本語読み上げ文の係り受け解析における韻律的特徴量の有効性"自然言語処理. 8. 71-89 (2001)

Description

Related Report

[Publications] Rei Oguro: "An efficient algorithm for Japanese sentence compaction based on phrase importance and inter-phrase dependency"Proceedings of TSD 2000. 103-108 (2000)

Description

Related Report

[Publications] Yoshiyuki Hirose: "Effectiveness of prosodic features in syntactic analysis of read Japanese sentences"Proceedings of ICSLP 2000. 3. 215-218 (2000)

Description

Related Report

[Publications] Kazuyuki Takagi: "Pause information for dependency analysis of read Japanese sentences"Proceedings of EUROSPEECH 2001. 2. 1041-1044 (2001)

Description

Related Report

[Publications] Kazuhiko Ozeki: "The use of prosody in Japanese dependency structure analysis"Proceedings of ISCA Tutorial and Research Workshop on Speech Recogniton and Understanding. 123-126 (2001)

Description

Related Report

[Publications] Rei Oguro: "Evaluation of a Japanese sentence compression method based on phrase significance and inter-phrase dependency"Proceedings of TSD 2002. 27-32 (2002)

Description

Related Report

[Publications] Kazuyuki Takagi: "Combination of pause and F0 information in dependency analysis of Japanese sentences"Proceedings of ICSLP 2002. 2. 1173-1176 (2002)

Description

Related Report

[Publications] 小黒玲: "文節重要度と係り受け整合度に基づいた文簡約実験"電子情報通信学会技術報告. NLC 2001-3. 15-20 (2001)

Description

Related Report

[Publications] 久保田新: "係り受け解析におけるポーズ・ピッチの利用法の検討"日本音響学会2001年秋季研究発表会講演論文集. I. 271-272 (2001)

Description

Related Report

[Publications] 久保田新: "韻律を利用した係り受け解析におけるポーズ・基本周波数情報の統合法の検討"日本音響学会2002年春季研究発表会講演論文集. I. 395-396 (2002)

Description

Related Report

[Publications] 諸岡祐平: "文節間係り受け整合度と文節重要度を用いて自動簡約した日本語文の主観評価"情報処理学会自然言語処理研究会資料. 153-2. 9-16 (2003)

Description

Related Report

[Publications] 根岸知弘: "サポートベクターマシンによる日本語長文の短文分割"言語処理学会第9回年次大会発表論文集. (発表予定). (2003)

Description

Related Report

[Publications] 諸岡祐平: "係り受け整合度と文節重要度を用いた自動簡約文の主観評価"言語処理学会第9回年次大会発表論文集. (発表予定). (2003)

Description

Related Report

[Publications] 呂美蓉: "日本語読み上げ文の係り受け解析における複数ポーズ情報の利用"日本音響学会2003年春季研究発表会講演論文集. I(発表予定). (2003)

Description

Related Report

[Publications] 沖本真美子: "韻律情報を用いた日本語読み上げ文の係り受け解析におけるニューラルネットワークの利用"日本音響学会2003年春季研究発表会講演論文集. I(発表予定). (2003)

Description

Related Report

[Publications] Rei Oguro, Kazuhiko Ozeki, Yujie Zhang, and Kazuyuki Takagi: "A Japanese sentence compaction algorithm based on phrase significance and inter-phrase dependency"Journal of Natural Language Processing. 8, No.3. 3-18 (2001)

Description

Related Report

[Publications] Yoshiyuki Hirose, Kazuhiko Ozeki, and Kazuyuki Takagi: "Effectiveness of prosodic features in dependency analysis of read Japanese sentences"Journal of Natural Language Processing. 8, No.4. 71-89 (2001)

Description

Related Report

[Publications] Rei Oguro, Kazuhiko Ozeki, Yujie Zhang, and Kazuyuki Takagi: "An efficient algorithm for Japanese sentence compaction based on phrase importance and inter-phrase dependency"Proceedings of TSD 2000. 103-108 (2000)

Description

Related Report

[Publications] Yoshiyuki Hirose, Kazuhiko Ozeki, and Kazuyuki Takagi: "Effectiveness of prosodic features in syntactic analysis of read Japanese sentences"Proceedings of ICSLP 2000. 3. 215-218 (2000)

Description

Related Report

[Publications] Kazuyuki Takagi and Kazuhiko Ozeki: "Pause information for dependency analysis of read Japanese sentences"Proceedings of EUROSPEECH 2002. 2. 1041-1044 (2001)

Description

Related Report

[Publications] Kazuhiko Ozeki, Kazuyuki Takagi, and Hajime Kubota: "The use of prosody in Japanese dependency structure analysis"Proceedings of ISCA Workshop on Prosody in Speech Recognition and Understanding. 123-126 (2001)

Description

Related Report

[Publications] Rei Oguro, Hiromi Sekiya, Yuhei Morooka, Kazuyuki Takagi, and Kazuhiko Ozeki: "Evaluation of a Japanese sentence compression method based on phrase significance and inter-phrase dependency"Proceedings of TSD 2002. 27-32 (2002)

Description

Related Report

[Publications] Kazuyuki Takagi, Hajime Kubota, and Kazuhiko Ozeki: "Combination of pause and F0 information in dependency analysis of Japanese sentences"Proceedings of ICSLP 2002. 2. 1173-1176 (2002)

Description

Related Report

[Publications] Rei Oguro, Kazuyuki Takagi, and Kazuhiko Ozeki: "A sentence compaction experiment based on phrase significance and inter-phrase dependency"Technical Report of IEICE, NLC 2001-3. 15-20 (2001)

Description

[Publications] 小黒玲: "文節重要度と係り受け整合度に基づく日本語文簡約アルゴリズム"自然言語処理. Vol.8, No.3. 3-18 (2001)

[Publications] 小黒玲: "文節重要度と係り受け整合度に基づいた文簡約実験"電子情報通信学会技術研究報告. NLC2001-3. 15-20 (2001)