• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to project page

1999 Fiscal Year Final Research Report Summary

Studies on Speech Recognition, Closed Caption and Summarization of Broadcast News

Research Project

Project/Area Number 09480064
Research Category

Grant-in-Aid for Scientific Research (B)

Allocation TypeSingle-year Grants
Section一般
Research Field Intelligent informatics
Research InstitutionToyohashi University of Technology

Principal Investigator

NAKAGAWA Seiichi  Toyohashi University of Technology, Faculty of Engineering, Professor, 工学部, 教授 (20115893)

Co-Investigator(Kenkyū-buntansha) KAI Atsuhiko  Shizuoka University, Faculty of Engineering, Assitant Professor, 工学部, 講師 (60283496)
MINEMATSU Nobuaki  Toyohashi University of Technology, Faculty of Engineering, Research Assistant, 工学部, 助手 (90273333)
MASUYAMA Sigeru  Toyohashi University of Technology, Faculty of Engineering, Professor, 工学部, 教授 (60173762)
ANDO Akio  NHK Laboratory, Sub-Head of Human-Interface Department, 放送技術研究所, 副部長
Project Period (FY) 1997 – 1999
Keywordsspeech recognition / acoustic model / closed caption / dictation / language model / summarization / broadcast news
Research Abstract

It is well-known that HMMs only of the basic structure can not capture the correlation among successive frames adequately. In our previous work, to solve this problem, segmental unit HMMs were introduced and their effectiveness was shown. And the integration of Δ cepstrum and ΔΔ cepstrum into the segmental unit HMMs was also found to improve the recognition performance in the work. Firstly, we compared frame-based models and segment-based models. Results showed the effectiveness of the use of segmental features as input vectors. Secondly, we compared syllable-based HMMs and triphone-based HMMs. Recognition experiments showed that syllable-based HMMs are suitable for Japanese.
Next, we developed a method that constructs language models using a task adaptation strategy and idiomatic expressions of news articles. First, we investigated the effect of a task adaptation method of N-gram language model using a limited amount of target articles. Second, we investigated the effect of the language model adaptation method using the latest articles. Third, we investigated the effect of the use of idiomatic expressions as morpheme units, since some specific expressions and idiomatic expressions are frequently observed in news articles. We showed that our proposed three methods were effective for constructing N-gram language models.
Finally, we proposed and evaluated a method for summarizing each sentence in TV news texts written in Japanese. It is not appropriate to select important sentences for abstracting news text, because a news text consists of only a few and long sentences. Then, we tried to reduce redundant parts, which consisted of modifier etc., of each sentence. We used a simple parsing method specialized for news texts so that the syntactical structure was not destroyed. We evaluated this summarizing method by obtaining information by means of questionnaires to 32 examinees.

  • Research Products

    (16 results)

All Other

All Publications (16 results)

  • [Publications] K. Hanai, K. Yamamoto, N. Minematsu and S. Nakagawa: "Continuous speech recognition using segmental unit input HMMS with mixture of probability density functions and context dependency"Proc. 5th Int. Conf. Spoken Language Processing. 2935-2938 (1998)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 中川聖一、赤松裕隆、西崎博光: "音声認識用言語モデルのためのタスク適応化と定型表現の利用"自然言語処理. 6・2. 97-115 (1999)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 甲斐充彦、廣瀬良文、中川聖一: "単語N-gram言語モデルを用いた音声認識システムにおける未知語・冗長語の処理"情報処理学会論文誌. 40・4. 1385-1394 (1999)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 三上真、増山繁、中川聖一: "ニュース番組における字幕生成のための文内短縮による要約"自然言語処理. 6・6. 65-81 (1999)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] H. Nishizaki and S. Nakagawa: "A Retrieval System of Broadcast News Speech Documents Through Key Word and Voice"Proc. Int. Workshop on Text, Speech and Dialogue, in Lecture Notes in Artificial Intelligence. 286-289 (1999)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 中川聖一: "音声認識研究の動向"電子情報通信学会論文誌. J83-DII・2. 433-457 (2000)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 中川聖一: "岩波書店"5章音声認識「音声」(田窪,前川,本多,白井,中川). 177〜229 (1998)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 中川聖一: "丸善"パターン情報処理. 310 (1999)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] K.Hanai, K.Yamamoto, N.Minematsu and S.Nakagawa: "Continuous Speech Recognition Using Segmental Unit Input HMMs with a Mixture of Probability Density Functions and Context Dependency"Proc. 5th Int. Conf. Spoken Language Processing. 2935-2938 (1998)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] S.Nakagawa, H.Akamatsu and H.Nishizaki: "A Task Adaptation Method and Use of Idiomatic Expression of Stochastic Language Model for Speech recognition"Journal of Natural Language Processing. Vol.6, No.6 (in Japanese). 89-107 (1999)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] A.Kai, Y.Hirose and S.Nakagawa: "Dealing with Out-of-vocabulary Words and Filled Pauses in Word N-gram based Speech Recognition System"Trans. IPSJ. Vol.40, No.4 (in Japanese). 1385-1394 (1999)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] M.Minami, S.Masuyama and S.Nakagawa: "A Summarization Method by Reducing Redundancy of Each Sentence for Making Captions of Newscasting"Journal of Natural Language Processing. Vol.6, No.6 (in Japanese). 65-81 (1999)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] H.Nishizaki and S.Nakagawa: "A Retrieval System of Broadcast News Speech Documents Through Key Word and Voice"Proc. Int. Workshop on Text, Speech and Dialogue, in Lecture Notes in Artificial Intelligence, V.Matousek, P.Mautner, J.Ocelikova and P.Sojka, Springer. 286-289 (1999)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] S.Nakagawa: "A Survey on Automatic Speech Recognition"IEICE Trans. Vol.83-D II No.2 (in Japanese). 433-457 (2000)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] Seiichi Nakagawa: "Speech Recognition, Speech"Iwanami-Shoten (in Japanese). (1998)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] Seiichi Nakagawa: "Pattern Information Processing"Maruzen (in Japanese). (1999)

    • Description
      「研究成果報告書概要(欧文)」より

URL: 

Published: 2001-10-23  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi