• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

A Study on Hands-Free Speech Recognition Using Microphone Array

Research Project

Project/Area Number 11480077
Research Category

Grant-in-Aid for Scientific Research (B)

Allocation TypeSingle-year Grants
Section一般
Research Field Intelligent informatics
Research InstitutionNARA INSTITUTE OF SCIENCE AND TECHNOLOGY

Principal Investigator

SARUWATARI Hiroshi (2000-2002)  Nara Institute of Science and Technology, Graduate School of Information Science, Associate Professor, 情報科学研究科, 助教授 (30324974)

中村 哲 (1999)  奈良先端科学技術大学院大学, 情報科学研究科, 助教授 (30263429)

Co-Investigator(Kenkyū-buntansha) LEE Akinobu  Nara Institute of Science and Technology, Graduate School of Information Science, Assistant Professor, 情報科学研究科, 助手 (80332766)
SHIKANO Kiyohiro  Nara Institute of Science and Technology, Graduate School of Information Science, Professor, 情報科学研究科, 教授 (00263426)
陸 金林  奈良先端科学技術大学院大学, 情報科学研究科, 助手 (50230868)
Project Period (FY) 1999 – 2002
Project Status Completed (Fiscal Year 2002)
Budget Amount *help
¥11,600,000 (Direct Cost: ¥11,600,000)
Fiscal Year 2002: ¥2,600,000 (Direct Cost: ¥2,600,000)
Fiscal Year 2001: ¥2,700,000 (Direct Cost: ¥2,700,000)
Fiscal Year 2000: ¥2,700,000 (Direct Cost: ¥2,700,000)
Fiscal Year 1999: ¥3,600,000 (Direct Cost: ¥3,600,000)
KeywordsMicrophone array / Speech recognition / Hands-free / Source localization / Super directivity / Noise reduction / Real environments / Beamforming / モデル適応
Research Abstract

In recent years, an accuracy in the speech recognition system can be remarkably improved by using Hidden Markov Model and Neural Networks. However, in real environments, there still exists the significant problems that the speech recognition performance degrades because of the additive noise and reverberation of the room.
In this study, we introduce a microphone array technology in which the sound sources can be identified and detected accurately in the three-dimensional acoustic field. This study can provide the following final results.
(1) The real acoustic database has been constructed using a 56-ch microphone array system, and the database has been widely distributed.
(2) The accurate DOA (direction of arrival) estimation technique with the CSP method has been realized. In addition, we have applied the technique into a moving robot navigation problem.
3) As a new array signal processing, we have proposed the multi-beamforming technique and the blind source separation by ICA. The effectiveness has been revealed through the experiments in the real situations.
(4) We have proposed a new approach that combines three-dimensional Vitabi search in the speech recognition and DOA estimation.
The research results have been published as follows ;
Journal paper 12, International conference 30, Invited talk4, Technical Report 11, Domestic workshop 15.

Report

(5 results)
  • 2002 Annual Research Report   Final Research Report Summary
  • 2001 Annual Research Report
  • 2000 Annual Research Report
  • 1999 Annual Research Report
  • Research Products

    (67 results)

All Other

All Publications (67 results)

  • [Publications] 西浦 敬信: "CSP法による音源位置同定を備えたマルチビームフォーミング"電子情報通信学会論文誌. Vol.J83-D-II, No.7. 1610-1619 (2000)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] 西浦 敬信: "マイクロホンアレーを用いたCSP法による複数音源位置推定"電子情報通信学会論文誌. Vol.J83-D-II, No.8. 1713-1721 (2000)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] 西浦 敬信: "反射音を利用したマルチビームフォーミングによる音声認識"電子情報通信学会論文誌. Vol.J83-D-II, No.11. 2198-2205 (2000)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] 三木 一浩: "マイクロホンアレーとHMM分解・合成法による雑音・残響下音声認識"電子情報通信学会論文誌. Vol.J83-D-II, No.11. 2206-2214 (2000)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] Tetsuya Takiguchi: "HMM-Separation-Based Speech Recognition for a Distant Moving Speaker"IEEE Transactions on Speech and Audio Processing. Vol.9, NO.2. 127-140 (2001)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] Hidekazu Kamiyanagida: "Direction of Arrival Estimation Using Nonlinear Microphone Array"IEICE Transactions Fundamentals. Vol.E84-A, No.4. 999-1010 (2001)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] Takeshi Yamada: "Distant-Talking Speech Recognition Based on a 3-D Viterbi search using a microphone array"IEEE Transactions on Speech and Audio Processing. Vol.10, No.2. 48-56 (2002)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] Panikos Heracleous: "A Microphone Array-based 3-D N-best Search for Simultanous Recognition of Multiple Sound Sources"IEICE Trans. Information and Systems. Vol.E85-D, No.6. 994-1002 (2002)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] Yuko Okada: "A design of adaptive beamformer based on average speech spectrum for noisy speech recognition"Acoustical Science and Technology. Vol.23, No.6. 323-327 (2002)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] Hiroshi Saruwatari: "Fast-Convergence Algorithm for Blind Source Separation Based on Array Signal Processing"IEICE Trans.Fundamentals. Vol.E86-A, No.3. 286-291 (2003)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] Tsuyoki Nishikawa: "Blind source separation of acoustic signals based on multistage ICA combining frequency-domain ICA and time-domain ICA"IEICE Trans.Fundamentals. Vol.E86-A, No.4. 846-858 (2003)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] Tsuyoki Nishikawa: "Stable learning algorithm for blind separation of temporally correlated acoustic signals combining multistage ICA and Linear Prediction"IEICE Transactions Fundamentals. Vol.E86-A, No.8(in printing). (2003)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] Takanori Nishiura: "Multiple Beamforming with Source localization Based on CSP Analysis, (in Japanese)"IEICE Trans.on Information and Systems. Vol.J83-D-II, No.7. 1610-1619 (2000)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] Takanori Nishiura: "Localization of Multiple Sound Sources Based on CSP Analysis with a Microphone Array, (inJapanese)"IEICE Trans.on Information and Systems. Vol.J83-D-II, No.8. 1713-1721 (2000)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] Takanori Nishiura: "Speech Recognition by Multiple Beamforming Utilizing Reflection Signals, (in Japanese)"IEICE Trans.on Information and Systems. Vol.J83-D-II, No.11. 2198-2205 (2000)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] Kazuhiro Miki: "Speech Recognition Based on HMM Decomposition and Composition Method with a Microphone Array in Noisy Reverberant Environments, (in Japanese)"IEICE Trans.on Information and Systems. Vol.J83-D-II, No.11. 2206-2214 (2000)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] Tetsuya Takiguchi: "HMM-Separation-Based Speech Recognition for a Distant Moving Speaker"IEEE Transactions on Speech and Audio Processing. Vol.9, No.2. 127-140 (2001)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] Hidekazu Kamiyanagida: "Direction of Arrival Estimation Using Nonlinear Microphone Array"IEICE Transactions Fundamentals. Vol.E84-A, No.4. 999-1010 (2001)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] Takeshi Yamada: "Distant-Talking Speech Recognition Based on a 3-D Viterbi search using a microphone array"IEEE Transactions on Speech and Audio Processing. Vol.10, No.2. 48-56 (2002)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] Panikos Heracleous: "A Microphone Array-based 3-D N-best Search for Simultaneous Recognition of Multiple Sound Sources"IEICE Trans.Information and Systems. Vol.E85-D, No.6. 994-1002 (2002)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] Yuko Okada: "A design of adaptive beamformer based on average speech spectrum for noisy speech recognition"Acoustical Science and Technology. Vol.23, No.6. 323-327 (2002)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] Hiroshi Saruwatari: "Fast-Convergence Algorithm for Blind Source Separation Based on Array Signal Processing"IEICE Trans.Fundamentals. Vol.E86-A, No.3. 286-291 (2003)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] Tsuyoki Nishikawa: "Blind source separation of acoustic signals based on multistage ICA combining frequency-domain ICA and time-domain ICA"IEICE Trans.Fundamentals. Vol.E86-A, No.4. 846-858 (2003)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] Tsuyoki Nishikawa: "Stable learning algorithm for blind separation of temporally correlated acoustic signals combining multistage ICA and Linear Prediction"IEICE Trans.Fundamentals. Vol.E86-A, No.8 (in printing). (2003)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2002 Final Research Report Summary
  • [Publications] Panikos Heracleous: "A Microphone Array-based 3-D N-best Search for Simultaneous Recognition of Multiple Sound Sources"IEICE Trans. Information and Systems. Vol.E85-D, No.6. 994-1002 (2002)

    • Related Report
      2002 Annual Research Report
  • [Publications] Yuko Okada: "A design of adaptive beamformer based on average speech spectrum for noisy speech recognition"Acoustical Science and Technology. Vol.23, No.6. 323-327 (2002)

    • Related Report
      2002 Annual Research Report
  • [Publications] Hiroshi Saruwatari: "Fast-Convergence Algorithm for Blind Source Separation Based on Array Signal Processing"IEICE Trans. Fundamentals. Vol.E86-A, No.3. 286-291 (2003)

    • Related Report
      2002 Annual Research Report
  • [Publications] Tsuyoki Nishikawa: "Blind source separation of acoustic signals based on multistage ICA combining frequency-domain ICA and time-domain ICA"IEICE Trans. Fundamentals. Vol.E86-A, No.4(in printing). (2003)

    • Related Report
      2002 Annual Research Report
  • [Publications] Tsuyoki Nishikawa: "Stable learning algorithm for blind separation of temporally correlated acoustic signals combining multistage ICA and Linear Prediction"IEICE Trans. Fundamentals. Vol.E86-A, No.8(in printing). (2003)

    • Related Report
      2002 Annual Research Report
  • [Publications] Takanobu Nishiura: "Talker localization in a Real Acoustic Environment based on DOA Estimation and Statistical Sound Source Identification"Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP2002). 2892-2895 (2002)

    • Related Report
      2002 Annual Research Report
  • [Publications] Tsuyoki Nishikawa: "Blind Source Separation Based on Multi-Stage ICA Combining Frequency-Domain ICA and Time-Domain ICA"Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP2002). 2938-2941 (2002)

    • Related Report
      2002 Annual Research Report
  • [Publications] Hiroshi Saruwatari: "Blind Source Separation Based on Fast-Convergence Algorithm Using ICA and Beamforming for Real Convolutive Mixture"Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP2002). 3097-3100 (2002)

    • Related Report
      2002 Annual Research Report
  • [Publications] Takanobu Nishiura: "Talker Tracking Display on Autonomous Mobile Robot with a Moving Microphone Array"Proceedings of the Eighth International Conference on Auditory Display (ICAD2002). 244-247 (2002)

    • Related Report
      2002 Annual Research Report
  • [Publications] Hiroshi Saruwatari: "Evaluation of Fast-Convergence Algorithm for Blind Source Separation of Real Convolutive Mixture"Proc. of 6th International Conference on Signal Processing (ICSP'02). 346-349 (2002)

    • Related Report
      2002 Annual Research Report
  • [Publications] Satoshi Nakamura: "Design and Collection of Acoustic Sound Data for Hands-Free Speech Recognition and Sound Scene Understanding"Proc. of IEEE International Conference on Multimedia and Expo (ICME2002). 161-164 (2002)

    • Related Report
      2002 Annual Research Report
  • [Publications] Tsuyoki Nishikawa: "Comparison of Time-Domain ICA, Frequency-Domain ICA and Multistage ICA"Proc. the 2002 European Signal Processing Conference (EUSIPCO2002). Vol.II. 15-18 (2002)

    • Related Report
      2002 Annual Research Report
  • [Publications] Takanobu Nishiura: "Suitable Design of Adaptive Beamformer Based on Average Speech Spectrum for Noisy Speech Recognition"Proc. of 7th International Conference on Spoken Language Processing (ICSLP2002). 1789-1792 (2002)

    • Related Report
      2002 Annual Research Report
  • [Publications] Tsuyoki Nishikawa: "Stable Learning Algorithm for Blind Separation of Temporally Correlated Signals Combining Multistage ICA and Linear Prediction"Proc. of Fourth International Symposium on Independent Component Analysis and Blind Signal Separation. No.P2A-05(in printing). (2003)

    • Related Report
      2002 Annual Research Report
  • [Publications] Rajkishore Prasad: "A Fixed-Point ICA Algorithm for Convoluted Speech Signal Separation"Proc. of Fourth International Symposium on Independent Component Analysis and Blind Signal Separation. No.P3A-07(in printing). (2003)

    • Related Report
      2002 Annual Research Report
  • [Publications] Hiroshi Saruwatari: "(Invited Paper) Blind Source Separation of Acoustic Signals Based on Multistage Independent Component Analysis"Proc. of Summer Meeting of Acoustical Society of Korea. 9-14 (2002)

    • Related Report
      2002 Annual Research Report
  • [Publications] Hidekazu Kamiyanagida: "Direction of Arrival Estimation Using Nonlinear Microphone Array"IEICE Transactions Fundamentals. Vol.E84-A, No.4. 999-1010 (2001)

    • Related Report
      2001 Annual Research Report
  • [Publications] Hiroshi Saruwatari: "Blind Source Separation Combining Frequency-Domain ICA and Beamforming"Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing(ICASSP2001). No. MULT-P2. 2733-2736 (2001)

    • Related Report
      2001 Annual Research Report
  • [Publications] Hidekazu Kamiyanagida: "Direction of Arrival Estimation Based on Nonlinear Microphone Array"Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing(ICASSP2001). No. SAM-P7. 3033-3036 (2001)

    • Related Report
      2001 Annual Research Report
  • [Publications] Takanobu Nishiura: "Speech Enhancement by Multiple Beamforming with Reflection Signal Equalization"Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing (JCASSP2001). No. SPEECH-L11. 189-192 (2001)

    • Related Report
      2001 Annual Research Report
  • [Publications] Panikos Heracleous: "A Microphone Array -Based 3-D N-Best Search Algorithm for the Simultaneous Recognition of Multiple Sound Sources in Real Environments"Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing(ICASSP2001). No. SPEECH-L11. 193-196 (2001)

    • Related Report
      2001 Annual Research Report
  • [Publications] Hiroshi Saruwatari: "Blind Source Separation for Speech Based on Fast-Convergence Algorithm with ICA and Beamforming"Proc. of 7^<th> European Conference on Speech Communication and Technology(EUROSPEECH2001). 2603-2606 (2001)

    • Related Report
      2001 Annual Research Report
  • [Publications] Hiroshi Saruwatari: "Fast-Convergence Algorithm for ICA-Based Blind Source Separation Using Array Signal Processing"Proc. of 11^<th> IEEE Workshop on Statistical Signal Processing(SSP2001). 464-467 (2001)

    • Related Report
      2001 Annual Research Report
  • [Publications] Hiroshi Saruwatari: "Blind Source Separation Based on Fast-Convergence Algorithm Using ICA and Array Signal Processing"Proc. of 3^<rd> International Conference on Component Analysis and Blind Signal Separation. 412-417 (2001)

    • Related Report
      2001 Annual Research Report
  • [Publications] Tsuyoki Nishikawa: "Blind Source Separation Based on Multi-Stage ICA Using Frequency-Domain ICA and Time-Domain ICA"Proc. of 3^<rd> International Conference on Fundamentals of Electronics, Communications and Computer Science. (採録決定済,3月発表予定). (2002)

    • Related Report
      2001 Annual Research Report
  • [Publications] 猿渡 洋: "(招待論文)アレー信号処理を用いたブラインド信号分離の基礎"電子情報通信学会 電気音響研究会 技術報告書. Vol.EA2001-7. 49-56 (2001)

    • Related Report
      2001 Annual Research Report
  • [Publications] 猿渡 洋: "(招待論文)雑音適応型相補的指向特性形成法に基づく非線形マイクロホンアレーを用いた音声強調"電子情報通信学会 音声研究会 技術報告書. Vol.SP2001-68. 43-44 (2001)

    • Related Report
      2001 Annual Research Report
  • [Publications] 猿渡 洋: "(招待論文)音声・音響信号を対象としたブラインド音源分離"電子情報通信学会 DSP研究会 技術報告書. Vol.DSP2001-194. 59-66 (2002)

    • Related Report
      2001 Annual Research Report
  • [Publications] Toshiya Kawamura: "Blind Source Separation Based on Fast-Convergence Algorithm with ICA and Beamforming"IEICE Technical Report. Vol.EA2001-2. 9-16 (2001)

    • Related Report
      2001 Annual Research Report
  • [Publications] 中村 雅也: "マイクロホンアレーを用いた自立移動型ロボットにおける話者位置推定"電子情報通信学会 電気音響研究会 技術報告書. Vol.EA2001-4. 25-32 (2001)

    • Related Report
      2001 Annual Research Report
  • [Publications] Tsuyoki Nishikawa: "Comparison of Blind Source Separation Methods Based on Time-Domain ICA Using Nonstationarity and Multistage ICA"IEICE Technical Report. Vol.EA2001-112. 45-52 (2001)

    • Related Report
      2001 Annual Research Report
  • [Publications] 上柳 英和: "オンラインアルゴリズムに基づく非線型マイクロホンアレーを用いた移動音源方位推定"電子情報通信学会 電気音響研究会 技術報告書. (3月発表予定). (2002)

    • Related Report
      2001 Annual Research Report
  • [Publications] 西浦敬信: "CSP法による音源位置同定を備えたマルチビームフォーミング"電子情報通信学会論文誌. Vol.J83-D-II,No.7. 1610-1619 (2000)

    • Related Report
      2000 Annual Research Report
  • [Publications] 西浦敬信: "マイクロホンアレーを用いたCSP法による複数音源位置推定"電子情報通信学会論文誌. Vol.J83-D-II,No.8. 1713-1721 (2000)

    • Related Report
      2000 Annual Research Report
  • [Publications] 西浦敬信: "反射音を利用したマルチビームフォーミングによる音声認識"電子情報通信学会論文誌. Vol.J83-D-II,No.11. 2198-2205 (2000)

    • Related Report
      2000 Annual Research Report
  • [Publications] 三木一浩: "マイクロホンアレーとHMM分解・合成法による雑音・残響下音声認識"電子情報通信学会論文誌. Vol.J83-D-II,No.11. 2206-2214 (2000)

    • Related Report
      2000 Annual Research Report
  • [Publications] Hidekazu Kamiyanagida: "Direction of Arrival Estimation Using Nonlinear Microphone Array"IEICE Transactions Fundamentals. Vol.E84-A,No.4(発表予定). (2001)

    • Related Report
      2000 Annual Research Report
  • [Publications] 三木一浩: "HMUを用いた環境音識別の検討"電子情報通信学会技術研究報告. SP99-106. 79-84 (1999)

    • Related Report
      1999 Annual Research Report
  • [Publications] 岡田有加: "音声の長時間スペクトル特性を考慮したアダプティグマイクロホンアレーによる音声認識"電子情報通信学会技術研究報告. SP99-71. (1999)

    • Related Report
      1999 Annual Research Report
  • [Publications] 西浦敬信: "実環境における反射音を利用したマルチビームフォーミングの検討"電子情報通信学会技術研究報告. SP99-1333. (2000)

    • Related Report
      1999 Annual Research Report
  • [Publications] 中村哲: "実環境におけるハンズフリー音声認識"電子情報通信学会技術研究報告. SP99-1112. 115-120 (1999)

    • Related Report
      1999 Annual Research Report
  • [Publications] Panikos Heracleous: "Simultaneous Recognition of Multiple Sound Sources based on 3-D N-best Search Using a Microphone Avray"Eurospeech 99. 9月号. (1999)

    • Related Report
      1999 Annual Research Report
  • [Publications] Sotoshi Nakamura: "Recognition of Distant Talking Speech based on 3-D Trellis Search using a Microphone Array and Adaptive Beamforning"Robust 99. 5月号. 219-222 (1999)

    • Related Report
      1999 Annual Research Report

URL: 

Published: 1999-04-01   Modified: 2016-04-21  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi