• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

A study on signal extraction in noisy and reverberant environment

Research Project

Project/Area Number 10680374
Research Category

Grant-in-Aid for Scientific Research (C)

Allocation TypeSingle-year Grants
Section一般
Research Field Intelligent informatics
Research InstitutionSchool of Information Science Japan Advanced Institute of Science and Technology

Principal Investigator

AKAGI Masato  Japan Advanced Institute of Science and Technology, School of Information Science, Professor, 情報科学研究科, 教授 (20242571)

Co-Investigator(Kenkyū-buntansha) 岩城 護  新潟大学, 大学院・自然科学研究科, 助教授 (20262595)
Project Period (FY) 1998 – 2000
Project Status Completed (Fiscal Year 2000)
Budget Amount *help
¥3,100,000 (Direct Cost: ¥3,100,000)
Fiscal Year 2000: ¥500,000 (Direct Cost: ¥500,000)
Fiscal Year 1999: ¥1,200,000 (Direct Cost: ¥1,200,000)
Fiscal Year 1998: ¥1,400,000 (Direct Cost: ¥1,400,000)
Keywordsnoise / reverberation / auditory mechanism / nerve firing / interaural time difference (ITD) / signal direction estimation / fundamental frequency / キャンセレーション / 雑音抑圧 / 聴覚末梢系
Research Abstract

This research discusses models of speech enhancement and segregation based on knowledge about human psychoacoustics and auditory physiology. The cancellation model is used for enhancing speech. Special attention is paid to reducing noise by using a spatial filtering technique, and increasing the robustness of fundamental frequency estimation by using a frequency filtering technique. Both techniques adopt concepts of the cancellation model. In addition, some constraints related to the heuristic regularities proposed by Bregman are used to overcome the problem associated with segregating two acoustic sources. Simulated results show that both spatial and frequency filtering are useful in enhancing speech. As a result, these filtering methods can be used effectively at the front-end of automatic speech recognition systems, and for speech feature extraction. The sound segregation model can precisely extract a desired signal from a noisy signal even in waveforms.
Additionally, this research discusses models of sound source direction estimation based on physiological data of mammal audition. The model can explain the relationship between transmission of temporal and phase information by nerve firing and accuracy of interaural time differences.

Report

(4 results)
  • 2000 Annual Research Report   Final Research Report Summary
  • 1999 Annual Research Report
  • 1998 Annual Research Report
  • Research Products

    (32 results)

All Other

All Publications (32 results)

  • [Publications] Mizumach and Akagi: "The auditory-oriented spectral distortion for evaluating speech signals distorted by aditive noise"Journal. Acoustical Society of Japan (E). 21-5. 251-258 (2000)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] Ito and Akagi: "A computational model of auditory sound localization based on ITD"Recent Developments in Auditory Mechanics. 483-489 (2000)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] 石本,鵜木,赤木: "周期性と調波性を考慮した雑音環境における基本周波数推定"日本音響学会聴覚研究会資料. H-2000-81. (2000)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] Mizumachi,Akagi and Nakamura: "Design of robust subtractive beamformer for noisy speech recognition"Proc.ICSLP2000, Beijing. IV. 57-60 (2000)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] Akagi,Mizumachi Ishimoto and Unokl: "Speech enhancement and segregation based on human auditory mechanims"Proc.IS2000,Aizu. 246-253 (2000)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] Ito and Akagi: "A computational model of binaural coincidence detection using impulses based on synchronization index"Proc.ISA2000(BIS2000). (2000)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] Mizumachi, M.and Akagi, M.: "The auditory-oriented spectral distortion for evaluating speech signals distorted by additive noises"J.Acoust.Soc.Jpn. (E). 21, 5. 251-258 (2000)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] Ito, K.and Akagi, M.: "A computational model of auditory sound localization based on ITD"In Recent Developments in Auditory Mechanics. World Scientific Publishing. 483-489 (2000)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] Ishimoto, Y.and Akagi, M.: "A fundamental frequency estimation method for noisy speech"Proc.WESTPRAC7. 161-164 (2000)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] Ito, K.and Akagi, M.: "A study on temporal information based on the synchronization index using a computational model"Proc.WESTPRAC7. 263-266 (2000)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] Mizumachi, M.and Akagi, M.: "Noise reduction using a small-scale microphone array under non-stationary signal conditions"Proc.WESTPRAC7. 421-424 (2000)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] Mizumachi, M., Akagi, M.and Nakamura, S.: "Design of robust subtractive beamformer for noisy speech recognition"Proc.ICSLP2000, Beijing. IV-57-60 (2000)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] Akagi, M., Mizumachi, M., Ishimoto, Y., and Unoki, M.: "Speech enhancement and segregation based on human auditory mechanisms"Proc.IS2000, Aizu. 246-253 (2000)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] Ito, K.and Akagi, M.: "A computational model of binaural coincidence detection using impulses based on synchronization index."Proc, ISA2000 (BIS2000), Wollongong, Australia. (2000)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] Mizumachi,M.and Akagi,M.: "The auditory-oriented spectral distortion for evaluating speech signals distorted by additive noises"J.Acoust.Soc.Jpn.(E). 21,5. 251-258 (2000)

    • Related Report
      2000 Annual Research Report
  • [Publications] Ito,K.and Akagi,M.: "A computational model of auditory sound localization based on ITD"Recent Developments in Auditory Mechanics, World Scientific Publishing. 483-489 (2000)

    • Related Report
      2000 Annual Research Report
  • [Publications] 石本,鵜木,赤木: "周期性と調波性を考慮した雑音環境における基本周波数推定"音響学会聴覚研究会資料. H-2000-81. (2000)

    • Related Report
      2000 Annual Research Report
  • [Publications] Mizumachi,M.,Akagi,M.and Nakamura,S.: "Design of robust subtractive beamformer for noisy speech recognition"Proc.ICSLP2000,Beijing. IV. 57-60 (2000)

    • Related Report
      2000 Annual Research Report
  • [Publications] Akagi,M.,Mizumachi,M.,Ishimoto,Y.,and Unoki,M.: "Speech enhancement and segregation based on human auditory mechanisms"Proc.IS2000,Aizu. 246-253 (2000)

    • Related Report
      2000 Annual Research Report
  • [Publications] Ito,K.and Akagi,M.: "A computational model of binaural coincidence detection using impulses based on synchronization index"Proc, ISA2000(BIS2000),Wollongong,Australia. (2000)

    • Related Report
      2000 Annual Research Report
  • [Publications] Unoki,M.and Akagi,M.: "A method of signal extraction from noisy signal based on auditory scene analysis"Speech Communication. 27,3-4. 261-279 (1999)

    • Related Report
      1999 Annual Research Report
  • [Publications] 水町,赤木: "マイクロホン対を用いたスペクトルサブトラクションによる雑音除去法"電子情報通信学会論文誌. J82-A,4. 503-512 (1999)

    • Related Report
      1999 Annual Research Report
  • [Publications] 鵜木,赤木: "聴覚の情景解析に基づいた雑音下の調波復合音の一抽出法"電子情報通信学会論文誌. J82-A,10. 1497-1507 (1999)

    • Related Report
      1999 Annual Research Report
  • [Publications] Unoki,M. and Akagi,M.: "Segregation of vowel in background noise using the method of segregating two acoustic sources based on auditory scene analysis"Proc.CASA99, IJCAI-99, Stockholm. 51-60 (1999)

    • Related Report
      1999 Annual Research Report
  • [Publications] Mizumachi,M. and Akagi,M.: "An objective distortion estimator for hearing aids and its application to noise reduction"Proc.EUROSPEECH99. 2619-2622 (1999)

    • Related Report
      1999 Annual Research Report
  • [Publications] 石本,赤木: "雑音が付加された音声の基本周波数推定と雑音抑圧"電子情報通信学会 音声研究会. (平成12年3月発表予定). 2000

    • Related Report
      1999 Annual Research Report
  • [Publications] Mizumachi and Akagi: "Noise reduction by paired-microphones using spectral subtraction" Proc.ICASSP98. II. 1001-1004 (1998)

    • Related Report
      1998 Annual Research Report
  • [Publications] 赤木正人: "聴覚特性を考慮した波形分析" 日本音響学会誌. 54,8. 575-581 (1998)

    • Related Report
      1998 Annual Research Report
  • [Publications] Unoki and Akagi: "Signal extraction from noisy signal based on auditory scene analysis" Proc.ICSLP98. 5. 2115-2118 (1998)

    • Related Report
      1998 Annual Research Report
  • [Publications] Akagi, Iwaki and Sakaguchi: "Spectral sequence compensation based on continuity of spectral Sequence" Proc.ICSLP98. 4. 1407-1410 (1998)

    • Related Report
      1998 Annual Research Report
  • [Publications] Unoki and Akagi: "A method of signal extraction from noisy signal based on anditory scene analysis" Speech Communication. (採録決定,印刷中).

    • Related Report
      1998 Annual Research Report
  • [Publications] 水町,赤木: "マイクロホン対を用いたスペクトルサブトラクションによる雑音除去法" 電子情報通信学会論文誌. (採録決定).

    • Related Report
      1998 Annual Research Report

URL: 

Published: 1998-04-01   Modified: 2016-04-21  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi