• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to project page

2010 Fiscal Year Final Research Report

New Approaches for Speech Universal Communications

Research Project

  • PDF
Project/Area Number 19200009
Research Category

Grant-in-Aid for Scientific Research (A)

Allocation TypeSingle-year Grants
Section一般
Research Field Media informatics/Database
Research InstitutionNara Institute of Science and Technology

Principal Investigator

SHIKANO Kiyoshiro  Nara Institute of Science and Technology, 情報科学研究科, 教授 (00263426)

Co-Investigator(Kenkyū-buntansha) SARUWATARI Hiroshi  奈良先端科学技術大学院大学, 情報科学研究科, 准教授 (30324974)
TODA Tomoki  奈良先端科学技術大学院大学, 情報科学研究科, 准教授 (90403328)
KAWANAMI Hiromichi  奈良先端科学技術大学院大学, 情報科学研究科, 助教 (80335489)
Co-Investigator(Renkei-kenkyūsha) NAKAJIMA Yoshitaka  奈良先端科学技術大学院大学, 情報科学研究科, 客員研究員 (40448189)
Project Period (FY) 2007 – 2010
Keywordsヒューマンインターフェイス / 音声コミュニケーション / 非可聴つぶやき / ブラインド音源分離 / 音声情報案内システム / ハンズフリー音声認識
Research Abstract

Two invented new speech media, Non-Audible Murmur (NAM) and High fidelity Blind Source Separation (SIMO-ICA), have been theoretically and practically developed. These new media technologies have been transferred to industries. As for speech recognition, speech guidance system, Takemaru-kun, has been installed and successfully operated in public facilities.

  • Research Products

    (38 results)

All 2010 2009 2008 2007 Other

All Journal Article (12 results) Presentation (25 results) Remarks (1 results)

  • [Journal Article] Esophageal speech enhancement based on statistical voice conversion with gaussian mixture models2010

    • Author(s)
      Hironori Doi, Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano
    • Journal Title

      IEICE Trans. Information and Systems vol.E93-D, no.9

      Pages: 2472-2482

  • [Journal Article] Evaluation of extremely small sound source signals used in speaking-aid system with statistical voice conversion2010

    • Author(s)
      Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano
    • Journal Title

      IEICE Trans. Information and Systems vol.E93-D, no.7

      Pages: 1909-1917

  • [Journal Article] Adaptive training for voice conversion based on eigenvoices2010

    • Author(s)
      Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano
    • Journal Title

      IEICE Trans. Information and Systems vol.E93-D, no.6

      Pages: 1589-1598

  • [Journal Article] Musicalnoise analysis in methods of integrating microphone array and spectral subtraction based on higher-order statistics2010

    • Author(s)
      Yu Takahashi, Hiroshi Saruwatari, Kiyohiro Shikano, Kazunobu Kondo
    • Journal Title

      EURASIP Journal on Advances in Signal Processing vol.2010, Article ID 431347

      Pages: 25

  • [Journal Article] Silent-speech enhancement using body-conducted vocal-tract resonance signals2010

    • Author(s)
      Tatsuya Hirahara, Makoto Otani, Shota Shimizu, Tomoki Toda, Keigo Nakamura, Yoshitaka Nakajima, Kiyohiro Shikano
    • Journal Title

      Speech Communication vol.52, no.4

      Pages: 301-313

  • [Journal Article] 実環境向け音声対話ロボット「キタちゃん」の開発2010

    • Author(s)
      猿渡洋, 川波弘道, 鹿野清宏
    • Journal Title

      日本ロボット学会誌 Vol.28, No.1

      Pages: 31-34

  • [Journal Article] 独立成分分析を導入した空間的サブトラクションアレーによるハンズフリー音声認識システムの開発2010

    • Author(s)
      高橋祐, 猿渡洋, 鹿野清宏
    • Journal Title

      電子情報通信学会論文誌D vol.J93-D, no.3

      Pages: 312-325

  • [Journal Article] 解析型二次統計量ICAとkurtosisに基づく学習区間判定を用いたリアルタイムブラインド音源抽出2009

    • Author(s)
      藤原裕樹, 高橋祐, 橘健太郎, 宮部滋樹, 猿渡洋, 鹿野清宏, 田中章
    • Journal Title

      電子情報通信学会論文誌A vol.J92-A, no.5

      Pages: 314-326

  • [Journal Article] Blind spatial subtraction array for speech enhancement in noisy environment2009

    • Author(s)
      Yu Takahashi, Tomoya Takatani, Keiichi Osako, Hiroshi Saruwatari, Kiyohiro Shikano
    • Journal Title

      IEEE Transactions on Audio, Speech and Language Processing vol.17, no.4

      Pages: 650-664

  • [Journal Article] Techniques in Rapid Unsupervised Speaker Adaptation based on HMM-Sufficient Statistics2009

    • Author(s)
      Randy Gomez, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano
    • Journal Title

      Speech Communication Vol.51, No.1

      Pages: 42-57

  • [Journal Article] Fast Convergence Blind Source Separation Using Frequency Subband Interpolation by Null Beamforming2008

    • Author(s)
      Keiichi Osako, Yoshimitsu Mori, Yu Takahashi, Hiroshi Saruwatari, Kiyohiro Shikano
    • Journal Title

      IEICE Trans. Fundamentals vol.E91-A, no.6

      Pages: 1357-1361

  • [Journal Article] STRAIGHT混合励振源を用いた混合正規分布モデルに基づく最ゆう声質変換法2008

    • Author(s)
      大谷大和, 戸田智基, 猿渡洋, 鹿野清宏
    • Journal Title

      電子情報通信学会論文誌 Vol.J91-D, No.4

      Pages: 1082-1091

  • [Presentation] Blind speech extraction combining ICA-based noise estimation and less-musical-noise nonlinear post processing2010

    • Author(s)
      Hiroshi Saruwatari, Yu Takahashi, Kiyohiro Shikano, Kazunobu Kondo
    • Organizer
      Asilomar Conference on Signals, Systems, and Computers
    • Place of Presentation
      California, USA
    • Year and Date
      2010-11-09
  • [Presentation] Improvement of speech recognition performance for spoken- oriented robot dialog system using end-fire array2010

    • Author(s)
      Hiroshi Sawada, Jani Even, Hiroshi Saruwatari, Kiyohiro Shikano, Tomoya Takatani
    • Organizer
      Proc. of 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS2010)
    • Place of Presentation
      Taipei, Taiwan
    • Year and Date
      2010-10-19
  • [Presentation] Adaptive voice-quality control based on one-to-many eigenvoice conversion2010

    • Author(s)
      Kumi Ohta, Tomoki Toda, Yamato Ohtani, Hiroshi Saruwatari, Kiyohiro Shikano
    • Organizer
      Proc. of INTERSPEECH, 2158-2161
    • Place of Presentation
      Chiba, Japan
    • Year and Date
      2010-09-29
  • [Presentation] The use of air-pressure sensor in electrolaryngeal speech enhancement based on statistical voice conversion2010

    • Author(s)
      Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano
    • Organizer
      Proc. of INTERSPEECH, 1628-1631
    • Place of Presentation
      Chiba, Japan
    • Year and Date
      2010-09-29
  • [Presentation] Comparison of Methods for Topic Classification in a Speech-Oriented Guidance System2010

    • Author(s)
      Rafael Torres, Shota Takeuchi, Hiromichi Kawanami, Tomoko Matsui, Hiroshi Saruwatari, Kiyohiro Shikano
    • Organizer
      Proc. of INTERSPEECH, 1261-1264
    • Place of Presentation
      Chiba, Japan
    • Year and Date
      2010-09-28
  • [Presentation] Blind speech extraction combining generalized MMSE STSA estimator and ICA-based noise and speech probability density function estimations2010

    • Author(s)
      Hiroshi Saruwatari, Ryoi Okamoto, Yu Takahashi, Kiyohiro Shikano
    • Organizer
      Proc. of the 9th International Conference on Latent Variable Analysis and Signal Separation (LVA2010), 49-56
    • Place of Presentation
      St.Malo, France
    • Year and Date
      2010-09-27
  • [Presentation] Linear transformation approaches to many-to-one voice conversion2010

    • Author(s)
      Chie Hayashida, Tomoki Toda, Yamato Ohtani, Hiroshi Saruwatari, Kiyohiro Shikano
    • Organizer
      Proc. of the 7th ISCA Speech Synthesis Workshop (SSW7), 74-79
    • Place of Presentation
      Kyoto, Japan
    • Year and Date
      2010-09-22
  • [Presentation] Theoretical analysis of iterative weak spectral subtraction via higher- order statistics2010

    • Author(s)
      Takayuki Inoue, Hiroshi Saruwatari, Yu Takahashi, Kiyohiro Shikano, Kazunobu Kondo
    • Organizer
      Proc. of 2010 IEEE International Workshop on Machine Learning for Signal Processing (MLSP2010), 220-225
    • Place of Presentation
      Kittila, Finland
    • Year and Date
      2010-09-01
  • [Presentation] Binaural hearing aid using sound-localization-preserved MMSE STSA estimator with ICA-based noise estimation2010

    • Author(s)
      Hiroshi Saruwatari, Masanobu Go, Ryoi Okamoto, Kiyohiro Shikano
    • Organizer
      Proc. of International Workshop on Acoustic Echo and Noise Control (IWAENC2010)
    • Place of Presentation
      Tel Aviv, Israel
    • Year and Date
      2010-08-31
  • [Presentation] Blind signal extraction based joint suppression of diffuse background noise and late reverberation2010

    • Author(s)
      Jani Even, Hiroshi Saruwatari, Kiyohiro Shikano, Tomoya Takatani
    • Organizer
      Proc. of European Signal Processing Conference (EUSIPCO2010), 1534-1538
    • Place of Presentation
      Aalborg, Denmark
    • Year and Date
      2010-08-26
  • [Presentation] Theoretical analysis of musical noise in generalized spectral subtraction: why should not use power/amplitude subtraction?,2010

    • Author(s)
      Takayuki Inoue, Yu Takahashi, Hiroshi Saruwatari, Kiyohiro Shikano, Kazunobu Kondo
    • Organizer
      Proc. of European Signal Processing Conference (EUSIPCO2010), 994-998
    • Place of Presentation
      Aalborg, Denmark
    • Year and Date
      2010-08-26
  • [Presentation] Musical noise controllable algorithm of channelwise spectral subtraction and beamforming based on higher-order statistics criterion2010

    • Author(s)
      Yohei Ishikawa, Hiroshi Saruwatari, Yu Takahashi, Kiyohiro Shikano, Kazunobu Kondo
    • Organizer
      Proc. of the 2nd International Workshop on Cognitive Information Processing (CIP2010), 81-86
    • Place of Presentation
      Elba, Italy
    • Year and Date
      2010-06-14
  • [Presentation] NON-PARALLEL TRAINING FOR MANY-TO-MANY EIGENVOICE CONVERSION2010

    • Author(s)
      Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano
    • Organizer
      Proc.ICASSP 2010, pp.4822-4825
    • Place of Presentation
      Dallas, U.S.A.
    • Year and Date
      2010-03-19
  • [Presentation] MMSE STSA ESTIMATOR WITH NONSTATIONARY NOISE ESTIMATION BASED ON ICA FOR HIGH-QUALITY SPEECH ENHANCEMENT2010

    • Author(s)
      Ryoi Okamoto, Yu Takahashi, Hiroshi Saruwatari, Kiyohiro Shikano
    • Organizer
      ICASSP 2010, 4778-4781
    • Place of Presentation
      Dallas, U.S.A.
    • Year and Date
      2010-03-19
  • [Presentation] SPEECH ENHANCEMENT IN PRESENCE OF DIFFUSE BACKGROUND NOISE : WHY USING BLIND SIGNAL EXTRACTION?2010

    • Author(s)
      Jani Even, Hiroshi Saruwatari, Kiyohiro Shikano, Tomoya Takatani
    • Organizer
      ICASSP 2010, 4770-4773
    • Place of Presentation
      Dallas, U.S.A.
    • Year and Date
      2010-03-19
  • [Presentation] STATISTICAL APPROACH TO ENHANCING ESOPHAGEAL SPEECH BASED ON GAUSSIAN MIXTURE MODELS2010

    • Author(s)
      Hironori Doi, Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano
    • Organizer
      ICASSP2010, 4250-4253
    • Place of Presentation
      Dallas, U.S.A.
    • Year and Date
      2010-03-19
  • [Presentation] COMPLEX NEWTON ALGORITHM FOR BLIND SIGNAL EXTRACTION OF SPEECH IN DIFFUSE NOISE2010

    • Author(s)
      Jani Even, Hiroshi Saruwatari, Kiyohiro Shikano, Tomoya Takatani
    • Organizer
      ICASSP2010, 213-216
    • Place of Presentation
      Dallas, U.S.A
    • Year and Date
      2010-03-16
  • [Presentation] THEORETICAL MUSICAL-NOISE ANALYSIS AND ITS GENERALIZATION FOR METHODS OF INTEGRATING BEAMFORMING AND SPECTRAL SUBTRACTION BASED ON HIGHER-ORDER STATISTICS2010

    • Author(s)
      Yu Takahashi, Hiroshi Saruwatari, Kiyohiro Shikano, Kazunobu Kondo
    • Organizer
      Proc.ICASSP 2010, 93-96
    • Place of Presentation
      Dallas, U.S.A.
    • Year and Date
      2010-03-16
  • [Presentation] Semi-blind suppression of internal noise for hands-free robot spoken dialog system2009

    • Author(s)
      Jani Even, Hiroshi Saruwatari, Kiyohiro Shikano, Tomoya Takatani
    • Organizer
      IEEE/RSJ International Conference on Intelligent Robotics and Systems (IROS2009),--, St.
    • Place of Presentation
      St Louis, USA
    • Year and Date
      2009-10-02
  • [Presentation] Blind Signal Extraction Based Speech Enhancement in Presence of Diffuse Background Noise2009

    • Author(s)
      Jani Even, Hiroshi Saruwatari, Kiyohiro Shikano
    • Organizer
      2009 International Workshop on Statistical Signal Processing (SSP2009), 513-517
    • Place of Presentation
      Cardiff UK
    • Year and Date
      2009-09-02
  • [Presentation] Directivity- Dependency-Reduced Blind Source Separation Integrating ICA, Beamforming and Binary Masking2007

    • Author(s)
      Yoshimitsu Mori, Hiroshi Saruwatari, Kiyohiro Shikano, Takashi Hiekata, Takashi Morita
    • Organizer
      2007 IEEE International Conference on Intelligent Robots and Systems (IROS2007)
    • Place of Presentation
      San Diego
    • Year and Date
      2007-10-31
  • [Presentation] Impact of Various Small Sound Source Signals on Voice Conversion Accuracy in Speech Communication Aid for Laryngectomees2007

    • Author(s)
      Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano
    • Organizer
      Proceedings of Interspeech 2007 - Eurospeech, 2517-2520
    • Place of Presentation
      San Diego
    • Year and Date
      2007-08-31
  • [Presentation] Study on Speaker Verification with Non-Audible Murmur Segments2007

    • Author(s)
      Hideki Okamoto, Mariko Kojima, Tomoko Matsui, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano
    • Organizer
      Proceedings of Interspeech, 2017-2020
    • Place of Presentation
      Antwerp Belgium
    • Year and Date
      2007-08-30
  • [Presentation] Rapid Unsupervised Speaker Adaptation Using Single Utterance Based on MLLR and Speaker Selection2007

    • Author(s)
      Randy Gomez, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano
    • Organizer
      Proceedings of Interspeech, 262-265
    • Place of Presentation
      Antwerp Belgium
    • Year and Date
      2007-08-29
  • [Presentation] MLSP 2007 Data Analysis Competition : Two-Stage Blind Source Separation Combining SIMO-Model-Based ICA and Binary Masking2007

    • Author(s)
      Yoshimitsu Mori, Keiichi Osako, Shigeki Miyabe, Yu Takahashi, Hiroshi Saruwatari, Kiyohiro Shikano
    • Organizer
      2007 IEEE International workshops on Machine Learning for Signal Processing (MLSP2007)
    • Place of Presentation
      Thessaloniki Greece
    • Year and Date
      2007-08-27
  • [Remarks] ホームページ等

    • URL

      http://spalab.naist.jp/

URL: 

Published: 2012-01-26   Modified: 2014-04-14  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi