• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to project page

2009 Fiscal Year Annual Research Report

新しい音声メディアによるユニバーサルコミュニケーションの研究

Research Project

Project/Area Number 19200009
Research InstitutionNara Institute of Science and Technology

Principal Investigator

鹿野 清宏  Nara Institute of Science and Technology, 情報科学研究科, 教授 (00263426)

Co-Investigator(Kenkyū-buntansha) 猿渡 洋  奈良先端科学技術大学院大学, 情報科学研究科, 准教授 (30324974)
戸田 智基  奈良先端科学技術大学院大学, 情報科学研究科, 助教 (90403328)
川波 弘道  奈良先端科学技術大学院大学, 情報科学研究科, 助教 (80335489)
Keywords音声ユニバーサルインタフェース / 非可聴つぶやき(NAM) / ブラインド音源分離(BSS) / ハンズフリー音声認識
Research Abstract

新しい静かな音声メディアである非可聴つぶやき(NAM)による無音声認識と無音声電話の研究、歪みなしの音の分離技術SIMO-ICAを用いたハンズフリー音声認識システムの研究、実環境での音声対話システムの研究を推進した。
(1) 非可聴つぶやき(NAM)による音声コミュニケーション手段の研究
(a) NAM発声はなじみのない発声方法であるので、計算機から適切な発声方法の適切な指示を行なう方法について検討して、NAM音声認識(無音声認識)によって評価を行った。
(b) 無音声電話に向けて、話者間で同じ発声を必要としない教師なし適応アルゴリズムを検討した。
(2) 歪なし音源分離SIMO-ICAによる音声コミュニケーション手段の研究
(a) SIMO-ICAを利用した背景雑音の除去に強いBSSA方式の実時間処理を改良して、人にも聞きやすくするために、ミュージカルノイズの低減手法について検討した。
(b) ハンズフリーロボット対話では、ロボットの内部雑音の処理が必要である。この内部雑音分離アルゴリズムを半教師あり独立成分分析によるブラインド音源分離の観点から適応処理手法の研究を行った。
(3) 実環境音声対話システムの研究
(a) 音声対話システムに音声検索の機能を追加の研究をさらに進める。とくに、音声検索のための言語モデルの構築方法について、最新の検索語情報、ローカル情報、グーグルNグラムを活用して検討した。
(b) 機械学習による音声と非音声の識別、タスク内発話とタスク外発話の識別に、BOW (Bag of Words)も利用した手法を考案して、性能を改善した。

  • Research Products

    (24 results)

All 2010 2009 Other

All Journal Article (4 results) (of which Peer Reviewed: 2 results) Presentation (19 results) Remarks (1 results)

  • [Journal Article] 実環境向け音声対話ロボット「キタちゃん」の開発2010

    • Author(s)
      猿渡洋, 川波弘道, 鹿野清宏
    • Journal Title

      日本ロボット学会誌 Vol.28, No.1

      Pages: 31-34

  • [Journal Article] 独立成分分析を導入した空間的サブトラクションアレーによるハンズフリー音声認識システムの開発2010

    • Author(s)
      高橋祐, 猿渡洋, 鹿野清宏
    • Journal Title

      電手情報通信学会論文誌D vol.J93-D, no.3

      Pages: 312-325

    • Peer Reviewed
  • [Journal Article] 解析型二次統計量ICAとkurtosisに基づく学習区間判定を用いたリアルタイムブラインド音源抽出2009

    • Author(s)
      藤原裕樹, 高橋祐, 橘健太郎, 宮部滋樹, 猿渡洋, 鹿野清宏, 田中章
    • Journal Title

      電子情報通信学会論文誌A vol.J92-A, no.5

      Pages: 314-326

  • [Journal Article] Blind spatial subtraction array for speech enhancement innoisy environment2009

    • Author(s)
      Yu Takahashi, Tomoya Takatani, Keiichi Osako, Hiroshi Saruwatari, Kiyohiro Shikano
    • Journal Title

      IEEE Transactions on Audio, Speech and Language Processing vol.17, no.4

      Pages: 650--664

    • Peer Reviewed
  • [Presentation] NON-PARALLEL TRAINING FOR MANY-TO-MANY EIGENVOICE CONVERSION2010

    • Author(s)
      Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano
    • Organizer
      IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
    • Place of Presentation
      Dallas, U.S.A
    • Year and Date
      2010-03-18
  • [Presentation] SPEECH ENHANCEMENT IN PRESENCE OF DIFFUSE BACKGROUND NOISE : WHY USING BLIND SIGNAL EXTRACTION?2010

    • Author(s)
      Jani Even, Hiroshi Saruwatari, Kiyorhiro Shikano, Tomoya Takatani
    • Organizer
      IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
    • Place of Presentation
      Dallas, U.S.A
    • Year and Date
      2010-03-17
  • [Presentation] MMSE STSA ESTIMATOR WITH NONSTATIONARY NOISEESTIMATION BASED ON ICA FOR HIGH-QUALITY SPEECH ENHANCEMENT2010

    • Author(s)
      Ryoi Okamoto, Yu Takahashi, Hiroshi Saruwatari, Kiyohiro Shikano
    • Organizer
      IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
    • Place of Presentation
      Dallas, U.S.A
    • Year and Date
      2010-03-16
  • [Presentation] Technologies for Processing Body Conductive Speech Detected with Non-Audible Murmur Microphone2009

    • Author(s)
      Tomoki Toda, Keigo Nakamura, Takayuki Nagai, Tomomi Kaino, Yoshitaka Nakajima, Kiyohiro Shikano
    • Organizer
      INTERSPEECH
    • Place of Presentation
      Brighton, UK
    • Year and Date
      20090900
  • [Presentation] Many-to-Many Eigenvoice Conversion with Reference Voice2009

    • Author(s)
      Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano
    • Organizer
      INTERSPEECH
    • Place of Presentation
      Brighton, UE
    • Year and Date
      20090900
  • [Presentation] Structure selection algorithm for less musical-noise generation in integration systems of beamforming and spectral subtraction2009

    • Author(s)
      Yu Takahashi, Yoshihisa Uemura, Hiroshi Saruwatari, Kiyohiro Shikano, Kazunobu Kondo
    • Organizer
      2009 International Workshop on Statistical Signal Processing (SSP2009)
    • Place of Presentation
      Cardiff UK
    • Year and Date
      20090900
  • [Presentation] Blind Signal Extraction Based Speech Enhancement in Presence of Diffuse Background Noise2009

    • Author(s)
      Jani Even, Hiroshi Saruwatari, Kiyohiro Shikano
    • Organizer
      2009 International Workshop on Statistical Signal Processing (SSP2009)
    • Place of Presentation
      Cardiff UK
    • Year and Date
      20090900
  • [Presentation] Enhanced Wiener Post-Processing Based on Partial Projection Back of the Blind Signal Separation Noise Estimate2009

    • Author(s)
      Jani Even, Hiroshi Saruwatari, Kiyohiro Shikano
    • Organizer
      17th European Signal Processing Conference (EUSIPCO2009)
    • Place of Presentation
      Glasgow, Scotland
    • Year and Date
      20090800
  • [Presentation] THEORETICAL MUSICAL-NOISE ANALYSIS AND ITS GENERALIZATION FOR METHODS OF INTEGRATING BEAMFORMING AND SPECTRAL SUBTRACTION BASED ON HIGHER-ORDER STATISTICS2009

    • Author(s)
      Yu Takahashi, Hiroshi Saruwatari, Kiyorhiro Shikano, Kazunobu Kondo
    • Organizer
      IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
    • Place of Presentation
      Dallas, U.S.A
    • Year and Date
      20090300
  • [Presentation] Fast and Versatile Blind Separation of Diverse Sounds Using Closed-Form Estimation of Probability Density Functions of Sources2009

    • Author(s)
      Hiroshi Saruwatari, Yu Takahashi, Kentaro Tachibana, Yoshimitsu Mori, Shigeki Miyabe, Kiyohiro Shikano, Akira Tanaka
    • Organizer
      IEEE/RSJ International Conference on Intelligent Robotics and Systems (IROS2009)
    • Place of Presentation
      St Louis, USA
    • Year and Date
      2009-10-12
  • [Presentation] Emphasized Speech Synthesis Based on Hidden Markov Models2009

    • Author(s)
      Kumiko Morizane, Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano
    • Organizer
      Oriental COCOSDA 2009
    • Place of Presentation
      Urumqi, China
    • Year and Date
      2009-08-10
  • [Presentation] Unknown Example Detection for Example-based Spoken Dialog System2009

    • Author(s)
      Shota Takeuchi, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano
    • Organizer
      Oriental COCOSDA 2009
    • Place of Presentation
      Urumqi, China
    • Year and Date
      2009-08-10
  • [Presentation] Eigenvoice-Based Approach to Voice Conversion and Voice Quality Control2009

    • Author(s)
      Tomoki Toda
    • Organizer
      National Conference on Man-Machine Speech Communication (NCMMSC)
    • Place of Presentation
      Urumqi, China
    • Year and Date
      2009-08-10
  • [Presentation] MUSICAL NOISE GENERATION ANALYSIS FOR NOISE REDUCTION METHODS BASED ON SPECTRAL SUBTRACTION AND MMSE STSA ESTIMATION2009

    • Author(s)
      Yoshihisa Uemura, Yu Takahashi, Hiroshi Saruwatari, Kiyohiro Shikano, Kazunobu Kondo
    • Organizer
      IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
    • Place of Presentation
      Taipei, Taiwan
    • Year and Date
      2009-04-23
  • [Presentation] ACOUSTIC COMPENSATION METHODS FOR BODY TRANSMITTED SPEECH CONVERSION2009

    • Author(s)
      Daisuke Miyamoto, Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano
    • Organizer
      IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
    • Place of Presentation
      Taipei, Taiwan
    • Year and Date
      2009-04-23
  • [Presentation] HANDS-FREE SPEECH RECOGNITION CHALLENGE FOR REAL-WORLD SPEECH DIALOGUE SYSTEMS2009

    • Author(s)
      Hiroshi Saruwatari, Hiromichi Kawanami, Shota Takeuchi, Yu Takahashi, Tobias Cincarek, Kiyohiro Shikano
    • Organizer
      IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
    • Place of Presentation
      Taipei, Taiwan
    • Year and Date
      2009-04-23
  • [Presentation] SOURCE ADAPTIVE BLIND SIGNAL EXTRACTION USING CLOSED-FORM ICA FOR HANDS-FREE ROBOT SPOKEN DIALOGUE SYSTEM2009

    • Author(s)
      Yu Takahashi, Hiroshi Saruwatari, Yuki Fujihara, Kentaro Tachibana, Yoshimitsu Mori, Shigeki Miyabe, Kiyohiro Shikano, Akira Tanaka
    • Organizer
      IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
    • Place of Presentation
      Taipei, Taiwan
    • Year and Date
      2009-04-23
  • [Presentation] MUSICAL NOISE ANALYSIS BASED ON HIGHER ORDER STATISTICS FOR MICROPHONE ARRAY AND NONLINEAR SIGNAL PROCESSING2009

    • Author(s)
      Yu Takahashi, Yoshihisa Uemura, Hiroshi Saruwatari, Kiyohiro Shikano, Kazunobu Kondo
    • Organizer
      IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
    • Place of Presentation
      Taipei, Taiwan
    • Year and Date
      2009-04-22
  • [Presentation] VOICE CONVERSION FOR VARIOUS TYPES OF BODY TRANSMITTED SPEECH2009

    • Author(s)
      Tomoki Toda, Keigo Nakamura, Hidehiko Sekimoto, Kiyohiro Shikano
    • Organizer
      IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
    • Place of Presentation
      Taipei, Taiwan
    • Year and Date
      2009-04-21
  • [Remarks]

    • URL

      http://spalab.naist.jp/database/library/paper_09.html

URL: 

Published: 2011-06-16   Modified: 2016-04-21  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi