2010 Fiscal Year Annual Research Report

プライバシー情報を隠蔽するための音声処理に関する研究

Research Project

Project/Area Number	21700192
Research Institution	Toyohashi University of Technology
Principal Investigator	山本一公豊橋技術科学大学, 大学院・工学研究科, 助教 (40324230)
Keywords	プライバシー保護 / センサ情報 / 音声信号処理 / 遠隔発話音声 / 音源分離 / 非負値行列因子分解 / 話者認識 / 位相情報
Research Abstract	公共の場で収録される音・音声情報が有効に活用できる場面は多いと考えられるが、プライバシーの問題を感じる人が多いため、現在のところあまり利用されていない。本研究では、音・音声情報から、プライバシー情報(話者を推定し得る情報)を隠蔽することを目的として、音信号から音声・背景音を分離することでプライバシーを保護する「音声除去」、音声信号を別人の音声に変えることでプライバシーを保護する「声質変換」、音声認識することで言語的なプライバシー情報を保護する技術について検討を進めてきた。音信号から音声と背景音を分離するためには、ネット上の音声が収録済みの音声・背景音混合音声であることから、シングルチャネルで音声と背景雑音を分離する技術が必要である。これまでの研究では、背景音が雑音の場合についての音声除去手法を開発・評価してきた。これに対して本年度は、背景音が音楽である場合の分離を目指して、これまでに開発してきたベクトル量子化(VQ)に基づく音声除去手法と非負値行列因子分解(NMF)に基づく音源分離手法について比較・検討を行った。その結果、同等のスペクトル歪値が得られる結果に対して、NMFに基づく手法では音声・背景音楽が同等に歪むのに対して、VQに基づく手法では若干音声が残るものの背景音楽が歪むことはないことが分かった。背景音楽を除去して音声認識を行う場合には、VQに基づく手法がNMFに基づく手法を上回る性能を示したが、このことが理由であると考えられる。また、プライバシーを保護するためには誰が喋っているかを知る必要があるため、実環境下話者認識の性能改善を行った。本年度は、位相情報の利用、有声音区間のみの利用により認識率を向上させることを試みた。その結果、残響が存在する環境での位相の利用は基本的に難しいが、有声音の区間だけを用いるならば位相情報の併用も有用であることが分かった。

Research Products
(9 results)

All 2011 2010

All Journal Article (3 results) (of which Peer Reviewed: 3 results) Presentation (5 results) Book (1 results)

[Journal Article] Auditory perception versus automatic estimation of location and orientation of an acoustic source in a real environment2010
- Author(s)
  Alberto Yoshihiro Nakano, Seiichi Nakagawa, Kazumasa Yamamoto
- Journal Title
  
  Acoustical Science and Technology
  
  Volume: Vol.31 Pages: 309-319
- Peer Reviewed
[Journal Article] Speaker recognition by combining MFCC and phase information in noisy conditions2010
- Author(s)
  Longbiao Wang, Kazue Minami, Kazumasa Yamamoto, Seiichi Nakagawa
- Journal Title
  
  IEICE Transactions of Information and Systems
  
  Volume: Vol.93 Pages: 2397-2406
- Peer Reviewed
[Journal Article] Distant speech recognition using a microphone array network2010
- Author(s)
  Alberto Yoshihiro Nakano, Seiichi Nakagawa, Kazumasa Yamamoto
- Journal Title
  
  IEICE Transactions of Information and Systems
  
  Volume: Vol.93 Pages: 2451-2462
- Peer Reviewed
[Presentation] 有声音部の位相情報を用いた話者認識の改善2011
- Author(s)
  嶋田晃太、山本一公、中川聖一
- Organizer
  日本音響学会
- Place of Presentation
  早稲田大学(東京都)
- Year and Date
  2011-03-10
[Presentation] NMFとVQ手法による音楽重畳音声の音楽除去と音声認識2011
- Author(s)
  仲野翔一、山本一公、中川聖一
- Organizer
  日本音響学会
- Place of Presentation
  早稲田大学(東京都)
- Year and Date
  2011-03-10
[Presentation] Large vocabulary speech recognition system : SPOJUS++2011
- Author(s)
  Yasuhisa Fujii, Kazumasa Yamamoto, Seiichi Nakagawa
- Organizer
  11^<th> WSEAS International Conference on Multime dia Systems & Signal Processing (MUSP'11)
- Place of Presentation
  イタリア・ベニス
- Year and Date
  2011-03-08
[Presentation] Speech recognition using long-term phase information2010
- Author(s)
  Kazumasa Yamamoto, Eiichi Sueyoshi, Seiichi Nakagawa
- Organizer
  INTERSPEECH 2010
- Place of Presentation
  幕張メッセ(千葉県)
- Year and Date
  2010-09-28
[Presentation] Evaluation of privacy protection techniques for speech signals2010
- Author(s)
  Kazumasa Yamamoto, Seiichi Nakagawa
- Organizer
  International Conference on Information Processing and Management of Uncertainty in Knowledge-Based Systems (IPMU2010)
- Place of Presentation
  ドイツ・ドルトムント
- Year and Date
  2010-06-29
[Book] Evaluation of privacy protection technique for speech signals (Chapter in"Information Processing and Management of Uncertainty in Knowledge-Based Systems : Applications(Communications and Computer and Information Science)"(Eyke Hullermeier, Rudolf Kruse Frank Hoffmann(Eds.))2010
- Author(s)
  Kazumasa Yamamoto, Seiichi Nakagawa
- Total Pages
  10
- Publisher
  Springer

2010 Fiscal Year Annual Research Report

プライバシー情報を隠蔽するための音声処理に関する研究

Principal Investigator

山本 一公 豊橋技術科学大学, 大学院・工学研究科, 助教 (40324230)

Research Products

[Journal Article] Auditory perception versus automatic estimation of location and orientation of an acoustic source in a real environment2010

Author(s)

Journal Title

[Journal Article] Speaker recognition by combining MFCC and phase information in noisy conditions2010

Author(s)

Journal Title

[Journal Article] Distant speech recognition using a microphone array network2010

Author(s)

Journal Title

[Presentation] 有声音部の位相情報を用いた話者認識の改善2011

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] NMFとVQ手法による音楽重畳音声の音楽除去と音声認識2011

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Large vocabulary speech recognition system : SPOJUS++2011

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Speech recognition using long-term phase information2010

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Evaluation of privacy protection techniques for speech signals2010

Author(s)

Organizer

Place of Presentation

Year and Date

[Book] Evaluation of privacy protection technique for speech signals (Chapter in"Information Processing and Management of Uncertainty in Knowledge-Based Systems : Applications(Communications and Computer and Information Science)"(Eyke Hullermeier, Rudolf Kruse Frank Hoffmann(Eds.))2010

Author(s)

Total Pages

Publisher

山本一公豊橋技術科学大学, 大学院・工学研究科, 助教 (40324230)