2015 Fiscal Year Annual Research Report

音声からの調音運動抽出に基づく発音マップ・調音動作アニメ表示法と発音矯正への適用

Research Project

Project/Area Number	25280128
Research Institution	Waseda University
Principal Investigator	新田恒雄早稲田大学, グリーンコンピューティングシステム研究機構, 教授 (70314101)
Co-Investigator(Kenkyū-buntansha)	林良子神戸大学, その他の研究科, 教授 (20347785) 入部百合絵愛知県立大学, 情報科学部, 講師 (40397500) 河合剛北海道大学, その他の研究科, 准教授 (70312981) 桂田浩一豊橋技術科学大学, 学内共同利用施設等, 准教授 (80324490)
Project Period (FY)	2013-04-01 – 2016-03-31
Keywords	発音学習 / 調音特徴抽出 / 発音マップ / 調音ジェスチャ / HTML5
Outline of Annual Research Achievements	H27 年度は，次に示す4 項目に沿って研究を進めた。 (1) 「音声-調音特徴変換エンジンを開発する」については，これまでDNN(deep neural network)の入力として使用してきた音声スペクトラム系列に加えて，テンソル空間の特徴を使用した方式を検討し，英語音素認識性能において大きく性能を向上できることを確認した。 (2) 「調音特徴系列から単語中の音素を選択して発音マップ上に表示する技術を開発する」については，これまでのFlash Player に変えて，近年多くのブラウザが対応しているHTML5による表示を実装した。 (3) 「調音特徴系列から調音動作を推定してアニメーション表示する技術を開発する」については，調音のかなめとなる部位が教師と学習者でどのように違っているかをハイライトして指摘する機能をHTML5で実装した。 (4) 「(2)および(3)の技術を基に実時間で動作する学習システムを構築し，教育現場において導入効果を評価する」については，webベースの学習システムを開発して，発音マップのサブシステムと調音ジェスチャアニメのサブシステム双方で発音矯正に必要な機能動作を確認した。　一方，(1)で開発した新しい調音特徴変換エンジンを(2)と(3)のサブシステムと接続する実験，および (4)のHTML5ベース学習システムを教育現場で評価する実験については，研究の期間中に終えることができなかった。この二つについては，本基盤研究を発展的に引き継いだ科研「音声からの発話動作可視化技術に基づく発話訓練支援の研究」(基盤(C)H27 – H29: 代表者入部) の中で実現していきたいと考えている。
Research Progress Status	27年度が最終年度であるため、記入しない。
Strategy for Future Research Activity	27年度が最終年度であるため、記入しない。
Causes of Carryover	27年度が最終年度であるため、記入しない。
Expenditure Plan for Carryover Budget	27年度が最終年度であるため、記入しない。

Research Products
(24 results)

All 2016 2015

All Journal Article (2 results) (of which Peer Reviewed: 2 results, Open Access: 1 results, Acknowledgement Compliant: 1 results) Presentation (20 results) (of which Int'l Joint Research: 11 results, Invited: 2 results) Book (2 results)

[Journal Article] Designing XML scema for classroom discourse visual representation through XSLT2016
- Author(s)
  Noriaki Katagiri and Goh Kawai
- Journal Title
  
  Journal of the Hokkaido University of Education
  
  Volume: 66 (2) Pages: 1-16
- Peer Reviewed
[Journal Article] Using Reversed Sequences and Grapheme Generation Rules to Extend the Feasibility of a Phoneme Transition Network-based Grapheme-to-Phoneme Conversion2015
- Author(s)
  Seng Kheang, Kouichi Katsurada, Yurie Iribe and Tsuneo Nitta
- Journal Title
  
  IEICE Transaction on Information and System
  
  Volume: Vol.E99-D, No.4 Pages: 1182-1192
- DOI
  10.1587/transinf.2015EDP7349
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Presentation] オートエンコーダと話者性変換ユニットを用いた声質変換法の提案2016
- Author(s)
  入澤浩太郎，桂田浩一，新田恒雄，入部百合絵
- Organizer
  日本音響学会2016年春季研究発表会
- Place of Presentation
  桐蔭横浜大学，横浜市
- Year and Date
  2016-03-09 – 2016-03-11
[Presentation] Suffix Array を用いた高速 STD システムにおけるリスコアリング法の検討2016
- Author(s)
  石原元気，桂田浩一，新田恒雄，入部百合絵
- Organizer
  日本音響学会2016年春季研究発表会
- Place of Presentation
  桐蔭横浜大学，横浜市
- Year and Date
  2016-03-09 – 2016-03-11
[Presentation] Audio-visual speech recognition using deep bottleneck features and high-performance lipreading2015
- Author(s)
  Satoshi Tamura, Hiroshi Ninomiya, Norihide Kitaoka, Shin Osuga, Yurie Iribe, Kazuya Takeda and Satoru Hayamizu
- Organizer
  APSIPA ASC 2015
- Place of Presentation
  Hong Kong
- Year and Date
  2015-12-16 – 2015-12-19
- Int'l Joint Research
[Presentation] Noise reduction from EEG using denoising autoencoder2015
- Author(s)
  Kota Nakazawa, Junsei Horikawa, Shunji Sugimoto, Tsuneo Nitta, and Kouichi Katsurada
- Organizer
  The 2nd Annual Meeting of the Society for Bioacoustics
- Place of Presentation
  Fukuoka, Japan
- Year and Date
  2015-12-12 – 2015-12-13
- Int'l Joint Research
[Presentation] EEG during Japanese vowel recall task2015
- Author(s)
  Kohei Asahara, Shunji Sugimoto, Kouichi Katsurada, Tsuneo Nitta, and Junsei Horikawa
- Organizer
  The 2nd Annual Meeting of the Society for Bioacoustics
- Place of Presentation
  Fukuoka, Japan
- Year and Date
  2015-12-12 – 2015-12-13
- Int'l Joint Research
[Presentation] Syllable recognition from EEG based on higher-order subspace method2015
- Author(s)
  Takumaru Kanzaki, Tetsunori Kobayashi, Shunji Sugimoto, Kouichi Katsurada, Junsei Horikawa, and Tsuneo Nitta
- Organizer
  The 2nd Annual Meeting of the Society for Bioacoustics
- Place of Presentation
  Fukuoka, Japan
- Year and Date
  2015-12-12 – 2015-12-13
- Int'l Joint Research
[Presentation] Active learning of language using hand-written comments2015
- Author(s)
  Akio Ohnishi and Goh Kawai
- Organizer
  Association for the Advancement of Information and Communication Technology at Universities Conference (AXIES 2015)
- Place of Presentation
  愛知県産業労働センター，名古屋市
- Year and Date
  2015-12-02 – 2015-12-04
- Invited
[Presentation] Developing a college freshman course for spoken language analysis2015
- Author(s)
  Goh Kawai
- Organizer
  Sapporo Gakuin University CALL-Plus Workshop 2015
- Place of Presentation
  札幌学院大学，札幌市
- Year and Date
  2015-11-08 – 2015-11-08
[Presentation] Active learning of English language2015
- Author(s)
  Goh Kawai
- Organizer
  Training Session for Hokkaido High School Teachers of English Language
- Place of Presentation
  函館市
- Year and Date
  2015-10-30 – 2015-10-30
- Invited
[Presentation] Development of new speech corpus for elderly Japanese speech recognition2015
- Author(s)
  Yurie Iribe, Norihide Kitaoka, and Shuhei Segawa
- Organizer
  Oriental COCOSDA 2015
- Place of Presentation
  Shanghai, China
- Year and Date
  2015-10-28 – 2015-10-30
- Int'l Joint Research
[Presentation] 深層学習によるボトルネック特徴量を用いたマルチモーダル音声認識2015
- Author(s)
  田村哲嗣，二宮宏史，北岡教英，大須賀晋，入部百合絵，武田一哉，速水悟
- Organizer
  電子情報通信学会技術研究報告, SP2015-69
- Place of Presentation
  神戸大学，神戸市
- Year and Date
  2015-10-16 – 2015-10-17
[Presentation] 深層学習によるマルチモーダル音声認識 - 画像特徴量の改善2015
- Author(s)
  田村哲嗣，二宮宏史，北岡教英，大須賀晋，入部百合絵，武田一哉，速水悟
- Organizer
  第2回サイレント音声認識ワークショップ
- Place of Presentation
  神戸大学，神戸市
- Year and Date
  2015-10-16 – 2015-10-17
[Presentation] Audio-visual processing toward robust speech recognition in cars2015
- Author(s)
  Satoshi Tamura, Hiroshi Ninomiya, Norihide Kitaoka, Shin Osuga, Yurie Iribe, Kazuya Takeda, and Satoru Hayamizu
- Organizer
  DSP in vehicle 2015
- Place of Presentation
  San Francisco, USA
- Year and Date
  2015-10-14 – 2015-10-16
- Int'l Joint Research
[Presentation] Investigation of DNN-based modeling for audio-visual speech recognition2015
- Author(s)
  Satoshi Tamura, Hiroshi Ninomiya, Norihide Kitaoka, Shin Osuga, Yurie Iribe, Kazuya Takeda, and Satoru Hayamizu
- Organizer
  MLSLP2015
- Place of Presentation
  Aizuwakamatsu, Japan
- Year and Date
  2015-09-19 – 2015-09-20
- Int'l Joint Research
[Presentation] 深層学習による音響・画像特徴量を用いたマルチモーダル音声認識2015
- Author(s)
  田村哲嗣，二宮宏史，北岡教英，大須賀晋，入部百合絵，武田一哉，速水悟
- Organizer
  日本音響学会2015年秋季研究発表会
- Place of Presentation
  会津大学, 会津若松市
- Year and Date
  2015-09-16 – 2015-09-18
[Presentation] AutoEncoderを用いた Active Appearance Modelsの性能評価2015
- Author(s)
  渡辺拓也，桂田浩一，新田恒雄，入部百合絵
- Organizer
  電子情報通信学会技術研究報告，PRMU2015-85
- Place of Presentation
  愛媛大学，松山市
- Year and Date
  2015-09-15 – 2015-09-15
[Presentation] Bilinear map of filter-bank outputs for DNN-based speech recognition2015
- Author(s)
  Tetsuji Ogawa, Kenshiro Ueda, Kouichi Katsurada, Tetsunori Kobayashi and Tsuneo Nitta
- Organizer
  Interspeech2015
- Place of Presentation
  Dresden, Germany
- Year and Date
  2015-09-06 – 2015-09-10
- Int'l Joint Research
[Presentation] Integration of deep bottleneck features for audio-visual speech recognition2015
- Author(s)
  Hiroshi Ninomiya, Norihide Kitaoka, Satoshi Tamura, Yurie Iribe and Kazuya Takeda
- Organizer
  Interspeech2015
- Place of Presentation
  Dresden, Germany
- Year and Date
  2015-09-06 – 2015-09-10
- Int'l Joint Research
[Presentation] Perception of syllable-final consonants by Chinese speakers and Japanese speakers", Proceedings of the 18th International Congress of Phonetic Sciences2015
- Author(s)
  Yaming Zhang and Ryoko Hayashi
- Organizer
  18th International Congress of Phonetic Sciences
- Place of Presentation
  Glasgow, UK
- Year and Date
  2015-08-10 – 2015-08-14
- Int'l Joint Research
[Presentation] Model Prioritization Voting Schemes for Phoneme Transition Network-based Grapheme-to-Phoneme Conversion2015
- Author(s)
  Seng Kheang, Kouichi Katsurada, Yurie Iribe and Tsuneo Nitta
- Organizer
  Computer and Information Science and Technology CIST'15
- Place of Presentation
  Ottawa, Canada
- Year and Date
  2015-05-11 – 2015-05-12
- Int'l Joint Research
[Book] Muendiche Kommunikation im DaF-Unterricht: Phonetik, Gespraech und Rhetorik2015
- Author(s)
  Mayako Niikura, Ryoko Hayashi, Markus Rude and Gabriela Schmidt
- Total Pages
  169
- Publisher
  Iudicium
[Book] Identification of Word Boundaries and Accented Syllables in German by German and Non-German Listeners2015
- Author(s)
  Hansjoerg Mixdorff, Ryoko Hayashi and Saori Ushiyama
- Total Pages
  406
- Publisher
  Trends in Phonetics and Phonology (Peter Lang)

2015 Fiscal Year Annual Research Report

音声からの調音運動抽出に基づく発音マップ・調音動作アニメ表示法と発音矯正への適用

Principal Investigator

新田 恒雄 早稲田大学, グリーンコンピューティングシステム研究機構, 教授 (70314101)

Research Products

[Journal Article] Designing XML scema for classroom discourse visual representation through XSLT2016

Author(s)

Journal Title

[Journal Article] Using Reversed Sequences and Grapheme Generation Rules to Extend the Feasibility of a Phoneme Transition Network-based Grapheme-to-Phoneme Conversion2015

Author(s)

Journal Title

DOI

[Presentation] オートエンコーダと話者 性変換ユニットを用いた声質変換法の提案2016

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Suffix Array を用いた高 速 STD システムにおけるリスコアリング法の検討2016

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Audio-visual speech recognition using deep bottleneck features and high-performance lipreading2015

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Noise reduction from EEG using denoising autoencoder2015

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] EEG during Japanese vowel recall task2015

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Syllable recognition from EEG based on higher-order subspace method2015

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Active learning of language using hand-written comments2015

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Developing a college freshman course for spoken language analysis2015

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Active learning of English language2015

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Development of new speech corpus for elderly Japanese speech recognition2015

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 深層学習によるボトルネック特徴量を用いたマルチモーダル音声認 識2015

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] 深層学習によるマルチモーダル音声認識 - 画像特徴量の改善2015

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Audio-visual processing toward robust speech recognition in cars2015

Author(s)

Organizer

Place of Presentation

Year and Date

[Presentation] Investigation of DNN-based modeling for audio-visual speech recognition2015

Author(s)

Organizer

新田恒雄早稲田大学, グリーンコンピューティングシステム研究機構, 教授 (70314101)

[Presentation] オートエンコーダと話者性変換ユニットを用いた声質変換法の提案2016

[Presentation] Suffix Array を用いた高速 STD システムにおけるリスコアリング法の検討2016

[Presentation] 深層学習によるボトルネック特徴量を用いたマルチモーダル音声認識2015