2015 Fiscal Year Annual Research Report

ヒト発話シミュレータによるStory Teller Systemの構築

Research Project

Project/Area Number	25240026
Research Institution	Japan Advanced Institute of Science and Technology
Principal Investigator	赤木正人北陸先端科学技術大学院大学, 情報科学研究科, 教授 (20242571)
Co-Investigator(Kenkyū-buntansha)	田中宏和北陸先端科学技術大学院大学, 情報科学研究科, 准教授 (00332320) 鵜木祐史北陸先端科学技術大学院大学, 情報科学研究科, 准教授 (00343187) 末光厚夫北陸先端科学技術大学院大学, 情報科学研究科, 助教 (20422199) 宮内良太北陸先端科学技術大学院大学, 情報科学研究科, 助教 (30455852) 北村達也甲南大学, 知能情報学部, 教授 (60293594) 川本真一群馬工業高等専門学校, 電子情報工学科, 講師 (70418507) 齋藤毅金沢大学, 電子情報学系, 助教 (70446962) 森川大輔北陸先端科学技術大学院大学, 情報科学研究科, 助教 (70709146) Erickson Donna 金沢医科大学, 教養部, 非常勤講師 (80331586) 党建武北陸先端科学技術大学院大学, 情報科学研究科, 教授 (80334796) 榊原健一北海道医療大学, 心理科学部, 准教授 (80396168)
Project Period (FY)	2013-04-01 – 2017-03-31
Keywords	音声情報処理 / 音声合成 / 音声知覚 / 音声生成
Outline of Annual Research Achievements	本研究では，非言語情報に関する音声知覚モデルと音声生成モデルを，知覚と生成の相互作用を記述した脳モデルにより結合することで，合成音声へのパラ言語・非言語情報付加が可能なStory Teller Systemの構築を行う。このために，(A) 生成モデルのコントロール手法の確立，(B) 知覚モデルで扱える発話スタイルの拡張，(C) これらのモデルを統合したシステムの構築ついて研究を実施した。１．知覚モデルの逆モデルによる感情音声合成　27年度は，提案している知覚モデルを入出力逆に用いることにより，感情音声合成システムを構築し，音声合成を試みた。このシステムの入力は，Valence-Activation (V-A) 空間内での任意の位置情報と平静に発話された音声である。また出力は，平静音声からV-A空間の位置情報を用いて変形された感情音声波形である。合成した音声の聴取実験による評価を行った結果，Joy, Angry, Sadそれぞれの感情をもつ音声が合成されたことが明らかとなった。２．知覚モデルの拡張　27年度は，前年度に行った日，米，中，独，越5か国語を用いた知覚印象採取のための聴取実験の結果を踏まえて，知覚モデルの再構築を行った。そして，再構築したモデルを用いて，複数言語（日，中，独3か国語）をカバーするV-A空間の位置情報推定を行った。この結果，従来の手法よりも高精度な位置推定が可能となった。推定された位置情報をもとに感情認識を行った結果，各国語向けに調整された従来の認識器とほぼ同等の性能を得た。３．統合システムの構築　27年度は，V-A感情空間上ですべてのモデルを統合するために，知覚モデル，知覚モデルの逆モデル，平静音声への意図した感情の付加システム，すべてをV-A感情空間の位置情報をもとにして構築した。これにより，これらのモデルおよびシステムの統合の枠組みがそろった。
Current Status of Research Progress	Current Status of Research Progress 2: Research has progressed on the whole more than it was originally planned. Reason (1) 知覚モデルの拡張については，大規模に実施した聴取実験結果をもとに，複数言語のための合成感情音声評価モデルの構築を進めたこと，また，複数言語の知覚にもとづいた音響特徴推定モデルを構築し，意図した知覚印象を与える音響特徴を推定することに成功したこと。 (2) 生成モデルの精緻化については，感情音声データベースの整備，このデータを用いたLFモデルによる声帯音源波形推定を実施し，声帯音源モデルのパラメータを適切に制御することにより感情音声合成が行える可能性を示したこと，また，知覚モデルの逆モデルを用いて，意図した感情付加が可能となったこと。 (3) 統合システムの構築については，すべてのモデル，システムについて Valence-Activation空間での感情表現を採用し，生成モデルと知覚モデルの一体化を図ったこと，これにより，統合システムの構築が容易になったこと。これらを考慮し，ほぼ当初予定通りの進捗状況であると考える。
Strategy for Future Research Activity	本年度は最終年度であるため，過去3年間に実施した内容も考慮の上，以下に示す研究実施計画の概要にもとづいて研究を実施する。Story Teller System 構築に向けて，生成・知覚モデルを統合するために，脳モデルとしてValence-Activation (V-A)空間での感情表現を用いる。そして，このもとで，生成モデル，知覚モデルを結合し，デモシステムを構築する。これを実現するための解くべき課題は，(A) 生成モデルのコントロール手法の確立（生成モデルの課題），(B) 知覚モデルで扱える発話スタイルの拡張（知覚モデルの課題），(C) これらのモデルを統合したシステムの構築，である．具体的には， (1) 統合システム：　ストーリーテラーシステムへの入力，この値にもとづいた合成音声の作成，合成音声の客観評価値，すべてをV-A空間上での位置情報をもとに表現することで，これらのシステムを統合する。そして，デモシステム（入力は，物語テキストから合成した平静な音声と物語のそれぞれの文に与えたV-A空間上での位置。この情報から，表現豊から物語朗読音声を出力）を構築する。(2) 生成モデル：　昨年度提案した三層構造の音声変形システム（入力は平静な発話音声とV-A空間上での位置。出力はV-A空間上での位置に対応した感情音声）をデモシステムとして用い，表現豊から物語朗読音声の合成を試みる。また，より生理学的な音声生成機構を模擬したARX-LFモデルによる表現豊かな音声の生成も試みる。(3) 知覚モデル：　生成モデルで合成された表現豊かな音声は，まだ完全ではない。このため，知覚モデル（入力は合成された表現豊かな音声。出力はV-A空間上での位置）を表現豊かな音声の客観的評価システムとして用い，生成モデルと合成音の客観的評価により自動で目標の知覚印象（V-A空間上での目標位置）に近づくようにモデルパラメータを制御するための方法を提案する。を実施する。

Research Products
(89 results)

All 2016 2015

All Journal Article (21 results) (of which Int'l Joint Research: 10 results, Peer Reviewed: 20 results, Acknowledgement Compliant: 10 results, Open Access: 19 results) Presentation (67 results) (of which Int'l Joint Research: 38 results, Invited: 4 results) Book (1 results)

[Journal Article] A study on transvelar coupling for non-nasalized sounds2016
- Author(s)
  Jianwu Dang, Jianguo Wei, Kiyoshi Honda, and Takayoshi Nakai
- Journal Title
  
  J. Acoust. Soc. Am.
  
  Volume: 139, 1 Pages: 441-454
- DOI
  http://doi.org/10.1121/1.4939964
- Peer Reviewed / Int'l Joint Research
[Journal Article] Robust Voice Activity Detection Based on Concept of Modulation Transfer Function in Noisy Reverberant Environments2016
- Author(s)
  Shota Morita, Masashi Unoki, Xugang Lu, and Masato Akagi
- Journal Title
  
  Journal of Signal Processing Systems
  
  Volume: Vol. 82, No. 2 Pages: 163-173
- DOI
  10.1007/s11265-015-1014-4
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Journal Article] MTF-based Kalman filtering with linear prediction for power envelope restoration in noisy reverberant environments2016
- Author(s)
  Yang Liu, Shota Morita, and Masashi Unoki
- Journal Title
  
  IEICE Trans. on Fundamentals of Electronics, Communications and Computer Sciences
  
  Volume: Vol. E99-A, No.2 Pages: 560-569
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Journal Article] Coordinate Systems in the Motor System: Computational Modeling and EEG Experiment2016
- Author(s)
  Tanaka, H., Miyakoshi, M., & Makeig, S.
- Journal Title
  
  Advances in Cognitive Neurodynamics (V)
  
  Volume: V Pages: 85-92
- DOI
  10.1007/978-981-10-0207-6_14
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Modeling the motor cortex: Optimality, recurrent neural networks, and spatial dynamics2016
- Author(s)
  Tanaka, H.
- Journal Title
  
  Neuroscience Research
  
  Volume: 104 Pages: 64-71
- DOI
  10.1016/j.neures.2015.10.012
- Peer Reviewed / Open Access
[Journal Article] Articulatory correlates of metrical structure: Studying jaw displacement patterns2016
- Author(s)
  Erickson, D. and Kawahara, S.
- Journal Title
  
  Linguistic Vanguard
  
  Volume: 2 Pages: 1-16
- DOI
  10.1515/lingvan-2015-0025
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Relationship of various open quotients with acoustic property, phonation types, fundamental frequency, and intensity2016
- Author(s)
  H. Yokonishi, H. Imagawa, K.-I. Sakakibara, A. Yamauchi, T. Nito, T. Yamasoba, N. Tayama
- Journal Title
  
  J. Voice
  
  Volume: 30, 2 Pages: 145-157
- DOI
  10.1016/j.jvoice.2015.01.009
- Peer Reviewed / Open Access
[Journal Article] Quantification of vocal fold vibration in various laryngeal disorders using high-speed digital imaging2016
- Author(s)
  i, H. Imagawa, K.-I. Sakakibara, T. Nito, N. Tayama, T. Yamasoba
- Journal Title
  
  J. Voice
  
  Volume: 30, 2 Pages: 205--214
- DOI
  10.1016/j.jvoice.2015.04.016
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Journal Article] A control strategy of a physiological articulatory model for speech production2015
- Author(s)
  X. Wu, J. Dang,
- Journal Title
  
  Journal of Chinese Linguistics
  
  Volume: VOL.43, NO.1B Pages: 337-363
- DOI
  10.1353/jcl.2015.0038
- Peer Reviewed / Int'l Joint Research / Acknowledgement Compliant
[Journal Article] Mapping Ultrasound-based Articulatory Images and Vowel Sounds with a DNN Framework2015
- Author(s)
  Jianguo Wei, Wenhuan Lu, Xinyuan Zheng, Qingzhi Hou, Qiang Fang, Jianwu Dang
- Journal Title
  
  Multimedia Tools and Applications
  
  Volume: Vol 75 Pages: 1-23
- DOI
  10.1007/s11042-015-3038-y
- Peer Reviewed / Open Access / Int'l Joint Research / Acknowledgement Compliant
[Journal Article] Strength of syllabic influences on articulation in Mandarin Chinese and French: Insights from a motor control approach2015
- Author(s)
  Liang Ma, Pascal Perrier, and Jianwu Dang
- Journal Title
  
  Journal of Phonetics
  
  Volume: 53 Pages: 101-124
- DOI
  10.1016/j.wocn.2015.09.005
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] A real-time articulatory visual feedback approach with target presentation for second language pronunciation learning2015
- Author(s)
  Suemitsu, A., Dang, J., Ito, T., Tiede, M.
- Journal Title
  
  J. Acoust. Soc. Am. JASA Express Letters
  
  Volume: 138, EL382 Pages: EL382-EL387
- DOI
  10.1121/1.4931827
- Peer Reviewed / Open Access / Int'l Joint Research / Acknowledgement Compliant
[Journal Article] Generalized Finite Difference Time Domain Method and Its Application to Acoustics2015
- Author(s)
  Jianguo Wei, Song Wang, Qingzhi Hou, and Jianwu Dang
- Journal Title
  
  Mathematical Problems in Engineering
  
  Volume: Volume 2015, Article ID 640305 Pages: 1-13
- DOI
  10.1155/2015/640305
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] 振幅変調音のピッチ知覚に基づいた調波複合音の基本周波数推定法2015
- Author(s)
  三輪賢一郎，鵜木祐史
- Journal Title
  
  電子情報通信学会論文誌
  
  Volume: Vol. J98-A, No.12 Pages: 668-679
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Journal Article] Edge prominence and declination in Japanese jaw displacement patterns: a view from the C/D model2015
- Author(s)
  Kawahara, S., Erickson, D., Suemitsu, A.
- Journal Title
  
  Journal of Phonetic Society of Japan
  
  Volume: 19, 2 Pages: 33-43
- Peer Reviewed / Open Access
[Journal Article] VoiceDub：複数タイミング情報をともなう映像エンタテイメント向け音声同期収録支援システム2015
- Author(s)
  川本真一,森島繁生,中村哲
- Journal Title
  
  情報処理学会論文誌
  
  Volume: Vol. 56, No. 4 Pages: 1142-1151
- Peer Reviewed / Open Access
[Journal Article] ATR音声データベース内の文音声における知覚的話者間類似度の計測2015
- Author(s)
  北村達也, 中間隆正, 大村宙, 川元広樹
- Journal Title
  
  日本音響学会誌
  
  Volume: 71, 10 Pages: 516-525
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Journal Article] Improvement of five-degree-of-freedom sensors for Northern Digital Incorporated's Wave speech research system2015
- Author(s)
  Tatsuya Kitamura, Yukiko Nota, Michiko Hashi, Hiroaki Hatano
- Journal Title
  
  Acoustical Science and Technology
  
  Volume: 36, 4 Pages: 347-350
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Journal Article] 磁気センサシステムによる調音運動のリアルタイム観測2015
- Author(s)
  北村達也
- Journal Title
  
  日本音響学会誌
  
  Volume: 71, 10 Pages: 526-531
- Open Access / Acknowledgement Compliant
[Journal Article] A practical guide to calculating syllable prominence, timing and boundaries in the C/D model. Special Issue on the C/D model2015
- Author(s)
  Erickson, D. and Kawahara, S.
- Journal Title
  
  Journal of Phonetic Society of Japan
  
  Volume: 19.2 Pages: 16-21
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] More about contrastive emphasis and the C/D model. Special Issue on the C/D model2015
- Author(s)
  Kim, J., Erickson, D., and Lee, S.
- Journal Title
  
  Journal of Phonetic Society of Japan
  
  Volume: 19.2 Pages: 44-54
- Peer Reviewed / Open Access / Int'l Joint Research
[Presentation] Quality improvement of HMM-based synthesized speech based on decomposition of naturalness and intelligibility using asymmetric bilinear model with non-negative matrix factorization2016
- Author(s)
  Dinh, T. A, and Akagi, M.
- Organizer
  IEICE Tech. Report, SP2015-141
- Place of Presentation
  ビーコンプラザ（大分県別府市）
- Year and Date
  2016-03-29
[Presentation] INVESTIGATIONS INTO VOWEL AND CONSONANT STRUCTURES IN ARTICULATORY AND AUDITORY SPACES USING LAPLACIAN EIGENMAPS2016
- Author(s)
  Jianwu Dang, Shengbei Wang, Masashi Unoki
- Organizer
  the 41st IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016)
- Place of Presentation
  Shanghai, China
- Year and Date
  2016-03-20 – 2016-03-25
- Int'l Joint Research
[Presentation] Analysis of spatial characteristics of the larynx using high-speed digital imaging2016
- Author(s)
  K.-I. Sakakibara, H. Imagawa, I.T.Tokuda, A. Yamauchi, H. Yokonishi, N. Tayama
- Organizer
  10th International Conference on Voice Physiology and Biomechanics 2016
- Place of Presentation
  Vina del Mar, Chile
- Year and Date
  2016-03-14 – 2016-03-17
- Int'l Joint Research / Invited
[Presentation] 変調フィルタバンクを用いた感情音声の変調スペクトル分析の検討2016
- Author(s)
  朱治，宮内良太，鵜木祐史
- Organizer
  日本音響学会2016年度春季研究発表会
- Place of Presentation
  桐蔭横浜大（神奈川県横浜市）
- Year and Date
  2016-03-09 – 2016-03-11
[Presentation] 頭部運動による両耳間音圧差の変化が音像の知覚に与える影響2016
- Author(s)
  森川大輔
- Organizer
  日本音響学会2016春期研究発表会
- Place of Presentation
  桐蔭横浜大（神奈川県横浜市）
- Year and Date
  2016-03-09 – 2016-03-11
[Presentation] 舌断面形状のモデリングに関する予備的検討2016
- Author(s)
  北村達也, 蒔苗久則, 伊藤仁
- Organizer
  日本音響学会研究発表会
- Place of Presentation
  桐蔭横浜大（神奈川県横浜市）
- Year and Date
  2016-03-09 – 2016-03-11
[Presentation] 姿勢変化に伴う母音調音の変化: NDI Wave dataを用いて2016
- Author(s)
  吐師道子, 北村達也, 能田由紀子
- Organizer
  日本音響学会2016春期研究発表会
- Place of Presentation
  桐蔭横浜大（神奈川県横浜市）
- Year and Date
  2016-03-09 – 2016-03-11
[Presentation] 聴覚フィードバックの遮断が歌声の音高変化に与える影響2016
- Author(s)
  青木政陽・齋藤毅・三好正人
- Organizer
  日本音響学会2016春期研究発表会
- Place of Presentation
  桐蔭横浜大（神奈川県横浜市）
- Year and Date
  2016-03-09 – 2016-03-11
[Presentation] 仮説検証による特定話者音声の音素アライメント2016
- Author(s)
  園田浩之介、川本真一、赤木正人
- Organizer
  平成27年度北陸地区学生による研究発表会
- Place of Presentation
  石川工業高等専門学校（石川県河北郡津幡町）
- Year and Date
  2016-03-08
[Presentation] Study on Effects of Speech Production during Delayed Auditory Feedback for Air-conducted and Bone-Conducted Speech2016
- Author(s)
  Teruki Toya, Daisuke Ishikawa, Ryota Miyauchi, Kazushi Nishimoto, and Masashi Unoki
- Organizer
  Proc. 2016 RISP International workshop on Nonlinear Circuits, Communications and Signal Processing (NCSP16)
- Place of Presentation
  Honolulu, USA
- Year and Date
  2016-03-07 – 2016-03-09
- Int'l Joint Research
[Presentation] Study on IIR Implementation for Modulation Transfer Function of Room Impulse Response2016
- Author(s)
  Yuta Kashihara and Masashi Unoki
- Organizer
  Proc. 2016 RISP International workshop on Nonlinear Circuits, Communications and Signal Processing (NCSP16)
- Place of Presentation
  Honolulu, USA
- Year and Date
  2016-03-07 – 2016-03-09
- Int'l Joint Research
[Presentation] A study on quality improvement of HMM-based synthesized voices using asymmetric bilinear model2016
- Author(s)
  Dinh, T. A, Morikawa, D., and Akagi, M.
- Organizer
  Proc. NCSP2016
- Place of Presentation
  Honolulu, HW, USA
- Year and Date
  2016-03-07
- Int'l Joint Research
[Presentation] Automatic Speech Emotion Recognition in Chinese Using a Three-layered Model in Dimensional Approach2016
- Author(s)
  Li, X. and Akagi, M.
- Organizer
  Proc. NCSP2016
- Place of Presentation
  Honolulu, HW, USA
- Year and Date
  2016-03-07
- Int'l Joint Research
[Presentation] A study on applying target prediction model to parameterize power envelope of emotional speech2016
- Author(s)
  Xue, Y. and Akagi, M.
- Organizer
  Proc. NCSP2016
- Place of Presentation
  Honolulu, HW, USA
- Year and Date
  2016-03-07
- Int'l Joint Research
[Presentation] 雑音駆動音声の感情知覚と振幅包絡線情報の関係にする検討2016
- Author(s)
  朱治, 宮内良太, 荒木友希子, 鵜木祐史
- Organizer
  日本音響学会聴覚研究会
- Place of Presentation
  九州大学(福岡県福岡市）
- Year and Date
  2016-03-04 – 2016-03-05
[Presentation] コーンビームCTで計測した鼻腔・副鼻腔の3次元音響解析2016
- Author(s)
  竹本浩典, 北村達也, 蒔苗久則, 山口徹太郎, 槇宏太郎
- Organizer
  電子情報通信学会音声研究会
- Place of Presentation
  サンピアンかわさき（神奈川県川崎市）
- Year and Date
  2016-01-14
[Presentation] 発話に関するデータの計測と利用2016
- Author(s)
  北村達也
- Organizer
  電子情報通信学会音声研究会
- Place of Presentation
  サンピアンかわさき（神奈川県川崎市）
- Year and Date
  2016-01-14
[Presentation] Emotional speech synthesis system based on a three-layered model using a dimensional approach2015
- Author(s)
  Xue, Y., Hamada, Y., and Akagi, M.
- Organizer
  Proc. APSIPA2015
- Place of Presentation
  Hong Kong, China
- Year and Date
  2015-12-16 – 2015-12-19
- Int'l Joint Research
[Presentation] Chinese Opera Genre Classification Based on Multi-feature Fusion and Extreme Learning Machine2015
- Author(s)
  JianRong Wang, ChenLiang Wang, JianGuo Wei and Jianwu Dang
- Organizer
  Proceedings of APSIPA Annual Summit and Conference 2015
- Place of Presentation
  Hong Kong, China
- Year and Date
  2015-12-16 – 2015-12-19
- Int'l Joint Research
[Presentation] Influences of Auditory and Vibrotactile Information on Vocal F0 Responses2015
- Author(s)
  Xiaozhen Wang, Kiyoshi Honda, Jianwu Dang, Hongcui Wang and Jianguo Wei
- Organizer
  Proceedings of APSIPA Annual Summit and Conference 2015
- Place of Presentation
  Hong Kong, China
- Year and Date
  2015-12-16 – 2015-12-19
- Int'l Joint Research
[Presentation] Investigation of Learning Trajectory of Mandarin for Tibetan Speakers2015
- Author(s)
  Huixia Wang, Jianwu Dang, Hui Feng, Hongcui Wang, Yang Yu, Kiyoshi Honda
- Organizer
  Proceedings of APSIPA Annual Summit and Conference 2015
- Place of Presentation
  Hong Kong, China
- Year and Date
  2015-12-16 – 2015-12-19
- Int'l Joint Research
[Presentation] Automatic Tongue Contour Tracking in Ultrasound Sequences without Manual Initialization2015
- Author(s)
  Hongcui Wang, Siyu Wang, Bruce Denby, Jianwu Dang
- Organizer
  Proceedings of APSIPA Annual Summit and Conference 2015
- Place of Presentation
  Hong Kong, China
- Year and Date
  2015-12-16 – 2015-12-19
- Int'l Joint Research
[Presentation] Vowel Normalization by Articulatory Information2015
- Author(s)
  Jingshu Zhang, Jianguo Wei, Wenhuan Lu, Qiang Fang, Kiyoshi Honda and Jianwu Dang
- Organizer
  Proceedings of APSIPA Annual Summit and Conference 2015
- Place of Presentation
  Hong Kong, China
- Year and Date
  2015-12-16 – 2015-12-19
- Int'l Joint Research
[Presentation] Investigation of relation between speech perception and production based on EEG source reconstruction2015
- Author(s)
  Guancheng Li, Jianwu Dang, Gaoyan Zhang, Zhilei Liu, Hongcui Wang
- Organizer
  Proceedings of APSIPA Annual Summit and Conference 2015
- Place of Presentation
  Hong Kong, China
- Year and Date
  2015-12-16 – 2015-12-19
- Int'l Joint Research
[Presentation] An Audio Watermarking Scheme Based on Automatic Parameterized Singular-Spectrum Analysis Using Differential Evolution2015
- Author(s)
  Jessada Karnjana, Pakinee Aimmanee, Masashi Unoki, and Chai Wuthiwiwatchai
- Organizer
  Proc. APSIPA2015
- Place of Presentation
  Hong Kong, China
- Year and Date
  2015-12-16 – 2015-12-19
- Int'l Joint Research
[Presentation] Aliasing-free implementation of discrete-time glottal source models and their applications to speech synthesis and F0 extractor2015
- Author(s)
  H. Kawahara, K.-I. Sakakibara, H. Banno, M. Morise, T. Toda, T. Irino
- Organizer
  Proceedings of APSIPA Annual Summit and Conference 2015
- Place of Presentation
  Hong Kong, China
- Year and Date
  2015-12-16 – 2015-12-19
- Int'l Joint Research
[Presentation] 静的および動的両耳間差が音像の分離知覚に与える影響2015
- Author(s)
  森川大輔
- Organizer
  電子情報通信学会応用音響研究会
- Place of Presentation
  金沢大学サテライトプラザ　（石川県金沢市）
- Year and Date
  2015-12-11 – 2015-12-12
- Invited
[Presentation] 歌唱における聴覚フィードバック遮断の影響2015
- Author(s)
  青木政陽・齋藤毅・三好正人
- Organizer
  電子情報通信学会応用音響研究会
- Place of Presentation
  金沢大学サテライトプラザ（石川県金沢市）
- Year and Date
  2015-12-11 – 2015-12-12
[Presentation] Study on method to control fundamental frequencycontour related to a position on Valence-Activation space2015
- Author(s)
  Hamada, Y., Elbarougy, R., Xue, Y., and Akagi, M.
- Organizer
  Proc. WESPAC2015
- Place of Presentation
  Singapore, Singapore
- Year and Date
  2015-12-09
- Int'l Joint Research
[Presentation] Effect of a fixed ultrasound probe on jaw movement during speech2015
- Author(s)
  Villegas, J., Wilson, I., Iguro, Y., and Erickson, D.
- Organizer
  Ultrafest VII
- Place of Presentation
  Hong Kong, China
- Year and Date
  2015-12-08 – 2015-12-10
- Int'l Joint Research
[Presentation] Preliminary Study on Blind Estimation of Room Acoustic Parameters in Noisy Reverberant Environments2015
- Author(s)
  Masashi Unoki, Shota Morita, Akikazu Miyazaki, and Masato Akagi
- Organizer
  Proc. 12th Western Pacific Acoustics Conference 2015 (WESPAC2015)
- Place of Presentation
  Singapore, SIngapore
- Year and Date
  2015-12-06 – 2015-12-09
- Int'l Joint Research
[Presentation] Is movement duration predetermined in visually guided reaching? A comparison of finite- and infinite-horizon optimal feedback control2015
- Author(s)
  Li L., Imamizu H. and Tanaka H.
- Organizer
  International Conference on Advanced Mechatronics (ICAM) 2015
- Place of Presentation
  Waseda Univ. (Tokyo, Japan)
- Year and Date
  2015-12-06
- Int'l Joint Research
[Presentation] Formation of internal forward model with sensory and reward prediction errors: A behavioral confirmation2015
- Author(s)
  Satou H., Sasaki A., Nozaki D. and Tanaka H.
- Organizer
  International Conference on Advanced Mechatronics (ICAM) 2015
- Place of Presentation
  Waseda Univ. (Tokyo, Japan)
- Year and Date
  2015-12-06
- Int'l Joint Research
[Presentation] 振幅変調音のピッチ知覚に基づいた基本周波数推定法の検討2015
- Author(s)
  三輪賢一郎，鵜木祐史
- Organizer
  日本音響学会聴覚研究会
- Place of Presentation
  勝沼ぶどうの丘（山梨県甲州市）
- Year and Date
  2015-11-13 – 2015-11-14
[Presentation] これからの歌声合成について2015
- Author(s)
  齋藤毅・河原英紀・徳田恵一・石川克己・中野倫靖
- Organizer
  情報処理学会MUS研究会
- Place of Presentation
  金沢大学サテライトプラザ（石川県金沢市）
- Year and Date
  2015-11-07 – 2015-11-08
[Presentation] Toward Improving Estimation Accuracy of Emotion Dimensions in Bilingual Scenario Based on Three-layered Model2015
- Author(s)
  Li, X. and Akagi, M.
- Organizer
  Proc. O-COCOSDA2015
- Place of Presentation
  Shanghai, China
- Year and Date
  2015-10-28
- Int'l Joint Research
[Presentation] Analysis of modulation-spectral features extracted from Japanese emotional speech2015
- Author(s)
  hu, Ryota Miyouchi, and Masashi Unoki
- Organizer
  日本音響学会聴覚研究会
- Place of Presentation
  国立清華大学（新竹市，台湾）
- Year and Date
  2015-10-23 – 2015-10-24
[Presentation] Improving estimation accuracy of dimension values for speech emotion in bilingual cases using a three-layered model2015
- Author(s)
  Li, X. and Akagi, M.
- Organizer
  Proc. Auditory Res. Meeting, The Acoustical Society of Japan
- Place of Presentation
  国立清華大学（新竹市，台湾）
- Year and Date
  2015-10-23
[Presentation] Rule-based emotional voice conversion utilizing three-layered model for dimensional approach2015
- Author(s)
  Xue, Y. and Akagi, M.
- Organizer
  Proc. Auditory Res. Meeting, The Acoustical Society of Japan
- Place of Presentation
  国立清華大学（新竹市，台湾）
- Year and Date
  2015-10-23
[Presentation] 磁気センサシステムに基づく調音運動と口蓋形状の関係の観測2015
- Author(s)
  北村達也, 能田由紀子, 吐師道子, 波多野博顕, 梅谷智弘
- Organizer
  第60回日本音声言語医学会総会
- Place of Presentation
  愛知県産業労働センター（愛知県名古屋市）
- Year and Date
  2015-10-15 – 2015-10-16
[Presentation] Feasibility of estimating direction of arrival based on monaural modulation spectrum2015
- Author(s)
  Daisuke Morikawa, Masaru Ando, Masashi Unoki
- Organizer
  The Eleventh International Conference on Intelligent Information Hiding and Multimedia Signal Processing
- Place of Presentation
  Adelaide, Australia
- Year and Date
  2015-09-23 – 2015-09-25
- Int'l Joint Research / Invited
[Presentation] 頭部運動による両耳間差の変化が音像の分離知覚に与える影響2015
- Author(s)
  森川大輔
- Organizer
  日本音響学会2015秋期研究発表会
- Place of Presentation
  会津大学（福島県会津若松市）
- Year and Date
  2015-09-16 – 2015-09-18
[Presentation] Articulation of phrasal stress in Mandarin Chinese2015
- Author(s)
  岩田礼，エリクソンドナ，澁谷良穂，末光厚夫
- Organizer
  音響学会2015年秋季研究発表会
- Place of Presentation
  会津大学（福島県会津若松市）
- Year and Date
  2015-09-16 – 2015-09-18
[Presentation] 磁気センサシステムのセンサ装着が発話に及ぼす影響:センサワイヤ交換の効果2015
- Author(s)
  北村達也, 能田由紀子, 吐師道子, 波多野博顕
- Organizer
  日本音響学会研究発表
- Place of Presentation
  会津大学（福島県会津若松市）
- Year and Date
  2015-09-16 – 2015-09-18
[Presentation] Spanish articulatory rhythm2015
- Author(s)
  Erickson, D., Villegas, J., Wilson, I., and Iguro, Y.
- Organizer
  Proc. ASJ '2015 Fall Meeting
- Place of Presentation
  会津大学（福島県会津若松市）
- Year and Date
  2015-09-16 – 2015-09-18
[Presentation] Articulation of phrasal stress in Mandarin Chinese2015
- Author(s)
  Iwata, R., Erickson, D., Shibuya, Y., Suemitsu, A.
- Organizer
  Proc. ASJ '2015 Fall Meeting
- Place of Presentation
  会津大学（福島県会津若松市）
- Year and Date
  2015-09-16 – 2015-09-18
[Presentation] Study on estimation of bilingual speech emotion dimensions using a three-layered model2015
- Author(s)
  Li, X., Akagi, M.
- Organizer
  Proc. ASJ '2015 Fall Meeting
- Place of Presentation
  会津大学（福島県会津若松市）
- Year and Date
  2015-09-16
[Presentation] A method for synthesizing emotional speech using the three-layered model based on a dimensional approach2015
- Author(s)
  Xue, Y., Hamada, Y., and Akagi, M.
- Organizer
  Proc. ASJ '2015 Fall Meeting
- Place of Presentation
  会津大学（福島県会津若松市）
- Year and Date
  2015-09-16
[Presentation] Combined Cine- and Tagged-MRI for Tracking Landmarks on the Tongue Surface2015
- Author(s)
  H. Bao, W. Lu, K. Honda, J. Wei, Q. Fang, J. Dang
- Organizer
  INTERSPEECH2015
- Place of Presentation
  Dresden, Germany
- Year and Date
  2015-09-07 – 2015-09-10
- Int'l Joint Research
[Presentation] Perception of Mandarin Tones by Native Tibetan Speakers2015
- Author(s)
  W. Bao, H. Feng, J. Dang, Z. Liu, Y. Yu, S. Wang
- Organizer
  INTERSPEECH2015
- Place of Presentation
  Dresden, Germany
- Year and Date
  2015-09-07 – 2015-09-10
- Int'l Joint Research
[Presentation] Measuring Oral and Nasal Airflow in Production of Chinese Plosive2015
- Author(s)
  Y. Chi, K. Honda, J. Wei, H. Feng, J. Dang
- Organizer
  INTERSPEECH2015
- Place of Presentation
  Dresden, Germany
- Year and Date
  2015-09-07 – 2015-09-10
- Int'l Joint Research
[Presentation] Complex tensor factorization in modulation frequency domain for single-channel speech enhancement2015
- Author(s)
  Shogo Masaya and Masashi Unoki
- Organizer
  Proc. Interspeech2015
- Place of Presentation
  Dresden, Germany
- Year and Date
  2015-09-07 – 2015-09-10
- Int'l Joint Research
[Presentation] Restoration of instantaneous amplitude and phase of speech signal in noisy reverberant environments2015
- Author(s)
  Yang Liu, Naushin Nower, Yonghong Yan, Masashi Unoki
- Organizer
  Proc. EUSIPCO2015
- Place of Presentation
  Nice Cote d'Azur, France
- Year and Date
  2015-08-31 – 2015-09-04
- Int'l Joint Research
[Presentation] How motor cortex represents body movements: Optimality, recurrent neural networks and spatial dynamics2015
- Author(s)
  Tanaka, H., Miyakoshi, M. and Makeig, S.
- Organizer
  IEEE The 24th International Symposium on Robot and Human Interactive Communication (RO-MAN)
- Place of Presentation
  Kobe International Conference Center (Kobe, Japan)
- Year and Date
  2015-08-31
- Int'l Joint Research
[Presentation] A lip protrusion mechanism examined by magnetic resonance imaging and finite element modeling2015
- Author(s)
  T. Li, K. Honda, J. Wei, J. Dang
- Organizer
  18th ICPhS
- Place of Presentation
  Glasgow, UK
- Year and Date
  2015-08-10 – 2015-08-14
- Int'l Joint Research
[Presentation] The perception of English vowel contrasts by Chinese EFL learners and native English speakers2015
- Author(s)
  A. Zhang, H. Feng, X. Zheng, Z. Xu, J. Dang
- Organizer
  18th ICPhS
- Place of Presentation
  Glasgow, UK
- Year and Date
  2015-08-10 – 2015-08-14
- Int'l Joint Research
[Presentation] Bridging articulation and perception: The C/D model and contrastive emphasis2015
- Author(s)
  Erickson, D., Kim, J., Kawahara, S., Wilson, I., Menezes, C., Suemitsu, A., Moore, J.
- Organizer
  18th International Congress of Phonetic Sciences (ICPhS 2015)
- Place of Presentation
  Glasgow, UK
- Year and Date
  2015-08-10 – 2015-08-14
- Int'l Joint Research
[Presentation] Perception of prosodic social affects in French: A free-labeling study2015
- Author(s)
  Guerry, M., Shochi, T., Rilliard, A., Erickson, D.
- Organizer
  International Congress of Phonetic Sciences
- Place of Presentation
  Glasgow, Scotland
- Year and Date
  2015-08-10 – 2015-08-14
- Int'l Joint Research
[Presentation] Articulatory movement in non-native consonant clusters2015
- Author(s)
  Funatsu, S., Fujimoto, M., Imaizumi, S., Erickson, D.
- Organizer
  International Congress of Phonetic Sciences
- Place of Presentation
  Glasgow, Scotland
- Year and Date
  2015-08-10 – 2015-08-14
- Int'l Joint Research
[Presentation] Study on restoration of instantaneous amplitude and phase of speech signal in noisy reverberant environments2015
- Author(s)
  Yang Liu，Naushin Nower，Yonghong Yan，Masashi Unoki
- Organizer
  Proc. IEICE Technical Report, EA2015
- Place of Presentation
  東北大学（宮城県仙台市）
- Year and Date
  2015-08-03 – 2015-08-04
[Presentation] Movement Representation in the Motor System: Computational Modeling and EEG experiment2015
- Author(s)
  Tanaka, H., Miyakoshi, M. and Makeig, S.
- Organizer
  Brain Connectivity Workshop
- Place of Presentation
  San Diego, U.S.A.
- Year and Date
  2015-06-12
- Int'l Joint Research
[Presentation] Coordinate Systems in the Motor System: Computational Modeling and EEG experiment2015
- Author(s)
  Tanaka, H., Miyakoshi, M. and Makeig, S.
- Organizer
  The 5th International Conference on Cognitive Neurodynamics
- Place of Presentation
  Sanya, China
- Year and Date
  2015-06-04
- Int'l Joint Research
[Presentation] Prosodic strategies of L1 and L2 speakers for attitudinal expressivity in USA English2015
- Author(s)
  Erickson, D., Rilliard, Shochi, T., de Moraes, J.
- Organizer
  Experimental and Theoretical Advances in Prosody (ETAP3)
- Place of Presentation
  Urbana-Champaign, Illinois, USA
- Year and Date
  2015-05-28 – 2015-05-30
- Int'l Joint Research
[Presentation] Analysis of glottal source waves for emotional speech using ARX-LF model2015
- Author(s)
  Li, Y., Hamada, Y., and Akagi, M.
- Organizer
  2015 Otogaku Symposium
- Place of Presentation
  電気通信大学（東京都調布市）
- Year and Date
  2015-05-24
[Presentation] 頭部運動に追従した両耳間時間差の変化による音像の分離知覚2015
- Author(s)
  森川大輔
- Organizer
  音学シンポジウム2015
- Place of Presentation
  電気通信大（東京都調布市）
- Year and Date
  2015-05-23 – 2015-05-24
[Presentation] 表現豊かな音声の認識・合成とAffective Speech-to-Speech Translationへの応用2015
- Author(s)
  赤木正人
- Organizer
  2015音学シンポジウム
- Place of Presentation
  電気通信大学（東京都調布市）
- Year and Date
  2015-05-23
- Invited
[Presentation] VOCAL RESPONSES TO FREQUENCY MODULATED COMPOSITE SINEWAVES VIA AUDITORY AND VIBROTACTILE PATHWAYS2015
- Author(s)
  X. Wang, K. Honda, J. Dang, J. Wei
- Organizer
  40th ICASSP
- Place of Presentation
  Brisbane, Australia
- Year and Date
  2015-04-19 – 2015-04-24
- Int'l Joint Research
[Book] 音声合成技術の現状と展望（総説）, 進化するヒトと機械の音声コミュニケーション2015
- Author(s)
  党　建武
- Total Pages
  340 (125-140)
- Publisher
  株式会社ニッケイ印刷

2015 Fiscal Year Annual Research Report

ヒト発話シミュレータによるStory Teller Systemの構築

Principal Investigator

赤木 正人 北陸先端科学技術大学院大学, 情報科学研究科, 教授 (20242571)

Current Status of Research Progress

Reason

Research Products

[Journal Article] A study on transvelar coupling for non-nasalized sounds2016

Author(s)

Journal Title

DOI

[Journal Article] Robust Voice Activity Detection Based on Concept of Modulation Transfer Function in Noisy Reverberant Environments2016

Author(s)

Journal Title

DOI

[Journal Article] MTF-based Kalman filtering with linear prediction for power envelope restoration in noisy reverberant environments2016

Author(s)

Journal Title

[Journal Article] Coordinate Systems in the Motor System: Computational Modeling and EEG Experiment2016

Author(s)

Journal Title

DOI

[Journal Article] Modeling the motor cortex: Optimality, recurrent neural networks, and spatial dynamics2016

Author(s)

Journal Title

DOI

[Journal Article] Articulatory correlates of metrical structure: Studying jaw displacement patterns2016

Author(s)

Journal Title

DOI

[Journal Article] Relationship of various open quotients with acoustic property, phonation types, fundamental frequency, and intensity2016

Author(s)

Journal Title

DOI

[Journal Article] Quantification of vocal fold vibration in various laryngeal disorders using high-speed digital imaging2016

Author(s)

Journal Title

DOI

[Journal Article] A control strategy of a physiological articulatory model for speech production2015

Author(s)

Journal Title

DOI

[Journal Article] Mapping Ultrasound-based Articulatory Images and Vowel Sounds with a DNN Framework2015

Author(s)

Journal Title

DOI

[Journal Article] Strength of syllabic influences on articulation in Mandarin Chinese and French: Insights from a motor control approach2015

Author(s)

Journal Title

DOI

[Journal Article] A real-time articulatory visual feedback approach with target presentation for second language pronunciation learning2015

Author(s)

Journal Title

DOI

[Journal Article] Generalized Finite Difference Time Domain Method and Its Application to Acoustics2015

Author(s)

Journal Title

DOI

[Journal Article] 振幅変調音のピッチ知覚に基づいた調波複合音の基本周波数推定法2015

Author(s)

Journal Title

[Journal Article] Edge prominence and declination in Japanese jaw displacement patterns: a view from the C/D model2015

Author(s)

Journal Title

[Journal Article] VoiceDub：複数タイミング情報をともなう映像エンタテイメント向け音声同期収録支援システム2015

Author(s)

Journal Title

[Journal Article] ATR音声データベース内の文音声における知覚的話者間類似度の計測2015

Author(s)

Journal Title

[Journal Article] Improvement of five-degree-of-freedom sensors for Northern Digital Incorporated's Wave speech research system2015

Author(s)

Journal Title

[Journal Article] 磁気センサシステムによる調音運動のリアルタイム観測2015

Author(s)

Journal Title

[Journal Article] A practical guide to calculating syllable prominence, timing and boundaries in the C/D model. Special Issue on the C/D model2015

Author(s)

Journal Title

[Journal Article] More about contrastive emphasis and the C/D model. Special Issue on the C/D model2015

赤木正人北陸先端科学技術大学院大学, 情報科学研究科, 教授 (20242571)