Phonetic classification of voice qualities based on production mechanisms

Research Project

Project/Area Number	15H03207
Research Category	Grant-in-Aid for Scientific Research (B)
Allocation Type	Single-year Grants
Section	一般
Research Field	Linguistics
Research Institution	Health Sciences University of Hokkaido
Principal Investigator	SAKAKIBARA Ken-Ichi 北海道医療大学, リハビリテーション科学部, 准教授 (80396168)
Co-Investigator(Kenkyū-buntansha)	林良子神戸大学, 国際文化学研究科, 教授 (20347785) 後藤多嘉緒東京大学, 医学部附属病院, 助教 (20735930) 河原英紀和歌山大学, 学内共同利用施設等, 名誉教授 (40294300) 牧勝弘愛知淑徳大学, 人間情報学部, 教授 (50447033) 齋藤毅金沢大学, 電子情報通信学系, 助教 (70446962) 山川仁子尚絅大学, 文化言語学部, 准教授 (80455196) 天野成昭愛知淑徳大学, 人間情報学部, 教授 (90396119) 山内彰人国立研究開発法人国立国際医療研究センター, その他部局等, 耳鼻咽喉科医師 (90612507)
Project Period (FY)	2015-04-01 – 2019-03-31
Project Status	Completed (Fiscal Year 2018)
Budget Amount *help	¥16,380,000 (Direct Cost: ¥12,600,000、Indirect Cost: ¥3,780,000) Fiscal Year 2018: ¥3,120,000 (Direct Cost: ¥2,400,000、Indirect Cost: ¥720,000) Fiscal Year 2017: ¥3,510,000 (Direct Cost: ¥2,700,000、Indirect Cost: ¥810,000) Fiscal Year 2016: ¥3,900,000 (Direct Cost: ¥3,000,000、Indirect Cost: ¥900,000) Fiscal Year 2015: ¥5,850,000 (Direct Cost: ¥4,500,000、Indirect Cost: ¥1,350,000)
Keywords	声質 / 音響分析 / 音声記号 / 音声学 / 生理計測 / 声帯振動 / 音源モデル / 音響特徴量 / 音声生理計測 / 音声知覚
Outline of Final Research Achievements	As the objective classification of voice quality of voiced sounds related to the laryngeal source, we proposed four different categories concerning the vibration portion in the larynx: (i) vocal fold phonation; (ii) vocal-ventricular phonation; (iii) vocal-aryepiglottic phonation. Also, the we described unvoiced sound source as glottal turbulence. For voiced sounds, it is possible to use physical and acoustic features, and periodic, subharmonic, aperiodic. Besides, we clarified the relationship between tension-relaxation and open quotient of vocal fold vibration as voice quality expression, and clarified that it was suitable as an objective expression word of voice quality. We developed a simple GCI and GOI detection method based on complex wavelet analysis of cosine series envelope as an analysis method of glottal open time rate.
Academic Significance and Societal Importance of the Research Achievements	声質に関しては主観的な定義の曖昧な表現語が用いられてきたが、本研究成果により、客観的に解釈、理解可能な声質表現が実現されたことは、声質関連研究において共通理解が可能な用語が確立されたという学術意義がある。また本研究を遂行する過程で、提案された様々な声質に関連した分析方法は、今後の音声分析・合成のために用いることが可能であり、感情分析、感情音声合成など具体的な応用が期待される。

Report

(5 results)

2018 Annual Research Report Final Research Report ( PDF )
2017 Annual Research Report
2016 Annual Research Report
2015 Annual Research Report

Research Products
(28 results)

All 2018 2017 2016 2015

All Journal Article (11 results) (of which Int'l Joint Research: 2 results, Peer Reviewed: 10 results, Open Access: 3 results, Acknowledgement Compliant: 1 results) Presentation (17 results) (of which Int'l Joint Research: 4 results, Invited: 1 results)

[Journal Article] Commonalities of glottal sources and vocal tract shape among speakers in emotional speech2018
- Author(s)
  Yongwei Li, Ken-Ichi Sakakibara, Daisuke Morikawa, Masato Akagi
- Journal Title
  
  Studies on Speech Production, Lecture Notes in Computer Sci.
  
  Volume: LNAI10733 Pages: 24-34
- DOI
  10.1007/978-3-030-00126-1_3
- ISBN
  9783030001254, 9783030001261
- Related Report
  2018 Annual Research Report
- Peer Reviewed
[Journal Article] Frequency domain variants of velvet noise and their application to speech processing and synthesis2018
- Author(s)
  H. Kawahara, K.-I. Sakakibara, M. Morise, H. Banno, T. Toda, T. Irino
- Journal Title
  
  Proc. Interspeech 2018
  
  Volume: 2018 Pages: 2027-2031
- Related Report
  2018 Annual Research Report
- Peer Reviewed
[Journal Article] 余弦級数包絡の複素wavelet分析に基づく簡易なGCI, GOIの検出法について2018
- Author(s)
  河原英紀, 榊原
- Journal Title
  
  信学技報　SP2018-22
  
  Volume: 118(198) Pages: 1-5
- Related Report
  2018 Annual Research Report
[Journal Article] Accurate estimation of fo and aperiodicity based on periodicity detector residuals and deviations of phase derivatives2017
- Author(s)
  H. Kawahara, K.-I. Sakakibara, M. Morise, H.Banno, T. Toda
- Journal Title
  
  APSIPA ASC 2017
  
  Volume: -
- Related Report
  2017 Annual Research Report
- Peer Reviewed
[Journal Article] A Modulation Property of Time-Frequency Derivatives of Filtered Phase and its Application to Aperiodicity and fo Estimation2017
- Author(s)
  H. Kawahara, K.-I. Sakakibara, M. Morise, H.Banno, T. Toda
- Journal Title
  
  Interspeech 2017
  
  Volume: - Pages: 424-428
- Related Report
  2017 Annual Research Report
- Peer Reviewed
[Journal Article] A new cosine series antialiasing function and its application to aliasing-free glottal source models for speech and singing synthesis2017
- Author(s)
  Hideki Kawahara, K. Sakakibara, H. Banno, M. Morise, T. Toda, T. Irino
- Journal Title
  
  Interspeech 2017
  
  Volume: - Pages: 1358-1362
- Related Report
  2017 Annual Research Report
- Peer Reviewed
[Journal Article] High-speed Videolaryngoscopy: Quantitative Parameters of Glottal Area Waveforms and High-speed Kymography in Healthy Individuals2016
- Author(s)
  Tsutsumi, M., Isotani, S., Pimenta, R.A.., Daier, J.E., Hachiya, A., Tsuji, D.H., Tayama, N., Yokonishi, H., Imagawa. H., Yamauchi, A., Takano, S., Sakakibara, K.-I., Montagnoli, A.N.
- Journal Title
  
  J. Voice
  
  Volume: In press Issue: 3 Pages: 1-9
- DOI
  10.1016/j.jvoice.2016.09.026
- Related Report
  2017 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Characterization of Vocal Fold Vibration in Sulcus Vocalis Using High-Speed Digital Imaging.2016
- Author(s)
  A. Yamauchi, H. Yokonishi, H. Imagawa, K.-I. Sakakibara, T. Nito, N. Tayama, T. Yamasoba
- Journal Title
  
  J. Speech Lang. Hear. Res.
  
  Volume: 60 Issue: 1 Pages: 24-37
- DOI
  10.1044/2016_jslhr-s-14-0285
- Related Report
  2016 Annual Research Report
- Peer Reviewed
[Journal Article] Validating Stereo-Endoscopy with a Synthetic Vocal Fold Model2016
- Author(s)
  K.A.Stevens, R. Shimamura, H. Imagawa, K.-I. Sakakibara, I.T.Tokuda
- Journal Title
  
  Acta Acustica united with Acustica
  
  Volume: 102 Issue: 4 Pages: 745-751
- DOI
  10.3813/aaa.918990
- Related Report
  2016 Annual Research Report
- Peer Reviewed / Int'l Joint Research
[Journal Article] Relationship of various open quotients with acoustic property, phonation types, fundamental frequency, and intensity2016
- Author(s)
  H. Yokonishi, H. Imagawa, K.-I. Sakakibara, A. Yamauchi, T. Nito, T. Yamasoba, N. Tayama
- Journal Title
  
  J. Voice
  
  Volume: 30, 2 Issue: 2 Pages: 145-157
- DOI
  10.1016/j.jvoice.2015.01.009
- Related Report
  2015 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Quantification of vocal fold vibration in various laryngeal disorders using high-speed digital imaging2016
- Author(s)
  i, H. Imagawa, K.-I. Sakakibara, T. Nito, N. Tayama, T. Yamasoba
- Journal Title
  
  J. Voice
  
  Volume: 30, 2 Issue: 2 Pages: 205-214
- DOI
  10.1016/j.jvoice.2015.04.016
- Related Report
  2015 Annual Research Report
- Peer Reviewed / Open Access / Acknowledgement Compliant
[Presentation] Open quotient に着目したオペラ歌唱と合唱歌唱の比較検討2018
- Author(s)
  若狭健太，榊原健一，河原英紀，寺澤洋子
- Organizer
  日本音響学会2019年春季研究発表会
- Related Report
  2018 Annual Research Report
[Presentation] Baseline design of a VOCODER based on an interference-free spectral representation and a generalized excitation source representation2018
- Author(s)
  河原英紀，榊原健一，森勢将雅
- Organizer
  日本音響学会2019年春季研究発表会
- Related Report
  2018 Annual Research Report
[Presentation] 生理・音響的特徴量分析によるオペラ歌唱と合唱歌唱の比較検討2018
- Author(s)
  若狭健太，榊原健一，河原英紀，寺澤洋子
- Organizer
  日本音響学会2018年秋季研究発表会
- Related Report
  2018 Annual Research Report
[Presentation] Simultaneous estimation of glottal source waveform and vocal tract shape from speech signal based on ARX-LF model2018
- Author(s)
  李永偉，榊原健一，赤木正人
- Organizer
  日本音響学会2018年秋季研究発表会
- Related Report
  2018 Annual Research Report
[Presentation] 声帯振動に着目したオペラ歌唱と合唱歌唱の比較研究2018
- Author(s)
  若狭健太，榊原健一，平賀譲，寺澤洋子
- Organizer
  2018年日本音響学会春季研究発表会
- Related Report
  2017 Annual Research Report
[Presentation] 対数領域パルスによる声帯音源モデルの拡張について2018
- Author(s)
  河原英紀，榊原健一
- Organizer
  2018年日本音響学会春季研究発表会
- Related Report
  2017 Annual Research Report
[Presentation] エリアシングの無い声帯音源モデルおよび対話的音声生成シミュレータの拡張について2017
- Author(s)
  河原英紀, 榊原健一
- Organizer
  日本音響学会2017年春季研究発表会
- Place of Presentation
  明治大学生田キャンパス(神奈川県,川崎市), 日本
- Year and Date
  2017-03-15
- Related Report
  2016 Annual Research Report
[Presentation] Commonalities and differences of glottal sources and vocal tract shapes among speakers in emotional speech2017
- Author(s)
  李永偉, 榊原健一, 森川大輔, 赤木正人
- Organizer
  日本音響学会2017年春季研究発表会
- Place of Presentation
  明治大学生田キャンパス(神奈川県,川崎市), 日本
- Year and Date
  2017-03-15
- Related Report
  2016 Annual Research Report
[Presentation] 基本周波数再訪2017
- Author(s)
  河原英紀, 榊原健一
- Organizer
  聴覚研究会
- Related Report
  2017 Annual Research Report
[Presentation] 対話音声生成シミュレータの時変モデルへの拡張について2017
- Author(s)
  河原英紀，榊原健一
- Organizer
  2017年日本音響学会秋季研究発表会
- Related Report
  2017 Annual Research Report
[Presentation] Relationships between features of glottal sources and vocal tract shapes and perceived positions on valence and activation in emotional speech2017
- Author(s)
  李永偉，榊原健一，赤木正人
- Organizer
  2017年日本音響学会秋季研究発表会
- Related Report
  2017 Annual Research Report
[Presentation] Commonalities of glottal sources and vocal tract shapes among speakers in emotional speech2017
- Author(s)
  Y. Li, K.-I. Sakakibara, D. Morikawa, M. Akagi
- Organizer
  The 11th International seminar on speech production
- Related Report
  2017 Annual Research Report
- Int'l Joint Research
[Presentation] 声帯開閉時間率 (open quotient: OQ) と音質の関連における健常者の性差・年齢差2016
- Author(s)
  山内彰人, 横西久幸, 今川博, 榊原健一, 二藤隆春, 山岨達也, 田山二朗
- Organizer
  日本音声言語医学会学術講演会
- Place of Presentation
  パシフィコ横浜(神奈川県横浜市), 日本
- Year and Date
  2016-11-03
- Related Report
  2016 Annual Research Report
[Presentation] 喉頭高速度デジタル撮像法によるストロボ画像化に関する研究2016
- Author(s)
  堤内亮博, 山内彰人, 今川博, 榊原健一, 横西久幸, 田山二朗
- Organizer
  日本音声言語医学会学術講演会
- Place of Presentation
  パシフィコ横浜(神奈川県横浜市), 日本
- Year and Date
  2016-11-03
- Related Report
  2016 Annual Research Report
[Presentation] SparkNG: Interactive Matlab tools for introduction to speech production, perception and processing fundamentals and application of the aliasing-free L-F model component2016
- Author(s)
  H. Kawahara
- Organizer
  Interspeech 2016
- Place of Presentation
  San Francisco, USA
- Year and Date
  2016-09-08
- Related Report
  2016 Annual Research Report
- Int'l Joint Research
[Presentation] Analysis of spatial characteristics of the larynx using high-speed digital imaging2016
- Author(s)
  K.-I. Sakakibara, H. Imagawa, I.T.Tokuda, A. Yamauchi, H. Yokonishi, N. Tayama
- Organizer
  International Conference on Voice Physiology and Biomechanics
- Place of Presentation
  Viña del Mar, Chile
- Year and Date
  2016-03-14
- Related Report
  2015 Annual Research Report
- Int'l Joint Research / Invited
[Presentation] Aliasing-free implementation of discrete-time glottal source models and their applications to speech synthesis and F0 extractor evaluation2015
- Author(s)
  H. Kawahara, K.-I. Sakakibara, H. Banno, M. Morise, T. Toda and T. Irino
- Organizer
  APSIPA ASC 2015
- Place of Presentation
  Hong Kong
- Year and Date
  2015-12-16
- Related Report
  2015 Annual Research Report
- Int'l Joint Research

Phonetic classification of voice qualities based on production mechanisms

Principal Investigator

SAKAKIBARA Ken-Ichi 北海道医療大学, リハビリテーション科学部, 准教授 (80396168)

¥16,380,000 (Direct Cost: ¥12,600,000、Indirect Cost: ¥3,780,000)

Report

Research Products

[Journal Article] Commonalities of glottal sources and vocal tract shape among speakers in emotional speech2018

Author(s)

Journal Title

DOI

ISBN

Related Report

[Journal Article] Frequency domain variants of velvet noise and their application to speech processing and synthesis2018

Author(s)

Journal Title

Related Report

[Journal Article] 余弦級数包絡の複素wavelet分析に基づく簡易なGCI, GOIの検出法について2018

Author(s)

Journal Title

Related Report

[Journal Article] Accurate estimation of fo and aperiodicity based on periodicity detector residuals and deviations of phase derivatives2017

Author(s)

Journal Title

Related Report

[Journal Article] A Modulation Property of Time-Frequency Derivatives of Filtered Phase and its Application to Aperiodicity and fo Estimation2017

Author(s)

Journal Title

Related Report

[Journal Article] A new cosine series antialiasing function and its application to aliasing-free glottal source models for speech and singing synthesis2017

Author(s)

Journal Title

Related Report

[Journal Article] High-speed Videolaryngoscopy: Quantitative Parameters of Glottal Area Waveforms and High-speed Kymography in Healthy Individuals2016

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Characterization of Vocal Fold Vibration in Sulcus Vocalis Using High-Speed Digital Imaging.2016

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Validating Stereo-Endoscopy with a Synthetic Vocal Fold Model2016

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Relationship of various open quotients with acoustic property, phonation types, fundamental frequency, and intensity2016

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Quantification of vocal fold vibration in various laryngeal disorders using high-speed digital imaging2016

Author(s)

Journal Title

DOI

Related Report

[Presentation] Open quotient に着目したオペラ歌唱と合唱歌唱の比較検討2018

Author(s)

Organizer

Related Report

[Presentation] Baseline design of a VOCODER based on an interference-free spectral representation and a generalized excitation source representation2018

Author(s)

Organizer

Related Report

[Presentation] 生理・音響的特徴量分析によるオペラ歌唱と合唱歌唱の比較検討2018

Author(s)

Organizer

Related Report

[Presentation] Simultaneous estimation of glottal source waveform and vocal tract shape from speech signal based on ARX-LF model2018

Author(s)

Organizer

Related Report

[Presentation] 声帯振動に着目したオペラ歌唱と合唱歌唱の比較研究2018

Author(s)

Organizer

Related Report

[Presentation] 対数領域パルスによる声帯音源モデルの拡張について2018

Author(s)

Organizer