• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Phonetic classification of voice qualities based on production mechanisms

Research Project

Project/Area Number 15H03207
Research Category

Grant-in-Aid for Scientific Research (B)

Allocation TypeSingle-year Grants
Section一般
Research Field Linguistics
Research InstitutionHealth Sciences University of Hokkaido

Principal Investigator

SAKAKIBARA Ken-Ichi  北海道医療大学, リハビリテーション科学部, 准教授 (80396168)

Co-Investigator(Kenkyū-buntansha) 林 良子  神戸大学, 国際文化学研究科, 教授 (20347785)
後藤 多嘉緒  東京大学, 医学部附属病院, 助教 (20735930)
河原 英紀  和歌山大学, 学内共同利用施設等, 名誉教授 (40294300)
牧 勝弘  愛知淑徳大学, 人間情報学部, 教授 (50447033)
齋藤 毅  金沢大学, 電子情報通信学系, 助教 (70446962)
山川 仁子  尚絅大学, 文化言語学部, 准教授 (80455196)
天野 成昭  愛知淑徳大学, 人間情報学部, 教授 (90396119)
山内 彰人  国立研究開発法人国立国際医療研究センター, その他部局等, 耳鼻咽喉科医師 (90612507)
Project Period (FY) 2015-04-01 – 2019-03-31
Project Status Completed (Fiscal Year 2018)
Budget Amount *help
¥16,380,000 (Direct Cost: ¥12,600,000、Indirect Cost: ¥3,780,000)
Fiscal Year 2018: ¥3,120,000 (Direct Cost: ¥2,400,000、Indirect Cost: ¥720,000)
Fiscal Year 2017: ¥3,510,000 (Direct Cost: ¥2,700,000、Indirect Cost: ¥810,000)
Fiscal Year 2016: ¥3,900,000 (Direct Cost: ¥3,000,000、Indirect Cost: ¥900,000)
Fiscal Year 2015: ¥5,850,000 (Direct Cost: ¥4,500,000、Indirect Cost: ¥1,350,000)
Keywords声質 / 音響分析 / 音声記号 / 音声学 / 生理計測 / 声帯振動 / 音源モデル / 音響特徴量 / 音声生理計測 / 音声知覚
Outline of Final Research Achievements

As the objective classification of voice quality of voiced sounds related to the laryngeal source, we proposed four different categories concerning the vibration portion in the larynx: (i) vocal fold phonation; (ii) vocal-ventricular phonation; (iii) vocal-aryepiglottic phonation. Also, the we described unvoiced sound source as glottal turbulence. For voiced sounds, it is possible to use physical and acoustic features, and periodic, subharmonic, aperiodic. Besides, we clarified the relationship between tension-relaxation and open quotient of vocal fold vibration as voice quality expression, and clarified that it was suitable as an objective expression word of voice quality.
We developed a simple GCI and GOI detection method based on complex wavelet analysis of cosine series envelope as an analysis method of glottal open time rate.

Academic Significance and Societal Importance of the Research Achievements

声質に関しては主観的な定義の曖昧な表現語が用いられてきたが、本研究成果により、客観的に解釈、理解可能な声質表現が実現されたことは、声質関連研究において共通理解が可能な用語が確立されたという学術意義がある。また本研究を遂行する過程で、提案された様々な声質に関連した分析方法は、今後の音声分析・合成のために用いることが可能であり、感情分析、感情音声合成など具体的な応用が期待される。

Report

(5 results)
  • 2018 Annual Research Report   Final Research Report ( PDF )
  • 2017 Annual Research Report
  • 2016 Annual Research Report
  • 2015 Annual Research Report
  • Research Products

    (28 results)

All 2018 2017 2016 2015

All Journal Article (11 results) (of which Int'l Joint Research: 2 results,  Peer Reviewed: 10 results,  Open Access: 3 results,  Acknowledgement Compliant: 1 results) Presentation (17 results) (of which Int'l Joint Research: 4 results,  Invited: 1 results)

  • [Journal Article] Commonalities of glottal sources and vocal tract shape among speakers in emotional speech2018

    • Author(s)
      Yongwei Li, Ken-Ichi Sakakibara, Daisuke Morikawa, Masato Akagi
    • Journal Title

      Studies on Speech Production, Lecture Notes in Computer Sci.

      Volume: LNAI10733 Pages: 24-34

    • DOI

      10.1007/978-3-030-00126-1_3

    • ISBN
      9783030001254, 9783030001261
    • Related Report
      2018 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Frequency domain variants of velvet noise and their application to speech processing and synthesis2018

    • Author(s)
      H. Kawahara, K.-I. Sakakibara, M. Morise, H. Banno, T. Toda, T. Irino
    • Journal Title

      Proc. Interspeech 2018

      Volume: 2018 Pages: 2027-2031

    • Related Report
      2018 Annual Research Report
    • Peer Reviewed
  • [Journal Article] 余弦級数包絡の複素wavelet分析に基づく簡易なGCI, GOIの検出法について2018

    • Author(s)
      河原 英紀, 榊原
    • Journal Title

      信学技報 SP2018-22

      Volume: 118(198) Pages: 1-5

    • Related Report
      2018 Annual Research Report
  • [Journal Article] Accurate estimation of fo and aperiodicity based on periodicity detector residuals and deviations of phase derivatives2017

    • Author(s)
      H. Kawahara, K.-I. Sakakibara, M. Morise, H.Banno, T. Toda
    • Journal Title

      APSIPA ASC 2017

      Volume: -

    • Related Report
      2017 Annual Research Report
    • Peer Reviewed
  • [Journal Article] A Modulation Property of Time-Frequency Derivatives of Filtered Phase and its Application to Aperiodicity and fo Estimation2017

    • Author(s)
      H. Kawahara, K.-I. Sakakibara, M. Morise, H.Banno, T. Toda
    • Journal Title

      Interspeech 2017

      Volume: - Pages: 424-428

    • Related Report
      2017 Annual Research Report
    • Peer Reviewed
  • [Journal Article] A new cosine series antialiasing function and its application to aliasing-free glottal source models for speech and singing synthesis2017

    • Author(s)
      Hideki Kawahara, K. Sakakibara, H. Banno, M. Morise, T. Toda, T. Irino
    • Journal Title

      Interspeech 2017

      Volume: - Pages: 1358-1362

    • Related Report
      2017 Annual Research Report
    • Peer Reviewed
  • [Journal Article] High-speed Videolaryngoscopy: Quantitative Parameters of Glottal Area Waveforms and High-speed Kymography in Healthy Individuals2016

    • Author(s)
      Tsutsumi, M., Isotani, S., Pimenta, R.A.., Daier, J.E., Hachiya, A., Tsuji, D.H., Tayama, N., Yokonishi, H., Imagawa. H., Yamauchi, A., Takano, S., Sakakibara, K.-I., Montagnoli, A.N.
    • Journal Title

      J. Voice

      Volume: In press Issue: 3 Pages: 1-9

    • DOI

      10.1016/j.jvoice.2016.09.026

    • Related Report
      2017 Annual Research Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] Characterization of Vocal Fold Vibration in Sulcus Vocalis Using High-Speed Digital Imaging.2016

    • Author(s)
      A. Yamauchi, H. Yokonishi, H. Imagawa, K.-I. Sakakibara, T. Nito, N. Tayama, T. Yamasoba
    • Journal Title

      J. Speech Lang. Hear. Res.

      Volume: 60 Issue: 1 Pages: 24-37

    • DOI

      10.1044/2016_jslhr-s-14-0285

    • Related Report
      2016 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Validating Stereo-Endoscopy with a Synthetic Vocal Fold Model2016

    • Author(s)
      K.A.Stevens, R. Shimamura, H. Imagawa, K.-I. Sakakibara, I.T.Tokuda
    • Journal Title

      Acta Acustica united with Acustica

      Volume: 102 Issue: 4 Pages: 745-751

    • DOI

      10.3813/aaa.918990

    • Related Report
      2016 Annual Research Report
    • Peer Reviewed / Int'l Joint Research
  • [Journal Article] Relationship of various open quotients with acoustic property, phonation types, fundamental frequency, and intensity2016

    • Author(s)
      H. Yokonishi, H. Imagawa, K.-I. Sakakibara, A. Yamauchi, T. Nito, T. Yamasoba, N. Tayama
    • Journal Title

      J. Voice

      Volume: 30, 2 Issue: 2 Pages: 145-157

    • DOI

      10.1016/j.jvoice.2015.01.009

    • Related Report
      2015 Annual Research Report
    • Peer Reviewed / Open Access
  • [Journal Article] Quantification of vocal fold vibration in various laryngeal disorders using high-speed digital imaging2016

    • Author(s)
      i, H. Imagawa, K.-I. Sakakibara, T. Nito, N. Tayama, T. Yamasoba
    • Journal Title

      J. Voice

      Volume: 30, 2 Issue: 2 Pages: 205-214

    • DOI

      10.1016/j.jvoice.2015.04.016

    • Related Report
      2015 Annual Research Report
    • Peer Reviewed / Open Access / Acknowledgement Compliant
  • [Presentation] Open quotient に着目したオペラ歌唱と合唱歌唱の比較検討2018

    • Author(s)
      若狭 健太,榊原 健一,河原 英紀,寺澤 洋子
    • Organizer
      日本音響学会2019年春季研究発表会
    • Related Report
      2018 Annual Research Report
  • [Presentation] Baseline design of a VOCODER based on an interference-free spectral representation and a generalized excitation source representation2018

    • Author(s)
      河原 英紀,榊原 健一,森勢 将雅
    • Organizer
      日本音響学会2019年春季研究発表会
    • Related Report
      2018 Annual Research Report
  • [Presentation] 生理・音響的特徴量分析によるオペラ歌唱と合唱歌唱の比較検討2018

    • Author(s)
      若狭 健太,榊原 健一,河原 英紀,寺澤 洋子
    • Organizer
      日本音響学会2018年秋季研究発表会
    • Related Report
      2018 Annual Research Report
  • [Presentation] Simultaneous estimation of glottal source waveform and vocal tract shape from speech signal based on ARX-LF model2018

    • Author(s)
      李 永偉,榊原 健一,赤木 正人
    • Organizer
      日本音響学会2018年秋季研究発表会
    • Related Report
      2018 Annual Research Report
  • [Presentation] 声帯振動に着目したオペラ歌唱と合唱歌唱の比較研究2018

    • Author(s)
      若狭健太,榊原健一,平賀譲,寺澤洋子
    • Organizer
      2018年日本音響学会春季研究発表会
    • Related Report
      2017 Annual Research Report
  • [Presentation] 対数領域パルスによる声帯音源モデルの拡張について2018

    • Author(s)
      河原英紀,榊原健一
    • Organizer
      2018年日本音響学会春季研究発表会
    • Related Report
      2017 Annual Research Report
  • [Presentation] エリアシングの無い声帯音源モデルおよび対話的音声生成シミュレータの拡張について2017

    • Author(s)
      河原 英紀, 榊原 健一
    • Organizer
      日本音響学会2017年春季研究発表会
    • Place of Presentation
      明治大学生田キャンパス(神奈川県,川崎市), 日本
    • Year and Date
      2017-03-15
    • Related Report
      2016 Annual Research Report
  • [Presentation] Commonalities and differences of glottal sources and vocal tract shapes among speakers in emotional speech2017

    • Author(s)
      李永偉, 榊原健一, 森川大輔, 赤木正人
    • Organizer
      日本音響学会2017年春季研究発表会
    • Place of Presentation
      明治大学生田キャンパス(神奈川県,川崎市), 日本
    • Year and Date
      2017-03-15
    • Related Report
      2016 Annual Research Report
  • [Presentation] 基本周波数再訪2017

    • Author(s)
      河原英紀, 榊原健一
    • Organizer
      聴覚研究会
    • Related Report
      2017 Annual Research Report
  • [Presentation] 対話音声生成シミュレータの時変モデルへの拡張について2017

    • Author(s)
      河原英紀,榊原健一
    • Organizer
      2017年日本音響学会秋季研究発表会
    • Related Report
      2017 Annual Research Report
  • [Presentation] Relationships between features of glottal sources and vocal tract shapes and perceived positions on valence and activation in emotional speech2017

    • Author(s)
      李 永偉,榊原 健一,赤木 正人
    • Organizer
      2017年日本音響学会秋季研究発表会
    • Related Report
      2017 Annual Research Report
  • [Presentation] Commonalities of glottal sources and vocal tract shapes among speakers in emotional speech2017

    • Author(s)
      Y. Li, K.-I. Sakakibara, D. Morikawa, M. Akagi
    • Organizer
      The 11th International seminar on speech production
    • Related Report
      2017 Annual Research Report
    • Int'l Joint Research
  • [Presentation] 声帯開閉時間率 (open quotient: OQ) と音質の関連における健常者の性差・年齢差2016

    • Author(s)
      山内彰人, 横西久幸, 今川博, 榊原健一, 二藤隆春, 山岨達也, 田山二朗
    • Organizer
      日本音声言語医学会学術講演会
    • Place of Presentation
      パシフィコ横浜(神奈川県 横浜市), 日本
    • Year and Date
      2016-11-03
    • Related Report
      2016 Annual Research Report
  • [Presentation] 喉頭高速度デジタル撮像法によるストロボ画像化に関する研究2016

    • Author(s)
      堤内亮博, 山内彰人, 今川博, 榊原健一, 横西久幸, 田山二朗
    • Organizer
      日本音声言語医学会学術講演会
    • Place of Presentation
      パシフィコ横浜(神奈川県 横浜市), 日本
    • Year and Date
      2016-11-03
    • Related Report
      2016 Annual Research Report
  • [Presentation] SparkNG: Interactive Matlab tools for introduction to speech production, perception and processing fundamentals and application of the aliasing-free L-F model component2016

    • Author(s)
      H. Kawahara
    • Organizer
      Interspeech 2016
    • Place of Presentation
      San Francisco, USA
    • Year and Date
      2016-09-08
    • Related Report
      2016 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Analysis of spatial characteristics of the larynx using high-speed digital imaging2016

    • Author(s)
      K.-I. Sakakibara, H. Imagawa, I.T.Tokuda, A. Yamauchi, H. Yokonishi, N. Tayama
    • Organizer
      International Conference on Voice Physiology and Biomechanics
    • Place of Presentation
      Viña del Mar, Chile
    • Year and Date
      2016-03-14
    • Related Report
      2015 Annual Research Report
    • Int'l Joint Research / Invited
  • [Presentation] Aliasing-free implementation of discrete-time glottal source models and their applications to speech synthesis and F0 extractor evaluation2015

    • Author(s)
      H. Kawahara, K.-I. Sakakibara, H. Banno, M. Morise, T. Toda and T. Irino
    • Organizer
      APSIPA ASC 2015
    • Place of Presentation
      Hong Kong
    • Year and Date
      2015-12-16
    • Related Report
      2015 Annual Research Report
    • Int'l Joint Research

URL: 

Published: 2015-04-16   Modified: 2023-03-16  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi