• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

PSYPHON: Psychoacoustic features for Phonation prediction

Research Project

Project/Area Number 20K11956
Research Category

Grant-in-Aid for Scientific Research (C)

Allocation TypeMulti-year Fund
Section一般
Review Section Basic Section 61030:Intelligent informatics-related
Research InstitutionThe University of Aizu

Principal Investigator

Villegas Julian  会津大学, コンピュータ理工学部, 上級准教授 (50706281)

Co-Investigator(Kenkyū-buntansha) 李 勝勲  国際基督教大学, 教養学部, 上級准教授 (20770134)
MARKOV K  会津大学, コンピュータ理工学部, 教授 (80394998)
Project Period (FY) 2020-04-01 – 2023-03-31
Project Status Completed (Fiscal Year 2022)
Budget Amount *help
¥4,290,000 (Direct Cost: ¥3,300,000、Indirect Cost: ¥990,000)
Fiscal Year 2022: ¥780,000 (Direct Cost: ¥600,000、Indirect Cost: ¥180,000)
Fiscal Year 2021: ¥1,170,000 (Direct Cost: ¥900,000、Indirect Cost: ¥270,000)
Fiscal Year 2020: ¥2,340,000 (Direct Cost: ¥1,800,000、Indirect Cost: ¥540,000)
KeywordsPhonation prediction / Machine Learning / Psychoacoustics / Phonation prediction / Machine Learning / Corpus acquisition / Machine learning
Outline of Research at the Start

Many of languages use voice qualities such as modal, breathy, creaky, etc. for distinguishing between units of sound. This project (PSYPHON) aims at the creation of predictors for those voice qualities by using models of sound perception instead of looking at the speech recordings directly.

Outline of Final Research Achievements

We were able to document and acquire new corpora of words from the Zapotec and Mixe languages. These languages are characterized by a three-way phonemic contrast (modal, creaky, and breathy). These corpora are valuable for training phonation prediction systems based on machine learning. By subjective experimentation, we found that the sensitivity to creakiness observed in classifications made by experts and machine learning systems based on these classifications surpassed that of native and naive listeners. This finding supports our hypothesis that psychoacoustic features, which are universal, are better predictors of perceived phonation compared to existing methods. In addition, we found that falsetto was associated with pitch, whispering with sharpness, and creakiness with loudness and roughness. Lastly, by re-analyzing previous subjective studies, we were able to develop a psychoacoustic roughness model based on machine learning techniques.

Academic Significance and Societal Importance of the Research Achievements

This project is significant because it helps to close the gap between languages that are under-resourced and those that have sufficient resources, contributing directly to the Sustainable Development Goal SDG-10 (reducing inequality within and among countries).

Report

(4 results)
  • 2022 Annual Research Report   Final Research Report ( PDF )
  • 2021 Research-status Report
  • 2020 Research-status Report
  • Research Products

    (8 results)

All 2023 2022 Other

All Journal Article (4 results) (of which Int'l Joint Research: 2 results,  Peer Reviewed: 2 results) Presentation (1 results) (of which Int'l Joint Research: 1 results) Remarks (3 results)

  • [Journal Article] Psychoacoustic features explain creakiness classifications made by naive and non-naive listeners2023

    • Author(s)
      Julian Villegas, Seunghun J. Lee, Jeremy Perkins & Konstantin Markov
    • Journal Title

      Speech Communication

      Volume: 147 Pages: 74-81

    • DOI

      10.1016/j.specom.2023.01.006

    • Related Report
      2022 Annual Research Report
    • Peer Reviewed / Int'l Joint Research
  • [Journal Article] Creakiness Judgments by Burmese and Vietnamese Speakers2022

    • Author(s)
      Villegas Julian、Lee Seunghun J.
    • Journal Title

      Proc. 25 Conf. of the Oriental chapter of the Int. Committee for the Coordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA)

      Volume: NA Pages: 1-5

    • DOI

      10.1109/o-cocosda202257103.2022.9997908

    • Related Report
      2022 Annual Research Report
  • [Journal Article] The non-coalescence of /h/ and Incomplete Neutralization in South Jeolla Korean2022

    • Author(s)
      Lee, Seunghun J. & Villegas, Julin & Oh, Mira
    • Journal Title

      Language and Speech

      Volume: 66(2) Issue: 2 Pages: 442-473

    • DOI

      10.1177/00238309221116130

    • Related Report
      2022 Annual Research Report
    • Peer Reviewed / Int'l Joint Research
  • [Journal Article] ICU language database series 8: Comparative Zapotec2022

    • Author(s)
      Y. Baldoria, B. P. Fleming, M. W. R. Liu, J. Villegas, and S. J. Lee
    • Journal Title

      ICU Working Papers in Linguistics (ICUWPL)

      Volume: 22 Pages: 225-349

    • Related Report
      2022 Annual Research Report
  • [Presentation] Creakiness Judgments by Burmese and Vietnamese Speakers2022

    • Author(s)
      Villegas Julian
    • Organizer
      25 Conf. of the Oriental chapter of the Int. Committee for the Coordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA)
    • Related Report
      2022 Annual Research Report
    • Int'l Joint Research
  • [Remarks] PSYPHON Website

    • URL

      https://onkyo.u-aizu.ac.jp/psyphon

    • Related Report
      2022 Annual Research Report
  • [Remarks] https://onkyo.u-aizu.ac.jp/psyphon/

    • Related Report
      2021 Research-status Report
  • [Remarks] PSYPHON website

    • URL

      http://onkyo.u-aizu.ac.jp/psyphon

    • Related Report
      2020 Research-status Report

URL: 

Published: 2020-04-28   Modified: 2024-12-25  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi