VOICE 2.0: towards augmentation of enriched speech communication

Research Project

Project/Area Number	20KK0233
Research Category	Fund for the Promotion of Joint International Research (Fostering Joint International Research (B))
Allocation Type	Multi-year Fund
Review Section	Medium-sized Section 61:Human informatics and related fields
Research Institution	Japan Advanced Institute of Science and Technology
Principal Investigator	鵜木祐史北陸先端科学技術大学院大学, 先端科学技術研究科, 教授 (00343187)
Co-Investigator(Kenkyū-buntansha)	赤木正人北陸先端科学技術大学院大学, 先端科学技術研究科, 名誉教授 (20242571) 木谷俊介北陸先端科学技術大学院大学, 先端科学技術研究科, 講師 (70635367) 森田翔太福山大学, 工学部, 講師 (70780378)
Project Period (FY)	2020-10-27 – 2025-03-31
Project Status	Granted (Fiscal Year 2023)
Budget Amount *help	¥18,720,000 (Direct Cost: ¥14,400,000、Indirect Cost: ¥4,320,000) Fiscal Year 2024: ¥3,900,000 (Direct Cost: ¥3,000,000、Indirect Cost: ¥900,000) Fiscal Year 2023: ¥3,900,000 (Direct Cost: ¥3,000,000、Indirect Cost: ¥900,000) Fiscal Year 2022: ¥3,640,000 (Direct Cost: ¥2,800,000、Indirect Cost: ¥840,000) Fiscal Year 2021: ¥3,770,000 (Direct Cost: ¥2,900,000、Indirect Cost: ¥870,000) Fiscal Year 2020: ¥3,510,000 (Direct Cost: ¥2,700,000、Indirect Cost: ¥810,000)
Keywords	音声コミュニケーション / エンリッチメント / 非言語情報知覚 / 音声了解度 / voice 2.0 / 非言語情報 / 聴覚エンリッチメント
Outline of Research at the Start	現在，Society 5.0を目指した音声情報処理は，AI技術革新の恩恵を受け，飛躍的に進歩している．合成音声の音質は人間のものに肉薄しているが，音声コミュニケーションの本質である，話し手の情動や意図，態度といった表現豊かな音声合成には至っていない．本研究では，ヒトの音声知覚・生成メカニズムに着目し，データサイエンスとの有機的結合を図ることで，エンリッチな音声コミュニケーションの拡張（VOICE 2.0）を目指す．本研究の期待するところは，音声の明瞭さや，感情，個人性，発話スタイルの変化といった声質をヒトのように高低・強弱の程度を制御して，音声コミュニケーションの付加価値を高めることにある．
Outline of Annual Research Achievements	本課題では，表現豊かな音声コミュニケーションを実現するために，音声の数理工学的な情報表現において，音声のエンリッチな情報を言語・非言語・パラ言語情報にそれぞれ分離し，それらを個別に制御することで，次の５点に関する音声エンリッチメント（VOICE 2.0）を実現する．課題① 言語情報におけるエンリッチメント，課題② 非言語情報（話者性）のエンリッチメント，課題③ 非言語情報（感情）のエンリッチメント，課題④ 非言語情報（声の質感）のエンリッチメント，課題⑤ パラ言語情報（緊迫感など）のエンリッチメント．本研究の期待するところは，音声の明瞭さや，感情，個人性，発話スタイルの変化といった声質をヒトのように高低・強弱の程度を制御して，音声コミュニケーションの付加価値を高めることにある． 2023年度は，前年度課題で検討した課題③～課題⑤について継続して実施した．まず課題③では，感情知覚と変調スペクトルに含まれる変調成分の関係を調査した．その結果，振幅包絡線に含まれる変調成分（0～16 Hz）が感情知覚に重要であることを明らかにした．さらに，この中に含まれる瞬時変調周波数の時間変化の傾きが感情知覚に関わることも明らかにした．次に課題④では，声の質感と音色属性に関わる特徴（音質評価指標）（シャープネス，ラフネス，変動強度）ならびに基本周波数成分の時間特徴（ジッターとシマー）の関係を検討した．その結果，音質評価指標ならびにジッター・シマーが声の自然さの評価に使えることがわかった．最後に課題⑤では，緊迫感知覚と変調スペクトルに含まれる変調成分の関係を調査した．その結果，課題③と同様に音声の振幅包絡線に含まれる瞬時変調周波数成分の時間変化の傾きを調整することで緊迫感を低下できることが可能であることがわかった．
Current Status of Research Progress	Current Status of Research Progress 2: Research has progressed on the whole more than it was originally planned. Reason 2023年度は，計画どおり，前年度課題で検討した，音声の非言語・パラ言語情報と変調知覚の関係を調査した．その結果，前述したように，瞬時変調周波数の時間変化を操作することが非言語・パラ言語情報のエンリッチメントに直結していることを明らかにした．このことから，瞬時変調周波数の時間変化を操作することで，非言語・パラ言語情報をエンリッチできることが容易に予想できる．次年度には，感情知覚や緊迫感知覚の向上を目指した最終検討に入ることができる．以上から区分(2)の進捗状況であると判断した．
Strategy for Future Research Activity	コロナ禍が明けてから，はじめて研究代表者のみドレスデン工科大学を訪問することができた．この3年間の互いの研究機関での研究進捗について，対面で報告し，今後の研究展開についても議論した．研究自体には，大きな進展があったことから，今後はオンラインの利用も含め，研究交流を一層深めていく予定である．研究分担者による長期滞在は，現在もなお難しいところではあるが，最終年度は，研究分担者も含め，全員でドレスデン工科大を訪問し，深く議論をして研究ゴールに到着できるようにしていきたい．

Report

(4 results)

Research Products
(83 results)

All 2024 2023 2022 2021 2020

All Journal Article (24 results) (of which Int'l Joint Research: 5 results, Peer Reviewed: 24 results, Open Access: 12 results) Presentation (59 results) (of which Int'l Joint Research: 30 results)

[Journal Article] Computational models of auditory sensation important for sound quality on basis of either gammatone or gammachirp auditory filterbank2024
- Author(s)
  Isoyama Takuto、Kidani Shunsuke、Unoki Masashi
- Journal Title
  
  Applied Acoustics
  
  Volume: 218 Pages: 109914-109914
- DOI
  10.1016/j.apacoust.2024.109914
- Related Report
  2023 Research-status Report
- Peer Reviewed / Open Access
[Journal Article] Linear Model Approach to Investigate the Comprehensive Entrainment in Conversation2024
- Author(s)
  Yuning Liu, Masashi Unoki
- Journal Title
  
  Proc. NCSP24
  
  Volume: - Pages: 51-54
- Related Report
  2023 Research-status Report
- Peer Reviewed
[Journal Article] Phase-Aware Speech Enhancement With Complex Wiener Filter2023
- Author(s)
  Nguyen Huy、Ho Tuan Vu、Akagi Masato、Unoki Masashi
- Journal Title
  
  IEEE Access
  
  Volume: 11 Pages: 141573-141584
- DOI
  10.1109/access.2023.3341919
- Related Report
  2023 Research-status Report
- Peer Reviewed / Open Access
[Journal Article] Contributions of Jitter and Shimmer in the Voice for Fake Audio Detection2023
- Author(s)
  Li Kai、Lu Xugang、Akagi Masato、Unoki Masashi
- Journal Title
  
  IEEE Access
  
  Volume: 11 Pages: 84689-84698
- DOI
  10.1109/access.2023.3301616
- Related Report
  2023 Research-status Report
- Peer Reviewed / Open Access
[Journal Article] Anomalous Sound Detection for Industrial Machines Using Acoustical Features Related to Timbral Metrics2023
- Author(s)
  Ota Yasuji、Unoki Masashi
- Journal Title
  
  IEEE Access
  
  Volume: 11 Pages: 70884-70897
- DOI
  10.1109/access.2023.3294334
- Related Report
  2023 Research-status Report
- Peer Reviewed / Open Access
[Journal Article] Contributions of Temporal Modulation Cues in Temporal Amplitude Envelope of Speech to Urgency Perception2023
- Author(s)
  Unoki Masashi、Kawamura Miho、Kobayashi Maori、Kidani Shunsuke、Li Junfeng、Akagi Masato
- Journal Title
  
  Applied Sciences
  
  Volume: 13 Issue: 10 Pages: 6239-6239
- DOI
  10.3390/app13106239
- Related Report
  2023 Research-status Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Analysis of Spectro-Temporal Modulation Representation for Deep-Fake Speech Detection2023
- Author(s)
  Cheng Haowei、Mawalim Candy Olivia、Li Kai、Wang Lijun、Unoki Masashi
- Journal Title
  
  Proc. APSIPA2023
  
  Volume: - Pages: 1822-1829
- DOI
  10.1109/apsipaasc58517.2023.10317309
- Related Report
  2023 Research-status Report
- Peer Reviewed
[Journal Article] Deepfake-speech Detection with Pathological Features and Multilayer Perceptron Neural Network2023
- Author(s)
  Chaiwongyen Anuwat、Duangpummet Suradej、Karnjana Jessada、Kongprawechnon Waree、Unoki Masashi
- Journal Title
  
  Proc. APSIPA2023
  
  Volume: - Pages: 2182-2188
- DOI
  10.1109/apsipaasc58517.2023.10317331
- Related Report
  2023 Research-status Report
- Peer Reviewed / Int'l Joint Research
[Journal Article] Contribution of modulation spectral features for cross-lingual speech emotion recognition under noisy reverberant conditions2023
- Author(s)
  Guo Taiyang、Li Sixia、Kidani Shunsuke、Okada Shogo、Unoki Masashi
- Journal Title
  
  Proc. APSIPA2023
  
  Volume: - Pages: 2221-2227
- DOI
  10.1109/apsipaasc58517.2023.10317449
- Related Report
  2023 Research-status Report
- Peer Reviewed
[Journal Article] Data-driven Non-uniform Filterbanks Based on F-ratio for Machine Anomalous Sound Detection2023
- Author(s)
  Li Kai、Tran Dung Kim、Lu Xugang、Akagi Masato、Unoki Masashi
- Journal Title
  
  Proc. EUSIPCO2023
  
  Volume: - Pages: 201-205
- DOI
  10.23919/eusipco58844.2023.10289922
- Related Report
  2023 Research-status Report
- Peer Reviewed
[Journal Article] Computational model for predicting sound quality metrics using loudness model based on gammatone/gammachirp auditory filterbank and its applications2023
- Author(s)
  Isoyama Takuto、Kidani Shunsuke、Unoki Masashi
- Journal Title
  
  Proc. INTER-NOISE2023
  
  Volume: 268 Issue: 3 Pages: 5955-5964
- DOI
  10.3397/in_2023_0861
- Related Report
  2023 Research-status Report
- Peer Reviewed
[Journal Article] Study on suppression effect of air-conducted sound by bone-conducted sound2023
- Author(s)
  Inoue Shunsuke、Toya Teruki、Uezu Yasufumi、Unoki Masashi
- Journal Title
  
  Proc. INTER-NOISE
  
  Volume: 268 Issue: 3 Pages: 5479-5489
- DOI
  10.3397/in_2023_0778
- Related Report
  2023 Research-status Report
- Peer Reviewed
[Journal Article] Consonant-emphasis Method Incorporating Robust Consonant-section Detection to Improve Intelligibility of Bone-conducted speech2023
- Author(s)
  Uezu Yasufumi、Wang Sicheng、Toya Teruki、Unoki Masashi
- Journal Title
  
  Proc. Interspeech2023
  
  Volume: - Pages: 849-853
- DOI
  10.21437/interspeech.2023-2568
- Related Report
  2023 Research-status Report
- Peer Reviewed
[Journal Article] Vowel production changes under noise with consideration of low-order formant masking2023
- Author(s)
  Yasufumi Uezu, Masato Akagi, Masashi Unoki
- Journal Title
  
  Proc. 20th International Congress of Phonetic Sciences
  
  Volume: - Pages: 673-677
- Related Report
  2023 Research-status Report
- Peer Reviewed
[Journal Article] An Improved Optimal Transport Kernel Embedding Method with Gating Mechanism for Singing Voice Separation and Speaker Identification2023
- Author(s)
  Yuan Weitao、Bian Yuren、Wang Shengbei、Unoki Masashi、Wang Wenwu
- Journal Title
  
  Proc. ICASSP2023
  
  Volume: - Pages: 849-853
- DOI
  10.1109/icassp49357.2023.10096651
- Related Report
  2023 Research-status Report
- Peer Reviewed
[Journal Article] Method of estimating three-dimensional direction-of-arrival based on monaural modulation spectrum2023
- Author(s)
  Wang Rui、Bui Nguyen Khanh、Morikawa Daisuke、Unoki Masashi
- Journal Title
  
  Applied Acoustics
  
  Volume: 203 Pages: 109215-109215
- DOI
  10.1016/j.apacoust.2023.109215
- Related Report
  2022 Research-status Report
- Peer Reviewed / Open Access
[Journal Article] Contribution of Common Modulation Spectral Features to Vocal-Emotion Recognition of Noise-Vocoded Speech in Noisy Reverberant Environments2022
- Author(s)
  Guo Taiyang、Zhu Zhi、Kidani Shunsuke、Unoki Masashi
- Journal Title
  
  Applied Sciences
  
  Volume: 12 Issue: 19 Pages: 9979-9979
- DOI
  10.3390/app12199979
- Related Report
  2022 Research-status Report
- Peer Reviewed / Open Access
[Journal Article] Detection of Brain Network Communities During Natural Speech Comprehension From Functionally Aligned EEG Sources2022
- Author(s)
  Zhou Di、Zhang Gaoyan、Dang Jianwu、Unoki Masashi、Liu Xin
- Journal Title
  
  Frontiers in Computational Neuroscience
  
  Volume: 16
- DOI
  10.3389/fncom.2022.919215
- Related Report
  2022 Research-status Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Speaker anonymization by modifying fundamental frequency and x-vector singular value2022
- Author(s)
  Mawalim Candy Olivia、Galajit Kasorn、Karnjana Jessada、Kidani Shunsuke、Unoki Masashi
- Journal Title
  
  Computer Speech & Language
  
  Volume: 73 Pages: 101326-101326
- DOI
  10.1016/j.csl.2021.101326
- Related Report
  2022 Research-status Report 2021 Research-status Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Relationship Between Speakers' Physiological Structure and Acoustic Speech Signals: Data-Driven Study Based on Frequency-Wise Attentional Neural Network2022
- Author(s)
  Li Kai、Lu Xugang、Akagi Masato、Dang Jianwu、Li Sheng、Unoki Masashi
- Journal Title
  
  Proc. EUSIPCO2022
  
  Volume: ー Pages: 379-383
- DOI
  10.23919/eusipco55093.2022.9909649
- Related Report
  2022 Research-status Report
- Peer Reviewed / Open Access
[Journal Article] Bone-conducted Speech Enhancement Using Vector-quantized Variational Autoencoder and Gammachirp Filterbank Cepstral Coefficients2022
- Author(s)
  Nguyen Quoc-Huy、Unoki Masashi
- Journal Title
  
  Proc. EUSIPCO2022
  
  Volume: ー Pages: 21-25
- DOI
  10.23919/eusipco55093.2022.9909731
- Related Report
  2022 Research-status Report
- Peer Reviewed
[Journal Article] Speech Watermarking Method Using McAdams Coefficient Based on Random Forest Learning2021
- Author(s)
  Mawalim Candy Olivia、Unoki Masashi
- Journal Title
  
  Entropy
  
  Volume: 23 Issue: 10 Pages: 1246-1246
- DOI
  10.3390/e23101246
- Related Report
  2021 Research-status Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Evolving Multi-Resolution Pooling CNN for Monaural Singing Voice Separation2021
- Author(s)
  Yuan Weitao、Dong Bofei、Wang Shengbei、Unoki Masashi、Wang Wenwu
- Journal Title
  
  IEEE/ACM Transactions on Audio, Speech, and Language Processing
  
  Volume: 29 Pages: 807-822
- DOI
  10.1109/taslp.2021.3051331
- Related Report
  2020 Research-status Report
- Peer Reviewed / Open Access
[Journal Article] Enhancement of speech intelligibility under noisy reverberant conditions based on modulation spectrum concept2020
- Author(s)
  Thuan Van Ngo, Tuan Vu Ho, Masashi Unoki, Rieko Kuboy, and Masato Akagi
- Journal Title
  
  Proc. APSIPA2020
  
  Volume: -
- Related Report
  2020 Research-status Report
- Peer Reviewed
[Presentation] Contributions of Instantaneous Modulation Components in Temporal Amplitude Envelope to Vocal Emotion Perception2024
- Author(s)
  Taiyang Guo, Takuto Isoyama, Shunsuke Kidani, Masashi Unoki
- Organizer
  日本音響学会2024年度春季研究発表会（拓殖大学）
- Related Report
  2023 Research-status Report
[Presentation] Conversation Scenario Classification Based on Conversation Entrainment2024
- Author(s)
  Yuning Liu, Di Zhou, Jianwu Dang, Aijun Li, Masashi Unoki
- Organizer
  日本音響学会2024年度春季研究発表会（拓殖大学）
- Related Report
  2023 Research-status Report
[Presentation] Leveraging Equalization-Cancellation Model in Speech Intelligibility Prediction for Hearing Aids2024
- Author(s)
  Xiajie Zhou, Candy Olivia Mawalim, Masashi Unoki
- Organizer
  日本音響学会2024年度春季研究発表会（拓殖大学）
- Related Report
  2023 Research-status Report
[Presentation] Linear Model Approach to Investigate the Comprehensive Entrainment in Conversation2024
- Author(s)
  Yuning Liu, Masashi Unoki
- Organizer
  2024 RISP International Workshop on Nonlinear Circuits, Communications and Signal Processing (Hawaii)
- Related Report
  2023 Research-status Report
- Int'l Joint Research
[Presentation] 聴覚フィルタバンクを用いた時変動音のラウドネス計算法の構築2023
- Author(s)
  磯山拓都, 木谷俊介, 鵜木祐史
- Organizer
  日本音響学会2023年度秋季研究発表会（名古屋工業大学）
- Related Report
  2023 Research-status Report
[Presentation] 産業機器の異常音検知に向けた音色関連特徴量の検討2023
- Author(s)
  大田恭士, 鵜木祐史
- Organizer
  日本音響学会2023年度秋季研究発表会（名古屋工業大学）
- Related Report
  2023 Research-status Report
[Presentation] 音色に関連づけた信号処理手法による異常音検知の一検討2023
- Author(s)
  大田恭士, 鵜木祐史
- Organizer
  第38回信号処理シンポジウム（京都テルサ）
- Related Report
  2023 Research-status Report
[Presentation] 謡曲の良さに寄与するスペクトル・時間変調情報の検討2023
- Author(s)
  木谷俊介，磯山拓人，鵜木祐史
- Organizer
  日本音響学会2023年度秋季研究発表会（名古屋工業大学）
- Related Report
  2023 Research-status Report
[Presentation] Emotion Prediction based on Conversation Entrainments2023
- Author(s)
  Liu Yuning, Unoki Masashi
- Organizer
  2023年度電気・情報関係学会北陸支部連合大会（金沢工大，オンライン）
- Related Report
  2023 Research-status Report
[Presentation] 聴覚フィルタバンクを用いた時変動音のラウドネス計算法の検討2023
- Author(s)
  磯山拓都, 木谷俊介, 鵜木祐史
- Organizer
  日本音響学会聴覚研究会（東北学院大学）
- Related Report
  2023 Research-status Report
[Presentation] Study on suppression effect of air-conducted sound by bone-conducted sound2023
- Author(s)
  Inoue Shunsuke、Toya Teruki、Uezu Yasufumi、Unoki Masashi
- Organizer
  InterNoise2023 (Makuhari)
- Related Report
  2023 Research-status Report
- Int'l Joint Research
[Presentation] Computational model for predicting sound quality metrics using loudness model based on gammatone/gammachirp auditory filterbank and its applications2023
- Author(s)
  Isoyama Takuto、Kidani Shunsuke、Unoki Masashi
- Organizer
  InterNoise2023 (Makuhari)
- Related Report
  2023 Research-status Report
- Int'l Joint Research
[Presentation] Vowel production changes under noise with consideration of low-order formant masking2023
- Author(s)
  Yasufumi Uezu, Masato Akagi, Masashi Unoki
- Organizer
  20th International Congress of Phonetic Sciences
- Related Report
  2023 Research-status Report
- Int'l Joint Research
[Presentation] Consonant-emphasis Method Incorporating Robust Consonant-section Detection to Improve Intelligibility of Bone-conducted speech2023
- Author(s)
  Uezu Yasufumi、Wang Sicheng、Toya Teruki、Unoki Masashi
- Organizer
  24th INTERSPEECH Conference
- Related Report
  2023 Research-status Report
- Int'l Joint Research
[Presentation] An Improved Optimal Transport Kernel Embedding Method with Gating Mechanism for Singing Voice Separation and Speaker Identification2023
- Author(s)
  Yuan Weitao、Bian Yuren、Wang Shengbei、Unoki Masashi、Wang Wenwu
- Organizer
  2023 IEEE International Conference on Acoustics, Speech, and Signal Processing (Greek island of Rhodes)
- Related Report
  2023 Research-status Report
- Int'l Joint Research
[Presentation] Analysis of Spectro-Temporal Modulation Representation for Deep-Fake Speech Detection2023
- Author(s)
  Cheng Haowei、Mawalim Candy Olivia、Li Kai、Wang Lijun、Unoki Masashi
- Organizer
  15th annual conference organized by Asia-Pacific Signal and Information Processing Association (Taipei)
- Related Report
  2023 Research-status Report
- Int'l Joint Research
[Presentation] Deepfake-speech Detection with Pathological Features and Multilayer Perceptron Neural Network2023
- Author(s)
  Chaiwongyen Anuwat、Duangpummet Suradej、Karnjana Jessada、Kongprawechnon Waree、Unoki Masashi
- Organizer
  15th annual conference organized by Asia-Pacific Signal and Information Processing Association (Taipei)
- Related Report
  2023 Research-status Report
- Int'l Joint Research
[Presentation] Contribution of modulation spectral features for cross-lingual speech emotion recognition under noisy reverberant conditions2023
- Author(s)
  Guo Taiyang、Li Sixia、Kidani Shunsuke、Okada Shogo、Unoki Masashi
- Organizer
  15th annual conference organized by Asia-Pacific Signal and Information Processing Association (Taipei)
- Related Report
  2023 Research-status Report
- Int'l Joint Research
[Presentation] Data-driven Non-uniform Filterbanks Based on F-ratio for Machine Anomalous Sound Detection2023
- Author(s)
  Li Kai、Tran Dung Kim、Lu Xugang、Akagi Masato、Unoki Masashi
- Organizer
  31st European Signal Processing Conference (Helsinki, Finland)
- Related Report
  2023 Research-status Report
- Int'l Joint Research
[Presentation] 骨導提示音による気導提示音の抑圧効果2023
- Author(s)
  井上隼輔, 鳥谷輝樹, 上江洲安史, 鵜木祐史
- Organizer
  日本音響学会2023年度春季研究発表会
- Related Report
  2022 Research-status Report
[Presentation] 選択的聴取能力と耳音響放射特性の関係性の検討2023
- Author(s)
  宮家一真, 木谷俊介, 鵜木祐史
- Organizer
  日本音響学会2023年度春季研究発表会
- Related Report
  2022 Research-status Report
[Presentation] 残響音声からの変調伝達関数・音声伝送指標・室内音響パラメータのブラインド推定法2023
- Author(s)
  鵜木祐史
- Organizer
  日本音響学会音声研究会
- Related Report
  2022 Research-status Report
[Presentation] 音声に含まれる緊迫感の変調周波数帯域の検討2023
- Author(s)
  木谷俊介，劉小テイ，郭太陽，磯山拓都，李軍鋒，赤木正人，鵜木祐史
- Organizer
  日本音響学会音声研究会
- Related Report
  2022 Research-status Report
[Presentation] Reconstruction of speech spectrogram based on non-invasive EEG signal2022
- Author(s)
  Di Zhou, Masashi Unoki, Gaoyan Zhang, Jianwu Dang
- Organizer
  ISCSLP2022
- Related Report
  2022 Research-status Report
- Int'l Joint Research
[Presentation] Vector-quantized Variational Autoencoder for Phase-aware Speech Enhancement2022
- Author(s)
  Tuan Vu Ho, Quoc Huy Nguyen, Masato Akagi, Masashi Unoki
- Organizer
  Interspeech2022
- Related Report
  2022 Research-status Report
- Int'l Joint Research
[Presentation] Data Augmentation Using McAdams-Coefficient-Based Speaker Anonymization for Fake Audio Detection2022
- Author(s)
  Kai Li, Sheng Li, Xugang Lu, Masato Akagi, Meng Liu, Lin Zhang, Chang Zeng, Longbiao Wang, Jianwu Dang, Masashi Unoki
- Organizer
  Interspeech2022
- Related Report
  2022 Research-status Report
- Int'l Joint Research
[Presentation] Automatic Mean Opinion Score Estimation with Temporal Modulation Features on Gammatone Filterbank for Speech Assessment2022
- Author(s)
  Quoc-Huy Nguyen, Kai Li, Masashi Unoki
- Organizer
  Interspeech2022
- Related Report
  2022 Research-status Report
- Int'l Joint Research
[Presentation] Deep Hashing for Speaker Identification and Retrieval Based on Auditory Sparse Representation2022
- Author(s)
  Dung Kim Tran, Masato Akagi, and Masashi Unoki
- Organizer
  APSIPA2022
- Related Report
  2022 Research-status Report
- Int'l Joint Research
[Presentation] Analysis of Amplitude and Frequency Perturbation in the Voice for Fake Audio Detection2022
- Author(s)
  Kai Li, Yao Wang, Minh Le Nguyen, Masato Akagi and Masashi Unoki
- Organizer
  APSIPA2022
- Related Report
  2022 Research-status Report
- Int'l Joint Research
[Presentation] F0 Modification via PV-TSM Algorithm for Speaker Anonymization Across Gender2022
- Author(s)
  Candy Olivia Mawalim, Shogo Okada, and Masashi Unoki
- Organizer
  APSIPA2022
- Related Report
  2022 Research-status Report
- Int'l Joint Research
[Presentation] Contribution of Timbre and Shimmer Features to Deepfake Speech Detection2022
- Author(s)
  Anuwat Chaiwongyen, Norranat Songsriboonsit, Suradej Duangpummet, Jessada Karnjana, Waree Kongprawechnon, and Masashi Unoki
- Organizer
  APSIPA2022
- Related Report
  2022 Research-status Report
- Int'l Joint Research
[Presentation] Unsupervised Anomalous Sound Detection for Machine Condition Monitoring Using Temporal Modulation Features on Gammatone Auditory Filterbank2022
- Author(s)
  Kai Li, Quoc-Huy Nguyen, Yasuji Ota, and Masashi Unoki
- Organizer
  DCASE2022
- Related Report
  2022 Research-status Report
- Int'l Joint Research
[Presentation] Study on the modulation frequency range that contributes to the perception of urgency2022
- Author(s)
  Shunsuke Kidani, Xiaoting Liu, Taiyang Guo, Takuto Isoyama, Junfeng Li, Masashi Unoki
- Organizer
  International Congress of Acoustics 2022 (ICA2022)
- Related Report
  2022 Research-status Report
- Int'l Joint Research
[Presentation] Anomalous sound detection using objective metrics related to timbral attributes2022
- Author(s)
  Yasuji Ota, Seigo Kura, Masashi Unoki
- Organizer
  International Congress of Acoustics 2022 (ICA2022)
- Related Report
  2022 Research-status Report
- Int'l Joint Research
[Presentation] Subjective evaluation regarding mixing ratio of bone-conducted to air-conducted speech for own-voice perception2022
- Author(s)
  Teruki Toya, Peter Birkholz, and Masashi Unoki
- Organizer
  International Congress of Acoustics 2022 (ICA2022)
- Related Report
  2022 Research-status Report
- Int'l Joint Research
[Presentation] 聴覚フィルタバンクを用いたラフネスモデルの構築2022
- Author(s)
  磯山拓都, 木谷俊介, 鵜木祐史
- Organizer
  日本音響学会2022年度秋季研究発表会
- Related Report
  2022 Research-status Report
[Presentation] 緊迫感知覚に寄与する変調周波数帯域の検討2022
- Author(s)
  木谷俊介, 劉小テイ, 郭太陽, 磯山拓都, 李軍鋒, 赤木正人, 鵜木祐史
- Organizer
  日本音響学会2022年度秋季研究発表会
- Related Report
  2022 Research-status Report
[Presentation] 音色属性の客観評価指標を活用した異常音検知の検討2022
- Author(s)
  大田恭士, 倉誠吾, 鵜木祐史
- Organizer
  日本音響学会2022年度秋季研究発表会
- Related Report
  2022 Research-status Report
[Presentation] 自己聴取音における音色と音高の印象に関する調査2022
- Author(s)
  森田翔太, 鳥谷輝樹, 鵜木祐史
- Organizer
  日本音響学会2022年度秋季研究発表会
- Related Report
  2022 Research-status Report
[Presentation] Study on Bone-conducted Speech Enhancement Using Vector-quantized Variational Autoencoder and Gammachirp Filterbank Cepstral Coefficients2022
- Author(s)
  Quoc-Huy Nguyen, Masashi Unoki
- Organizer
  電子情報通信学会信号処理研究会
- Related Report
  2022 Research-status Report
[Presentation] Study on Relationship Between Speakers’ Physiological Structure and Acoustic Speech Signals: Data-Driven Study Based on Frequency-WiseAttentional Neural Network2022
- Author(s)
  Kai Li, Xugang Lu, Masato Akagi, Jianwu Dang, Sheng Li, and Masashi Unoki
- Organizer
  電子情報通信学会信号処理研究会
- Related Report
  2022 Research-status Report
[Presentation] 聴覚的顕著性の予測のためのSpectro-Temporal Modulation 分析の検討2022
- Author(s)
  田中聡一郎, 堀口遼太郎, 木谷俊介, 鵜木祐史
- Organizer
  電子情報通信学会信号処理研究会
- Related Report
  2022 Research-status Report
[Presentation] Dialogue scenario classification based on social factors2022
- Author(s)
  Yuning Liu, Di Zhou, Masashi Unoki, Jianwu Dang, Aijun Li
- Organizer
  ISCSLP2022
- Related Report
  2022 Research-status Report
- Int'l Joint Research
[Presentation] Spectro-Temporal Modulationによる音声感情認識の調査2022
- Author(s)
  村上正悟，森田翔太
- Organizer
  電子情報通信学会　情報・システムソサイエティ特別企画　ジュニア＆学生ポスターセッション
- Related Report
  2022 Research-status Report
[Presentation] 音色の客観指標と信号分析を組合せた異常音検知の一検討2022
- Author(s)
  大田恭士，鵜木祐史
- Organizer
  第37回信号処理シンポジウム
- Related Report
  2022 Research-status Report
[Presentation] Study on Expressiveness of Speech Synthesis Using Multi-resolution Modulation-filtered Cochleagram2022
- Author(s)
  Kaili Zhang, Masashi Unoki
- Organizer
  NCSP22
- Related Report
  2021 Research-status Report
- Int'l Joint Research
[Presentation] Replay Attack Detection using MFCC and ResNeWt18 for Automatic Speaker Verification2022
- Author(s)
  Anuwat Chaiwongyen, Waree Kongprawechnon, Suradej Duangpummet, Jessada Karnjana, Masashi Unoki
- Organizer
  NCSP22
- Related Report
  2021 Research-status Report
- Int'l Joint Research
[Presentation] Improving Security in McAdams Coefficient‐Based Speaker Anonymization by Watermarking Method2021
- Author(s)
  Candy Olivia Mawalim, Masashi Unoki
- Organizer
  APSIPA2021
- Related Report
  2021 Research-status Report
- Int'l Joint Research
[Presentation] Tampering Detection for Speech Signals Using Synchronization Code and LSF based Watermarks2021
- Author(s)
  Shengbei Wang, Weitao Yuan, Zhen Zhang, Jianming Wang, Masashi Unoki
- Organizer
  APSIPA2021
- Related Report
  2021 Research-status Report
- Int'l Joint Research
[Presentation] Hybridization of Speech Information Hiding and Encryption for Double-layer Security in Speech Communication2021
- Author(s)
  Kasorn Galajit, Jessada Karnjana, Pakinee Aimmanee, Masashi Unoki
- Organizer
  APSIPA2021
- Related Report
  2021 Research-status Report
- Int'l Joint Research
[Presentation] Crossfire Conditional Generative Adversarial Networks for Singing Voice Extraction2021
- Author(s)
  Weitao Yuan, Shengbei Wang, Xiangrui Li, Masashi Unoki and Wenwu Wang
- Organizer
  Interspeech2021
- Related Report
  2021 Research-status Report
- Int'l Joint Research
[Presentation] Synchronous multi-bit audio watermarking based on phase shifting2021
- Author(s)
  Shengbei Wang, Weitao Yuan, Zhen Zhang, Jianming Wang, Masashi Unoki
- Organizer
  ICASSP2021
- Related Report
  2021 Research-status Report
- Int'l Joint Research
[Presentation] Speech Watermarking Approach for Securing Speaker Anonymization using McAdams Coefficients2021
- Author(s)
  Candy Olivia Mawalim and Masashi Unoki
- Organizer
  電子情報通信学会EMM研究会
- Related Report
  2021 Research-status Report
[Presentation] 発話のしにくさの自覚と音読潜時，音節反復との関連の予備検討2021
- Author(s)
  古田尚久，北村達也，林良子，能田由紀子，鵜木祐史
- Organizer
  日本音響学会音声研究会
- Related Report
  2021 Research-status Report
[Presentation] 発話時の骨導伝達特性の測定における気導音遮断の検討2021
- Author(s)
  鳥谷輝樹，Peter Birkholz，鵜木祐史
- Organizer
  日本音響学会秋季研究発表会
- Related Report
  2021 Research-status Report
[Presentation] 遮音壁による気導音遮断の下での骨導音声の伝達特性の測定2021
- Author(s)
  鳥谷輝樹，Peter Birkholz，鵜木祐史
- Organizer
  日本音響学会聴覚研究会
- Related Report
  2021 Research-status Report
[Presentation] 自己聴取音の印象に関する個人性の調査2021
- Author(s)
  森田翔太, 鳥谷輝樹, 鵜木祐史
- Organizer
  日本音響学会2021年度春季研究発表会
- Related Report
  2020 Research-status Report
[Presentation] Audio Information Hiding in Sub-signals by deploying Singular Spectrum Analysis and Psychoacoustic Model2021
- Author(s)
  Kasorn Galajit, Jessada Karnjana, Masashi Unoki
- Organizer
  電子情報通信学会EMM研究会
- Related Report
  2020 Research-status Report
[Presentation] X-vector anonymization using regression modeling with statistical and singular value2021
- Author(s)
  Candy Olivia Mawalim, Kasorn Galajit, Jessada Karnjana, Masashi Unoki
- Organizer
  電子情報通信学会EMM研究会
- Related Report
  2020 Research-status Report

VOICE 2.0: towards augmentation of enriched speech communication

Principal Investigator

鵜木 祐史 北陸先端科学技術大学院大学, 先端科学技術研究科, 教授 (00343187)

¥18,720,000 (Direct Cost: ¥14,400,000、Indirect Cost: ¥4,320,000)

Current Status of Research Progress

Reason

Report

Research Products

[Journal Article] Computational models of auditory sensation important for sound quality on basis of either gammatone or gammachirp auditory filterbank2024

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Linear Model Approach to Investigate the Comprehensive Entrainment in Conversation2024

Author(s)

Journal Title

Related Report

[Journal Article] Phase-Aware Speech Enhancement With Complex Wiener Filter2023

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Contributions of Jitter and Shimmer in the Voice for Fake Audio Detection2023

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Anomalous Sound Detection for Industrial Machines Using Acoustical Features Related to Timbral Metrics2023

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Contributions of Temporal Modulation Cues in Temporal Amplitude Envelope of Speech to Urgency Perception2023

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Analysis of Spectro-Temporal Modulation Representation for Deep-Fake Speech Detection2023

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Deepfake-speech Detection with Pathological Features and Multilayer Perceptron Neural Network2023

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Contribution of modulation spectral features for cross-lingual speech emotion recognition under noisy reverberant conditions2023

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Data-driven Non-uniform Filterbanks Based on F-ratio for Machine Anomalous Sound Detection2023

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Computational model for predicting sound quality metrics using loudness model based on gammatone/gammachirp auditory filterbank and its applications2023

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Study on suppression effect of air-conducted sound by bone-conducted sound2023

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Consonant-emphasis Method Incorporating Robust Consonant-section Detection to Improve Intelligibility of Bone-conducted speech2023

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Vowel production changes under noise with consideration of low-order formant masking2023

Author(s)

Journal Title

Related Report

[Journal Article] An Improved Optimal Transport Kernel Embedding Method with Gating Mechanism for Singing Voice Separation and Speaker Identification2023

Author(s)

Journal Title

DOI

鵜木祐史北陸先端科学技術大学院大学, 先端科学技術研究科, 教授 (00343187)