Speech privacy protection by high-quality, invertible, and extendable speech anonymization and de-anonymization

研究課題

研究課題/領域番号	21K17775
研究種目	若手研究
配分区分	基金
審査区分	小区分61010:知覚情報処理関連
研究機関	国立情報学研究所
研究代表者	Wang Xin 国立情報学研究所, コンテンツ科学研究系, 特任助教 (60843141)
研究期間 (年度)	2021-04-01 – 2024-03-31
研究課題ステータス	交付 (2022年度)
配分額 *注記	4,550千円 (直接経費: 3,500千円、間接経費: 1,050千円) 2023年度: 1,560千円 (直接経費: 1,200千円、間接経費: 360千円) 2022年度: 1,560千円 (直接経費: 1,200千円、間接経費: 360千円) 2021年度: 1,430千円 (直接経費: 1,100千円、間接経費: 330千円)
キーワード	speech privacy / speaker anonymization / speech waveform modeling / neural network / deep learning
研究開始時の研究の概要	Human speech contains not only verbal contents but also private information about the speaker such as the speaker identity. This proposal is on protecting the speaker’s privacy in speech data for two scenarios: 1) Speech anonymization: when the speaker shares the speech data in untrusted cyberspace, this speech data should be protected so that the audience can understand the speech but cannot infer who the speaker is; 2) Speech de-anonymization: when the speaker further shares the speech data to trusted audience, the original natural speech can be reconstructed from protected version.
研究実績の概要	The second year's work consists of three parts: Part 1) Based on the previous year's work, the second VoicePrivacy challenge was organized by us and other universities. We defined new evaluation frameworks and conducted solid evaluations. In addition to many findings, we found that the new baseline, which was the research outcome of the previous year, outperformed the legacy baseline. We also saw submissions that outperformed the new baseline, which indicates the advancement of the research field brought by the VoicePrivacy challenge. Part 2) Based on the framework of the voice privacy challenge, we did a deep analysis of the common approaches to generating anonymized speaker identity representation (i.e., pseudo speaker embedding). Through a large-scale experiment, we identified good strategies to choose and assign the pseudo-speaker, including random gender selection and utterance-level anonymization. We also found that a simple percentile-based pitch conversion reduced the risk against the strongest (Semi-Informed) attacker. These findings were published in a top IEEE journal. Part 3) We followed the research plan and extended the language-independent speaker anonymization framework. Although the framework is language-independent, its performance degrades when processing unseen languages. We found that using multilingual training data for the waveform generator was helpful. We also proposed a correlation-alignment-based strategy to alleviate channel mismatch. Additionally, we extended the framework to hide gender information. Both works were published in top conferences.
現在までの達成度 (区分)	現在までの達成度 (区分) 2: おおむね順調に進展している理由 The efforts of the VoicePrivacy Challenge 2022 produced good outcomes. The challenge attracted 43 registered teams from 17 countries, which led to 16 successful submissions. We also organized a special session in the Interspeech 2022 satellite workshop and had presentations from participants and ourselves. The results are released on VoicePrivacy Challenge's official website: https://www.voiceprivacychallenge.org/results-2022/. The experimental study analyzing the shortcomings and optimal strategy for speaker anonymization under (Part 2 of the research outcome) was published in a top IEEE journal. We followed the research plan and investigated the language-independent speaker anonymization framework (Part 3 of the research outcome), and the work was accepted by the Interspeech 2022 conference (CORE rank A) and ICASSP 2023 conference (CORE rank B).
今後の研究の推進方策	Following the research plan made in the previous year, we will work on the language-independent speaker anonymization framework. Although it performs well in different languages (research outcome of 1st year) and other speaker attributes (Part 3 of the research outcome), there are issues left: 1) The quality of the anonymized voice is still inferior to the natural voice. Findings from the research outcome (Part 2) indicate that the selection-based generate pseudo speaker embedding is one bottleneck. We plan to investigate generative approaches for better performance. 2) The optimization of the speaker anonymization framework lacks a solid mathematical description. We plan to derive a unified mathematical description to consider multiple goals of the optimization and improve the current framework accordingly. The final year research plan also includes work on the VoicePrivacy Challenge series: 1) post-challenge analysis on VoicePrivacy Challenge 2022 and how the progress of the research field has been made since the previous challenge. 2) whether stronger attacker models can recognize the speaker identity in the anonymized speech waveforms.

報告書

(2件)

2022 実施状況報告書
2021 実施状況報告書

研究成果

(20件)

すべて 2023 2022 2021 その他

すべて国際共同研究 (3件) 雑誌論文 (2件) (うち国際共著 2件、査読あり 2件、オープンアクセス 2件) 学会発表 (10件) (うち国際学会 10件、招待講演 2件) 備考 (5件)

[国際共同研究] Avignon University/Inria/University of Lorraine(フランス)
- 関連する報告書
  2022 実施状況報告書
[国際共同研究] University of Avignon/EURECOM/Universite de Lorraine(フランス)
- 関連する報告書
  2021 実施状況報告書
[国際共同研究] Naver Corporation(韓国)
- 関連する報告書
  2021 実施状況報告書
[雑誌論文] Privacy and Utility of X-Vector Based Speaker Anonymization2022
- 著者名/発表者名
  Srivastava Brij Mohan Lal、Maouche Mohamed、Sahidullah Md、Vincent Emmanuel、Bellet Aurelien、Tommasi Marc、Tomashenko Natalia、Wang Xin、Yamagishi Junichi
- 雑誌名
  
  IEEE/ACM Transactions on Audio, Speech, and Language Processing
  
  巻: 30 ページ: 2383-2395
- DOI
  10.1109/taslp.2022.3190741
- 関連する報告書
  2022 実施状況報告書
- 査読あり / オープンアクセス / 国際共著
[雑誌論文] The VoicePrivacy 2020 Challenge: Results and findings2022
- 著者名/発表者名
  Natalia Tomashenko, Xin Wang, Emmanuel Vincent, Jose Patino, Brij Mohan Lal Srivastava, Paul-Gauthier No?, Andreas Nautsch, Nicholas Evans, Junichi Yamagishi, Benjamin O’Brien, Ana?s Chanclu, Jean-Fran?ois Bonastre, Massimiliano Todisco, Mohamed Maouche
- 雑誌名
  
  Computer Speech & Language
  
  巻: 74 ページ: 101362-101362
- DOI
  10.1016/j.csl.2022.101362
- 関連する報告書
  2021 実施状況報告書
- 査読あり / オープンアクセス / 国際共著
[学会発表] Hiding Speaker’s Sex in Speech Using Zero-Evidence Speaker Representation in an Analysis/Synthesis Pipeline2023
- 著者名/発表者名
  Paul-Gauthier Noe, Xiaoxiao Miao, Xin Wang, Junichi Yamagishi, Jean-Francois Bonastre, and Driss Matrouf
- 学会等名
  ICASSP 2023
- 関連する報告書
  2022 実施状況報告書
- 国際学会
[学会発表] Analyzing Language-Independent Speaker Anonymization Framework under Unseen Conditions2022
- 著者名/発表者名
  Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi, and Natalia Tomashenko
- 学会等名
  Interspeech 2022
- 関連する報告書
  2022 実施状況報告書
- 国際学会
[学会発表] Tutorial on speaker anonymization (software part)2022
- 著者名/発表者名
  Xin Wang
- 学会等名
  2nd Symposium on Security and Privacy in Speech Communication joined with 2nd VoicePrivacy Challenge Workshop
- 関連する報告書
  2022 実施状況報告書
- 国際学会 / 招待講演
[学会発表] Language-Independent Speaker Anonymization Approach Using Self-Supervised Pre-Trained Models2022
- 著者名/発表者名
  Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi, Natalia Tomashenko
- 学会等名
  Proc. Odyssey 2022 The Speaker and Language Recognition Workshop
- 関連する報告書
  2021 実施状況報告書
- 国際学会
[学会発表] Estimating the confidence of speech spoofing countermeasure2022
- 著者名/発表者名
  Wang Xin, Yamagishi Junichi
- 学会等名
  ICASSP 2022
- 関連する報告書
  2021 実施状況報告書
- 国際学会
[学会発表] Attention Back-end for Automatic Speaker Verification with Multiple Enrollment Utterances2022
- 著者名/発表者名
  Chang Zeng, Xin Wang, Erica Cooper, Xiaoxiao Miao, Junichi Yamagishi
- 学会等名
  ICASSP 2022
- 関連する報告書
  2021 実施状況報告書
- 国際学会
[学会発表] Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation2022
- 著者名/発表者名
  Hemlata Tak, Massimiliano Todisco, Xin Wang, Jee-weon Jung, Junichi Yamagishi, Nicholas Evans
- 学会等名
  Proc. Odyssey 2022 The Speaker and Language Recognition Workshop
- 関連する報告書
  2021 実施状況報告書
- 国際学会
[学会発表] Investigating self-supervised front ends for speech spoofing countermeasures2022
- 著者名/発表者名
  Xin Wang, Junichi Yamagishi
- 学会等名
  Proc. Odyssey 2022 The Speaker and Language Recognition Workshop
- 関連する報告書
  2021 実施状況報告書
- 国際学会
[学会発表] Benchmarking and challenges in security and privacy for voice biometrics2021
- 著者名/発表者名
  Jean-Francois Bonastre, Hector Delgado, Nicholas Evans, Tomi Kinnunen, Kong Aik Lee, Xuechen Liu, Andreas Nautsch, Paul-Gauthier NoE, Jose Patino, Md Sahidullah, Brij Mohan Lal Srivastava, Massimiliano Todisco, Natalia Tomashenko, Emmanuel Vincent, Xin Wang, Junichi Yamagishi
- 学会等名
  2021 ISCA Symposium on Security and Privacy in Speech Communication
- 関連する報告書
  2021 実施状況報告書
- 国際学会
[学会発表] Two speech security issues after the speech synthesis boom2021
- 著者名/発表者名
  Wang Xin
- 学会等名
  Speech Synthesis Forum, China Computer Federation
- 関連する報告書
  2021 実施状況報告書
- 国際学会 / 招待講演
[備考] VoicePrivacy Challenge 2022 results and outcomes
- URL
  https://www.voiceprivacychallenge.org/results-2022/
- 関連する報告書
  2022 実施状況報告書
[備考] Tutorial on speaker anonymization (software)
- URL
  https://colab.research.google.com/drive/1_zRL_f9iyDvl_5Y2Rdakg0hYAl_5Rgyq
- 関連する報告書
  2022 実施状況報告書
[備考] Official page of VoicePrivacy
- URL
  https://www.voiceprivacychallenge.org/
- 関連する報告書
  2021 実施状況報告書
[備考] Open-source baseline of VoicePrivacy 2022
- URL
  https://github.com/Voice-Privacy-Challenge/Voice-Privacy-Challenge-2022
- 関連する報告書
  2021 実施状況報告書
[備考] Languange-independent speaker anonymization system
- URL
  https://github.com/nii-yamagishilab/SSL-SAS
- 関連する報告書
  2021 実施状況報告書

Speech privacy protection by high-quality, invertible, and extendable speech anonymization and de-anonymization

研究代表者

Wang Xin 国立情報学研究所, コンテンツ科学研究系, 特任助教 (60843141)

4,550千円 (直接経費: 3,500千円、間接経費: 1,050千円)

現在までの達成度 (区分)

理由

報告書

研究成果

[国際共同研究] Avignon University/Inria/University of Lorraine(フランス)

関連する報告書

[国際共同研究] University of Avignon/EURECOM/Universite de Lorraine(フランス)

関連する報告書

[国際共同研究] Naver Corporation(韓国)

関連する報告書

[雑誌論文] Privacy and Utility of X-Vector Based Speaker Anonymization2022

著者名/発表者名

雑誌名

DOI

関連する報告書

[雑誌論文] The VoicePrivacy 2020 Challenge: Results and findings2022

著者名/発表者名

雑誌名

DOI

関連する報告書

[学会発表] Hiding Speaker’s Sex in Speech Using Zero-Evidence Speaker Representation in an Analysis/Synthesis Pipeline2023

著者名/発表者名

学会等名

関連する報告書

[学会発表] Analyzing Language-Independent Speaker Anonymization Framework under Unseen Conditions2022

著者名/発表者名

学会等名

関連する報告書

[学会発表] Tutorial on speaker anonymization (software part)2022

著者名/発表者名

学会等名

関連する報告書

[学会発表] Language-Independent Speaker Anonymization Approach Using Self-Supervised Pre-Trained Models2022

著者名/発表者名

学会等名

関連する報告書

[学会発表] Estimating the confidence of speech spoofing countermeasure2022

著者名/発表者名

学会等名

関連する報告書

[学会発表] Attention Back-end for Automatic Speaker Verification with Multiple Enrollment Utterances2022

著者名/発表者名

学会等名

関連する報告書

[学会発表] Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation2022

著者名/発表者名

学会等名

関連する報告書

[学会発表] Investigating self-supervised front ends for speech spoofing countermeasures2022

著者名/発表者名

学会等名

関連する報告書

[学会発表] Benchmarking and challenges in security and privacy for voice biometrics2021

著者名/発表者名

学会等名

関連する報告書

[学会発表] Two speech security issues after the speech synthesis boom2021

著者名/発表者名

学会等名

関連する報告書

[備考] VoicePrivacy Challenge 2022 results and outcomes

URL

関連する報告書

[備考] Tutorial on speaker anonymization (software)

URL

関連する報告書

[備考] Official page of VoicePrivacy

URL

関連する報告書

[備考] Open-source baseline of VoicePrivacy 2022

URL

関連する報告書

[備考] Languange-independent speaker anonymization system

URL

関連する報告書