Language-independent speaker anonymization with multiple privacy-related attributes

Research Project

Project/Area Number	22K21319
Research Category	Grant-in-Aid for Research Activity Start-up
Allocation Type	Multi-year Fund
Review Section	1002:Human informatics, applied informatics and related fields
Research Institution	National Institute of Informatics
Principal Investigator	Miao Xiaoxiao 国立情報学研究所, コンテンツ科学研究系, 特任研究員 (10962508)
Project Period (FY)	2022-08-31 – 2024-03-31
Project Status	Discontinued (Fiscal Year 2023)
Budget Amount *help	¥2,860,000 (Direct Cost: ¥2,200,000、Indirect Cost: ¥660,000) Fiscal Year 2023: ¥1,430,000 (Direct Cost: ¥1,100,000、Indirect Cost: ¥330,000) Fiscal Year 2022: ¥1,430,000 (Direct Cost: ¥1,100,000、Indirect Cost: ¥330,000)
Keywords	OHNN / VoicePAT / SynVox2 / speaker anonymization / language independent / gender netural / Speech processing / Speech privacy / Voice transformation / Anonymization / Deep learning
Outline of Research at the Start	Exposure of speech data without taking any measures would cause privacy issues. The goal of the project is to perform a user-centric approach to hide multiple privacy-related speech attributes including speaker identity, age, gender, and dialect information, leaving non-private attributes unchange.
Outline of Annual Research Achievements	Part 1) Improved anonymizer: we proposed orthogonal Householder neural network (OHNN)-based anonymizer that rotates the original speaker vectors to anonymized ones to maintain the diversity and strengthen privacy protection. The related work has been accepted to IEEE/ACM Transactions on Audio, Speech, and Language Processing. Part 2) User-friendly voice anonymization framework: Along with the popularity of speaker anonymization topic is increasing, the comparison and combination of different anonymization approaches remains challenging due to the complexity of evaluation and the absence of user-friendly research frameworks. We therefore propose an efficient speaker anonymization and evaluation framework based on a modular and easily extendable structure called VoicePAT. Our code is fully open source. Related work has been submitted to OJSP. Part 3) ASV speech dataset anonymization: The legal and ethical concerns has led to the withdrawal of the widely-used VoxCeleb2 dataset for speaker recognition, we employ the our proposed OHNN-based speaker anonymization technique to create a privacy-friendly VoxCeleb2 dataset called SynVox2. In addition, we define several metrics for evaluating the use of SynVox2 in terms of privacy, utility, and fairness. These metrics may serve as a protocol for future research, enabling researchers to assess whether a synthetic dataset is suitable for their ASV research. Furthermore, we discuss the challenges of using synthetic data for the downstream task of speaker verification. Related work has been submitted to ICASSP2024.

Report

(2 results)

2023 Annual Research Report
2022 Research-status Report

Research Products
(9 results)

All 2023 Other

All Int'l Joint Research (2 results) Journal Article (1 results) (of which Int'l Joint Research: 1 results, Peer Reviewed: 1 results, Open Access: 1 results) Presentation (1 results) (of which Int'l Joint Research: 1 results) Remarks (5 results)

[Int'l Joint Research] University of Stuttgart(ドイツ)
- Related Report
  2023 Annual Research Report
[Int'l Joint Research] University of Avignon/EURECOM/Universite de Lorraine(フランス)
- Related Report
  2022 Research-status Report
[Journal Article] Speaker Anonymization using Orthogonal Householder Neural Network2023
- Author(s)
  Xiaoxiao Miao , Xin Wang , Erica Cooper , Junichi Yamagishi , Natalia Tomashenko
- Journal Title
  
  IEEE/ACM Transactions on Audio, Speech, and Language Processing
  
  Volume: 31 Pages: 3681-3695
- Related Report
  2023 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Presentation] Hiding speaker's sex in speech using zero-evidence speaker representation in an analysis/synthesis pipeline2023
- Author(s)
  Paul-Gauthier Noe, Xiaoxiao Miao, Xin Wang, Junichi Yamagishi, Jean-Francois Bonastre, Driss Matrouf
- Organizer
  ICASPP 2022
- Related Report
  2022 Research-status Report
- Int'l Joint Research
[Remarks] VoicePAT
- URL
  https://github.com/DigitalPhonetics/VoicePAT
- Related Report
  2023 Annual Research Report
[Remarks] Official page of VoicePrivacy
- URL
  https://www.voiceprivacychallenge.org/
- Related Report
  2022 Research-status Report
[Remarks] Open-source baseline of VoicePrivacy 2022
- URL
  https://github.com/Voice-Privacy-Challenge/Voice-Privacy-Challenge-2022
- Related Report
  2022 Research-status Report
[Remarks] Languange-independent speaker anonymization system
- URL
  https://github.com/nii-yamagishilab/SSL-SAS
- Related Report
  2022 Research-status Report
[Remarks] Speaker gender attribute privacy
- URL
  https://github.com/nii-yamagishilab/speaker_sex_attribute_privacy
- Related Report
  2022 Research-status Report

Language-independent speaker anonymization with multiple privacy-related attributes

Principal Investigator

Miao Xiaoxiao 国立情報学研究所, コンテンツ科学研究系, 特任研究員 (10962508)

¥2,860,000 (Direct Cost: ¥2,200,000、Indirect Cost: ¥660,000)

Report

Research Products

[Int'l Joint Research] University of Stuttgart(ドイツ)

Related Report

[Int'l Joint Research] University of Avignon/EURECOM/Universite de Lorraine(フランス)

Related Report

[Journal Article] Speaker Anonymization using Orthogonal Householder Neural Network2023

Author(s)

Journal Title

Related Report

[Presentation] Hiding speaker's sex in speech using zero-evidence speaker representation in an analysis/synthesis pipeline2023

Author(s)

Organizer

Related Report

[Remarks] VoicePAT

URL

Related Report

[Remarks] Official page of VoicePrivacy

URL

Related Report

[Remarks] Open-source baseline of VoicePrivacy 2022

URL

Related Report

[Remarks] Languange-independent speaker anonymization system

URL

Related Report

[Remarks] Speaker gender attribute privacy

URL

Related Report