研究開始時の研究の概要 |
Exposure of speech data without taking any measures would cause privacy issues. The goal of the project is to perform a user-centric approach to hide multiple privacy-related speech attributes including speaker identity, age, gender, and dialect information, leaving non-private attributes unchange.
|
研究実績の概要 |
Part 1) Improved anonymizer: we proposed orthogonal Householder neural network (OHNN)-based anonymizer that rotates the original speaker vectors to anonymized ones to maintain the diversity and strengthen privacy protection. The related work has been accepted to IEEE/ACM Transactions on Audio, Speech, and Language Processing.
Part 2) User-friendly voice anonymization framework: Along with the popularity of speaker anonymization topic is increasing, the comparison and combination of different anonymization approaches remains challenging due to the complexity of evaluation and the absence of user-friendly research frameworks. We therefore propose an efficient speaker anonymization and evaluation framework based on a modular and easily extendable structure called VoicePAT. Our code is fully open source. Related work has been submitted to OJSP.
Part 3) ASV speech dataset anonymization: The legal and ethical concerns has led to the withdrawal of the widely-used VoxCeleb2 dataset for speaker recognition, we employ the our proposed OHNN-based speaker anonymization technique to create a privacy-friendly VoxCeleb2 dataset called SynVox2. In addition, we define several metrics for evaluating the use of SynVox2 in terms of privacy, utility, and fairness. These metrics may serve as a protocol for future research, enabling researchers to assess whether a synthetic dataset is suitable for their ASV research. Furthermore, we discuss the challenges of using synthetic data for the downstream task of speaker verification. Related work has been submitted to ICASSP2024.
|