• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Language-independent speaker anonymization with multiple privacy-related attributes

Research Project

Project/Area Number 22K21319
Research Category

Grant-in-Aid for Research Activity Start-up

Allocation TypeMulti-year Fund
Review Section 1002:Human informatics, applied informatics and related fields
Research InstitutionNational Institute of Informatics

Principal Investigator

Miao Xiaoxiao  国立情報学研究所, コンテンツ科学研究系, 特任研究員 (10962508)

Project Period (FY) 2022-08-31 – 2024-03-31
Project Status Discontinued (Fiscal Year 2023)
Budget Amount *help
¥2,860,000 (Direct Cost: ¥2,200,000、Indirect Cost: ¥660,000)
Fiscal Year 2023: ¥1,430,000 (Direct Cost: ¥1,100,000、Indirect Cost: ¥330,000)
Fiscal Year 2022: ¥1,430,000 (Direct Cost: ¥1,100,000、Indirect Cost: ¥330,000)
KeywordsOHNN / VoicePAT / SynVox2 / speaker anonymization / language independent / gender netural / Speech processing / Speech privacy / Voice transformation / Anonymization / Deep learning
Outline of Research at the Start

Exposure of speech data without taking any measures would cause privacy issues.
The goal of the project is to perform a user-centric approach to hide multiple privacy-related speech attributes including speaker identity, age, gender, and dialect information, leaving non-private attributes unchange.

Outline of Annual Research Achievements

Part 1) Improved anonymizer: we proposed orthogonal Householder neural network (OHNN)-based anonymizer that rotates the original speaker vectors to anonymized ones to maintain the diversity and strengthen privacy protection. The related work has been accepted to IEEE/ACM Transactions on Audio, Speech, and Language Processing.

Part 2) User-friendly voice anonymization framework: Along with the popularity of speaker anonymization topic is increasing, the comparison and combination of different anonymization approaches remains challenging due to the complexity of evaluation and the absence of user-friendly research frameworks. We therefore propose an efficient speaker anonymization and evaluation framework based on a modular and easily extendable structure called VoicePAT. Our code is fully open source. Related work has been submitted to OJSP.

Part 3) ASV speech dataset anonymization: The legal and ethical concerns has led to the withdrawal of the widely-used VoxCeleb2 dataset for speaker recognition, we employ the our proposed OHNN-based speaker anonymization technique to create a privacy-friendly VoxCeleb2 dataset called SynVox2. In addition, we define several metrics for evaluating the use of SynVox2 in terms of privacy, utility, and fairness. These metrics may serve as a protocol for future research, enabling researchers to assess whether a synthetic dataset is suitable for their ASV research. Furthermore, we discuss the challenges of using synthetic data for the downstream task of speaker verification. Related work has been submitted to ICASSP2024.

Report

(2 results)
  • 2023 Annual Research Report
  • 2022 Research-status Report
  • Research Products

    (9 results)

All 2023 Other

All Int'l Joint Research (2 results) Journal Article (1 results) (of which Int'l Joint Research: 1 results,  Peer Reviewed: 1 results,  Open Access: 1 results) Presentation (1 results) (of which Int'l Joint Research: 1 results) Remarks (5 results)

  • [Int'l Joint Research] University of Stuttgart(ドイツ)

    • Related Report
      2023 Annual Research Report
  • [Int'l Joint Research] University of Avignon/EURECOM/Universite de Lorraine(フランス)

    • Related Report
      2022 Research-status Report
  • [Journal Article] Speaker Anonymization using Orthogonal Householder Neural Network2023

    • Author(s)
      Xiaoxiao Miao , Xin Wang , Erica Cooper , Junichi Yamagishi , Natalia Tomashenko
    • Journal Title

      IEEE/ACM Transactions on Audio, Speech, and Language Processing

      Volume: 31 Pages: 3681-3695

    • Related Report
      2023 Annual Research Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Presentation] Hiding speaker's sex in speech using zero-evidence speaker representation in an analysis/synthesis pipeline2023

    • Author(s)
      Paul-Gauthier Noe, Xiaoxiao Miao, Xin Wang, Junichi Yamagishi, Jean-Francois Bonastre, Driss Matrouf
    • Organizer
      ICASPP 2022
    • Related Report
      2022 Research-status Report
    • Int'l Joint Research
  • [Remarks] VoicePAT

    • URL

      https://github.com/DigitalPhonetics/VoicePAT

    • Related Report
      2023 Annual Research Report
  • [Remarks] Official page of VoicePrivacy

    • URL

      https://www.voiceprivacychallenge.org/

    • Related Report
      2022 Research-status Report
  • [Remarks] Open-source baseline of VoicePrivacy 2022

    • URL

      https://github.com/Voice-Privacy-Challenge/Voice-Privacy-Challenge-2022

    • Related Report
      2022 Research-status Report
  • [Remarks] Languange-independent speaker anonymization system

    • URL

      https://github.com/nii-yamagishilab/SSL-SAS

    • Related Report
      2022 Research-status Report
  • [Remarks] Speaker gender attribute privacy

    • URL

      https://github.com/nii-yamagishilab/speaker_sex_attribute_privacy

    • Related Report
      2022 Research-status Report

URL: 

Published: 2022-09-01   Modified: 2024-12-25  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi