• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Speech privacy protection by high-quality, invertible, and extendable speech anonymization and de-anonymization

Research Project

Project/Area Number 21K17775
Research Category

Grant-in-Aid for Early-Career Scientists

Allocation TypeMulti-year Fund
Review Section Basic Section 61010:Perceptual information processing-related
Research InstitutionNational Institute of Informatics

Principal Investigator

Wang Xin  国立情報学研究所, コンテンツ科学研究系, 特任准教授 (60843141)

Project Period (FY) 2021-04-01 – 2024-03-31
Project Status Completed (Fiscal Year 2023)
Budget Amount *help
¥4,550,000 (Direct Cost: ¥3,500,000、Indirect Cost: ¥1,050,000)
Fiscal Year 2023: ¥1,560,000 (Direct Cost: ¥1,200,000、Indirect Cost: ¥360,000)
Fiscal Year 2022: ¥1,560,000 (Direct Cost: ¥1,200,000、Indirect Cost: ¥360,000)
Fiscal Year 2021: ¥1,430,000 (Direct Cost: ¥1,100,000、Indirect Cost: ¥330,000)
Keywordsプライバシー / 音声匿名化 / 話者識別 / 音声情報処理 / 深層学習 / speech privacy / speaker anonymization / speech waveform modeling / neural network / deep learning
Outline of Research at the Start

Human speech contains not only verbal contents but also private information about the speaker such as the speaker identity. This proposal is on protecting the speaker’s privacy in speech data for two scenarios: 1) Speech anonymization: when the speaker shares the speech data in untrusted cyberspace, this speech data should be protected so that the audience can understand the speech but cannot infer who the speaker is; 2) Speech de-anonymization: when the speaker further shares the speech data to trusted audience, the original natural speech can be reconstructed from protected version.

Outline of Final Research Achievements

Protecting the personally identifiable information encoded in speech waveform is urgent for many SNS applications. Although there are quite a few deep-learning-based methods trying to project or anonymize the speaker identity in speech data, their solutions are not satisfying. The main contributions of this project can be summarized in three aspects. First, this project proposed language-independent speaker identity anonymization using self-supervised learning speech models. The proposed system was applied to both English and Mandarin data. Second, this project proposed a new anonymization algorithm based on vector rotation. This alleviates the issue of the k-anonymity anonymization in existing methods. Third, this project took the initiative to anonymize a real large-scale speech database called VoxCeleb2 and investigated the utility and privacy protection performance. The research outcomes were published in top journals and conferences in the speech field.

Academic Significance and Societal Importance of the Research Achievements

学術的成果として、現存の深層学習に基づく話者匿名化技術の言語依存性を着目し、複数の言語にも適用できる話者匿名化技術を開発した。また、従来のk-匿名化手法より、話者ベクトルの全体の分布を維持しながら匿名化が可能な手法を提案した。最後に、音声分野において初めてデータベース全体の匿名化を行い、有用性とプライバシー保護性能を調査した。いずれもの成果は音声分野のトップジャーナルや学会で発表された。そのほか、国際的なVoicePrivacyChallengeの運営にも貢献した。提案された技術はテレビ放送に使われたこともあった。

Report

(4 results)
  • 2023 Annual Research Report   Final Research Report ( PDF )
  • 2022 Research-status Report
  • 2021 Research-status Report
  • Research Products

    (25 results)

All 2024 2023 2022 2021 Other

All Int'l Joint Research (4 results) Journal Article (3 results) (of which Int'l Joint Research: 3 results,  Peer Reviewed: 3 results,  Open Access: 3 results) Presentation (11 results) (of which Int'l Joint Research: 11 results,  Invited: 2 results) Remarks (7 results)

  • [Int'l Joint Research] University of Avignon/Inria/EURECOM(フランス)

    • Related Report
      2023 Annual Research Report
  • [Int'l Joint Research] Avignon University/Inria/University of Lorraine(フランス)

    • Related Report
      2022 Research-status Report
  • [Int'l Joint Research] University of Avignon/EURECOM/Universite de Lorraine(フランス)

    • Related Report
      2021 Research-status Report
  • [Int'l Joint Research] Naver Corporation(韓国)

    • Related Report
      2021 Research-status Report
  • [Journal Article] Speaker Anonymization Using Orthogonal Householder Neural Network2023

    • Author(s)
      Miao Xiaoxiao、Wang Xin、Cooper Erica、Yamagishi Junichi、Tomashenko Natalia
    • Journal Title

      IEEE/ACM Transactions on Audio, Speech, and Language Processing

      Volume: 31 Pages: 3681-3695

    • DOI

      10.1109/taslp.2023.3313429

    • Related Report
      2023 Annual Research Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] Privacy and Utility of X-Vector Based Speaker Anonymization2022

    • Author(s)
      Srivastava Brij Mohan Lal、Maouche Mohamed、Sahidullah Md、Vincent Emmanuel、Bellet Aurelien、Tommasi Marc、Tomashenko Natalia、Wang Xin、Yamagishi Junichi
    • Journal Title

      IEEE/ACM Transactions on Audio, Speech, and Language Processing

      Volume: 30 Pages: 2383-2395

    • DOI

      10.1109/taslp.2022.3190741

    • Related Report
      2022 Research-status Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] The VoicePrivacy 2020 Challenge: Results and findings2022

    • Author(s)
      Natalia Tomashenko, Xin Wang, Emmanuel Vincent, Jose Patino, Brij Mohan Lal Srivastava, Paul-Gauthier No?, Andreas Nautsch, Nicholas Evans, Junichi Yamagishi, Benjamin O’Brien, Ana?s Chanclu, Jean-Fran?ois Bonastre, Massimiliano Todisco, Mohamed Maouche
    • Journal Title

      Computer Speech & Language

      Volume: 74 Pages: 101362-101362

    • DOI

      10.1016/j.csl.2022.101362

    • Related Report
      2021 Research-status Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Presentation] SynVox2: Towards a Privacy-Friendly VoxCeleb2 Dataset2024

    • Author(s)
      Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi, Nicholas Evans, Massimiliano Todisco, Jean-Francois Bonastre, and Mickael Rouvier
    • Organizer
      IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
    • Related Report
      2023 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Hiding Speaker’s Sex in Speech Using Zero-Evidence Speaker Representation in an Analysis/Synthesis Pipeline2023

    • Author(s)
      Paul-Gauthier Noe, Xiaoxiao Miao, Xin Wang, Junichi Yamagishi, Jean-Francois Bonastre, and Driss Matrouf
    • Organizer
      ICASSP 2023
    • Related Report
      2022 Research-status Report
    • Int'l Joint Research
  • [Presentation] Analyzing Language-Independent Speaker Anonymization Framework under Unseen Conditions2022

    • Author(s)
      Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi, and Natalia Tomashenko
    • Organizer
      Interspeech 2022
    • Related Report
      2022 Research-status Report
    • Int'l Joint Research
  • [Presentation] Tutorial on speaker anonymization (software part)2022

    • Author(s)
      Xin Wang
    • Organizer
      2nd Symposium on Security and Privacy in Speech Communication joined with 2nd VoicePrivacy Challenge Workshop
    • Related Report
      2022 Research-status Report
    • Int'l Joint Research / Invited
  • [Presentation] Language-Independent Speaker Anonymization Approach Using Self-Supervised Pre-Trained Models2022

    • Author(s)
      Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi, Natalia Tomashenko
    • Organizer
      Proc. Odyssey 2022 The Speaker and Language Recognition Workshop
    • Related Report
      2021 Research-status Report
    • Int'l Joint Research
  • [Presentation] Estimating the confidence of speech spoofing countermeasure2022

    • Author(s)
      Wang Xin, Yamagishi Junichi
    • Organizer
      ICASSP 2022
    • Related Report
      2021 Research-status Report
    • Int'l Joint Research
  • [Presentation] Attention Back-end for Automatic Speaker Verification with Multiple Enrollment Utterances2022

    • Author(s)
      Chang Zeng, Xin Wang, Erica Cooper, Xiaoxiao Miao, Junichi Yamagishi
    • Organizer
      ICASSP 2022
    • Related Report
      2021 Research-status Report
    • Int'l Joint Research
  • [Presentation] Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation2022

    • Author(s)
      Hemlata Tak, Massimiliano Todisco, Xin Wang, Jee-weon Jung, Junichi Yamagishi, Nicholas Evans
    • Organizer
      Proc. Odyssey 2022 The Speaker and Language Recognition Workshop
    • Related Report
      2021 Research-status Report
    • Int'l Joint Research
  • [Presentation] Investigating self-supervised front ends for speech spoofing countermeasures2022

    • Author(s)
      Xin Wang, Junichi Yamagishi
    • Organizer
      Proc. Odyssey 2022 The Speaker and Language Recognition Workshop
    • Related Report
      2021 Research-status Report
    • Int'l Joint Research
  • [Presentation] Benchmarking and challenges in security and privacy for voice biometrics2021

    • Author(s)
      Jean-Francois Bonastre, Hector Delgado, Nicholas Evans, Tomi Kinnunen, Kong Aik Lee, Xuechen Liu, Andreas Nautsch, Paul-Gauthier NoE, Jose Patino, Md Sahidullah, Brij Mohan Lal Srivastava, Massimiliano Todisco, Natalia Tomashenko, Emmanuel Vincent, Xin Wang, Junichi Yamagishi
    • Organizer
      2021 ISCA Symposium on Security and Privacy in Speech Communication
    • Related Report
      2021 Research-status Report
    • Int'l Joint Research
  • [Presentation] Two speech security issues after the speech synthesis boom2021

    • Author(s)
      Wang Xin
    • Organizer
      Speech Synthesis Forum, China Computer Federation
    • Related Report
      2021 Research-status Report
    • Int'l Joint Research / Invited
  • [Remarks] Voice Privacy Challenge 2024 official website

    • URL

      https://www.voiceprivacychallenge.org/

    • Related Report
      2023 Annual Research Report
  • [Remarks] Voice Privacy Challenge 2024 official toolkit

    • URL

      https://github.com/Voice-Privacy-Challenge/Voice-Privacy-Challenge-2024/

    • Related Report
      2023 Annual Research Report
  • [Remarks] VoicePrivacy Challenge 2022 results and outcomes

    • URL

      https://www.voiceprivacychallenge.org/results-2022/

    • Related Report
      2022 Research-status Report
  • [Remarks] Tutorial on speaker anonymization (software)

    • URL

      https://colab.research.google.com/drive/1_zRL_f9iyDvl_5Y2Rdakg0hYAl_5Rgyq

    • Related Report
      2022 Research-status Report
  • [Remarks] Official page of VoicePrivacy

    • URL

      https://www.voiceprivacychallenge.org/

    • Related Report
      2021 Research-status Report
  • [Remarks] Open-source baseline of VoicePrivacy 2022

    • URL

      https://github.com/Voice-Privacy-Challenge/Voice-Privacy-Challenge-2022

    • Related Report
      2021 Research-status Report
  • [Remarks] Languange-independent speaker anonymization system

    • URL

      https://github.com/nii-yamagishilab/SSL-SAS

    • Related Report
      2021 Research-status Report

URL: 

Published: 2021-04-28   Modified: 2025-01-30  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi