2021 Fiscal Year Annual Research Report
Study on Audio Information Hiding Based on Human Auditory Perception with Phase Modulation
Project/Area Number |
20J20580
|
Research Institution | Japan Advanced Institute of Science and Technology |
Principal Investigator |
MAWALIM CANDY OLIVIA 北陸先端科学技術大学院大学, 先端科学技術研究科, 特別研究員(DC1)
|
Project Period (FY) |
2020-04-24 – 2023-03-31
|
Keywords | information hiding / voice privacy / speaker anonymization / watermarking / authentication |
Outline of Annual Research Achievements |
The major milestone in FY2021 is developing a framework to improve the security of speaker anonymization. Speaker anonymization aims to address the voice privacy issue by suppressing the original speaker's personally identified information (PII). The output anonymized speech should be able to authenticate by the authorized parties. However, since the mapping between speaker and pseudo-speaker is not necessarily one-to-one correspondence, recognizing genuine anonymized speech is difficult. To deal with this issue, the proposed framework integrates the information hiding approach to simultaneously secure PII and verify the content via an embedded watermark. The related publications consist of one international conference and two journals.
|
Current Status of Research Progress |
Current Status of Research Progress
2: Research has progressed on the whole more than it was originally planned.
Reason
The progress of this study is going well as planned. At this stage, the proposed framework has been developed by integrating the information hiding approach to protecting content and securing speaker individuality information. It consists of an encoder and a decoder. The encoder aims to protect the speaker's identity by using an anonymization approach while embedding a parameter that represents a watermark. The decoder seeks to protect the authentication of the speech by accurately detecting the embedded watermarks. An extensive evaluation has been conducted to validate the proposed framework's performance compared to the existing methods. The results of this study in FY2021 were reported in APSIPA Proceeding 2021, MDPI Entropy Journal 2021, and Computer Speech and Language Journal 2022.
|
Strategy for Future Research Activity |
In future work, the remaining issues, especially those related to subjective and objective evaluations for intelligibility and naturalness requirements, will be addressed. The results obtained by using existing objective evaluations could give general information about a speaker anonymization method, but it is still inadequate to show the significance of each method. Besides, x-vector-based information hiding and the investigation of other prospective speech features will be considered. By controlling the less significant eigenstructure of the x-vector, we expect better protection for speech signals. Finally, the workflow for the real application will be investigated for speech tampering and spoofing countermeasure.
|