研究課題/領域番号 |
21K17837
|
研究機関 | 国立研究開発法人情報通信研究機構 |
研究代表者 |
李 勝 国立研究開発法人情報通信研究機構, ユニバーサルコミュニケーション研究所先進的音声翻訳研究開発推進センター, 研究員 (70840940)
|
研究期間 (年度) |
2021-04-01 – 2023-03-31
|
キーワード | adversarial attacks / speech recognition / speech enhancement |
研究実績の概要 |
Although COVID19, our project is fruitful and concrete as planned. We followed new powerful deep neural network-based models and new attack methods in the last two years. To protect the system from attacks, we are very interested in using existing technologies, e.g., speech enhancement or adaptation, to solve this problem. This year, my research focuses on investigating the potential of speech enhancement. Papers from Journals and top conferences have been accepted in our research. Next year, we will continue to focus on building concrete speech recognition systems with new popular models and attacking methods. Reliable and easy-implement methods, e.g., speech enhancement, will also be investigated to protect the system from adversarial attacks.
|
現在までの達成度 (区分) |
現在までの達成度 (区分)
1: 当初の計画以上に進展している
理由
This year, the progress is as follows: We construct speech recognition systems with recent popular training toolkits and neural network types (accepted in Journals and conferences, e.g., ICASSP2022) We did surveys for the current attack methods. We implement robust adversarial attacks using the Kaldi-based ASR systems. We are also happy to see that this framework can be used to protect sensitive speech content (accepted in LREC2022). To defend against attacks, we find that adversarial audios are very sensitive. Moreover, the feature of its spectrogram is very different from the human voice, and it can be treated as a special kind of noise. We construct speech enhancement systems and study their mechanism this year (accepted in Journals and conferences, e.g., ICASSP2022).
|
今後の研究の推進方策 |
Next year, we will continue to build concrete speech recognition systems with new popular models and attacking methods with state-of-the-art frameworks, e.g., transformer. To defend against the attacks, we are very interested in using existing technologies, e.g., speech enhancement or adaptation, to solve this problem. Papers from journals and conferences will be expected.
|
次年度使用額が生じた理由 |
Last year, because of COVID19, all international conferences and academic visiting were canceled. I did not spend the funding, and I mainly did online research activity.
This year, regarding business regularization, I will continue to limit business traveling. So, the funding will be spent on purchasing devices (e.g., spoken dialogue robot, database, musical instrument) and paper publication fees (e.g., books, conferences, and journal papers).
|