Safe and secure speech information processing based on liveness detection and ASVspoof challenge

Research Project

Project/Area Number	18KT0051
Research Category	Grant-in-Aid for Scientific Research (B)
Allocation Type	Multi-year Fund
Section	特設分野
Research Field	The Information Society and Trust
Research Institution	National Institute of Informatics
Principal Investigator	Yamagishi Junichi 国立情報学研究所, コンテンツ科学研究系, 教授 (70709352)
Co-Investigator(Kenkyū-buntansha)	大木哲史静岡大学, 情報学部, 准教授 (80537407)
Project Period (FY)	2018-07-18 – 2021-03-31
Project Status	Completed (Fiscal Year 2020)
Budget Amount *help	¥18,460,000 (Direct Cost: ¥14,200,000、Indirect Cost: ¥4,260,000) Fiscal Year 2020: ¥5,590,000 (Direct Cost: ¥4,300,000、Indirect Cost: ¥1,290,000) Fiscal Year 2019: ¥5,720,000 (Direct Cost: ¥4,400,000、Indirect Cost: ¥1,320,000) Fiscal Year 2018: ¥7,150,000 (Direct Cost: ¥5,500,000、Indirect Cost: ¥1,650,000)
Keywords	音声情報処理 / 話者照合 / 生体検知 / 生体認証 / 音声インタフェース / なりすまし / ASVspoofチャレンジ
Outline of Final Research Achievements	As speech processing became more widespread in society, attacks on speaker verification and speech recognition began to occur. The purpose of this research is to improve the liveness detection technology of speech and to present a solution to the problem. Liveness detection is a machine learning technology that distinguishes between "voice obtained by another person without permission, processed, and reproduced by an external device" and "live voice uttered on the spot from the living body". Therefore, we built a DB containing a large amount of audio files synthesized by the latest speech synthesis and voice conversion technology, held a competition, and advanced the liveness detection technology in the field. It has become possible to detect artificial voices for which there is no perceived audible difference, and a solution has been obtained that realizes safe and secure voice information processing.
Academic Significance and Societal Importance of the Research Achievements	音声情報処理は多くのスマートデバイスで利用されており、社会を支える基盤技術である。音声の生体検知は音声インターフェースの手軽さとトラストの両方を同時に実現する技術であり、社会的意義は高い。実際、本研究を通して構築し、一般公開したDBは、世界のアカデミック組織のみならず、多くの企業にも利用されている。学術的意義も高く、多くの国際会議論文が本DBを利用している。現在AI技術により生成されたメディアが悪用される事が危惧され、deepfakeと呼ばれることもある。本研究は、音声を対象に研究を行ったが、その成果は映像や文字等にも応用可能であると考えられ、今後さらに発展させることが可能であると期待される。

Report

(4 results)

2020 Annual Research Report Final Research Report ( PDF )
2019 Research-status Report
2018 Research-status Report

Research Products
(40 results)

All 2021 2020 2019 2018 Other

All Int'l Joint Research (13 results) Journal Article (12 results) (of which Int'l Joint Research: 10 results, Peer Reviewed: 12 results, Open Access: 12 results) Presentation (11 results) (of which Int'l Joint Research: 4 results, Invited: 4 results) Book (1 results) Remarks (3 results)

[Int'l Joint Research] Eurecom/ロレーヌ大学/Avignon University(フランス)
- Related Report
  2020 Annual Research Report
[Int'l Joint Research] University of East Finland/Aalto University(フィンランド)
- Related Report
  2020 Annual Research Report
[Int'l Joint Research] 中国科学技術大学/iFlytek Research(中国)
- Related Report
  2020 Annual Research Report
[Int'l Joint Research] MIT/Google Inc(米国)
- Related Report
  2020 Annual Research Report
[Int'l Joint Research] University of Edinburgh(英国)
- Related Report
  2020 Annual Research Report
[Int'l Joint Research]
- Related Report
  2020 Annual Research Report
[Int'l Joint Research] Eurecom/INRIA(フランス)
- Related Report
  2019 Research-status Report
[Int'l Joint Research] University of East Finland(フィンランド)
- Related Report
  2019 Research-status Report
[Int'l Joint Research] University of Edinburgh(英国)
- Related Report
  2019 Research-status Report
[Int'l Joint Research] Johns Hopkins University(米国)
- Related Report
  2019 Research-status Report
[Int'l Joint Research] Eurecom/INRIA(フランス)
- Related Report
  2018 Research-status Report
[Int'l Joint Research] University of East Finland(フィンランド)
- Related Report
  2018 Research-status Report
[Int'l Joint Research] Johns Hopkins University(米国)
- Related Report
  2018 Research-status Report
[Journal Article] ASVspoof 2019: Spoofing Countermeasures for the Detection of Synthesized, Converted and Replayed Speech2021
- Author(s)
  Nautsch Andreas、Wang Xin、Evans Nicholas、Kinnunen Tomi H.、Vestman Ville、Todisco Massimiliano、Delgado Hector、Sahidullah Md、Yamagishi Junichi、Lee Kong Aik
- Journal Title
  
  IEEE Transactions on Biometrics, Behavior, and Identity Science
  
  Volume: 3 Issue: 2 Pages: 252-265
- DOI
  10.1109/tbiom.2021.3059479
- Related Report
  2020 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] ASVspoof 2019: a large-scale public database of synthesized, converted and replayed speech2020
- Author(s)
  Xin Wang, Junichi Yamagishi, Massimiliano Todisco, Hector Delgado, Andreas Nautsch, Nicholas Evans, Md Sahidullah, Ville Vestman, Tomi Kinnunen, Kong Aik Lee, Lauri Juvela, Paavo Alku, Yu-Huai Peng, Hsin-Te Hwang, Yu Tsao, Hsin-Min Wang, Sebastien Le Maguer, Markus Becker, Fergus Henderson他計40名
- Journal Title
  
  Computer Speech & Language
  
  Volume: 64 Pages: 101114-101114
- DOI
  10.1016/j.csl.2020.101114
- Related Report
  2020 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Tandem Assessment of Spoofing Countermeasures and Automatic Speaker Verification: Fundamentals2020
- Author(s)
  Kinnunen Tomi、Delgado Hector、Evans Nicholas、Lee Kong Aik、Vestman Ville、Nautsch Andreas、Todisco Massimiliano、Wang Xin、Sahidullah Md、Yamagishi Junichi、Reynolds Douglas A.
- Journal Title
  
  IEEE/ACM Transactions on Audio, Speech, and Language Processing
  
  Volume: 28 Pages: 2195-2210
- DOI
  10.1109/taslp.2020.3009494
- Related Report
  2020 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] An Initial Investigation on Optimizing Tandem Speaker Verification and Countermeasure Systems Using Reinforcement Learning2020
- Author(s)
  Kanervisto Anssi、Hautamaki Ville、Kinnunen Tomi、Yamagishi Junichi
- Journal Title
  
  ISCA The Speaker and Language Recognition Workshop Odyssey 2020
  
  Volume: － Pages: 151-158
- DOI
  10.21437/odyssey.2020-22
- Related Report
  2020 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Study on Possibility of Estimating Smartphone Inputs from Tap Sounds2020
- Author(s)
  Yumo Ouchi, Ryosuke Okudera, Yuya Shiomi, Kota Uehara, Ayaka Sugimoto, Tetsushi Ohki, Masakatsu Nishigaki
- Journal Title
  
  2020 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)
  
  Volume: － Pages: 1425-1429
- Related Report
  2020 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] An initial investigation on optimizing tandem speaker verification and countermeasure systems using reinforcement learning2020
- Author(s)
  Anssi Kanervisto, Ville Hautamaki, Tomi Kinnunen, Junichi Yamagishi
- Journal Title
  
  Proc. Odyssey 2020 The Speaker and Language Recognition Workshop
  
  Volume: - Pages: 1-8
- Related Report
  2019 Research-status Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Attentive Filtering Networks for Audio Replay Attack Detection2019
- Author(s)
  C. Lai, A. Abad, K. Richmond, J. Yamagishi, N. Dehak and S. King
- Journal Title
  
  Proc. 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
  
  Volume: - Pages: 6316-6320
- DOI
  10.1109/icassp.2019.8682640
- Related Report
  2019 Research-status Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection2019
- Author(s)
  Massimiliano Todisco, Xin Wang, Ville Vestman, Md. Sahidullah, Hector Delgado, Andreas Nautsch, Junichi Yamagishi, Nicholas Evans, Tomi H. Kinnunen, Kong Aik Lee
- Journal Title
  
  Proc. Interspeech 2019
  
  Volume: - Pages: 1008-1012
- DOI
  10.21437/interspeech.2019-2249
- Related Report
  2019 Research-status Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Efficient Spoofing Attack Detection against Unknown Sample using End-to-End Anomaly Detection2019
- Author(s)
  Ohki Tetsushi、Gupta Vishu、Nishigaki Masakatsu
- Journal Title
  
  2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)
  
  Volume: - Pages: 224-230
- DOI
  10.1109/apsipaasc47483.2019.9023183
- Related Report
  2019 Research-status Report
- Peer Reviewed / Open Access
[Journal Article] Attentive Filtering Networks for Audio Replay Attack Detection2019
- Author(s)
  Cheng-I Lai, Alberto Abad, Korin Richmond, Junichi Yamagishi, Najim Dehak, Simon King
- Journal Title
  
  2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
  
  Volume: 1 Pages: 1-5
- Related Report
  2018 Research-status Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Integrated Presentation Attack Detection and Automatic Speaker Verification: Common Features and Gaussian Back-end Fusion2018
- Author(s)
  Massimiliano Todisco, Hector Delgado, Kong Aik Lee, Md Sahidullah, Nicholas Evans, Tomi Kinnunen, Junichi Yamagishi
- Journal Title
  
  Proc. Interspeech 2018
  
  Volume: 1 Pages: 77-81
- DOI
  10.21437/interspeech.2018-2289
- Related Report
  2018 Research-status Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Transforming acoustic characteristics to deceive playback spoofing countermeasures of speaker verification systems2018
- Author(s)
  Fuming Fang, Junichi Yamagishi, Isao Echizen, Md Sahidullah, Tomi Kinnunen
- Journal Title
  
  2018 IEEE International Workshop on Information Forensics and Security (WIFS)
  
  Volume: 1 Pages: 1-9
- DOI
  10.1109/wifs.2018.8630764
- Related Report
  2018 Research-status Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Presentation] ディープフェイク画像からの個人再識別化に関する検討2021
- Author(s)
  藤垣成汰朗, 成田惇, 塩見裕哉, 菅沼弥生, 西垣正勝, 大木哲史
- Organizer
  暗号と情報セキュリティシンポジウム2021
- Related Report
  2020 Annual Research Report
[Presentation] スマートフォンのタップ音からの入力内容推測可能性に関する研究(その2)2021
- Author(s)
  大内結雲, 奥寺瞭介, 塩見祐哉, 大木哲史, 西垣正勝
- Organizer
  暗号と情報セキュリティシンポジウム2021
- Related Report
  2020 Annual Research Report
[Presentation] 深層生成モデルによるメディア生成とフェイク検知2020
- Author(s)
  山岸順一
- Organizer
  第23回情報論的学習理論ワークショップ (IBIS2020)
- Related Report
  2020 Annual Research Report
- Invited
[Presentation] 生体認証を回避する物理的なAdversarial Exampleの検討2020
- Author(s)
  Vo Ngoc Khoi Nguyen, 西垣正勝, 大木哲史
- Organizer
  第82回情報処理学会全国大会
- Related Report
  2019 Research-status Report
[Presentation] スマートフォンのタップ音からの入力内容推測可能性に関する研究2020
- Author(s)
  大内結雲, 奥寺瞭介, 塩見裕哉, 上原航汰, 杉本彩歌, 大木哲史, 西垣正勝
- Organizer
  暗号と情報セキュリティシンポジウム2020,
- Related Report
  2019 Research-status Report
[Presentation] 話者照合の生体検知チャレンジ「ASVspoof 2019」の概要と今後の展望2019
- Author(s)
  山岸順一
- Organizer
  第9回バイオメトリクスと認識・認証シンポジウム
- Related Report
  2019 Research-status Report
- Invited
[Presentation] フェイク動画問題: メディア解析技術によるアプローチ2019
- Author(s)
  山岸順一
- Organizer
  JST/CRDS 公開ワークショップ「意思決定のための情報科学～情報氾濫・フェイク・分断に立ち向かうことは可能か～」 2019年7月25日
- Related Report
  2019 Research-status Report
- Invited
[Presentation] Speaker Identity Cloning and Protection2019
- Author(s)
  Junichi Yamagishi
- Organizer
  AFEKA SPEECH PROCESSING CONFERENCE 2019: 10-YEAR ANNIVERSARY CONFERENCE
- Related Report
  2019 Research-status Report
- Int'l Joint Research / Invited
[Presentation] Attentive Filtering Networks for Audio Replay Attack Detection2019
- Author(s)
  Cheng-I Lai, Alberto Abad, Korin Richmond, Junichi Yamagishi, Najim Dehak, Simon King
- Organizer
  2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
- Related Report
  2018 Research-status Report
- Int'l Joint Research
[Presentation] Integrated Presentation Attack Detection and Automatic Speaker Verification: Common Features and Gaussian Back-end Fusion2018
- Author(s)
  Massimiliano Todisco, Hector Delgado, Kong Aik Lee, Md Sahidullah, Nicholas Evans, Tomi Kinnunen, Junichi Yamagishi
- Organizer
  Interspeech 2018
- Related Report
  2018 Research-status Report
- Int'l Joint Research
[Presentation] Transforming acoustic characteristics to deceive playback spoofing countermeasures of speaker verification systems2018
- Author(s)
  Fuming Fang, Junichi Yamagishi, Isao Echizen, Md Sahidullah, Tomi Kinnunen
- Organizer
  2018 IEEE International Workshop on Information Forensics and Security (WIFS)
- Related Report
  2018 Research-status Report
- Int'l Joint Research
[Book] Introduction to Voice Presentation Attack Detection and Recent Advances (Chapter 15, Handbook of Biometric Anti-Spoofing, 2nd edition)2019
- Author(s)
  Md Sahidullah, Hector Delgado, Massimiliano Todisco, Tomi Kinnunen, Nicholas Evans, Junichi Yamagishi, and Kong-Aik Lee
- Total Pages
  41
- Publisher
  Springer
- ISBN
  9783319926261
- Related Report
  2018 Research-status Report
[Remarks] ASVSpoof challenge website
- URL
  https://www.asvspoof.org
- Related Report
  2020 Annual Research Report
[Remarks] ASVspoof website
- URL
  https://www.asvspoof.org
- Related Report
  2019 Research-status Report
[Remarks] ASVspoof 2019
- URL
  http://www.asvspoof.org
- Related Report
  2018 Research-status Report

Safe and secure speech information processing based on liveness detection and ASVspoof challenge

Principal Investigator

Yamagishi Junichi 国立情報学研究所, コンテンツ科学研究系, 教授 (70709352)

¥18,460,000 (Direct Cost: ¥14,200,000、Indirect Cost: ¥4,260,000)

Report

Research Products

[Int'l Joint Research] Eurecom/ロレーヌ大学/Avignon University(フランス)

Related Report

[Int'l Joint Research] University of East Finland/Aalto University(フィンランド)

Related Report

[Int'l Joint Research] 中国科学技術大学/iFlytek Research(中国)

Related Report

[Int'l Joint Research] MIT/Google Inc(米国)

Related Report

[Int'l Joint Research] University of Edinburgh(英国)

Related Report

[Int'l Joint Research]

Related Report

[Int'l Joint Research] Eurecom/INRIA(フランス)

Related Report

[Int'l Joint Research] University of East Finland(フィンランド)

Related Report

[Int'l Joint Research] University of Edinburgh(英国)

Related Report

[Int'l Joint Research] Johns Hopkins University(米国)

Related Report

[Int'l Joint Research] Eurecom/INRIA(フランス)

Related Report

[Int'l Joint Research] University of East Finland(フィンランド)

Related Report

[Int'l Joint Research] Johns Hopkins University(米国)

Related Report

[Journal Article] ASVspoof 2019: Spoofing Countermeasures for the Detection of Synthesized, Converted and Replayed Speech2021

Author(s)

Journal Title

DOI

Related Report

[Journal Article] ASVspoof 2019: a large-scale public database of synthesized, converted and replayed speech2020

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Tandem Assessment of Spoofing Countermeasures and Automatic Speaker Verification: Fundamentals2020

Author(s)

Journal Title

DOI

Related Report

[Journal Article] An Initial Investigation on Optimizing Tandem Speaker Verification and Countermeasure Systems Using Reinforcement Learning2020

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Study on Possibility of Estimating Smartphone Inputs from Tap Sounds2020

Author(s)

Journal Title

Related Report

[Journal Article] An initial investigation on optimizing tandem speaker verification and countermeasure systems using reinforcement learning2020

Author(s)

Journal Title

Related Report

[Journal Article] Attentive Filtering Networks for Audio Replay Attack Detection2019

Author(s)

Journal Title

DOI

Related Report

[Journal Article] ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection2019

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Efficient Spoofing Attack Detection against Unknown Sample using End-to-End Anomaly Detection2019

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Attentive Filtering Networks for Audio Replay Attack Detection2019

Author(s)

Journal Title

Related Report

[Journal Article] Integrated Presentation Attack Detection and Automatic Speaker Verification: Common Features and Gaussian Back-end Fusion2018