2020 Fiscal Year Annual Research Report

Safe and secure speech information processing based on liveness detection and ASVspoof challenge

Research Project

Project/Area Number	18KT0051
Research Institution	National Institute of Informatics
Principal Investigator	山岸順一国立情報学研究所, コンテンツ科学研究系, 教授 (70709352)
Co-Investigator(Kenkyū-buntansha)	大木哲史静岡大学, 情報学部, 准教授 (80537407)
Project Period (FY)	2018-07-18 – 2021-03-31
Keywords	音声情報処理 / なりすまし / 生体認証 / ASVspoofチャレンジ / 生体検知
Outline of Annual Research Achievements	音声アシスト等の音声情報処理端末が普及するにつれ、要素技術である話者照合や音声認識に対する攻撃が起き始め、セキュリティ上の課題となっている。本研究の目的は、提案者が成果を挙げて来た生体検知技術を先駆的に導入し、問題の一解決法を提示することである。生体検知技術とは「他人により許可なく取得され、加工され、外部デバイスにより再生された音声」と「生体からその場で発声した生の音声」を区別する機械学習技術である。提案者はこれまで話者照合の生体検知精度を競うASVSpoof challengeを開催するなど音声の生体検知技術の研究発展に大きく貢献して来た。 2018年度はASVspoof Challenge 2019用のデータベースとして、最先端の音声合成や声質変換技術により生成された様々な合成音声を含む大規模データベースを構築した。これはGoogleをはじめとする著名企業および分野をリードする大学合計17組織が保有する技術により生成された多様な合成音声を含み、音声の生体検知研究用ベンチマークとしては最大である。 2019年度では、本データベースを全世界の大学および企業合計150組織へ配布し、世界的なコンペティションを開催した。参加者は各自の技術により生体検知モデルを構築し、オーガナイザーが生体認証に対するエラーを考慮した上でランキングを行った。その結果、人間には聴覚上差がわからない様な加工音声に対しても、高精度に識別可能であることが判明し、安全安心な音声情報処理端末を実現する上で重要な結果が得られた。そのほか、国際会議Interspeech 2019およびASRU201におけるスペシャルセッションの開催、および、国際ジャーナルCSLにおける特集号も企画した。 2020年度ではより詳細な分析を行い、一連の研究活動を3本のジャーナル論文とてまとめ、国際ジャーナル誌において出版した。

Research Products
(15 results)

All 2021 2020 Other

All Int'l Joint Research (6 results) Journal Article (5 results) (of which Int'l Joint Research: 4 results, Peer Reviewed: 5 results, Open Access: 5 results) Presentation (3 results) (of which Invited: 1 results) Remarks (1 results)

[Int'l Joint Research] Eurecom/ロレーヌ大学/Avignon University(フランス)
- Country Name
  FRANCE
- Counterpart Institution
  Eurecom/ロレーヌ大学/Avignon University
[Int'l Joint Research] University of East Finland/Aalto University(フィンランド)
- Country Name
  FINLAND
- Counterpart Institution
  University of East Finland/Aalto University
[Int'l Joint Research] 中国科学技術大学/iFlytek Research(中国)
- Country Name
  CHINA
- Counterpart Institution
  中国科学技術大学/iFlytek Research
[Int'l Joint Research] MIT/Google Inc(米国)
- Country Name
  U.S.A.
- Counterpart Institution
  MIT/Google Inc
[Int'l Joint Research] University of Edinburgh(英国)
- Country Name
  UNITED KINGDOM
- Counterpart Institution
  University of Edinburgh
[Int'l Joint Research]
- # of Other Countries
  3
[Journal Article] ASVspoof 2019: Spoofing Countermeasures for the Detection of Synthesized, Converted and Replayed Speech2021
- Author(s)
  Nautsch Andreas、Wang Xin、Evans Nicholas、Kinnunen Tomi H.、Vestman Ville、Todisco Massimiliano、Delgado Hector、Sahidullah Md、Yamagishi Junichi、Lee Kong Aik
- Journal Title
  
  IEEE Transactions on Biometrics, Behavior, and Identity Science
  
  Volume: 3 Pages: 252～265
- DOI
  10.1109/TBIOM.2021.3059479
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech2020
- Author(s)
  Wang Xin、Yamagishi Junichi、Todisco Massimiliano、Delgado Hector、Nautsch Andreas、Evans Nicholas、Sahidullah Md、Vestman Ville、Kinnunen Tomi、Lee Kong Aik、Juvela Lauri、Alku Paavo、Peng Yu-Huai、Hwang Hsin-Te、Tsao Yu、Wang Hsin-Min、Maguer Sebastien Le、Becker Markus、Henderson Fergus et al
- Journal Title
  
  Computer Speech & Language
  
  Volume: 64 Pages: 101114～101114
- DOI
  10.1016/j.csl.2020.101114
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Tandem Assessment of Spoofing Countermeasures and Automatic Speaker Verification: Fundamentals2020
- Author(s)
  Kinnunen Tomi、Delgado Hector、Evans Nicholas、Lee Kong Aik、Vestman Ville、Nautsch Andreas、Todisco Massimiliano、Wang Xin、Sahidullah Md、Yamagishi Junichi、Reynolds Douglas A.
- Journal Title
  
  IEEE/ACM Transactions on Audio, Speech, and Language Processing
  
  Volume: 28 Pages: 2195～2210
- DOI
  10.1109/taslp.2020.3009494
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] An Initial Investigation on Optimizing Tandem Speaker Verification and Countermeasure Systems Using Reinforcement Learning2020
- Author(s)
  Kanervisto Anssi、Hautamaki Ville、Kinnunen Tomi、Yamagishi Junichi
- Journal Title
  
  ISCA The Speaker and Language Recognition Workshop Odyssey 2020
  
  Volume: － Pages: 151～158
- DOI
  10.21437/Odyssey.2020-22
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Study on Possibility of Estimating Smartphone Inputs from Tap Sounds2020
- Author(s)
  Yumo Ouchi, Ryosuke Okudera, Yuya Shiomi, Kota Uehara, Ayaka Sugimoto, Tetsushi Ohki, Masakatsu Nishigaki
- Journal Title
  
  2020 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)
  
  Volume: － Pages: 1425-1429
- Peer Reviewed / Open Access
[Presentation] ディープフェイク画像からの個人再識別化に関する検討2021
- Author(s)
  藤垣成汰朗, 成田惇, 塩見裕哉, 菅沼弥生, 西垣正勝, 大木哲史
- Organizer
  暗号と情報セキュリティシンポジウム2021
[Presentation] スマートフォンのタップ音からの入力内容推測可能性に関する研究(その2)2021
- Author(s)
  大内結雲, 奥寺瞭介, 塩見祐哉, 大木哲史, 西垣正勝
- Organizer
  暗号と情報セキュリティシンポジウム2021
[Presentation] 深層生成モデルによるメディア生成とフェイク検知2020
- Author(s)
  山岸順一
- Organizer
  第23回情報論的学習理論ワークショップ (IBIS2020)
- Invited
[Remarks] ASVSpoof challenge website
- URL
  https://www.asvspoof.org

2020 Fiscal Year Annual Research Report

Safe and secure speech information processing based on liveness detection and ASVspoof challenge

Principal Investigator

山岸 順一 国立情報学研究所, コンテンツ科学研究系, 教授 (70709352)

Research Products

[Int'l Joint Research] Eurecom/ロレーヌ大学/Avignon University(フランス)

Country Name

Counterpart Institution

[Int'l Joint Research] University of East Finland/Aalto University(フィンランド)

Country Name

Counterpart Institution

[Int'l Joint Research] 中国科学技術大学/iFlytek Research(中国)

Country Name

Counterpart Institution

[Int'l Joint Research] MIT/Google Inc(米国)

Country Name

Counterpart Institution

[Int'l Joint Research] University of Edinburgh(英国)

Country Name

Counterpart Institution

[Int'l Joint Research]

# of Other Countries

[Journal Article] ASVspoof 2019: Spoofing Countermeasures for the Detection of Synthesized, Converted and Replayed Speech2021

Author(s)

Journal Title

DOI

[Journal Article] ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech2020

Author(s)

Journal Title

DOI

[Journal Article] Tandem Assessment of Spoofing Countermeasures and Automatic Speaker Verification: Fundamentals2020

Author(s)

Journal Title

DOI

[Journal Article] An Initial Investigation on Optimizing Tandem Speaker Verification and Countermeasure Systems Using Reinforcement Learning2020

Author(s)

Journal Title

DOI

[Journal Article] Study on Possibility of Estimating Smartphone Inputs from Tap Sounds2020

Author(s)

Journal Title

[Presentation] ディープフェイク画像からの個人再識別化に関する検討2021

Author(s)

Organizer

[Presentation] スマートフォンのタップ音からの入力内容推測可能性に関する研究(その2)2021

Author(s)

Organizer

[Presentation] 深層生成モデルによるメディア生成とフェイク検知2020

Author(s)

Organizer

[Remarks] ASVSpoof challenge website

URL

山岸順一国立情報学研究所, コンテンツ科学研究系, 教授 (70709352)