Research on Silent interaction with deep neural networks

Research Project

Project/Area Number	19H04148
Research Category	Grant-in-Aid for Scientific Research (B)
Allocation Type	Single-year Grants
Section	一般
Review Section	Basic Section 61020:Human interface and interaction-related
Research Institution	The University of Tokyo
Principal Investigator	Rekimoto Jun 東京大学, 大学院情報学環・学際情報学府, 教授 (20463896)
Project Period (FY)	2019-04-01 – 2022-03-31
Project Status	Completed (Fiscal Year 2021)
Budget Amount *help	¥17,420,000 (Direct Cost: ¥13,400,000、Indirect Cost: ¥4,020,000) Fiscal Year 2021: ¥4,030,000 (Direct Cost: ¥3,100,000、Indirect Cost: ¥930,000) Fiscal Year 2020: ¥5,460,000 (Direct Cost: ¥4,200,000、Indirect Cost: ¥1,260,000) Fiscal Year 2019: ¥7,930,000 (Direct Cost: ¥6,100,000、Indirect Cost: ¥1,830,000)
Keywords	サイレントスピーチ / 人間拡張 / 人工知能 / 音声処理 / 深層学習 / ヒューマンコンピュータインタラクション / ウィスパースピーチ / Human-AI-Integration / スピーチインタラクション / マルチモーダルインタラクション / サイレントボイス / ヒューマンAIインテグレーション / スピーチインタフェース
Outline of Research at the Start	深層学習による無音声発話（声帯を振動させずに、発話の口腔動作のみを行う）の認識の研究を行う。プローブにより取得する口腔の映像情報から、利用者が声帯を振動させずに発話した発声内容を認識し、音響特徴を生成するニューラルネットモデルを構築する。人間とコンピュータが緊密に連携した種々のインタラクションを支援する新しいウェアラブルコンピュータの構成が可能になる。また、咽頭障害、声帯機能障害、高齢による発声困難者に対して、声によるコミュニケーションを取り戻すための支援技術基盤を提供する。
Outline of Final Research Achievements	Speech interfaces are rapidly becoming popular, but there are some limitations, such as their inability to be used in public or noisy environments. In this project, we studied silent speech recognition using deep learning. We constructed a deep learner that recognizes speech content from intraoral images observed by an ultrasonic imaging probe attached to the underside of the jaw, a mechanism that estimates speech from acceleration sensors attached to the jaw and throat for skin movement, and a mechanism that recognizes speech by acceleration sensors attached to a mask. We confirmed that the system can drive a smart speaker or other spoken dialogue system. Furthermore, we succeeded in constructing a multimodal interface that combines eye gaze information and command recognition from lip images.
Academic Significance and Societal Importance of the Research Achievements	本研究成果により、音声インタラクションが公共環境や騒音環境で利用できないなどの従来の制限を超えて利用できる可能性が出てきた。音声インタラクションは他の入力手段と比較しても高速で、手指を拘束しないなどの特徴を持つ。未来のモバイルインタフェースやウェアラブルインタフェースの手段としてサイレントスピーチが広範に利用される可能性がある。さらに、咽頭障害、声帯機能障害、高齢による発声困難者に対して、声によるコミュニケーションを取り戻すための支援技術としての利用意義がある。

Report

(4 results)

2021 Annual Research Report Final Research Report ( PDF )
2020 Annual Research Report
2019 Annual Research Report

Research Products
(18 results)

All 2022 2021 2020 2019 Other

All Int'l Joint Research (1 results) Journal Article (8 results) (of which Int'l Joint Research: 3 results, Peer Reviewed: 8 results) Presentation (7 results) (of which Int'l Joint Research: 5 results, Invited: 4 results) Remarks (1 results) Funded Workshop (1 results)

[Int'l Joint Research] Georgia Institute of Technology/college of computing(米国)
- Related Report
  2019 Annual Research Report
[Journal Article] SilentSpeller: Towards mobile, hands-free, silent speech text entry using electropalatography2022
- Author(s)
  Kimura Naoki、Gemicioglu Tan、Womack Jonathan、Li Richard、Zhao Yuhui、Bedri Abdelkareem、Su Zixiong、Olwal Alex、Rekimoto Jun、Starner Thad
- Journal Title
  
  Proceedings of the ACM on Human-Computer Interaction
  
  Volume: - Pages: 1-5
- DOI
  10.1145/3491102.3502015
- Related Report
  2021 Annual Research Report
- Peer Reviewed / Int'l Joint Research
[Journal Article] JustSpeak: Automated, User-Configurable, Interactive Agents for Speech Tutoring2021
- Author(s)
  Xinlei Zhang, Takashi Miyaki, and Jun Rekimoto
- Journal Title
  
  Proc. ACM Hum.-Comput. Interact. 5, EICS
  
  Volume: Article 202 Issue: EICS Pages: 24-24
- DOI
  10.1145/3459744
- Related Report
  2021 Annual Research Report
- Peer Reviewed / Int'l Joint Research
[Journal Article] Gaze+Lip: Rapid, Precise and Expressive Interactions Combining Gaze Input and Silent Speech Commands for Hands-free Smart TV Control2021
- Author(s)
  Su Zixiong、Zhang Xinlei、Kimura Naoki、Rekimoto Jun
- Journal Title
  
  ETRA21 ACM Symposium on Eye Tracking Research and Applications
  
  Volume: - Pages: 1-6
- DOI
  10.1145/3448018.3458011
- Related Report
  2021 Annual Research Report
- Peer Reviewed / Int'l Joint Research
[Journal Article] Human Augmentation for Skill Acquisition and Skill Transfer2021
- Author(s)
  Hideki Koike, Jun Rekimoto, Junichi Ushiba, Shinichi Furuya, and Asa Ito
- Journal Title
  
  In Extended Abstracts of the 2021 CHI Conference on Human Factors in Computing Systems (CHI EA '21). Association for Computing Machinery
  
  Volume: Article 93 Pages: 1-3
- DOI
  10.1145/3411763.3441354
- Related Report
  2021 Annual Research Report
- Peer Reviewed
[Journal Article] A Language Acquisition Support System that Presents Differences and Distances from Model Speech2021
- Author(s)
  Kazuki Kawamura and Jun Rekimoto
- Journal Title
  
  The Adjunct Publication of the 34th Annual ACM Symposium on User Interface Software and Technology. Association for Computing Machinery
  
  Volume: - Pages: 44-46
- DOI
  10.1145/3474349.3480225
- Related Report
  2021 Annual Research Report
- Peer Reviewed
[Journal Article] SilentMask: Mask-type Silent Speech Interface with Measurement of Mouth Movement2021
- Author(s)
  Hirotaka Hiraki, Jun Rekimoto
- Journal Title
  
  Augmented Humans 2020
  
  Volume: 2021 Pages: 1-8
- Related Report
  2020 Annual Research Report
- Peer Reviewed
[Journal Article] Derma: Silent Speech Interaction Using Transcutaneous Motion Sensing2021
- Author(s)
  Jun Rekimoto, Yu Nishimura
- Journal Title
  
  Augmented Humans 2020
  
  Volume: 2021 Pages: 1-8
- Related Report
  2020 Annual Research Report
- Peer Reviewed
[Journal Article] TieLent: A Casual Neck-Mounted Mouth Capturing Device for Silent Speech Interaction2020
- Author(s)
  Kimura Naoki、Hayashi Kentaro、Rekimoto Jun
- Journal Title
  
  AVI 2020
  
  Volume: 2020 Pages: 1-8
- DOI
  10.1145/3399715.3399852
- Related Report
  2020 Annual Research Report
- Peer Reviewed
[Presentation] Human Augmentation and the future of Human-Computer Integration2020
- Author(s)
  Jun Rekimoto
- Organizer
  IEEE InTech 2020
- Related Report
  2020 Annual Research Report
- Int'l Joint Research / Invited
[Presentation] Human Augmentation:人間の能力の拡張と進化2020
- Author(s)
  暦本純一
- Organizer
  MIRU2020
- Related Report
  2020 Annual Research Report
- Invited
[Presentation] SottoVoce: 超音波画像と深層学習による無発声音声インタラクション2019
- Author(s)
  暦本純一，木村直紀，河野通就
- Organizer
  インタラクション2019
- Related Report
  2019 Annual Research Report
[Presentation] Homo Cyberneticus: The Era of Human-AI Integration2019
- Author(s)
  Jun Rekimoto
- Organizer
  ACM UIST 2019
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Presentation] Human Augmentation and the future of Human-Computer Interactions2019
- Author(s)
  Jun Rekimoto
- Organizer
  CHIuXID, 5th International ACM In-Cooperation HCI and UX Conference
- Related Report
  2019 Annual Research Report
- Int'l Joint Research / Invited
[Presentation] Human Augmentation(keynote)2019
- Author(s)
  Jun Rekimoto
- Organizer
  ACM MobileHCI2019
- Related Report
  2019 Annual Research Report
- Int'l Joint Research / Invited
[Presentation] SottoVoce: An Ultrasound Imaging-Based Silent Speech Interaction Using Deep Neural Networks2019
- Author(s)
  Naoki Kimura, Michinari Kono, Jun Rekimoto
- Organizer
  CHI '19: Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems
- Related Report
  2019 Annual Research Report
- Int'l Joint Research
[Remarks] SottoVoce
- URL
  https://lab.rekimoto.org/projects/sottovoce/
- Related Report
  2019 Annual Research Report
[Funded Workshop] Human Augmentation for Skill Acquisition and Skill Transfer2021
- Related Report
  2021 Annual Research Report

Research on Silent interaction with deep neural networks

Principal Investigator

Rekimoto Jun 東京大学, 大学院情報学環・学際情報学府, 教授 (20463896)

¥17,420,000 (Direct Cost: ¥13,400,000、Indirect Cost: ¥4,020,000)

Report

Research Products

[Int'l Joint Research] Georgia Institute of Technology/college of computing(米国)

Related Report

[Journal Article] SilentSpeller: Towards mobile, hands-free, silent speech text entry using electropalatography2022

Author(s)

Journal Title

DOI

Related Report

[Journal Article] JustSpeak: Automated, User-Configurable, Interactive Agents for Speech Tutoring2021

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Gaze+Lip: Rapid, Precise and Expressive Interactions Combining Gaze Input and Silent Speech Commands for Hands-free Smart TV Control2021

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Human Augmentation for Skill Acquisition and Skill Transfer2021

Author(s)

Journal Title

DOI

Related Report

[Journal Article] A Language Acquisition Support System that Presents Differences and Distances from Model Speech2021

Author(s)

Journal Title

DOI

Related Report

[Journal Article] SilentMask: Mask-type Silent Speech Interface with Measurement of Mouth Movement2021

Author(s)

Journal Title

Related Report

[Journal Article] Derma: Silent Speech Interaction Using Transcutaneous Motion Sensing2021

Author(s)

Journal Title

Related Report

[Journal Article] TieLent: A Casual Neck-Mounted Mouth Capturing Device for Silent Speech Interaction2020

Author(s)

Journal Title

DOI

Related Report

[Presentation] Human Augmentation and the future of Human-Computer Integration2020

Author(s)

Organizer

Related Report

[Presentation] Human Augmentation:人間の能力の拡張と進化2020

Author(s)

Organizer

Related Report

[Presentation] SottoVoce: 超音波画像と深層学習による無発声音声インタラクション2019

Author(s)

Organizer

Related Report

[Presentation] Homo Cyberneticus: The Era of Human-AI Integration2019

Author(s)

Organizer

Related Report

[Presentation] Human Augmentation and the future of Human-Computer Interactions2019

Author(s)

Organizer

Related Report

[Presentation] Human Augmentation(keynote)2019

Author(s)

Organizer

Related Report

[Presentation] SottoVoce: An Ultrasound Imaging-Based Silent Speech Interaction Using Deep Neural Networks2019

Author(s)

Organizer

Related Report

[Remarks] SottoVoce

URL

Related Report

[Funded Workshop] Human Augmentation for Skill Acquisition and Skill Transfer2021

Related Report