Infrastructural spoken language technology to support smooth communication with hearing-impaired people in education

Research Project

Project/Area Number	20H01716
Research Category	Grant-in-Aid for Scientific Research (B)
Allocation Type	Single-year Grants
Section	一般
Review Section	Basic Section 09070:Educational technology-related
Research Institution	Yamato University (2022-2023) Tsukuba University of Technology (2020-2021)
Principal Investigator	Kobayashi Akio 大和大学, 情報学部, 教授 (10741168)
Co-Investigator(Kenkyū-buntansha)	北岡教英豊橋技術科学大学, 工学(系)研究科(研究院), 教授 (10333501) 西崎博光山梨大学, 大学院総合研究部, 教授 (40362082) 安啓一筑波技術大学, 産業技術学部, 准教授 (70407352)
Project Period (FY)	2020-04-01 – 2023-03-31
Project Status	Completed (Fiscal Year 2023)
Budget Amount *help	¥18,070,000 (Direct Cost: ¥13,900,000、Indirect Cost: ¥4,170,000) Fiscal Year 2022: ¥4,290,000 (Direct Cost: ¥3,300,000、Indirect Cost: ¥990,000) Fiscal Year 2021: ¥4,030,000 (Direct Cost: ¥3,100,000、Indirect Cost: ¥930,000) Fiscal Year 2020: ¥9,750,000 (Direct Cost: ¥7,500,000、Indirect Cost: ¥2,250,000)
Keywords	ろう・難聴 / 音声認識 / 音声コミュニケーション / 聴覚障害者 / 情報保障 / 聴覚障害 / ニューラルネットワーク / データ拡大法 / 音声言語処理 / 点訳 / 視覚障害者 / 盲ろう / 敵対的生成ネットワーク / 音声コーパス
Outline of Research at the Start	健聴者(聞こえる人)と聴覚障害者とのコミュニケーションでは、筆談や手話通訳が使われるが、いずれも円滑な意思疎通が行えていない。そこで、健聴者と障害者双方にとって円滑な意思疎通を実現する、音声言語基盤技術(コミュニケーションブリッジ)を研究する。本研究では次の5つの研究項目を実施する。1）聴覚障害者音声の収集、2）健聴者と聴覚障害者の意思疎通に有効な音響特徴、3）聴覚障害者音声の音響的特性を反映した音声認識、4）これらを統合したコミュニケーションブリッジの構築と、5）その評価。本研究により、健聴者と聴覚障害者の相互理解が深まり、ひいては障害者の社会進出が支援可能となる。
Outline of Final Research Achievements	For deaf and hard-of-hearing (DHH) individuals with ambiguous articulation, speech interfaces with automatic speech recognition (ASR) have been challenging due to insufficient ASR developed for normal hearing. In order to solve this issue, a corpus of 50 DHH individuals' speech was developed, and a large number of read and spontaneous speech, i.e., conversation, were compiled in this study. The developed corpus was also employed to improve the ASR performance for DHH. The results showed a significant reduction in phoneme errors for the DHH speech in read speech, although not as significant as for normal hearing.
Academic Significance and Societal Importance of the Research Achievements	聴覚障害学的な観点からのコーパス作成はこれまでも世界的に行われてきたが、音声認識を含む音声言語処理への応用を視野に入れた50人ほどの大規模コーパスは例がなく、聴覚障害者を対象とした音声言語処理技術の進展に大きく寄与するものである。重度聴覚障害者の発話は多様性があり、既存の音声認識ソフトウェアで十分な認識性能に達する者もいれば、調音運動の不安定性から個人適応のような手法によっても性能の改善が限定的であるなどの知見が得られており、聴覚障害者を支援する音声インターフェースを備えたデバイスの実現に向けて前進した。

Report

(4 results)

Research Products
(28 results)

All 2024 2023 2022 2021 2020

All Journal Article (11 results) (of which Peer Reviewed: 11 results, Open Access: 2 results) Presentation (17 results)

[Journal Article] Corpus Construction for Deaf Speakers and Analysis by Automatic Speech Recognition2023
- Author(s)
  Kobayashi Akio、Yasu Keiichi
- Journal Title
  
  2023 Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)
  
  Volume: 1 Pages: 1-6
- DOI
  10.1109/apsipaasc58517.2023.10317192
- Related Report
  2022 Annual Research Report
- Peer Reviewed
[Journal Article] End-To-End Speech to Braille Translation in Japanese2022
- Author(s)
  Akio Kobayashi , Junji Onishi , Hiromitsu Nishizaki , Norihide Kitaoka
- Journal Title
  
  Conference Proceedings of 2022 IEEE International Conference on Consumer Electronics (ICCE)
  
  Volume: 2022 Pages: 1-2
- DOI
  10.1109/icce53296.2022.9730468
- Related Report
  2021 Annual Research Report
- Peer Reviewed
[Journal Article] Dynamic out-of-vocabulary word registration to language model for speech recognition2021
- Author(s)
  Kitaoka Norihide、Chen Bohan、Obashi Yuya
- Journal Title
  
  EURASIP Journal on Audio, Speech, and Music Processing
  
  Volume: 2021 Issue: 1 Pages: 1-8
- DOI
  10.1186/s13636-020-00193-1
- Related Report
  2021 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Corpus Design and Automatic Speech Recognition for Deaf and Hard-of-Hearing People2021
- Author(s)
  Kobayashi Akio、Yasu Keiichi、Nishizaki Hiromitsu、Kitaoka Norihide
- Journal Title
  
  2021 IEEE 10th Global Conference on Consumer Electronics (GCCE)
  
  Volume: N.A. Pages: 17-18
- DOI
  10.1109/gcce53005.2021.9621959
- Related Report
  2021 Annual Research Report
- Peer Reviewed
[Journal Article] Advanced language model fusion method for encoder-decoder model in Japanese speech2021
- Author(s)
  Daiki Mori, Kengo Ohta, Ryota Nishimura, Atsunori Ogawa, Norihide Kitaoka
- Journal Title
  
  Proc. APSIPA ASC 2021
  
  Volume: - Pages: 503-510
- Related Report
  2021 Annual Research Report
- Peer Reviewed
[Journal Article] End-to-end spontaneous speech recognition using hesitation labeling2021
- Author(s)
  Koharu Horii, Meiko Fukuda, Kengo Ohta, Ryota Nishimura, Atsunori Ogawa, Norihide Kitaoka
- Journal Title
  
  Proc. APSIPA ASC 2021
  
  Volume: - Pages: 1077-1081
- Related Report
  2021 Annual Research Report
- Peer Reviewed
[Journal Article] ExKaldi-RT: A Real-Time Automatic Speech Recognition Extension Toolkit of Kaldi2021
- Author(s)
  Wang Yu, Chee Siang Leow, Akio Kobayashi, Takehito Utsuro, Hiromitsu Nishizaki
- Journal Title
  
  Proceedings of 2020 IEEE 10th Global Conference on Consumer Electronics (GCCE)
  
  Volume: - Pages: 346-350
- DOI
  10.1109/gcce53005.2021.9621992
- Related Report
  2021 Annual Research Report
- Peer Reviewed
[Journal Article] Development of a Low-Latency and Real-Time Automatic Speech Recognition System2020
- Author(s)
  Leow Chee Siang, Hayakawa Tomoaki, Nishizaki Hiromitsu, Kitaoka Norihide
- Journal Title
  
  Proceedings of the 2020 IEEE 9th Global Conference on Consumer Electronics
  
  Volume: - Pages: 464-467
- DOI
  10.1109/gcce50665.2020.9291818
- Related Report
  2020 Annual Research Report
- Peer Reviewed
[Journal Article] ExKaldi: A Python-based Extension Tool of Kaldi2020
- Author(s)
  Wang Yu, Leow Chee Siang, Nishizaki Hiromitsu, Kobayashi Akio, Utsuro Takehito
- Journal Title
  
  Proceedings of the 2020 IEEE 9th Global Conference on Consumer Electronics
  
  Volume: - Pages: 470-473
- DOI
  10.1109/gcce50665.2020.9291717
- Related Report
  2020 Annual Research Report
- Peer Reviewed
[Journal Article] End-to-end recognition of streaming Japanese speech using CTC and local attention2020
- Author(s)
  Chen Jiahao、Nishimura Ryota、Kitaoka Norihide
- Journal Title
  
  APSIPA Transactions on Signal and Information Processing
  
  Volume: 9 Issue: 1 Pages: 1-7
- DOI
  10.1017/atsip.2020.23
- NAID
  120007045056
- Related Report
  2020 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Disfluencies and Strategies Used by People Who Stutter During a Working Memory Task2020
- Author(s)
  Arongna, Naomi Sakai, Keiichi Yasu and Koichi Mori
- Journal Title
  
  Journal of Speech, Language, and Hearing Research
  
  Volume: 63
- Related Report
  2020 Annual Research Report
- Peer Reviewed
[Presentation] ラジオ音声を対象とした音声強調における擬似学習データ利用の検討2024
- Author(s)
  小林彰夫、安啓一
- Organizer
  第86回情報処理学会全国大会
- Related Report
  2022 Annual Research Report
[Presentation] 混合音声から抽出した難聴者の発話の聞き取りやすさに関する客観的および主観的な評価による検討2024
- Author(s)
  藤江匠汰、安啓一、小林彰夫
- Organizer
  情報処理学会アクセシビリティ研究会
- Related Report
  2022 Annual Research Report
[Presentation] 話者適応による聴覚障害者音声認識の評価2024
- Author(s)
  高橋快斗、木内貴浩、若林佑幸、太田健吾、小林彰夫、北岡教英
- Organizer
  情報処理学会第151回音声言語情報処理研究発表会
- Related Report
  2022 Annual Research Report
[Presentation] 聴覚障害者音声における自己教師あり学習に基づく音声認識の評価2024
- Author(s)
  高橋快斗、木内貴浩、若林佑幸、太田健吾、小林彰夫、北岡教英
- Organizer
  日本音響学会春季研究発表会
- Related Report
  2022 Annual Research Report
[Presentation] タスク外音響情報を付加したEnd-to-End音声認識モデルの設計2022
- Author(s)
  森大輝，太田健吾，西村良太，小川厚徳, 北岡教英
- Organizer
  日本音響学会講演論文集
- Related Report
  2021 Annual Research Report
[Presentation] 非流暢ラベルを用いた言い淀み整形End-to-End音声認識2022
- Author(s)
  堀井こはる，福田芽衣子，太田健吾，西村良太，小川厚徳，北岡教英
- Organizer
  日本音響学会講演論文集
- Related Report
  2021 Annual Research Report
[Presentation] 読み上げ文を対象としたEnd-to-End音声点訳2021
- Author(s)
  小林彰夫、大西淳児、西崎博光、北岡教英
- Organizer
  日本音響学会講演論文集
- Related Report
  2021 Annual Research Report
[Presentation] Encoder-Decoder音声認識モデルにおける暗黙的言語情報の置換法2021
- Author(s)
  森大輝，太田健吾，西村良太，小川厚徳，北岡教英
- Organizer
  日本音響学会講演論文集
- Related Report
  2021 Annual Research Report
[Presentation] 言い淀みを考慮した自由発話のEnd-to-End音声認識2021
- Author(s)
  堀井こはる，福田芽衣子，太田健吾，西村良太，北岡教英
- Organizer
  日本音響学会講演論文集
- Related Report
  2021 Annual Research Report
[Presentation] End-to-end音声認識モデルにおける暗黙的言語情報の置換法2021
- Author(s)
  森大輝，太田健吾，西村良太，小川厚徳，北岡教英
- Organizer
  音学シンポジウム,
- Related Report
  2021 Annual Research Report
[Presentation] Kaldiベースのストリーミング音声認識システムの開発2021
- Author(s)
  レオチーシャン，王宇，小林彰夫，宇津呂武仁，西崎博光
- Organizer
  日本音響学会2021年秋季研究発表会講演論文集
- Related Report
  2021 Annual Research Report
[Presentation] 聴覚障害者の音声データの収集と音素認識による評価2021
- Author(s)
  小林彰夫 , 安啓一 , 西崎博光 , 北岡教英
- Organizer
  日本音響学会研究発表会講演論文集
- Related Report
  2020 Annual Research Report
[Presentation] Uni-directional LSTM と Local Attentionを用いたストリーミング音声認識2020
- Author(s)
  陳家浩，西村良太，北岡教英
- Organizer
  日本音響学会講演論文集
- Related Report
  2020 Annual Research Report
[Presentation] 若年の補聴器・人工内耳装用者における語音検査と自身の発話明瞭性の関係2020
- Author(s)
  安啓一，種子田尚人，石井悠貴
- Organizer
  日本音響学会秋季研究発表会講演論文集
- Related Report
  2020 Annual Research Report
[Presentation] 加齢性難聴に伴うポピュラーソングの印象変化の検討～模擬難聴を用いて2020
- Author(s)
  寺澤洋子，水野真由美，山本雄也，大中悠生，石川嘉秀，松井淑恵，安啓一
- Organizer
  日本音響学会秋季研究発表会講演論文集
- Related Report
  2020 Annual Research Report
[Presentation] 加齢性難聴者のサイン音認知の研究にむけて2020
- Author(s)
  大中悠生, 安啓一, 松井淑惠, 寺澤洋子
- Organizer
  情報処理学会研究報告
- Related Report
  2020 Annual Research Report
[Presentation] 聴覚障害者の調性認知に関する検討2020
- Author(s)
  寺澤洋子, 相馬翔太, 安啓一, 平賀瑠美
- Organizer
  情報処理学会研究報告
- Related Report
  2020 Annual Research Report

Infrastructural spoken language technology to support smooth communication with hearing-impaired people in education

Principal Investigator

Kobayashi Akio 大和大学, 情報学部, 教授 (10741168)

¥18,070,000 (Direct Cost: ¥13,900,000、Indirect Cost: ¥4,170,000)

Report

Research Products

[Journal Article] Corpus Construction for Deaf Speakers and Analysis by Automatic Speech Recognition2023

Author(s)

Journal Title

DOI

Related Report

[Journal Article] End-To-End Speech to Braille Translation in Japanese2022

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Dynamic out-of-vocabulary word registration to language model for speech recognition2021

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Corpus Design and Automatic Speech Recognition for Deaf and Hard-of-Hearing People2021

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Advanced language model fusion method for encoder-decoder model in Japanese speech2021

Author(s)

Journal Title

Related Report

[Journal Article] End-to-end spontaneous speech recognition using hesitation labeling2021

Author(s)

Journal Title

Related Report

[Journal Article] ExKaldi-RT: A Real-Time Automatic Speech Recognition Extension Toolkit of Kaldi2021

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Development of a Low-Latency and Real-Time Automatic Speech Recognition System2020

Author(s)

Journal Title

DOI

Related Report

[Journal Article] ExKaldi: A Python-based Extension Tool of Kaldi2020

Author(s)

Journal Title

DOI

Related Report

[Journal Article] End-to-end recognition of streaming Japanese speech using CTC and local attention2020

Author(s)

Journal Title

DOI

NAID

Related Report

[Journal Article] Disfluencies and Strategies Used by People Who Stutter During a Working Memory Task2020

Author(s)

Journal Title

Related Report

[Presentation] ラジオ音声を対象とした音声強調における擬似学習データ利用の検討2024

Author(s)

Organizer

Related Report

[Presentation] 混合音声から抽出した難聴者の発話の聞き取りやすさ に関する客観的および主観的な評価による検討2024

Author(s)

Organizer

Related Report

[Presentation] 話者適応による聴覚障害者音声認識の評価2024

Author(s)

Organizer

Related Report

[Presentation] 聴覚障害者音声における自己教師あり学習に基づく音声認識の評価2024

Author(s)

Organizer

Related Report

[Presentation] タスク外音響情報を付加したEnd-to-End音声認識モデルの設計2022

Author(s)

Organizer

Related Report

[Presentation] 非流暢ラベルを用いた言い淀み整形End-to-End音声認識2022

[Presentation] 混合音声から抽出した難聴者の発話の聞き取りやすさに関する客観的および主観的な評価による検討2024