A Study on Enhancing Semantic Representation Learning and Its Application to Disease Name Estimation

Research Project

Project/Area Number	20K11833
Research Category	Grant-in-Aid for Scientific Research (C)
Allocation Type	Multi-year Fund
Section	一般
Review Section	Basic Section 60080:Database-related
Research Institution	Fukui University of Technology
Principal Investigator	Keshi Ikuo 福井工業大学, 工学部, 教授 (40815867)
Co-Investigator(Kenkyū-buntansha)	中川肇富山大学, 学術研究部医学系, 教授 (30135256) 林篤志富山大学, 学術研究部医学系, 教授 (20283773) 辻岡和孝金城大学, 社会福祉学部, 講師 (50724960)
Project Period (FY)	2020-04-01 – 2024-03-31
Project Status	Completed (Fiscal Year 2023)
Budget Amount *help	¥4,290,000 (Direct Cost: ¥3,300,000、Indirect Cost: ¥990,000) Fiscal Year 2022: ¥1,040,000 (Direct Cost: ¥800,000、Indirect Cost: ¥240,000) Fiscal Year 2021: ¥1,170,000 (Direct Cost: ¥900,000、Indirect Cost: ¥270,000) Fiscal Year 2020: ¥2,080,000 (Direct Cost: ¥1,600,000、Indirect Cost: ¥480,000)
Keywords	電子カルテ / 病名推定 / 意味表現学習 / 大規模言語モデル / BERT / GPT-4 / 病名シソーラス / 機械学習 / ICD10 / ニューラルネットワーク / 退院サマリ / 生成AI / 単語意味ベクトル辞書 / 医工連携 / ディープラーニング
Outline of Research at the Start	ニューラルネットワークは学習結果の解釈性の課題がある．また，大規模データが必須であり，レアな病名が多い電子カルテの病名推定に応用することは困難である．研究代表者は，ニューラルネットワークの隠れノードを有限個で意味を代表する特徴単語で表現し，特徴単語との関連性を記述した単語意味ベクトル辞書を単語と隠れノード間の重み初期値に導入することにより，学習結果の解釈性を向上させた（意味表現学習と呼ぶ）．本研究の目的は，病名推定を対象に特徴単語の新分野への対応，深層学習との融合による解釈性のある病名推定の性能向上，特徴単語空間での主訴分析のためのプラットフォーム化である．
Outline of Final Research Achievements	This study developed a technology for automatically estimating disease names from electronic medical records (discharge summaries and chief complaints) for potential diagnostic support in clinical settings. Using a disease name thesaurus, semantic representation learning achieved highly accurate and interpretable estimations. Further accuracy improvements were attained with BERT for discharge summaries. For chief complaints, semantic representation learning identified optimal conditions, and the use of the generative AI GPT-4 with retrieval augmented generation (RAG) achieved high accuracy, demonstrating the influence of reference data scope.
Academic Significance and Societal Importance of the Research Achievements	本研究は、電子カルテの経過要約や主訴から病名を自動推定する技術を開発することで、医療分野における診断の透明性と効率性を向上させる点で意義を持つ。学術的には、病名シソーラスを活用した意味表現学習により、高精度な推定と解釈可能性を両立できることを示した。さらに、大規模言語モデルBERTやGPT-4を用いることで、精度が飛躍的に向上することを実証した。社会的意義としては、診療情報管理士や医師の負担軽減、医療費削減、医療サービスの質向上への貢献が期待される。本研究の成果は、医療現場の業務効率化を推進し、より質の高い医療提供体制の構築に貢献するものである。

Report

(5 results)

2023 Annual Research Report Final Research Report ( PDF )
2022 Research-status Report
2021 Research-status Report
2020 Research-status Report

Research Products
(7 results)

All 2023 2022

All Journal Article (2 results) (of which Peer Reviewed: 1 results) Presentation (4 results) (of which Int'l Joint Research: 1 results) Patent(Industrial Property Rights) (1 results)

[Journal Article] Interpretable Disease Name Estimation based on Learned Models using Semantic Representation Learning of Medical Terms.2022
- Author(s)
  Ikuo Keshi, Ryota Daimon, Atsushi Hayashi
- Journal Title
  
  Proceedings of the 14th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management
  
  Volume: 1 Pages: 265-272
- DOI
  10.5220/0011548900003335
- Related Report
  2022 Research-status Report
[Journal Article] 自然言語処理を利用した本邦版Computer Assisted Coding構築手法の研究2022
- Author(s)
  辻岡和孝，芥子育雄，中川肇，林篤志
- Journal Title
  
  日本診療情報管理学会誌「診療情報管理」
  
  Volume: -
- Related Report
  2021 Research-status Report
- Peer Reviewed
[Presentation] 意味表現学習による解釈可能な病名推定と大規模言語モデルによる高精度な病名推定2023
- Author(s)
  大門諒大, 芥子育雄, 林篤志
- Organizer
  2023年度電気・情報関係学会北陸支部連合大会
- Related Report
  2023 Annual Research Report
[Presentation] Interpretable Disease Name Estimation based on Learned Models using Semantic Representation Learning of Medical Terms.2022
- Author(s)
  Ikuo Keshi, Ryota Daimon, Atsushi Hayashi
- Organizer
  KDIR 2022 : 14th International Conference on Knowledge Discovery and Information Retrieval
- Related Report
  2022 Research-status Report
- Int'l Joint Research
[Presentation] 医療辞書を用いた意味表現学習による解釈性のある病名推定2022
- Author(s)
  大門諒大, 芥子育雄, 林篤志
- Organizer
  2022年度電気・情報関係学会北陸支部連合大会
- Related Report
  2022 Research-status Report
[Presentation] 意味表現学習とサポートベクタマシンを用いたCACシステムの評価2022
- Author(s)
  辻岡和孝，芥子育雄，中川肇，林篤志
- Organizer
  第17回医療情報学会中部支部会学術大会
- Related Report
  2021 Research-status Report
[Patent(Industrial Property Rights)] 医療用単語意味表現学習方法を用いた病名学習済みモデルとこれを用いた解釈性のある病名推定システム及びその推定方法2022
- Inventor(s)
  芥子育雄
- Industrial Property Rights Holder
  金井学園
- Industrial Property Rights Type
  特許
- Industrial Property Number
  2022-057301
- Filing Date
  2022
- Related Report
  2021 Research-status Report

A Study on Enhancing Semantic Representation Learning and Its Application to Disease Name Estimation

Principal Investigator

Keshi Ikuo 福井工業大学, 工学部, 教授 (40815867)

¥4,290,000 (Direct Cost: ¥3,300,000、Indirect Cost: ¥990,000)

Report

Research Products

[Journal Article] Interpretable Disease Name Estimation based on Learned Models using Semantic Representation Learning of Medical Terms.2022

Author(s)

Journal Title

DOI

Related Report

[Journal Article] 自然言語処理を利用した本邦版Computer Assisted Coding構築手法の研究2022

Author(s)

Journal Title

Related Report

[Presentation] 意味表現学習による解釈可能な病名推定と大規模言語モデルによる高精度な病名推定2023

Author(s)

Organizer

Related Report

[Presentation] Interpretable Disease Name Estimation based on Learned Models using Semantic Representation Learning of Medical Terms.2022

Author(s)

Organizer

Related Report

[Presentation] 医療辞書を用いた意味表現学習による解釈性のある病名推定2022

Author(s)

Organizer

Related Report

[Presentation] 意味表現学習とサポートベクタマシンを用いたCACシステムの評価2022

Author(s)

Organizer

Related Report

[Patent(Industrial Property Rights)] 医療用単語意味表現学習方法を用いた病名学習済みモデルとこれを用いた解釈性のある病名推定システム及びその推定方法2022

Inventor(s)

Industrial Property Rights Holder

Industrial Property Rights Type

Industrial Property Number

Filing Date

Related Report