Secure, Precise and Fast Sequential Pattern Mining with Learning Data Distribution

研究課題

研究課題/領域番号	21K17746
研究種目	若手研究
配分区分	基金
審査区分	小区分60080:データベース関連
研究機関	東京工業大学
研究代表者	Le Hieu・Hanh 東京工業大学, 情報理工学院, 助教 (60813996)
研究期間 (年度)	2021-04-01 – 2024-03-31
研究課題ステータス	交付 (2022年度)
配分額 *注記	4,420千円 (直接経費: 3,400千円、間接経費: 1,020千円) 2023年度: 1,170千円 (直接経費: 900千円、間接経費: 270千円) 2022年度: 1,170千円 (直接経費: 900千円、間接経費: 270千円) 2021年度: 2,080千円 (直接経費: 1,600千円、間接経費: 480千円)
キーワード	data mining / privacy / medical data / differential privacy / recommendation
研究開始時の研究の概要	This study aims to present a method for eliminating the need for trust in SPM while preserving privacy and providing secure, precise, and fast sequential data analysis that carefully learns the data distribution. The execution time should be reduced via parallel computation that utilizes modern hardware such as scalable multi-core CPUs. The feasibility of the proposed method will be studied using both open datasets and real medical data.
研究実績の概要	This study aims to present a method for eliminating the need for trust in sequential pattern mining (SPM) while preserving privacy and providing secure, precise, and fast sequential data analysis which carefully learns the data distribution. The fundamental algorithms of sequential data analysis on sequential medical data without privacy-preserving have been studied this year. In detail, several methods to analyze the sequence variants from more than one hospital have been designed and evaluated. The basic privacy-preserving SPM has also been studied in detail and the initial experimental results have been observed. For estimating the frequency of the sequences, an appropriate amount of noise is added to the original frequency to ensure privacy.
現在までの達成度 (区分)	現在までの達成度 (区分) 3: やや遅れている理由 The basic algorithm has been developed. However, evaluation process has been delayed due to the constraint of using sensitive data.
今後の研究の推進方策	The evaluation process will be improved by using other datasets which are easier to use. Moreover, further improving the performance of the privacy-preserving data analysis will be studied.

報告書

(2件)

2022 実施状況報告書
2021 実施状況報告書

研究成果

(12件)

すべて 2023 2022 2021

すべて雑誌論文 (3件) (うち査読あり 3件、オープンアクセス 1件) 学会発表 (9件) (うち国際学会 1件)

[雑誌論文] Methods for Analyzing Medical-Order Sequence Variants in Sequential Pattern Mining for Electronic Medical Record Systems2023
- 著者名/発表者名
  Hieu Hanh Le, Tatsuhiro Yamada, Yuichi Honda, Takatoshi Sakamoto, Ryosuke Matsuo, Tomoyoshi Yamazaki, Kenji Araki, Haruo Yokota
- 雑誌名
  
  ACM Transactions on Computing for Healthcare
  
  巻: 4, issue 1, no. 3 号: 1 ページ: 1-28
- DOI
  10.1145/3561825
- 関連する報告書
  2022 実施状況報告書
- 査読あり
[雑誌論文] シーケンスバリアントの比較と電子カルテの分析への応用2023
- 著者名/発表者名
  Yuqing Li, Le Hieu Hanh, 松尾亮輔, 山崎友義, 荒木賢二, 横田治夫
- 雑誌名
  
  日本データベース学会データドリブンスタディーズ論文誌
  
  巻: 1, no.5 ページ: 1-8
- 関連する報告書
  2022 実施状況報告書
- 査読あり / オープンアクセス
[雑誌論文] Comparison of Sequence Variants and the Application in Electronic Medical Records2022
- 著者名/発表者名
  Yuqing Li, Hieu Hanh Le, Ryosuke Matsuo, Tomoyoshi Yamazak, Kenji Araki, Haruo Yokota
- 雑誌名
  
  Proceeding of the 33rd International Conference on Database and Expert Systems Applications (DEXA2022), Part 2
  
  巻: 13427 ページ: 117-130
- DOI
  10.1007/978-3-031-12426-6_10
- ISBN
  9783031124259, 9783031124266
- 関連する報告書
  2022 実施状況報告書
- 査読あり
[学会発表] 動的に医療指示種類を変更したシーケンス解析における特徴的な治療パターン抽出2023
- 著者名/発表者名
  黒川健人, Le Hieu Hanh, 松尾亮輔, 山崎友義, 荒木賢二, 横田治夫
- 学会等名
  第１５回データ工学と情報マネジメントに関するフォーラム
- 関連する報告書
  2022 実施状況報告書
[学会発表] COVID-19に関する頻出医療指示パターンの時期による差異と差異発生時期の可視化2023
- 著者名/発表者名
  Zhao Zitai, Le Hieu Hanh, 松尾亮輔, 山﨑友義, 荒木賢二, 横田治夫
- 学会等名
  第１５回データ工学と情報マネジメントに関するフォーラム
- 関連する報告書
  2022 実施状況報告書
[学会発表] クラスタリングを用いた多病院間の頻出医療指示パターン比較2023
- 著者名/発表者名
  安光夕輝, Le Hieu Hanh, 松尾亮輔, 山﨑友義, 荒木賢二, 横田治夫
- 学会等名
  第１５回データ工学と情報マネジメントに関するフォーラム
- 関連する報告書
  2022 実施状況報告書
[学会発表] COVID-19の異なる医療機関と時期における頻出治療パターンの比較2022
- 著者名/発表者名
  Zhao Zitai, Le Hieu Hanh, 松尾亮輔, 山﨑友義, 荒木賢二, 横田治夫
- 学会等名
  第42回医療情報学連合大会
- 関連する報告書
  2022 実施状況報告書
[学会発表] 数医療機関間の頻出医療指示パターン比較手法2022
- 著者名/発表者名
  横田治夫, Le Hieu Hanh, Li Yuqing, 松尾亮輔, 山﨑友義, 荒木賢二
- 学会等名
  第26回日本医療情報学会春季学術大会
- 関連する報告書
  2022 実施状況報告書
[学会発表] MERJ: Medical Entity-Relation Extraction System for Japanese Clinical Texts2022
- 著者名/発表者名
  An Wang, Hieu Hanh Le, Ryosuke Matsuo, Tomoyoshi Yamazaki, Kenji Araki, Haruo Yokota
- 学会等名
  The 14th Forum on Data Engineering and Information Management (DEIM 202)
- 関連する報告書
  2021 実施状況報告書
[学会発表] シーケンシャルパターンマイニングに基づく多病院間の頻出治療パターンの比較2022
- 著者名/発表者名
  Li Yuqing, Le Hieu Hanh, 松尾亮輔, 山崎友義, 荒木賢二, 横田治夫
- 学会等名
  第１４回データ工学と情報マネジメントに関するフォーラム予稿集
- 関連する報告書
  2021 実施状況報告書
[学会発表] 医療データのシーケンス解析とその課題2022
- 著者名/発表者名
  横田治夫, Le Hieu Hanh, 松尾亮輔, 山﨑友義, 荒木賢二
- 学会等名
  第１２回日本医療情報学会「医用人工知能研究会」人工知能学会「医用人工知能研究会」合同研究会
- 関連する報告書
  2021 実施状況報告書
[学会発表] Sequential Pattern Mining of Large Combinable Items with Values for a Set-of-items Recommendation2021
- 著者名/発表者名
  Hieu Hanh Le, Yutaka Horino, Tomoyoshi Yamazaki, Kenji Araki, Haruo Yokota
- 学会等名
  The 34 IEEE International Symposium on Computer-based Medical Systems (CBMS 2021)
- 関連する報告書
  2021 実施状況報告書
- 国際学会

Secure, Precise and Fast Sequential Pattern Mining with Learning Data Distribution

研究代表者

Le Hieu・Hanh 東京工業大学, 情報理工学院, 助教 (60813996)

4,420千円 (直接経費: 3,400千円、間接経費: 1,020千円)

現在までの達成度 (区分)

理由

報告書

研究成果

[雑誌論文] Methods for Analyzing Medical-Order Sequence Variants in Sequential Pattern Mining for Electronic Medical Record Systems2023

著者名/発表者名

雑誌名

DOI

関連する報告書

[雑誌論文] シーケンスバリアントの比較と電子カルテの分析への応用2023

著者名/発表者名

雑誌名

関連する報告書

[雑誌論文] Comparison of Sequence Variants and the Application in Electronic Medical Records2022

著者名/発表者名

雑誌名

DOI

ISBN

関連する報告書

[学会発表] 動的に医療指示種類を変更したシーケンス解析における特徴的な治療パターン抽出2023

著者名/発表者名

学会等名

関連する報告書

[学会発表] COVID-19に関する頻出医療指示パターンの時期による差異と差異発生時期の可視化2023

著者名/発表者名

学会等名

関連する報告書

[学会発表] クラスタリングを用いた多病院間の頻出医療指示パターン比較2023

著者名/発表者名

学会等名

関連する報告書

[学会発表] COVID-19の異なる医療機関と時期における頻出治療パターンの比較2022

著者名/発表者名

学会等名

関連する報告書

[学会発表] 数医療機関間の頻出医療指示パターン比較手法2022

著者名/発表者名

学会等名

関連する報告書

[学会発表] MERJ: Medical Entity-Relation Extraction System for Japanese Clinical Texts2022

著者名/発表者名

学会等名

関連する報告書

[学会発表] シーケンシャルパターンマイニングに基づく多病院間の頻出治療パターンの比較2022

著者名/発表者名

学会等名

関連する報告書

[学会発表] 医療データのシーケンス解析とその課題2022

著者名/発表者名

学会等名

関連する報告書

[学会発表] Sequential Pattern Mining of Large Combinable Items with Values for a Set-of-items Recommendation2021

著者名/発表者名

学会等名

関連する報告書