Big Data Processing with Compressed Secure Computation

Research Project

Project/Area Number	21H05052
Research Category	Grant-in-Aid for Scientific Research (S)
Allocation Type	Single-year Grants
Review Section	Broad Section J
Research Institution	The University of Tokyo
Principal Investigator	定兼邦彦東京大学, 大学院情報理工学系研究科, 教授 (20323090)
Co-Investigator(Kenkyū-buntansha)	坂本比呂志九州工業大学, 大学院情報工学研究院, 教授 (50315123) 清水佳奈早稲田大学, 理工学術院, 教授 (60367050) 渋谷哲朗東京大学, 医科学研究所, 教授 (60396893) 申吉浩学習院大学, 付置研究所, 教授 (60523587) 神保洸貴東京理科大学, 創域理工学部情報計算科学科, 助教 (80966630)
Project Period (FY)	2021-07-05 – 2026-03-31
Project Status	Granted (Fiscal Year 2025)
Budget Amount *help	¥158,990,000 (Direct Cost: ¥122,300,000、Indirect Cost: ¥36,690,000) Fiscal Year 2025: ¥30,680,000 (Direct Cost: ¥23,600,000、Indirect Cost: ¥7,080,000) Fiscal Year 2024: ¥30,680,000 (Direct Cost: ¥23,600,000、Indirect Cost: ¥7,080,000) Fiscal Year 2023: ¥30,680,000 (Direct Cost: ¥23,600,000、Indirect Cost: ¥7,080,000) Fiscal Year 2022: ¥30,680,000 (Direct Cost: ¥23,600,000、Indirect Cost: ¥7,080,000) Fiscal Year 2021: ¥36,270,000 (Direct Cost: ¥27,900,000、Indirect Cost: ¥8,370,000)
Keywords	秘匿計算 / 簡潔データ構造 / 学習
Outline of Research at the Start	「圧縮秘匿計算」という新概念を提案する．これは次のような概念である．(a) 秘匿計算：個人のプライバシーを保護するためにデータを暗号化したまま保存，計算する．(b) 圧縮索引：データに付加する補助情報を圧縮することで大規模データを省資源の計算機で高速に処理する．(c) 圧縮計算：データを圧縮することで冗長性を排除し，データからの学習・解析の性能・速度を向上させる．この技術を個人ゲノム医療解析，自然言語等の非定型データの解析に応用する．
Outline of Annual Research Achievements	秘密分散に基づく秘匿計算として，配列へのランダムアクセスを可能にするORAM，データ圧縮に用いる可変長符号の符号化・復号化，グループに分割されたデータのソート，AESを用いた暗号化についてそれぞれ効率的なプロトコルを開発した． Function Secret Sharing という手法を用い，秘密分散法を用いて格納された文字列からの全文検索と，決定木評価のためのプロトコルを開発した．連合学習は，複数のパーティが自身の保有するデータを公開することなく，統合されたデータの学習を可能とすることを目的する技術である．データのカーネル値への変換にデータを秘匿する性質（データ秘匿性）があることを示した．カーネル値に基づく学習を安全に委託するためのスキームを示し，次いで，委託スキームに基づいた連合学習のフレームワークを提示した．区間検索 (range search) は配列の一部分の要素に対して最大値等を計算する処理である．区間に関する情報を秘匿する単純な方法としては配列全体を読み込むものがあるが，通信量が多くなる．この問題に対し，セグメント木というデータ構造を用いることで通信量を削減した．区間に関する情報を秘匿するために，木の子ノードをランダムに並び替える処理を行っている．これにより，単純な方法より100倍高速にしている．秘密分散以外にも，完全準同型暗号についても高速化を行った．完全準同型暗号は速度が非常に遅いため，実用的とは言えない．そこで様々なアルゴリズムの基本となる，ソートの高速化を行った．これまでは比較演算とソーティングネットワークを用いたアルゴリズムが知られているが，本研究では計数ソートを提案し，高速化を実現した．秘密分散法に基づく秘匿計算ライブラリ csclib を開発し，その Python 版を公開した．
Current Status of Research Progress	Current Status of Research Progress 2: Research has progressed on the whole more than it was originally planned. Reason 本研究のテーマである，「圧縮秘匿計算」のための基本的な技術が開発できた．計算の高速化のための最も基本的な技術として二分探索がある．これは配列中の値を検索する際に全ての値を調べるのではなく一部の値のみを参照するだけで検索を行う手法だが，このような高速化は暗号化された値に対しては行いにくい．なぜなら配列のどの要素を参照したかという情報から，検索したい値についての情報が漏れてしまうからである．そのような情報を漏らさずに配列のアクセスを実現する分散 ORAM というデータ構造があるが，既存のものは効率があまり良くなかった．本研究ではこれを改善した．さらに，文字列データベースの検索に対し，二分探索アルゴリズムを用いた検索手法を提案した．これにより，検索したい文字列の長さに比例する回数の二分探索で，文字列を検索できるようになった．これは，分散 ORAM を単純に用いる場合よりも効率が良い．また，完全準同型暗号についても高速化を行ったが，その手法は秘密分散で用いられているものに基づくものである．これは，秘密分散法と完全準同型暗号という異なる手法に基づく秘匿計算を統一したアルゴリズムの開発に繋がる成果であり，今後の発展が期待できる．
Strategy for Future Research Activity	二分探索に基づく文字列検索のアルゴリズムとデータ構造を改良し，圧縮した文字列からの検索を実現する．そのためには，単調増加数列を圧縮して格納する必要がある．ただし，圧縮後のサイズから値に関する情報が漏れないようにするためにする必要があり，そのようなデータの圧縮法を考案する．機械学習においてもプライバシーの保護が重要であり，秘匿計算が必要となる．情報を秘匿しつつ高速に学習を行うアルゴリズムの開発を行う．開発中のライブラリ csclib の機能追加を行う．
Assessment Rating	Interim Assessment Comments (Rating) A: In light of the aim of introducing the research area into the research categories, the expected progress has been made in research.

Report

(7 results)

2023 Abstract (Interim Assessment) ( PDF ) Annual Research Report Interim Assessment (Comments) ( PDF )
2022 Annual Research Report
2021 Abstract ( PDF ) Comments on the Screening Results ( PDF ) Annual Research Report

Research Products
(46 results)

All 2024 2023 2022 2021 Other

All Int'l Joint Research (2 results) Journal Article (21 results) (of which Int'l Joint Research: 4 results, Peer Reviewed: 21 results, Open Access: 11 results) Presentation (20 results) (of which Int'l Joint Research: 7 results, Invited: 3 results) Remarks (3 results)

[Int'l Joint Research] University of Copenhagen(デンマーク)
- Related Report
  2023 Annual Research Report
[Int'l Joint Research] IIT Delhi(インド)
- Related Report
  2023 Annual Research Report
[Journal Article] Constant-Round Private Decision Tree Evaluation for Secret Shared Data2024
- Author(s)
  Cheng Nan、Gupta Naman、Mitrokotsa Aikaterini、Morita Hiraku、Tozawa Kazunari
- Journal Title
  
  Proceedings on Privacy Enhancing Technologies
  
  Volume: 2024 Issue: 1 Pages: 397-412
- DOI
  10.56553/popets-2024-0023
- Related Report
  2023 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Synthetic Data Generation Without Real Data: Uncovering Insights in Malware Detection2024
- Author(s)
  Liu Chris、Maeda Katsuyuki、Takai Junnosuke、Murota Keisuke、Shin Kilho
- Journal Title
  
  Lecture Notes in Networks and Systems Advances in Information and Communication
  
  Volume: - Pages: 235-255
- DOI
  10.1007/978-3-031-53963-3_17
- ISBN
  9783031539626, 9783031539633
- Related Report
  2023 Annual Research Report
- Peer Reviewed
[Journal Article] Prediction of specific surface area of metal?organic frameworks by graph kernels2024
- Author(s)
  Morikawa Yu、Shin Kilho、Kubouchi Masataka、Ohshima Hiroaki
- Journal Title
  
  The Journal of Supercomputing
  
  Volume: ー Issue: 9 Pages: 13027-13047
- DOI
  10.1007/s11227-024-05914-3
- Related Report
  2023 Annual Research Report
- Peer Reviewed
[Journal Article] Efficient Privacy Preserving Range Query Using Segment Tree2024
- Author(s)
  Shirotake Shusuke、Shimizu Kana
- Journal Title
  
  Proceedings of 58th Annual Conference on Information Sciences and Systems (CISS)
  
  Volume: - Pages: 1-6
- DOI
  10.1109/ciss59072.2024.10480202
- Related Report
  2023 Annual Research Report
- Peer Reviewed
[Journal Article] Single-Shuffle Card-Based Protocol with Eight Cards per Gate2023
- Author(s)
  Tozawa Kazunari、Morita Hiraku、Mizuki Takaaki
- Journal Title
  
  UCNC 2023, Lecture Notes in Computer Science
  
  Volume: 14003 Pages: 171-185
- DOI
  10.1007/978-3-031-34034-5_12
- ISBN
  9783031340338, 9783031340345
- Related Report
  2023 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Genetic algorithm-based feature selection with manifold learning for cancer classification using microarray data2023
- Author(s)
  Wang Zixuan、Zhou Yi、Takagi Tatsuya、Song Jiangning、Tian Yu-Shi、Shibuya Tetsuo
- Journal Title
  
  BMC Bioinformatics
  
  Volume: 24 Issue: 1
- DOI
  10.1186/s12859-023-05267-3
- Related Report
  2023 Annual Research Report
- Peer Reviewed / Open Access / Int'l Joint Research
[Journal Article] Unbiased Locally Private Estimator for Polynomials of Laplacian Variables2023
- Author(s)
  Hillebrand Quentin、Suppakitpaisarn Vorapong、Shibuya Tetsuo
- Journal Title
  
  Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
  
  Volume: - Pages: 741-751
- DOI
  10.1145/3580305.3599537
- Related Report
  2023 Annual Research Report
- Peer Reviewed / Int'l Joint Research
[Journal Article] Privacy-Preserving Genomic Statistical Analysis Under Local Differential Privacy2023
- Author(s)
  Yamamoto Akito、Shibuya Tetsuo
- Journal Title
  
  LNCS
  
  Volume: 13942 Pages: 40-48
- DOI
  10.1007/978-3-031-37586-6_3
- ISBN
  9783031375859, 9783031375866
- Related Report
  2023 Annual Research Report
- Peer Reviewed
[Journal Article] Privacy-Preserving Publication of GWAS Statistics using Smooth Sensitivity2023
- Author(s)
  Yamamoto Akito、Shibuya Tetsuo
- Journal Title
  
  Proc. PST
  
  Volume: - Pages: 1-12
- DOI
  10.1109/pst58708.2023.10320160
- Related Report
  2023 Annual Research Report
- Peer Reviewed
[Journal Article] A Joint Permute-and-Flip and Its Enhancement for Large-Scale Genomic Statistical Analysis2023
- Author(s)
  Yamamoto Akito、Shibuya Tetsuo
- Journal Title
  
  IEEE International Conference on Data Mining Workshops
  
  Volume: - Pages: 217-226
- DOI
  10.1109/icdmw60847.2023.00034
- Related Report
  2023 Annual Research Report
- Peer Reviewed
[Journal Article] The Fine-Grained Complexity of?Approximately Counting Proper Connected Colorings (Extended Abstract)2023
- Author(s)
  Barish Robert D.、Shibuya Tetsuo
- Journal Title
  
  Combinatorial Optimization and Applications
  
  Volume: - Pages: 123-136
- DOI
  10.1007/978-3-031-49614-1_8
- ISBN
  9783031496134, 9783031496141
- Related Report
  2023 Annual Research Report
- Peer Reviewed
[Journal Article] Privacy-Preserving Statistical Analysis of Genomic Data Using Compressive Mechanism with Haar Wavelet Transform2023
- Author(s)
  Yamamoto Akito、Shibuya Tetsuo
- Journal Title
  
  Journal of Computational Biology
  
  Volume: 30 Issue: 2 Pages: 176-188
- DOI
  10.1089/cmb.2022.0246
- Related Report
  2022 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Private Evaluation of a Decision Tree based on Secret Sharing2023
- Author(s)
  Mohammad Nabil Ahmed, Kana Shimizu
- Journal Title
  
  Information Security and Cryptology - ICISC 2022, 25th International Conference, ICISC 2022, Seoul, South Korea, November 30–December 2, 2022, Revised Selected Papers
  
  Volume: 13849 Pages: 186-209
- Related Report
  2022 Annual Research Report
- Peer Reviewed
[Journal Article] Privacy-Preserving Feature Selection with Fully Homomorphic Encryption2022
- Author(s)
  Ono Shinji、Takata Jun、Kataoka Masaharu、I Tomohiro、Shin Kilho、Sakamoto Hiroshi
- Journal Title
  
  Algorithms
  
  Volume: 15 Issue: 7 Pages: 229-229
- DOI
  10.3390/a15070229
- Related Report
  2022 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] A Compression-Based Multiple Subword Segmentation for Neural Machine Translation2022
- Author(s)
  Nonaka Keita、Yamanouchi Kazutaka、I Tomohiro、Okita Tsuyoshi、Shimada Kazutaka、Sakamoto Hiroshi
- Journal Title
  
  Electronics
  
  Volume: 11 Issue: 7 Pages: 1014-1014
- DOI
  10.3390/electronics11071014
- Related Report
  2021 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Efficient Privacy-Preserving Variable-Length Substring Match for Genome Sequence2021
- Author(s)
  Yoshiki Nakagawa, Satsuya Ohata, Kana Shimizu
- Journal Title
  
  Proceedings of WABI
  
  Volume: LIPIcs 201
- Related Report
  2021 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Compression of Multiple k-Mer Sets by Iterative SPSS Decomposition2021
- Author(s)
  Kazushi Kitaya and Tetsuo Shibuya
- Journal Title
  
  Proceedings of WABI
  
  Volume: LIPIcs 201
- Related Report
  2021 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] More practical differentially private publication of key statistics in GWAS2021
- Author(s)
  Yamamoto Akito、Shibuya Tetsuo
- Journal Title
  
  Bioinformatics Advances
  
  Volume: 1 Issue: 1 Pages: 1-10
- DOI
  10.1093/bioadv/vbab004
- Related Report
  2021 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] UDON: Unsupervised Data SelectiON for Biomedical Entity Recognition2021
- Author(s)
  Akdemir Arda、Shibuya Tetsuo
- Journal Title
  
  Proceedings of 4th International Conference on Computing and Big Data (ICCBD)
  
  Volume: ACM Pages: 1-7
- DOI
  10.1145/3507524.3507525
- Related Report
  2021 Annual Research Report
- Peer Reviewed / Open Access
[Journal Article] Differentially Private Linkage Analysis with TDT ? the case of two affected children per family2021
- Author(s)
  Yamamoto Akito、Shibuya Tetsuo
- Journal Title
  
  Proceedings of BIBM
  
  Volume: IEEE Pages: 765-770
- DOI
  10.1109/bibm52615.2021.9669365
- Related Report
  2021 Annual Research Report
- Peer Reviewed
[Journal Article] Efficient Differentially Private Methods for a Transmission Disequilibrium Test in Genome Wide Association Studies2021
- Author(s)
  Yamamoto Akito、Shibuya Tetsuo
- Journal Title
  
  Proceedings of Pacific Symposium on Biocomputing
  
  Volume: World Scientific Pages: 85-96
- DOI
  10.1142/9789811250477_0009
- Related Report
  2021 Annual Research Report
- Peer Reviewed / Open Access
[Presentation] Function Secret Sharing を用いた秘匿全文検索2024
- Author(s)
  内山智貴，清水佳奈
- Organizer
  暗号と情報セキュリティシンポジウム（SCIS2024)
- Related Report
  2023 Annual Research Report
[Presentation] 可変長符号を用いたデータ圧縮の秘匿化2024
- Author(s)
  瀬戸友暁, 定兼邦彦, 戸澤一成
- Organizer
  暗号と情報セキュリティシンポジウム（SCIS2024)
- Related Report
  2023 Annual Research Report
[Presentation] 秘匿分割データのソートとその応用2024
- Author(s)
  戸澤一成, 諌山航太, 定兼邦彦
- Organizer
  暗号と情報セキュリティシンポジウム（SCIS2024)
- Related Report
  2023 Annual Research Report
[Presentation] ツリーベース型マルチサーバORAMのラウンド数削減とその応用2024
- Author(s)
  諌山航太, 定兼邦彦, 戸澤一成
- Organizer
  暗号と情報セキュリティシンポジウム（SCIS2024)
- Related Report
  2023 Annual Research Report
[Presentation] 完全準同型暗号における計数ソートベースのソートプロトコル2024
- Author(s)
  西村拓海, 戸澤一成, 定兼邦彦
- Organizer
  情報処理学会コンピュータセキュリティ研究会
- Related Report
  2023 Annual Research Report
[Presentation] Toward Privacy Preserving Biomedical Data Analysis2023
- Author(s)
  渋谷哲朗
- Organizer
  the 10th International Conference on Biomedical and Bioinformatics Engineering
- Related Report
  2023 Annual Research Report
- Int'l Joint Research / Invited
[Presentation] Texture Image Classification Using Earth Mover’s Dista2023
- Author(s)
  Reo Otsu, Hiroshi Sakamoto
- Organizer
  11th International Symposium on Applied Engineering and Sciences
- Related Report
  2023 Annual Research Report
- Int'l Joint Research
[Presentation] Privacy Preserving Feature Selection2023
- Author(s)
  Koki Wakiyama, Hiroshi Sakamoto, Tomohiro I
- Organizer
  11th International Symposium on Applied Engineering and Sciences
- Related Report
  2023 Annual Research Report
- Int'l Joint Research
[Presentation] Improved fast image generation from DCT coefficients2023
- Author(s)
  Yuki Haraguchi, Hiroshi Sakamoto
- Organizer
  11th International Symposium on Applied Engineering and Sciences
- Related Report
  2023 Annual Research Report
- Int'l Joint Research
[Presentation] 生命情報を安全に活用するプライバシ保護データマイニング2023
- Author(s)
  清水佳奈
- Organizer
  バイオメディカル・ファジィ・システム学会第36回年次大会
- Related Report
  2023 Annual Research Report
- Invited
[Presentation] 決定木評価のFunction Secret Sharingプロトコルの計算量の改善および並列化2023
- Author(s)
  高寺俊喜, 定兼邦彦, 戸澤一成
- Organizer
  コンピュータセキュリティシンポジウム
- Related Report
  2023 Annual Research Report
[Presentation] 秘密分散ベース秘匿AES計算2023
- Author(s)
  森田啓, 定兼邦彦, Peter Scholl, 戸澤一成
- Organizer
  コンピュータセキュリティシンポジウム
- Related Report
  2023 Annual Research Report
[Presentation] 秘密計算における可変長符号の符号化と復号2023
- Author(s)
  瀬戸友暁, 定兼邦彦, 戸澤一成
- Organizer
  情報セキュリティ研究会
- Related Report
  2023 Annual Research Report
[Presentation] 秘匿接尾辞ソーティングとその応用2023
- Author(s)
  神保洸貴, 定兼邦彦
- Organizer
  暗号と情報セキュリティシンポジウム (SCIS)
- Related Report
  2022 Annual Research Report
[Presentation] 秘密計算基数ソートの通信量の削減2023
- Author(s)
  吉田勇輝, 定兼邦彦, 戸澤一成
- Organizer
  暗号と情報セキュリティシンポジウム (SCIS)
- Related Report
  2022 Annual Research Report
[Presentation] 秘密計算基数ソートの通信量の削減2022
- Author(s)
  吉田勇輝，定兼邦彦
- Organizer
  情報処理学会アルゴリズム研究会 AL187-7
- Related Report
  2021 Annual Research Report
[Presentation] データ圧縮の機械学習と秘匿計算への応用2022
- Author(s)
  坂本比呂志
- Organizer
  第120回人工知能基本問題研究会
- Related Report
  2021 Annual Research Report
- Invited
[Presentation] Neural machine translation using subword segmentation with probabilistic locally consistent parsing2021
- Author(s)
  Keita Nonaka, Hiroshi Sakamoto, Tomohiro I
- Organizer
  9th International Symposium on Applied Engineering and Sciences (SAES2021)
- Related Report
  2021 Annual Research Report
- Int'l Joint Research
[Presentation] Evaluating the execution time of an efficient multiparty sorting protocol on TFHE2021
- Author(s)
  Jun Takata, Shinji Ono, Tomohiro I, Hiroshi Sakamoto
- Organizer
  9th International Symposium on Applied Engineering and Sciences (SAES2021)
- Related Report
  2021 Annual Research Report
- Int'l Joint Research
[Presentation] Secure computing of eigenvalues and eigenvectors using fully homomorphic encryption2021
- Author(s)
  Kanta Moriyama, Hiroshi Sakamoto
- Organizer
  9th International Symposium on Applied Engineering and Sciences (SAES2021)
- Related Report
  2021 Annual Research Report
- Int'l Joint Research
[Remarks] 圧縮秘匿計算ライブラリ csclib
- URL
  https://researchmap.jp/sada/csclib
- Related Report
  2023 Annual Research Report
[Remarks] Kunihiko Sadakane
- URL
  https://researchmap.jp/sada/
- Related Report
  2021 Annual Research Report
[Remarks] 完全準同型暗号(TFHE)のための高機能ライブラリ
- URL
  https://github.com/hiroshi-kyutech/FTHE-tool
- Related Report
  2021 Annual Research Report

Big Data Processing with Compressed Secure Computation

Principal Investigator

定兼 邦彦 東京大学, 大学院情報理工学系研究科, 教授 (20323090)

¥158,990,000 (Direct Cost: ¥122,300,000、Indirect Cost: ¥36,690,000)

Current Status of Research Progress

Reason

Interim Assessment Comments (Rating)

Report

Research Products

[Int'l Joint Research] University of Copenhagen(デンマーク)

Related Report

[Int'l Joint Research] IIT Delhi(インド)

Related Report

[Journal Article] Constant-Round Private Decision Tree Evaluation for Secret Shared Data2024

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Synthetic Data Generation Without Real Data: Uncovering Insights in Malware Detection2024

Author(s)

Journal Title

DOI

ISBN

Related Report

[Journal Article] Prediction of specific surface area of metal?organic frameworks by graph kernels2024

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Efficient Privacy Preserving Range Query Using Segment Tree2024

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Single-Shuffle Card-Based Protocol with Eight Cards per Gate2023

Author(s)

Journal Title

DOI

ISBN

Related Report

[Journal Article] Genetic algorithm-based feature selection with manifold learning for cancer classification using microarray data2023

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Unbiased Locally Private Estimator for Polynomials of Laplacian Variables2023

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Privacy-Preserving Genomic Statistical Analysis Under Local Differential Privacy2023

Author(s)

Journal Title

DOI

ISBN

Related Report

[Journal Article] Privacy-Preserving Publication of GWAS Statistics using Smooth Sensitivity2023

Author(s)

Journal Title

DOI

Related Report

[Journal Article] A Joint Permute-and-Flip and Its Enhancement for Large-Scale Genomic Statistical Analysis2023

Author(s)

Journal Title

DOI

Related Report

[Journal Article] The Fine-Grained Complexity of?Approximately Counting Proper Connected Colorings (Extended Abstract)2023

Author(s)

Journal Title

DOI

ISBN

Related Report

[Journal Article] Privacy-Preserving Statistical Analysis of Genomic Data Using Compressive Mechanism with Haar Wavelet Transform2023

Author(s)

Journal Title

DOI

Related Report

[Journal Article] Private Evaluation of a Decision Tree based on Secret Sharing2023

Author(s)

Journal Title

定兼邦彦東京大学, 大学院情報理工学系研究科, 教授 (20323090)