• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to project page

2021 Fiscal Year Annual Research Report

Efficient Query Processing for Learning-based Data Management

Research Project

Project/Area Number 19K11979
Research InstitutionOsaka University

Principal Investigator

肖 川  大阪大学, 情報科学研究科, 准教授 (10643900)

Project Period (FY) 2019-04-01 – 2022-03-31
Keywordsquery processing / ML + DB / high-dimensional data / similarity search
Outline of Annual Research Achievements

There were three major achievements in FY2021, the final year of this research project. First, we studied the problem of efficient query processing for embeddings, a fundamental operation in data science and machine learning tasks. By utilizing hierarchical graph structures, we proposed a novel indexing approach to approximate nearest neighbor search for real-valued high-dimensional data. The experimental evaluation showed that with the same query processing time constraint, the proposed approach improves recall rates by 3% - 10% when compared to existing solutions, and it requires less indexing time than existing solutions. We published our discoveries at the Proceedings of the VLDB Endowment (PVLDB), 2021. Second, we finished system prototyping and released the source codes of our software at GitHub. The released software includes the programs used in our papers published at ACM SIGMOD 2021 and PVLDB 2021. Third, we reported our discoveries in this project and gave a tutorial on querying high-dimensional data at ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) 2021, a top-tier conference of the data science research community.

  • Research Products

    (18 results)

All 2022 2021 Other

All Int'l Joint Research (2 results) Journal Article (2 results) (of which Int'l Joint Research: 1 results,  Peer Reviewed: 2 results,  Open Access: 2 results) Presentation (10 results) (of which Int'l Joint Research: 3 results) Remarks (4 results)

  • [Int'l Joint Research] シドニー工科大学(オーストラリア)

    • Country Name
      AUSTRALIA
    • Counterpart Institution
      シドニー工科大学
  • [Int'l Joint Research] 香港科技大学/深セン大学/深セン計算科学研究院(中国)

    • Country Name
      CHINA
    • Counterpart Institution
      香港科技大学/深セン大学/深セン計算科学研究院
  • [Journal Article] HSGAN: Reducing mode collapse in GANs by the latent code distance of homogeneous samples2022

    • Author(s)
      Simin Yu, Kuntian Zhang, Chuan Xiao, Joshua Zhexue Huang, Mark Junjie Li, Makoto Onizuka
    • Journal Title

      Computer Vision and Image Understanding

      Volume: 214 Pages: 103314~103314

    • DOI

      10.1016/j.cviu.2021.103314

    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] HVS: Hierarchical Graph Structure Based on Voronoi Diagrams for Solving Approximate Nearest Neighbor Search2021

    • Author(s)
      Kejing Lu, Mineichi Kudo, Chuan Xiao, Yoshiharu Ishikawa
    • Journal Title

      Proceedings of the VLDB Endowment

      Volume: 15 Pages: 246~258

    • DOI

      10.14778/3489496.3489506

    • Peer Reviewed / Open Access
  • [Presentation] JupySim: Jupyter Notebook Similarity Search System2022

    • Author(s)
      Misato Horiuchi, Yuya Sasaki, Chuan Xiao, Makoto Onizuka
    • Organizer
      International Conference on Extending Database Technology (EDBT)
    • Int'l Joint Research
  • [Presentation] 深層生成モデルを用いた編集を意識した分子グラフ補完2022

    • Author(s)
      胡晟, 瀧川一学, 肖川
    • Organizer
      第14回データ工学と情報マネジメントに関するフォーラム (DEIM)
  • [Presentation] 学習型索引を用いた時系列データ検索の高速化2022

    • Author(s)
      松本和人, 肖川, 鬼塚真
    • Organizer
      第14回データ工学と情報マネジメントに関するフォーラム (DEIM)
  • [Presentation] Attention GANを用いたテーブルデータの欠測値補完2022

    • Author(s)
      河越淳, 董于洋, 野澤拓磨, 肖川
    • Organizer
      第14回データ工学と情報マネジメントに関するフォーラム (DEIM)
  • [Presentation] 結合カーディナリティ推定の中間結果を利用した結合順最適化2022

    • Author(s)
      川本孝太朗, 伊藤竜一, 佐々木勇和, 肖川, 鬼塚真
    • Organizer
      第14回データ工学と情報マネジメントに関するフォーラム (DEIM)
  • [Presentation] 統合型データベースにおける適応的2相ロックに基づく分散トランザクション制御2022

    • Author(s)
      三宅康太, 佐々木勇和, 肖川, 鬼塚真
    • Organizer
      第14回データ工学と情報マネジメントに関するフォーラム (DEIM)
  • [Presentation] モデル構造の自動チューニングを用いたパーソナライズド連合学習手法2022

    • Author(s)
      松田光司, 佐々木勇和, 肖川, 鬼塚真
    • Organizer
      第14回データ工学と情報マネジメントに関するフォーラム (DEIM)
  • [Presentation] 機械学習によるトランザクション処理性能の網羅的な評価2022

    • Author(s)
      池田悠人, 三宅康太, 肖川, 鬼塚真
    • Organizer
      第14回データ工学と情報マネジメントに関するフォーラム (DEIM)
  • [Presentation] High-Dimensional Similarity Query Processing for Data Science2021

    • Author(s)
      Jianbin Qin, Wei Wang, Chuan Xiao, Ying Zhang, Yaoshu Wang
    • Organizer
      ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD)
    • Int'l Joint Research
  • [Presentation] BTGAN: Training GAN with Balanced Triplet Loss and Two-Branch Architecture2021

    • Author(s)
      Simin Yu, Kuntian Zhang, Chuan Xiao, Xianyu Bao, Joshua Zhexue Huang, Mark Junjie Li
    • Organizer
      International Joint Conference on Neural Networks (IJCNN)
    • Int'l Joint Research
  • [Remarks] 大阪大学 ビッグデータ工学講座 鬼塚研究室

    • URL

      http://www-bigdata.ist.osaka-u.ac.jp/ja/paper/

  • [Remarks] 名古屋大学 情報学研究科 データベース研究室(石川研究室)

    • URL

      https://www.db.is.i.nagoya-u.ac.jp/ja/research/publications/

  • [Remarks] Chuan Xiaoのホームページ

    • URL

      https://sites.google.com/site/chuanxiao1983/publication

  • [Remarks] Chuan XiaoのDBLPページ

    • URL

      https://dblp.org/pid/57/4384-1.html

URL: 

Published: 2022-12-28  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi