• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

High-Dimension Low-Sample-Size Big Data Analysis by Higher Order Metrics

Research Project

Project/Area Number 17K00043
Research Category

Grant-in-Aid for Scientific Research (C)

Allocation TypeMulti-year Fund
Section一般
Research Field Statistical science
Research InstitutionUniversity of Tsukuba

Principal Investigator

Sato-Ilic Mika  筑波大学, システム情報系, 教授 (60269214)

Co-Investigator(Kenkyū-buntansha) 青嶋 誠  筑波大学, 数理物質系, 教授 (90246679)
清水 信夫  統計数理研究所, データ科学研究系, 助教 (00332130)
Project Period (FY) 2017-04-01 – 2020-03-31
Project Status Completed (Fiscal Year 2019)
Budget Amount *help
¥4,550,000 (Direct Cost: ¥3,500,000、Indirect Cost: ¥1,050,000)
Fiscal Year 2019: ¥1,040,000 (Direct Cost: ¥800,000、Indirect Cost: ¥240,000)
Fiscal Year 2018: ¥1,560,000 (Direct Cost: ¥1,200,000、Indirect Cost: ¥360,000)
Fiscal Year 2017: ¥1,950,000 (Direct Cost: ¥1,500,000、Indirect Cost: ¥450,000)
Keywords分類 / ビックデータ / 尺度構成 / Fuzzy Clustering / Scales of Clusters / Fuzzy Clustering Models / Regression Analysis / HDLSS Data / Data Fusion / Categorical Data / Geometrical Data / Big Data Analysis / 統計科学 / 高次計量 / クラスタリング
Outline of Final Research Achievements

It has been theoretically clarified that conventional statistical science-based methods cannot be used to analyze high-dimensional small sample size big data. Therefore, in this research, for the analysis of this data, we developed an appropriate metric in a common metric space that can measure multiple data at once, and developed a cluster metric model based on it. Furthermore, we evaluated the various performances of the developed method, applied it to various data, and evaluated its applicability.

Academic Significance and Societal Importance of the Research Achievements

一般に、高次元小標本型データが複数得られた場合、典型的な高次元小標本型ビックデータとなる。これらの複数のデータを同時に解析するための新たな方法の開発に取り組んだ。具体的には、複数のデータを通じて共通に得られるクラスターを共通尺度とする計量とそれを利用したモデルの開発を行った。これにより、高次元のデータの動的変動をより低次元の空間で説明することが可能となった。また、共通の部分ベクトル空間への射影を用いることで、種々のデータ構造を比較可能とし、かつ低次元空間に縮約可能とするモデルの開発も行った。この方法は、ビックデータ解析で問題とされる種々のデータの融合法としても有効であることを示した。

Report

(4 results)
  • 2019 Annual Research Report   Final Research Report ( PDF )
  • 2018 Research-status Report
  • 2017 Research-status Report
  • Research Products

    (37 results)

All 2020 2019 2018 2017

All Journal Article (12 results) (of which Int'l Joint Research: 1 results,  Peer Reviewed: 9 results,  Open Access: 8 results) Presentation (23 results) (of which Int'l Joint Research: 17 results,  Invited: 11 results) Book (2 results)

  • [Journal Article] Probabilistic Metric Based Multidimensional Scaling2020

    • Author(s)
      Mika Sato-Ilic
    • Journal Title

      Procedia Computer Science

      Volume: 168 Pages: 65-72

    • DOI

      10.1016/j.procs.2020.02.258

    • Related Report
      2019 Annual Research Report
    • Peer Reviewed / Open Access
  • [Journal Article] Quantification and Visualization for Difference of Fuzzy Clustering Results2019

    • Author(s)
      Mika Sato-Ilic
    • Journal Title

      The 2019 IEEE International Conference on Fuzzy Systems

      Volume: 1 Pages: 1-6

    • Related Report
      2019 Annual Research Report
    • Peer Reviewed
  • [Journal Article] 高次元データに対するファジィクラスタリングの主成分分析による評価2019

    • Author(s)
      村山喬則,佐藤美佳
    • Journal Title

      第 35 回ファジィシステムシンポジウム 講演論文集

      Volume: 1 Pages: 203-208

    • NAID

      130007772942

    • Related Report
      2019 Annual Research Report
    • Open Access
  • [Journal Article] Homogeneous Cluster Analysis2018

    • Author(s)
      M. Sato-Ilic
    • Journal Title

      Procedia Computer Sciences, Elsevier

      Volume: 140 Pages: 269-275

    • DOI

      10.1016/j.procs.2018.10.320

    • Related Report
      2018 Research-status Report
    • Peer Reviewed / Open Access
  • [Journal Article] Cluster-Scaled Regression Analysis for High-Dimension and Low-Sample Size Data2018

    • Author(s)
      M. Sato-Ilic
    • Journal Title

      Advances in Smart Systems Research

      Volume: 7 Pages: 1-10

    • Related Report
      2018 Research-status Report
    • Peer Reviewed
  • [Journal Article] 高次元小標本データに対するT-ノルムに基づくマルチレイヤークラスタリング2018

    • Author(s)
      伊藤佳輝, 元田卓, 佐藤美佳
    • Journal Title

      第34回ファジィシステムシンポジウム講演論文集

      Volume: - Pages: 480-485

    • Related Report
      2018 Research-status Report
  • [Journal Article] Two-sample tests for high-dimension, strongly spiked eigenvalue models2018

    • Author(s)
      Aoshima, M., Yata, K.
    • Journal Title

      Statistica Sinica

      Volume: 28 Pages: 43-62

    • DOI

      10.5705/ss.202016.0063

    • Related Report
      2018 Research-status Report 2017 Research-status Report
    • Peer Reviewed / Open Access
  • [Journal Article] A Survey of High Dimension Low Sample Size Asymptotics2018

    • Author(s)
      M. Aoshima, D. Shen, H. Shen, K. Yata, Y. Zhou, J.S. Marron
    • Journal Title

      Special Issue in Honour of Peter Gavin Hall, Australian & New Zealand Journal of Statistics

      Volume: 60 Issue: 1 Pages: 4-19

    • DOI

      10.1111/anzs.12212

    • Related Report
      2018 Research-status Report 2017 Research-status Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] 集約的シンボリックデータのカイ2乗統計量を用いた非類似度とその不動産情報データへの適用2018

    • Author(s)
      清水信夫, 中野純司, 山本由和
    • Journal Title

      統計数理

      Volume: 66 Pages: 279-294

    • Related Report
      2018 Research-status Report
    • Peer Reviewed
  • [Journal Article] Knowledge-based Comparable Predicted Values in Regression Analysis2017

    • Author(s)
      M. Sato-Ilic
    • Journal Title

      Procedia Computer Science, Elsevier

      Volume: 114 Pages: 216-223

    • DOI

      10.1016/j.procs.2017.09.063

    • Related Report
      2017 Research-status Report
    • Peer Reviewed / Open Access
  • [Journal Article] Identification and Scaling Methods based on Comparative Quantification for Dissimilarity Data2017

    • Author(s)
      M. Sato-Ilic, P. Ilic
    • Journal Title

      The 2017 IEEE International Conference on Fuzzy Systems

      Volume: - Pages: 1-6

    • DOI

      10.1109/fuzz-ieee.2017.8015443

    • Related Report
      2017 Research-status Report
    • Peer Reviewed / Open Access
  • [Journal Article] 3元マルチソースデータに対する同時ファジィクラスタリング手法2017

    • Author(s)
      矢吹健二、佐藤美佳
    • Journal Title

      第 33 回ファジィシステムシンポジウム講演論文集

      Volume: - Pages: 441-446

    • NAID

      130007612747

    • Related Report
      2017 Research-status Report
    • Open Access
  • [Presentation] Statistical Data Science at A Crossroads (基調講演)2020

    • Author(s)
      Mika Sato-Ilic
    • Organizer
      KES-Intelligent Decision Technologies, Smart Innovation, Systems and Technologies
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research / Invited
  • [Presentation] Probabilistic Metric Based Multidimensional Scaling2019

    • Author(s)
      Mika Sato-Ilic
    • Organizer
      Complex Adaptive Systems 2019
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research
  • [Presentation] Quantification and Visualization for Difference of Fuzzy Clustering Results2019

    • Author(s)
      Mika Sato-Ilic
    • Organizer
      The 2019 IEEE International Conference on Fuzzy Systems
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research / Invited
  • [Presentation] Fuzzy clustering-based non-linear dimensionality reduction2019

    • Author(s)
      Mika Sato-Ilic
    • Organizer
      13th CFE 2019 and 12th CMStatistics 2019
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research / Invited
  • [Presentation] Statistical Data Science Based on Soft Computing2019

    • Author(s)
      Mika Sato-Ilic
    • Organizer
      MIT-Tsukuba Joint-Workshop on Data Systems Science towards Social and Business Innovations
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research / Invited
  • [Presentation] 高次元データに対するファジィクラスタリングの主成分分析による評価2019

    • Author(s)
      村山喬則,佐藤美佳
    • Organizer
      第 35 回ファジィシステムシンポジウム
    • Related Report
      2019 Annual Research Report
  • [Presentation] High-Dimensional Statistical Analysis: Non-Sparsity, Strongly Spiked Noise and HDLSS (基調講演)2019

    • Author(s)
      Makoto Aoshima
    • Organizer
      The 7th International Workshop in Sequential Methodologies
    • Related Report
      2019 Annual Research Report
    • Int'l Joint Research / Invited
  • [Presentation] 集約的シンボリックデータにおける変数間の相関の指標2019

    • Author(s)
      清水信夫, 中野純司, 山本由和
    • Organizer
      2019年度統計関連学会連合大会
    • Related Report
      2019 Annual Research Report
  • [Presentation] Cluster-Scaled Intelligent Data Analysis(基調講演)2018

    • Author(s)
      M. Sato-Ilic
    • Organizer
      3rd International Conference on Smart Computing & Informatics
    • Related Report
      2018 Research-status Report
    • Int'l Joint Research / Invited
  • [Presentation] Soft Clustering-based Models2018

    • Author(s)
      M.Sato-Ilic
    • Organizer
      23rd International Conference on Computattional Statistics (COMPSTAT 2018)
    • Related Report
      2018 Research-status Report
    • Int'l Joint Research / Invited
  • [Presentation] Homogeneous Cluster Analysis2018

    • Author(s)
      M.Sato-Ilic
    • Organizer
      Complex Adaptive Systems 2018
    • Related Report
      2018 Research-status Report
    • Int'l Joint Research
  • [Presentation] Cluster-Scaled Regression Analysis for High-Dimension and Low-Sample Size Data2018

    • Author(s)
      M. Sato-Ilic
    • Organizer
      Knowledge-Based and Intelligent Information & Engineering Systems - Intelligent Decision Technologies
    • Related Report
      2018 Research-status Report
    • Int'l Joint Research
  • [Presentation] 高次元小標本データに対するT-ノルムに基づくマルチレイヤークラスタリング2018

    • Author(s)
      伊藤佳輝, 元田卓, 佐藤美佳
    • Organizer
      第34回ファジィシステムシンポジウム
    • Related Report
      2018 Research-status Report
  • [Presentation] 分類構造に基づく異常検知手法2018

    • Author(s)
      小林大悟, 佐藤美佳
    • Organizer
      2018年度統計関連学会連合大会
    • Related Report
      2018 Research-status Report
  • [Presentation] Dissimilarity between aggregated symbolic data using chi-squared statistics2018

    • Author(s)
      N. Shimizu, J. Nakano, Y. Yamamoto
    • Organizer
      2018 Workshop in Symbolic Data Analysis
    • Related Report
      2018 Research-status Report
    • Int'l Joint Research
  • [Presentation] Modeling New Complex Data Structures (基調講演)2017

    • Author(s)
      M. Sato-Ilic
    • Organizer
      Complex Adaptive Systems 2017
    • Related Report
      2017 Research-status Report
    • Int'l Joint Research / Invited
  • [Presentation] Knowledge-based Comparable Predicted Values in Regression Analysis2017

    • Author(s)
      M. Sato-Ilic
    • Organizer
      Complex Adaptive Systems 2017
    • Related Report
      2017 Research-status Report
    • Int'l Joint Research
  • [Presentation] Cluster Identification and Scaling Methods based on Comparative Quantification for Dissimilarity Data2017

    • Author(s)
      M. Sato-Ilic, P. Ilic
    • Organizer
      The 2017 IEEE International Conference on Fuzzy Systems
    • Related Report
      2017 Research-status Report
    • Int'l Joint Research / Invited
  • [Presentation] A Fuzzy Clustering based Data Fusion Method2017

    • Author(s)
      M. Sato-Ilic
    • Organizer
      11th International Conference on Computational and Financial Econometrics and 10th International Conference of the ERCIM Working Group on Computational and Methodological Statistics
    • Related Report
      2017 Research-status Report
    • Int'l Joint Research / Invited
  • [Presentation] Cluster-Scaled Forecasting Method For High-Dimension Low-Sample Size Data2017

    • Author(s)
      M. Sato-Ilic
    • Organizer
      The 6th Japanese-German Symposium on Classification
    • Related Report
      2017 Research-status Report
    • Int'l Joint Research
  • [Presentation] Asymmetric Clustering Methods based on Orthogonal Projector to the Intersection of Subspaces2017

    • Author(s)
      M. Sato-Ilic
    • Organizer
      2017年度統計関連学会連合大会
    • Related Report
      2017 Research-status Report
  • [Presentation] 3元マルチソースデータに対する同時ファジィクラスタリング手法2017

    • Author(s)
      矢吹健二、佐藤美佳
    • Organizer
      第 33 回ファジィシステムシンポジウム
    • Related Report
      2017 Research-status Report
  • [Presentation] Dissimilarities between Groups of Data2017

    • Author(s)
      N. Shimizu, J. Nakano, Y. Yamamoto
    • Organizer
      New Zealand Statistical Association and the International Association of Statistical Computing 2017
    • Related Report
      2017 Research-status Report
    • Int'l Joint Research / Invited
  • [Book] Fuzzy Clustering Models and Their Related Concepts(1章), Fuzzy Approaches for Soft Computing and Approximate Reasoning: Theories and Applications2020

    • Author(s)
      Mika Sato-Ilic
    • Publisher
      Springer, Switzerland
    • Related Report
      2019 Annual Research Report
  • [Book] サービスサイエンスの事訳―データサイエンスと数理科学の融合に向けてー2017

    • Author(s)
      イリチュ美佳、高木英明
    • Total Pages
      51
    • Publisher
      筑波大学出版会
    • ISBN
      9784904074459
    • Related Report
      2017 Research-status Report

URL: 

Published: 2017-04-28   Modified: 2021-02-19  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi