• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to project page

2009 Fiscal Year Final Research Report

MATHEMATICAL STATISTICS FOR DATA ANALYSIS IN HIGH DIMENSION, LOW SAMPLE SIZE CONTEXT AND ITS APPLICATIONS

Research Project

  • PDF
Project/Area Number 18300092
Research Category

Grant-in-Aid for Scientific Research (B)

Allocation TypeSingle-year Grants
Section一般
Research Field Statistical science
Research InstitutionUniversity of Tsukuba

Principal Investigator

AOSHIMA Makoto  University of Tsukuba, 大学院・数理物質科学研究科, 教授 (90246679)

Co-Investigator(Kenkyū-buntansha) AKAHIRA Masafumi  筑波大学, 副学長 (70017424)
KOIKE Ken-ichi  筑波大学, 大学院・数理物質科学研究科, 准教授 (90260471)
OHYAUCHI Nao  筑波大学, 大学院・数理物質科学研究科, 助教 (40375374)
TASAKI Hiroyuki  筑波大学, 大学院・数理物質科学研究科, 准教授 (30179684)
KAWAMURA Kazuhiro  筑波大学, 大学院・数理物質科学研究科, 准教授 (40204771)
TAKAHASHI Hideto  筑波大学, 大学院・人間総合科学研究科, 准教授 (80261808)
MINAMI Nariyuki  慶應義塾大学, 医学部, 教授 (10183964)
Project Period (FY) 2006 – 2009
Keywords多変量解析 / 機械学習 / パターン認識 / モデル選択 / ノイズ / 生体生命情報学 / マイクロアレイ / 高次元データ
Research Abstract

We developed the high-dimension asymptotic theory for High Dimension, Low Sample Size (HDLSS) datasets under a general setup such as non-Gaussian distributions. We found several geometric structures of HDLSS datasets. We showed that the naive PCA is inconsistent in the HDLSS context. We proposed effective inference methods called (1) the noise-reduction methodology, and (2) the cross-data-matrix methodology. By using those methodologies, we gave consistent estimation for intrinsic dimensionality, eigenvalues, their limiting distributions, PC directions and PC scores in the HDLSS context. We applied those methodologies to the discriminant analysis and the cluster analysis in HDLSS data situations from a microarray study of prostate cancer.

  • Research Products

    (15 results)

All 2010 2009 2008 2007 2006 Other

All Journal Article (6 results) (of which Peer Reviewed: 6 results) Presentation (7 results) Book (1 results) Remarks (1 results)

  • [Journal Article] Asymptotic second-order consistency for two-stage estimation methodologies and its applications2010

    • Author(s)
      Aoshima, M., Yata, K.
    • Journal Title

      Ann.Inst.Statist.Math. 62(in press)

    • Peer Reviewed
  • [Journal Article] Intrinsic dimensionality estimation of high dimension, low sample size data with d-asymptotics2010

    • Author(s)
      Yata, K., Aoshima, M.
    • Journal Title

      Commun.Statist.-Theory and Meth. 39(in press)

    • Peer Reviewed
  • [Journal Article] Sequential estimation procedures for end points of support in a non-regular distribution2010

    • Author(s)
      Koike, K.
    • Journal Title

      Commun.Statist.-Theory and Meth. 39(in press)

    • Peer Reviewed
  • [Journal Article] PCA consistency for non-Gaussian data in high dimension, low sample size context2009

    • Author(s)
      Yata, K., Aoshima, M.
    • Journal Title

      Commun.Statist.-Theory and Meth. 38

      Pages: 2634-2652

    • Peer Reviewed
  • [Journal Article] 非正則推定における情報量の概念とその役割2008

    • Author(s)
      赤平昌文
    • Journal Title

      日本統計学会誌 37

      Pages: 329-342

    • Peer Reviewed
  • [Journal Article] A Bayesian view of the Hammersley-Chapman-Robbins-type inequality2007

    • Author(s)
      Akahira, M., Ohyauchi, N.
    • Journal Title

      Statistics 41

      Pages: 137-144

    • Peer Reviewed
  • [Presentation] Eigenvalue estimation in HDLSS context and its applications2008

    • Author(s)
      Aoshima, M.
    • Organizer
      日本学術振興会日露共同研究プロジェクト研究集会
    • Place of Presentation
      広島大学
    • Year and Date
      2008-11-04
  • [Presentation] Intrinsic dimensionality estimation of high dimension, low sample size data with geometric representation2008

    • Author(s)
      Aoshima, M.
    • Organizer
      International IISA Conference, University of Connecticut
    • Place of Presentation
      Connecticut, U.S.A.
    • Year and Date
      2008-05-24
  • [Presentation] Information inequality bounds in non-regular estimation2007

    • Author(s)
      大谷内奈穂
    • Organizer
      日本数学会秋季総合分科会, 統計数学分科会
    • Place of Presentation
      東北大学
    • Year and Date
      2007-09-23
  • [Presentation] Sequential estimation of a location parameter for the location-scale family of distributions in non-regular case2007

    • Author(s)
      Koike, K.
    • Organizer
      The 56th Session of the International Statistical Institute, Lisboa Congress Centre
    • Place of Presentation
      Lisboa, Portugal
    • Year and Date
      2007-08-27
  • [Presentation] Asymptotic second-order consistency for two-stage estimation methodologies and its applications2007

    • Author(s)
      Aoshima, M.
    • Organizer
      The 56th Session of the International Statistical Institute, Lisboa Congress Centre
    • Place of Presentation
      Lisboa, Portugal
    • Year and Date
      2007-08-25
  • [Presentation] Asymptotic second-order consistency for two-stage methodologies via covariance structures2007

    • Author(s)
      Aoshima, M.
    • Organizer
      First International Workshop in Sequential Methodologies
    • Place of Presentation
      Auburn, U.S.A.
    • Year and Date
      2007-07-25
  • [Presentation] The second order large- deviation efficiency for an exponential family of distributions2006

    • Author(s)
      Akahira, M.
    • Organizer
      日本学術振興会日露共同プロジェクト研究集会
    • Place of Presentation
      広島大学
    • Year and Date
      2006-08-07
  • [Book] 知識ベース : 電子情報通信基礎2010

    • Author(s)
      青嶋誠, 他
    • Publisher
      電子情報通信学会(印刷中)
  • [Remarks] ホームページ等 : つくばリポジトリ

    • URL

      https://www.tulips.tsukuba.ac.jp/portal/tulips-r.php

URL: 

Published: 2011-06-18   Modified: 2016-04-21  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi