• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

A STUDY OF FEATURE (ATTRIBUTE) SELECTION IN DATA MINING

Research Project

Project/Area Number 12680398
Research Category

Grant-in-Aid for Scientific Research (C)

Allocation TypeSingle-year Grants
Section一般
Research Field Intelligent informatics
Research InstitutionTokyo Denki University

Principal Investigator

MANABU Ichino  Tokyo Denki University, Department of Information and Arts, Professor., 理工学部, 教授 (40057245)

Project Period (FY) 2000 – 2001
Project Status Completed (Fiscal Year 2001)
Budget Amount *help
¥1,600,000 (Direct Cost: ¥1,600,000)
Fiscal Year 2001: ¥500,000 (Direct Cost: ¥500,000)
Fiscal Year 2000: ¥1,100,000 (Direct Cost: ¥1,100,000)
Keywordspattern recognition / data mining / feature selection / neighborhood graph / feature evaluation / geometrical thickness / analysis of variance / Calhoun correlation coefficient / 相関係数 / 0-1整数計画 / 手書き文字認識 / 分離能力 / 記述の一般性
Research Abstract

The purpose of this research is to develop some methods of feature (attribute) selection in data mining. We report the results for feature selection in classification problems. Then, we report a new correlation coefficient which is applicable to various nonlinear relationships between feature variables.
1) Feature selection for classification problems When we have only a finite
number of samples, the classification performance may not be improved by the addition of new features used to describe samples. This means that we have to strike a balance between the interclass distinguishability and the generality of class descriptions. We introduced two graphs: the generality or dered mutual neighborhood graph and the generality ordered interclass mutual neighborhood graph, then we dev eloped a feature selection algorithm based on the modified zero-one integer programmirig and it's simplified algorithm.
2) Generalized correlation coefficient
Pearson's correlation coefficient is useful to detect causality between feature variables. However, this well known tool is not applicable to general nonlinear causal relations. If two feature variables follow to a functional structure, the sample distribution with respect to the feature variables has a geometrically thin structure. We developed a generalized correlation coefficient, called the Calhoun correlation coefficient. This new measure are able to evaluate various nonlinear functional relations and other geometrically this structures.

Report

(3 results)
  • 2001 Annual Research Report   Final Research Report Summary
  • 2000 Annual Research Report
  • Research Products

    (13 results)

All Other

All Publications (13 results)

  • [Publications] M.Ichino, H.Yaguchi: "An apparent simplicity appearing in pattern classification problems"Pattern Recognition. 33・9. 1567-1474 (2000)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2001 Final Research Report Summary
  • [Publications] 市野, 矢口, 野中: "幾何学的厚みに基づく相関係数"電子情報通信学会論文誌A. (採録). (2002)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2001 Final Research Report Summary
  • [Publications] M.Ichino: "Neighborhood Graphs in Classification Problems for Symbolic Data"Journal of the Japanese Society of Computational Statistics(accepted). (2002)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2001 Final Research Report Summary
  • [Publications] Manabu Ichino: "Neighborhood graphs in classification problems for symbolic data"Proceedings, International Conference on New Trends in Computational Statistics with Biomedical Applications. 191-202 (2001)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2001 Final Research Report Summary
  • [Publications] Manabu Ichino: "A region-based fuzzy pattern classifier for symbolic data"18^<th> Research Report of Japanese Classification Society. 1-16 (2001)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2001 Final Research Report Summary
  • [Publications] M.Ichino and H.Yaguchi: "An apparent simplicity appearing in pattern classification problems"Pattern recognition. 33-9. 1467-1474 (2000)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2001 Final Research Report Summary
  • [Publications] M.Ichino, H.Yaguchi, and T.Nonaka: "Correlation coefficient based on gemetrical thikness"Trans. IEICE(A). (in press).

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2001 Final Research Report Summary
  • [Publications] M.Ichino: "Neighborhodd graphs in classification problems for symbolic data"Journal of the Japanese Sociey of Computational Statistics. (accepted).

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2001 Final Research Report Summary
  • [Publications] M.Ichino: "Neighborhood graphs in classification problems for symbolic data"Proceedings, International Conference on New trends in Computational Statistics with Biomedical Applications. 191-202 (2001)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2001 Final Research Report Summary
  • [Publications] M.Ichino: "A region-based fuzzy pattern clasifier for symbolic data"18-th Research Report of Japanese Classification Society. 1-16 (2001)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2001 Final Research Report Summary
  • [Publications] 市野, 矢口, 野中: "幾何学的厚みに基づく相関係数"電子情報通信学会論文誌A 採録. (2002)

    • Related Report
      2001 Annual Research Report
  • [Publications] M. Ichino: "Neighborhood Graphs in Classification Problems for Symbolic Data"Journal of the Japanese society of Computational Statistics (accepted).

    • Related Report
      2001 Annual Research Report
  • [Publications] Manabu Ichino and Hiroyuki Yaguchi: "An apparent simplicity appearing in pattern classification problems."Pattern Recognition. 33・9. 1467-1474 (2000)

    • Related Report
      2000 Annual Research Report

URL: 

Published: 2000-04-01   Modified: 2016-04-21  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi