• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to project page

2001 Fiscal Year Final Research Report Summary

A STUDY OF FEATURE (ATTRIBUTE) SELECTION IN DATA MINING

Research Project

Project/Area Number 12680398
Research Category

Grant-in-Aid for Scientific Research (C)

Allocation TypeSingle-year Grants
Section一般
Research Field Intelligent informatics
Research InstitutionTokyo Denki University

Principal Investigator

MANABU Ichino  Tokyo Denki University, Department of Information and Arts, Professor., 理工学部, 教授 (40057245)

Project Period (FY) 2000 – 2001
Keywordspattern recognition / data mining / feature selection / neighborhood graph / feature evaluation / geometrical thickness / analysis of variance / Calhoun correlation coefficient
Research Abstract

The purpose of this research is to develop some methods of feature (attribute) selection in data mining. We report the results for feature selection in classification problems. Then, we report a new correlation coefficient which is applicable to various nonlinear relationships between feature variables.
1) Feature selection for classification problems When we have only a finite
number of samples, the classification performance may not be improved by the addition of new features used to describe samples. This means that we have to strike a balance between the interclass distinguishability and the generality of class descriptions. We introduced two graphs: the generality or dered mutual neighborhood graph and the generality ordered interclass mutual neighborhood graph, then we dev eloped a feature selection algorithm based on the modified zero-one integer programmirig and it's simplified algorithm.
2) Generalized correlation coefficient
Pearson's correlation coefficient is useful to detect causality between feature variables. However, this well known tool is not applicable to general nonlinear causal relations. If two feature variables follow to a functional structure, the sample distribution with respect to the feature variables has a geometrically thin structure. We developed a generalized correlation coefficient, called the Calhoun correlation coefficient. This new measure are able to evaluate various nonlinear functional relations and other geometrically this structures.

  • Research Products

    (10 results)

All Other

All Publications (10 results)

  • [Publications] M.Ichino, H.Yaguchi: "An apparent simplicity appearing in pattern classification problems"Pattern Recognition. 33・9. 1567-1474 (2000)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] 市野, 矢口, 野中: "幾何学的厚みに基づく相関係数"電子情報通信学会論文誌A. (採録). (2002)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] M.Ichino: "Neighborhood Graphs in Classification Problems for Symbolic Data"Journal of the Japanese Society of Computational Statistics(accepted). (2002)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] Manabu Ichino: "Neighborhood graphs in classification problems for symbolic data"Proceedings, International Conference on New Trends in Computational Statistics with Biomedical Applications. 191-202 (2001)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] Manabu Ichino: "A region-based fuzzy pattern classifier for symbolic data"18^<th> Research Report of Japanese Classification Society. 1-16 (2001)

    • Description
      「研究成果報告書概要(和文)」より
  • [Publications] M.Ichino and H.Yaguchi: "An apparent simplicity appearing in pattern classification problems"Pattern recognition. 33-9. 1467-1474 (2000)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] M.Ichino, H.Yaguchi, and T.Nonaka: "Correlation coefficient based on gemetrical thikness"Trans. IEICE(A). (in press).

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] M.Ichino: "Neighborhodd graphs in classification problems for symbolic data"Journal of the Japanese Sociey of Computational Statistics. (accepted).

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] M.Ichino: "Neighborhood graphs in classification problems for symbolic data"Proceedings, International Conference on New trends in Computational Statistics with Biomedical Applications. 191-202 (2001)

    • Description
      「研究成果報告書概要(欧文)」より
  • [Publications] M.Ichino: "A region-based fuzzy pattern clasifier for symbolic data"18-th Research Report of Japanese Classification Society. 1-16 (2001)

    • Description
      「研究成果報告書概要(欧文)」より

URL: 

Published: 2003-09-17  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi