• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

The theory of filter based feature selection and high-performance algorithms

Research Project

Project/Area Number 26280090
Research Category

Grant-in-Aid for Scientific Research (B)

Allocation TypePartial Multi-year Fund
Section一般
Research Field Intelligent informatics
Research InstitutionGakushuin University

Principal Investigator

Kuboyama Tetsuji  学習院大学, 計算機センター, 教授 (80302660)

Co-Investigator(Kenkyū-buntansha) 申 吉浩  兵庫県立大学, 応用情報科学研究科, 教授 (60523587)
チャクラボルティ バサビ  岩手県立大学, ソフトウェア情報学部, 教授 (90305293)
橋本 隆子  千葉商科大学, 商経学部, 教授 (80551697)
川前 徳章  東京電機大学, 公私立大学の部局等, 研究員 (30447031)
Project Period (FY) 2014-04-01 – 2018-03-31
Project Status Completed (Fiscal Year 2017)
Budget Amount *help
¥15,860,000 (Direct Cost: ¥12,200,000、Indirect Cost: ¥3,660,000)
Fiscal Year 2016: ¥6,110,000 (Direct Cost: ¥4,700,000、Indirect Cost: ¥1,410,000)
Fiscal Year 2015: ¥6,240,000 (Direct Cost: ¥4,800,000、Indirect Cost: ¥1,440,000)
Fiscal Year 2014: ¥3,510,000 (Direct Cost: ¥2,700,000、Indirect Cost: ¥810,000)
Keywords特徴選択 / カテゴリカルデータ / 一貫性指標 / 変数間相互作用 / 変数選択 / フィルター型 / トピック抽出 / アルゴリズム / 疎データ / 機械学習
Outline of Final Research Achievements

We focus on feature selection algorithms that extract minimal subsets of features relevant to class labels from categorical data with high dimensional feature space. Filter-based feature selection consists of two important components; consistency measures between feature sets and class labels, and search strategies for minimal feature sets . Through theoretical and empirical analysis on these two components, we designed and implemented a very fast feature selection algorithm with high accuracy and scalability. We applied this algorithm to two applications; topic extraction from tweets, and pattern acquisition from graph-structured data.

Report

(5 results)
  • 2017 Annual Research Report   Final Research Report ( PDF )
  • 2016 Annual Research Report
  • 2015 Annual Research Report
  • 2014 Annual Research Report
  • Research Products

    (44 results)

All 2018 2017 2016 2015 2014 Other

All Int'l Joint Research (4 results) Journal Article (22 results) (of which Int'l Joint Research: 4 results,  Peer Reviewed: 22 results,  Open Access: 4 results,  Acknowledgement Compliant: 7 results) Presentation (13 results) (of which Int'l Joint Research: 1 results,  Invited: 2 results) Remarks (2 results) Funded Workshop (3 results)

  • [Int'l Joint Research] Digital Humanities, UCLA/simMachines社(米国)

    • Related Report
      2017 Annual Research Report
  • [Int'l Joint Research] ヴロツワフ工科大学(ポーランド)

    • Related Report
      2017 Annual Research Report
  • [Int'l Joint Research] Wroclaw Univ. of Science and Technology(Poland)

    • Related Report
      2016 Annual Research Report
  • [Int'l Joint Research] Digital Humanities, UCLA(米国)

    • Related Report
      2016 Annual Research Report
  • [Journal Article] Nearest Neighbor Search using Sketches as Quantized Images of Dimension Reduction2018

    • Author(s)
      Higuchi Naoya、Imamura Yasunobu、Kuboyama Tetsuji、Hirata Kouichi、Shinohara Takeshi
    • Journal Title

      7th International Conference on Pattern Recognition Applications and Methods (ICPRAM)

      Volume: LNCS 10857 Pages: 356-363

    • DOI

      10.5220/0006585003560363

    • Related Report
      2017 Annual Research Report
    • Peer Reviewed
  • [Journal Article] sCwc/sLcc: Highly Scalable Feature Selection Algorithms.2017

    • Author(s)
      Kilho Shin, Tetsuji Kuboyama, Takako Hashimoto and Dave Shepard
    • Journal Title

      Information

      Volume: 8 Issue: 4 Pages: 159-159

    • DOI

      10.3390/info8040159

    • Related Report
      2017 Annual Research Report
    • Peer Reviewed / Open Access / Int'l Joint Research
  • [Journal Article] 機械学習を用いたソーシャルネットワークと履歴書の照合方式の提案2017

    • Author(s)
      橋本英奈、宮崎夏美、市野将嗣、久保山哲二、 越前功、吉浦裕
    • Journal Title

      情報処理学会論文誌

      Volume: 58(12)

    • NAID

      120006763739

    • Related Report
      2017 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Acquisition of Multiple Block Preserving Outerplanar Graph Patterns by an Evolutionary Method for Graph Pattern Sets2017

    • Author(s)
      Fumiya Tokuhara, Tetsuhiro Miyahara, Tetsuji Kuboyama, Yusuke Suzuki, Tomoyuki Uchida
    • Journal Title

      Proceedings of 2017 IEEE 10th International Workshop on Computational Intelligence and Applications (IWCIA)

      Volume: - Pages: 191-197

    • DOI

      10.1109/iwcia.2017.8203583

    • Related Report
      2017 Annual Research Report
    • Peer Reviewed
  • [Journal Article] A Context-Aware Fitness Function Based on Feature Selection for Evolutionary Learning of Characteristic Graph Patterns2017

    • Author(s)
      Tokuhara Fumiya、Miyahara Tetsuhiro、Kuboyama Tetsuji、Suzuki Yusuke、Uchida Tomoyuki
    • Journal Title

      Proc of 9th Asian Conference on Intelligent Information and Database Systems (ACIIDS)

      Volume: LNCS 10191 Pages: 748-757

    • DOI

      10.1007/978-3-319-54472-4_70

    • ISBN
      9783319544717, 9783319544724
    • Related Report
      2017 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Topic life cycle extraction from big Twitter data based on community detection in bipartite networks2017

    • Author(s)
      Hashimoto Takako、Okamoto Hiroshi、Kuboyama Tetsuji、Shin Kilho
    • Journal Title

      Proc. of IEEE International Conference on Big Data

      Pages: 2740-2745

    • DOI

      10.1109/bigdata.2017.8258238

    • Related Report
      2017 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Topic Extraction on Twitter Considering Author’s Role Based on Bipartite Networks2017

    • Author(s)
      Hashimoto Takako、Kuboyama Tetsuji、Okamoto Hiroshi、Shin Kilho
    • Journal Title

      Proc. of 20th International Conference on Discovery Science (DS)

      Volume: LNCS 10558 Pages: 239-247

    • DOI

      10.1007/978-3-319-67786-6_17

    • ISBN
      9783319677859, 9783319677866
    • Related Report
      2017 Annual Research Report
    • Peer Reviewed / Open Access
  • [Journal Article] Topic Extraction from Millions of Tweets Based on Community Detection in Bipartite Networks2017

    • Author(s)
      Takako Hashimoto, Tetsuji Kuboyama, Hiroshi Okamoto, Kilho Shin
    • Journal Title

      Proc. in Information Modelling and Knowledge Bases XXIX, 27th International Conference on Information Modelling and Knowledge Bases (EJC})

      Volume: LNCS 10558

    • Related Report
      2017 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Multiple Alignments of Data Objects and Generalized Center Star Algorithm2017

    • Author(s)
      Kilho Shin, Tetsuji Kuboyama, Tetsuhiro Miyahara, Kenji Tanaka
    • Journal Title

      Proc. in Fuzzy Systems and Data Mining III (FSDM)

    • Related Report
      2017 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Improving Classification Accuracy by Means of the Sliding Window Method in Consistency-Based Feature Selection2017

    • Author(s)
      Adrian Pino Angulo, Kilho Shin
    • Journal Title

      Proc. of 20th International Conference on Discovery Science (DS)

      Volume: LNCS 10558 Pages: 155-170

    • DOI

      10.1007/978-3-319-67786-6_12

    • ISBN
      9783319677859, 9783319677866
    • Related Report
      2017 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Topic Extraction Method from Millions of Tweets Based on Fast Feature Selection Technique CWC2016

    • Author(s)
      Takako Hashimoto, Dave Shepard, Tetsuji Kuboyama, Kilho Shin
    • Journal Title

      IEEE International Conference on Data Mining Workshops

      Volume: - Pages: 724-731

    • DOI

      10.1109/icdmw.2016.0107

    • Related Report
      2016 Annual Research Report
    • Peer Reviewed / Int'l Joint Research
  • [Journal Article] Fast Hilbert Sort Algorithm Without Using Hilbert Indices2016

    • Author(s)
      Yasunobu Imamura, Takeshi Shinohara, Kouichi Hirata, Tetsuji Kuboyama
    • Journal Title

      Lecture Notes in Computer Science

      Volume: Vol.9939 Pages: 259-267

    • DOI

      10.1007/978-3-319-46759-7_20

    • ISBN
      9783319467580, 9783319467597
    • Related Report
      2016 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Using canonical representations of block tree patterns in acquisition of characteristic block preserving outerplanar graph patterns2016

    • Author(s)
      Fumiya Tokuhara, Tetsuhiro Miyahara, Yusuke Suzuki, Tomoyuki Uchida, Tetsuji Kuboyama
    • Journal Title

      9th IEEE International Workshop on Computational Intelligence and Applications

      Volume: - Pages: 93-99

    • DOI

      10.1109/iwcia.2016.7805755

    • Related Report
      2016 Annual Research Report
    • Peer Reviewed
  • [Journal Article] A Fast and Accurate Feature Selection Algorithm Based on Binary Consistency Measure2016

    • Author(s)
      Kilho Shin, Seiya Miyaza
    • Journal Title

      Computational Intelligence

      Volume: 32(4) Issue: 4 Pages: 646-667

    • DOI

      10.1111/coin.12072

    • Related Report
      2016 Annual Research Report
    • Peer Reviewed / Acknowledgement Compliant
  • [Journal Article] Breaking Anonymity of Social Network Accounts by Using Coordinated and Extensible Classifiers Based on Machine Learning2016

    • Author(s)
      Eina Hashimoto, Masatsugu Ichino, Tetsuji Kuboyama, Isao Echizen, Hiroshi Yoshiura
    • Journal Title

      Lecture Notes in Computer Science

      Volume: 9844 Pages: 455-470

    • DOI

      10.1007/978-3-319-45234-0_41

    • ISBN
      9783319452333, 9783319452340
    • Related Report
      2016 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Topic extraction from millions of tweets using singular value decomposition and feature selection2015

    • Author(s)
      Takako Hashimoto, Tetsuji Kuboyama, Basabi Chakraborty
    • Journal Title

      Proc. of Asia-Pacific Signal and Information Processing Association Annual Summit and Conference

      Volume: IEEE Catalog No. 36228 Pages: 1145-1150

    • DOI

      10.1109/apsipa.2015.7415451

    • Related Report
      2015 Annual Research Report
    • Peer Reviewed / Acknowledgement Compliant
  • [Journal Article] Super-CWC and super-LCC: Super fast feature selection algorithms2015

    • Author(s)
      Kilho Shin, Tetsuji Kuboyama, Takako Hashimoto, Dave Shepard
    • Journal Title

      Proc. of IEEE International Conference on Big Data

      Volume: IEEE Cat.No. CFP15BGD-USB Pages: 1-7

    • DOI

      10.1109/bigdata.2015.7363742

    • Related Report
      2015 Annual Research Report
    • Peer Reviewed / Int'l Joint Research / Acknowledgement Compliant
  • [Journal Article] Tree PCA for Extracting Dominant Substructures from Labeled Rooted Trees2015

    • Author(s)
      Tomoya Yamazaki, Akihiro Yamamoto, Tetsuji Kuboyama
    • Journal Title

      Lecture Notes in Artificial Intelligence

      Volume: 9356 Pages: 316-323

    • DOI

      10.1007/978-3-319-24282-8_27

    • ISBN
      9783319242811, 9783319242828
    • Related Report
      2015 Annual Research Report
    • Peer Reviewed / Acknowledgement Compliant
  • [Journal Article] Event Detection from Millions of Tweets Related to the Great East Japan Earthquake Using Feature Selection Technique.2015

    • Author(s)
      Takako Hashimoto, Dave Shepard, Tetsuji Kuboyama, Kilho Shin
    • Journal Title

      Proc. of IEEE International Conference on Data Mining Workshop

      Volume: IEEE Comp. Soc. Ord. No. E5653 Pages: 7-12

    • DOI

      10.1109/icdmw.2015.248

    • Related Report
      2015 Annual Research Report
    • Peer Reviewed / Int'l Joint Research / Acknowledgement Compliant
  • [Journal Article] A Geometric Theory of Feature Selection and Distance-Based Measures2015

    • Author(s)
      Kilho Shin, Adrian Pino Angulo
    • Journal Title

      Proc. of IJCAI

      Volume: IJCAI2015

    • Related Report
      2015 Annual Research Report
    • Peer Reviewed / Open Access
  • [Journal Article] Fast and Accurate Steepest-Descent Consistency-Constrained Algorithms for Feature Selection2015

    • Author(s)
      Adrian Pino Angulo, Kilho Shin
    • Journal Title

      Machine Learning, Optimization, and Big Data, Lecture Notes in Computer Science

      Volume: 9432 Pages: 293-305

    • DOI

      10.1007/978-3-319-27926-8_26

    • ISBN
      9783319279251, 9783319279268
    • Related Report
      2015 Annual Research Report
    • Peer Reviewed / Acknowledgement Compliant
  • [Journal Article] Real Time Recommendations from Connoisseurs2015

    • Author(s)
      Noriaki Kawamae
    • Journal Title

      Proc. of the ACM SIGKDD

      Volume: KDD'15 Pages: 537-546

    • DOI

      10.1145/2783258.2783260

    • Related Report
      2015 Annual Research Report
    • Peer Reviewed / Open Access / Acknowledgement Compliant
  • [Presentation] クラスタ構造を仮定した場合の双クラスタリングアルゴリズムの解析2017

    • Author(s)
      山浦智佳子 (共著者: 小林靖明, 山本章博, 久保山哲二)
    • Organizer
      第103回人工知能基本問題研究会(SIG-FPAI)
    • Place of Presentation
      湯布院公民館
    • Year and Date
      2017-03-13
    • Related Report
      2016 Annual Research Report
  • [Presentation] モジュラリティを基準とした関係データに対する特徴選択2017

    • Author(s)
      紫藤佑介 (共著者: 山本章博,小林靖明,久保山哲二)
    • Organizer
      第103回人工知能基本問題研究会(SIG-FPAI)
    • Place of Presentation
      湯布院公民館
    • Year and Date
      2017-03-13
    • Related Report
      2016 Annual Research Report
  • [Presentation] Polishing Big Data for Interpretable Results and Simple Algorithm Design (Panel Discussion on New Research Challenges)2017

    • Author(s)
      Tetsuji Kuboyama
    • Organizer
      10th Asian Conference on Intelligent Information and Database Systems (ACIIDS)
    • Related Report
      2017 Annual Research Report
    • Int'l Joint Research / Invited
  • [Presentation] A Novel Hybrid Feature Selection Algorithm for Intrusion Detection2016

    • Author(s)
      Adrian Pino Angulo, 申吉浩
    • Organizer
      人工知能学会 第100回人工知能基本問題研究会(SIG-FPAI)
    • Place of Presentation
      熊本市民会館
    • Year and Date
      2016-03-27
    • Related Report
      2015 Annual Research Report
  • [Presentation] Feature selection based identification of crucial factors for successful advertising on mobile devices2016

    • Author(s)
      Chun-Cheng Liu, Goutam Chakraborty
    • Organizer
      人工知能学会 第99回人工知能基本問題研究会(SIG-FPAI)
    • Place of Presentation
      仙台市湯の原ホテル
    • Year and Date
      2016-01-21
    • Related Report
      2015 Annual Research Report
  • [Presentation] Tree PCAによる任意形状の木構造を抽出するアルゴリズム2016

    • Author(s)
      山崎朋哉, 山本章博, 久保山哲二
    • Organizer
      人工知能学会 第99回人工知能基本問題研究会(SIG-FPAI)
    • Place of Presentation
      仙台市湯の原ホテル
    • Year and Date
      2016-01-21
    • Related Report
      2015 Annual Research Report
  • [Presentation] 距離ベースの特徴選択指標2015

    • Author(s)
      申吉浩 (共著: Angulo Adrian Pino, 久保山 哲二)
    • Organizer
      第97回 人工知能基本問題研究会(SIG-FPAI)
    • Place of Presentation
      別府市
    • Year and Date
      2015-03-22
    • Related Report
      2014 Annual Research Report
  • [Presentation] Evolutionary Algorithms and various Evaluation Measures for Feature Subset Selection2015

    • Author(s)
      Basabi Chakraborty
    • Organizer
      Proceedings of International Conference on Electronic Design, Computer Networks & Automated Verification EDCAV 2015
    • Place of Presentation
      Shillong, India
    • Year and Date
      2015-01-29 – 2015-01-30
    • Related Report
      2014 Annual Research Report
    • Invited
  • [Presentation] De Morgan Property of Bayes Risk as A Feature Selection Measure2014

    • Author(s)
      Tetsuji Kuboyama (共著: kilho Shin)
    • Organizer
      Workshop on Graph-Based Algorithms for Big Data and Its Application (GABA2014) 査読あり
    • Place of Presentation
      慶応大学(日吉キャンパス)
    • Year and Date
      2014-11-23
    • Related Report
      2014 Annual Research Report
  • [Presentation] Mapping Kernels for Cyclically Ordered Trees2014

    • Author(s)
      Tetsuji Kuboyama (共著: Kouichi Hirata)
    • Organizer
      Workshop on Graph-Based Algorithms for Big Data and Its Application (GABA2014) 査読あり
    • Place of Presentation
      慶応大学(日吉キャンパス)
    • Year and Date
      2014-11-23
    • Related Report
      2014 Annual Research Report
  • [Presentation] On Bundled Query Processing for High Dimensional Similarity Search2014

    • Author(s)
      Yohei Nasu (共著: Naoki Kishikawa, Kei Tashima, Shin Kodama, Yasunobu Imamura, Takeshi Shinohara, Kouichi Hirata, Tetsuji Kuboyama)
    • Organizer
      Workshop on Graph-Based Algorithms for Big Data and Its Application (GABA2014) 査読あり
    • Place of Presentation
      慶応大学(日吉キャンパス)
    • Year and Date
      2014-11-23
    • Related Report
      2014 Annual Research Report
  • [Presentation] Central Point Selection for Dimension Reduction Projection Simple-Map Using Binary Quantization2014

    • Author(s)
      Quming Jin (共著: Masaya Nakashima, Takeshi Shinohara, Kouichi Hirata, Tetsuji Kuboyama)
    • Organizer
      Workshop on Graph-Based Algorithms for Big Data and Its Application (GABA2014) 査読あり
    • Place of Presentation
      慶応大学(日吉キャンパス)
    • Year and Date
      2014-11-23
    • Related Report
      2014 Annual Research Report
  • [Presentation] 特徴選択指標について2014

    • Author(s)
      久保山哲二 (共著: 申吉浩)
    • Organizer
      第94回 人工知能基本問題研究会(SIG-FPAI)
    • Place of Presentation
      根室市
    • Year and Date
      2014-07-24
    • Related Report
      2014 Annual Research Report
  • [Remarks] sCWCの実装

    • URL

      https://github.com/tkub/scwc

    • Related Report
      2017 Annual Research Report
  • [Remarks] sCWC: very fast feature selection for nominal data

    • URL

      https://github.com/tkub/scwc

    • Related Report
      2016 Annual Research Report
  • [Funded Workshop] CU-EE MSP/IEEE Signal Processing Society Thailand Section/ IEICE Bangkok Section Seminar - Big Data analytics2016

    • Place of Presentation
      Faculty of Engineering, Chulalongkorn University
    • Year and Date
      2016-03-08
    • Related Report
      2015 Annual Research Report
  • [Funded Workshop] High Dimensional Data Summarization for Discrete Structures (Special Session in SISA2016)2016

    • Place of Presentation
      Classic Kameo Hotel & Serviced Apartments, Ayutthaya
    • Related Report
      2016 Annual Research Report
  • [Funded Workshop] Social Data Analysis Seminar2015

    • Place of Presentation
      Digital Humanities, UCLA
    • Year and Date
      2015-06-26
    • Related Report
      2015 Annual Research Report

URL: 

Published: 2014-04-04   Modified: 2022-02-21  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi