Optimal Construction of Compression-based Feature Space

Research Project

Project/Area Number	15K00148
Research Category	Grant-in-Aid for Scientific Research (C)
Allocation Type	Multi-year Fund
Section	一般
Research Field	Multimedia database
Research Institution	The University of Electro-Communications
Principal Investigator	Koga Hisashi 電気通信大学, 大学院情報理工学研究科, 准教授 (40361836)
Project Period (FY)	2015-04-01 – 2019-03-31
Project Status	Completed (Fiscal Year 2018)
Budget Amount *help	¥2,730,000 (Direct Cost: ¥2,100,000、Indirect Cost: ¥630,000) Fiscal Year 2017: ¥650,000 (Direct Cost: ¥500,000、Indirect Cost: ¥150,000) Fiscal Year 2016: ¥1,040,000 (Direct Cost: ¥800,000、Indirect Cost: ¥240,000) Fiscal Year 2015: ¥1,040,000 (Direct Cost: ¥800,000、Indirect Cost: ¥240,000)
Keywords	圧縮ベースパターン認識 / 圧縮辞書 / トライ / 特徴空間 / データ圧縮 / 時系列データ / 辞書間距離 / 圧縮性特徴 / 集合間類似度 / 圧縮特徴空間 / 再圧縮率 / パターン認識 / 軸の独立性 / 集合間類似検索
Outline of Final Research Achievements	Compression based pattern recognition is an unsupervised data analysis technique which realizes data analysis without prior knowledge about the data to be analyzed. Its primary point is to measure the similarity between two data based on the compression rate. In particular, in order to exploit the standard pattern recognition algorithms such as SVM and k-means, this research deals with compression-based feature spaces in which an object is represented as a compression vector consisting of multiple compression ratios and studies their effective construction. As the main result, by exchanging the words among the compression dictionaries each of which is responsible for one dimension so that they may be more independent one another, we succeeded in improving the pattern recognition accuracy by 7 to 8% as compared with the previous method in literatures.
Academic Significance and Societal Importance of the Research Achievements	近年、人工知能が大流行しているが、学習データを用意する手間が大きいのが課題である。とくにIoTの時代となり、これまで分析対象とされなかった新種のデータを分析する必要に迫られているが、そのようなデータはそもそも性質が不明なので学習データを用意するのが難しい。圧縮ベースパターン認識は非教示でデータ分析をする技術であり、上記のような性質が不明なデータを分析するのが得意であり、学習データを構築するための要素技術として重要である。本研究はそのパターン認識やクラスタリング（データ分類）の精度改善に貢献した。

Report

(5 results)

2018 Annual Research Report Final Research Report ( PDF )
2017 Research-status Report
2016 Research-status Report
2015 Research-status Report

Research Products
(11 results)

All 2018 2017 2016

All Journal Article (6 results) (of which Peer Reviewed: 3 results, Acknowledgement Compliant: 1 results) Presentation (5 results)

[Journal Article] ユークリッド距離に基づく多観点非類似度とその分割最適化クラスタリングへの応用2018
- Author(s)
  藤原勇二，古賀久志，戸田貴久
- Journal Title
  
  人工知能学会研究資料
  
  Volume: SIG-FPAI-B509 Pages: 51-56
- NAID
  130008061396
- Related Report
  2017 Research-status Report
[Journal Article] 集合間類似度を用いたストリームデータのtop-k類似検索に対する高速な厳密解アルゴリズム2017
- Author(s)
  山崎智博，古賀久志，戸田貴久
- Journal Title
  
  信学技報
  
  Volume: 117(28) Pages: 1-8
- Related Report
  2017 Research-status Report
[Journal Article] 多観点類似度を用いた凝集型階層クラスタリング2017
- Author(s)
  藤原勇二, 古賀久志, 戸田貴久
- Journal Title
  
  情報科学技術フォーラム(FIT2017)
  
  Volume: 第2分冊 Pages: 121-124
- NAID
  170000184442
- Related Report
  2017 Research-status Report
[Journal Article] Fast Exact Algorithm to Solve Continuous Similarity Search for Evolving Queries2017
- Author(s)
  T. Yamazaki, H. Koga and T.Toda
- Journal Title
  
  springer LNCS
  
  Volume: 10648 Pages: 84-96
- DOI
  10.1007/978-3-319-70145-5_7
- ISBN
  9783319701448, 9783319701455
- Related Report
  2017 Research-status Report
- Peer Reviewed
[Journal Article] Improved Compression-Based Pattern Recognition Exploiting New Useful Features2017
- Author(s)
  T. Uchino, H. Koga, T. Toda
- Journal Title
  
  springer LNCS
  
  Volume: 10255 Pages: 363-371
- DOI
  10.1007/978-3-319-58838-4_40
- ISBN
  9783319588377, 9783319588384
- Related Report
  2017 Research-status Report
- Peer Reviewed
[Journal Article] Effective Construction of Compression-based Feature Space2016
- Author(s)
  H. Koga, Y. Nakajima and T. Toda
- Journal Title
  
  Proc. International Symposium on Information Theory and Its Applications (ISITA2016)
  
  Volume: - Pages: 116-120
- Related Report
  2016 Research-status Report
- Peer Reviewed / Acknowledgement Compliant
[Presentation] 品質パラメータの学習による動画像圧縮技術に基づく時系列分類手法の改良2018
- Author(s)
  村井建応，古賀久志，戸田貴久
- Organizer
  第39回情報理論とその応用シンポジウム
- Related Report
  2018 Annual Research Report
[Presentation] 圧縮性基底辞書の単語数均等化による特徴空間の改善2018
- Author(s)
  大内晶太，古賀久志
- Organizer
  第39回情報理論とその応用シンポジウム
- Related Report
  2018 Annual Research Report
[Presentation] 共通要素を類似度とするハッシュベース集合間類似検索手法の改善2018
- Author(s)
  鈴木聡，古賀久志，Gibran FUENTES PINEDA，戸田貴久
- Organizer
  第10回データ工学と情報マネジメントに関するフォーラム(DEIM2018)
- Related Report
  2017 Research-status Report
[Presentation] 圧縮ベースパターン認識に有用な新しい特徴量の抽出2016
- Author(s)
  内野太智，古賀久志，戸田貴久
- Organizer
  第39回情報理論とその応用シンポジウム(SITA2016)
- Place of Presentation
  高山グリーンホテル（岐阜県高山市）
- Year and Date
  2016-12-16
- Related Report
  2016 Research-status Report
[Presentation] 共通要素数を重視したハッシュベース集合間類似検索2016
- Author(s)
  板橋大樹，古賀久志，Gibran Fuentes Pineda GIBRAN，戸田貴久
- Organizer
  第8回データ工学と情報マネジメントに関するフォーラム(DEIM2016)
- Place of Presentation
  ヒルトン福岡シーホーク
- Year and Date
  2016-03-02
- Related Report
  2015 Research-status Report

Optimal Construction of Compression-based Feature Space

Principal Investigator

Koga Hisashi 電気通信大学, 大学院情報理工学研究科, 准教授 (40361836)

¥2,730,000 (Direct Cost: ¥2,100,000、Indirect Cost: ¥630,000)

Report

Research Products

[Journal Article] ユークリッド距離に基づく多観点非類似度とその分割最適化クラスタリングへの応用2018

Author(s)

Journal Title

NAID

Related Report

[Journal Article] 集合間類似度を用いたストリームデータのtop-k類似検索に対する高速な厳密解アルゴリズム2017

Author(s)

Journal Title

Related Report

[Journal Article] 多観点類似度を用いた凝集型階層クラスタリング2017

Author(s)

Journal Title

NAID

Related Report

[Journal Article] Fast Exact Algorithm to Solve Continuous Similarity Search for Evolving Queries2017

Author(s)

Journal Title

DOI

ISBN

Related Report

[Journal Article] Improved Compression-Based Pattern Recognition Exploiting New Useful Features2017

Author(s)

Journal Title

DOI

ISBN

Related Report

[Journal Article] Effective Construction of Compression-based Feature Space2016

Author(s)

Journal Title

Related Report

[Presentation] 品質パラメータの学習による動画像圧縮技術に基づく時系列分類手法の改良2018

Author(s)

Organizer

Related Report

[Presentation] 圧縮性基底辞書の単語数均等化による特徴空間の改善2018

Author(s)

Organizer

Related Report

[Presentation] 共通要素を類似度とするハッシュベース集合間類似検索手法の改善2018

Author(s)

Organizer

Related Report

[Presentation] 圧縮ベースパターン認識に有用な新しい特徴量の抽出2016

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report

[Presentation] 共通要素数を重視したハッシュベース集合間類似検索2016

Author(s)

Organizer

Place of Presentation

Year and Date

Related Report