• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Studies on Advanced Pattern Matching over Continuous Data Streams

Research Project

Project/Area Number 20700001
Research Category

Grant-in-Aid for Young Scientists (B)

Allocation TypeSingle-year Grants
Research Field Fundamental theory of informatics
Research InstitutionHokkaido University

Principal Investigator

KIDA Takuya  Hokkaido University, 大学院・情報科学研究科, 准教授 (70343316)

Project Period (FY) 2008 – 2010
Project Status Completed (Fiscal Year 2010)
Budget Amount *help
¥4,290,000 (Direct Cost: ¥3,300,000、Indirect Cost: ¥990,000)
Fiscal Year 2010: ¥1,040,000 (Direct Cost: ¥800,000、Indirect Cost: ¥240,000)
Fiscal Year 2009: ¥1,690,000 (Direct Cost: ¥1,300,000、Indirect Cost: ¥390,000)
Fiscal Year 2008: ¥1,560,000 (Direct Cost: ¥1,200,000、Indirect Cost: ¥360,000)
Keywordsアルゴリズム理論 / データストリーム / パターン照合 / 情報検索 / 文字列処理 / データ圧縮 / 接尾辞木 / VF符号
Research Abstract

I have studied high speed and advanced pattern matching over continuous data streams and also about compression technique for realizing that. For the former, I have proposed a pattern matching algorithm, named BPS, which is based on bit-parallel techniques and allows complex queries for multi-dimension data streams. By the algorithm, we can search over data streams for queries that highly combined with numerical data and categorical data as well as text data. For the latter, I have developed a novel data compression method, named STVF coding, which is based on VF coding and suitable for pattern matching. The method has a good feature of allowing doing keyword search in simple and quick manners, as it gains high compression ratios as well as existent well-known compression methods.

Report

(4 results)
  • 2010 Annual Research Report   Final Research Report ( PDF )
  • 2009 Annual Research Report
  • 2008 Annual Research Report
  • Research Products

    (29 results)

All 2011 2010 2009 2008 Other

All Journal Article (12 results) (of which Peer Reviewed: 12 results) Presentation (15 results) Book (1 results) Remarks (1 results)

  • [Journal Article] Unsupervised Spam Detection by Document Probability Estimation with Maximal Overlap Method2011

    • Author(s)
      Takashi Uemura, Daisuke Ikeda, Takuya Kida, Hiroki Arimura,
    • Journal Title

      人工知能学会論文誌 Vol.26, No.1

      Pages: 297-306

    • NAID

      130000455378

    • Related Report
      2010 Final Research Report
    • Peer Reviewed
  • [Journal Article] Unsupervised Spam Detection by Document Probability Estimation with Maximal Overlap Method2011

    • Author(s)
      Takashi Uemura, Daisuke Ikeda, Takuya Kida, Hiroki Arimura
    • Journal Title

      人工知能学会論文誌

      Volume: Vol.26, No.1 Pages: 297-306

    • NAID

      130000455378

    • Related Report
      2010 Annual Research Report
    • Peer Reviewed
  • [Journal Article] On Performance of Compressed Pattern Matching on VF Codes2011

    • Author(s)
      Satoshi Yoshida, Takuya Kida
    • Journal Title

      Proc.of Data Compression Conference 2011

      Volume: IEEE DCC.2011 Pages: 486-486

    • Related Report
      2010 Annual Research Report
    • Peer Reviewed
  • [Journal Article] 分節木と共用文字列で表現される符号上での効率良い圧縮照合アルゴリズム2010

    • Author(s)
      喜田拓也
    • Journal Title

      電子情報通信学会和文論文誌 Vol.J93-D, No.6

      Pages: 733-741

    • Related Report
      2010 Final Research Report
    • Peer Reviewed
  • [Journal Article] 分散木と共用文字列で表現される符号上での効率良い圧縮照合アルゴリズム2010

    • Author(s)
      喜田拓也
    • Journal Title

      電子情報通信学会和文論文誌

      Volume: Vol.J93-D, No.6 Pages: 733-741

    • Related Report
      2010 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Training Parse Trees for Efficient VF Coding2010

    • Author(s)
      Takashi Uemura, Takuya Kida, Satoshi Yoshida, Tatsuya Asai, Seishi Okamoto
    • Journal Title

      Proc.of the 17th Symposium on String Processing and Information Retrieval (SPIRE2010)

      Volume: LNCS 6393 Pages: 179-184

    • Related Report
      2010 Annual Research Report
    • Peer Reviewed
  • [Journal Article] An Efficient Algorithm for Almost Instantaneous VF Code Using Multiplexed Parse Tree2010

    • Author(s)
      Satoshi Yoshida, Takuya Kida
    • Journal Title

      Proc.of Data Compression Conference 2010 IEEE DCC2010

      Pages: 219-228

    • NAID

      120006660801

    • Related Report
      2009 Annual Research Report
    • Peer Reviewed
  • [Journal Article] STVF符号:頻度刈り込み接尾辞木を用いた効率良いVF符号化2009

    • Author(s)
      喜田拓也
    • Journal Title

      日本データベース学会論文誌DBSJ Journal Vol.8,No.1

      Pages: 125-130

    • Related Report
      2010 Final Research Report
    • Peer Reviewed
  • [Journal Article] A space-saving approximation algorithm for grammar-based compression2009

    • Author(s)
      H.Sakamoto, S.Maruyama, T. Kida, S.Shimozono
    • Journal Title

      IEICE Trans. on Information and Systems E92-D(2)

      Pages: 158-165

    • NAID

      10026807354

    • Related Report
      2010 Final Research Report
    • Peer Reviewed
  • [Journal Article] STVF符号:頻度刈り込み接尾辞木を用いた効率良いVF符号化2009

    • Author(s)
      喜田拓也
    • Journal Title

      日本データベース学会論文誌DBSJ Journal Vol.8

      Pages: 125-130

    • Related Report
      2009 Annual Research Report
    • Peer Reviewed
  • [Journal Article] Flexible Framework for Time-Series Pattern Matching over Multi-Dimension Data Stream2009

    • Author(s)
      Takuya Kida, Tomoya Saito, and Hiroki Arimura
    • Journal Title

      Proc. of New Frontiers in Applied Data Mining : PAKDD 2008 International Workshops LNAI5433

      Pages: 1-12

    • Related Report
      2008 Annual Research Report
    • Peer Reviewed
  • [Journal Article] ウェブ閲覧における効率的なキーワード抽出とその利用2008

    • Author(s)
      上村卓史, 喜田拓也, 有村博紀
    • Journal Title

      情報処理学会論文誌:データベース(TOD) Vol.38

      Pages: 49-60

    • NAID

      110007990001

    • Related Report
      2010 Final Research Report
    • Peer Reviewed
  • [Presentation] On Performance of Compressed Pattern Matching on VF Codes2011

    • Author(s)
      Satosnl Yosnlqa, Takuya Kula
    • Organizer
      Proc.of Data Compression Conference 2011, p.486
    • Place of Presentation
      Utah, USA
    • Year and Date
      2011-03-30
    • Related Report
      2010 Final Research Report
  • [Presentation] A Combination of Variable-length-to-Fixed-length Coding with Arithmetic Coding for Efficient Compression and Pattern Matching2010

    • Author(s)
      Satoshi Yoshida, Takuya Kida
    • Organizer
      In 5th Workshop on Compression, Text, and Algorithms
    • Place of Presentation
      Grand Faro hotel, Los Cabos, Mexico
    • Year and Date
      2010-10-14
    • Related Report
      2010 Annual Research Report
  • [Presentation] Training Parse Trees for Efficient VF Coding2010

    • Author(s)
      Takashi Uemura, Takuya Kida, Satoshi Yoshida, Tatsuya Asai, Seishi Okamoto
    • Organizer
      Proc.of the 17th Symposium on String Processing and Information Retrieval (SPIRE2010), LNCS 6393, pp.179-184
    • Place of Presentation
      Los Cabos, Mexico
    • Year and Date
      2010-10-12
    • Related Report
      2010 Final Research Report
  • [Presentation] 分節木の訓練によるVF符号の圧縮率改善2010

    • Author(s)
      上村卓史, 吉田諭史, 喜田拓也
    • Organizer
      若手研究者支援のための産学協同GCOE国内シンポジウム
    • Place of Presentation
      北海道大学,札幌
    • Year and Date
      2010-10-06
    • Related Report
      2010 Annual Research Report
  • [Presentation] VF符号と算術符号の組合せ手法による圧縮率向上について2010

    • Author(s)
      吉田諭史, 喜田拓也
    • Organizer
      情報処理学会 第150回データベースシステム研究会・第99回情報基礎とアクセス技術研究会 合同研究発表会
    • Place of Presentation
      青山学院大学,東京
    • Year and Date
      2010-08-04
    • Related Report
      2010 Annual Research Report
  • [Presentation] An Efficient Algorithm for Almost Instantaneous VF Code Using Multiplexed Parse Tree2010

    • Author(s)
      Satoshi Yoshida, Takuya Kida
    • Organizer
      In Proc.of Data Compression Conference 2010 (DCC 2010), 219-228
    • Place of Presentation
      Utah, USA
    • Year and Date
      2010-03-25
    • Related Report
      2010 Final Research Report
  • [Presentation] 仮想的な多重分節木による効率良いAIVF符号2009

    • Author(s)
      吉田諭史, 喜田拓也
    • Organizer
      第148回データベースシステム・第95回情報学基礎合同研究発表会
    • Place of Presentation
      神戸ファッションマート(兵庫県)
    • Year and Date
      2009-07-28
    • Related Report
      2009 Annual Research Report
  • [Presentation] VF符号上における圧縮照合アルゴリズム2009

    • Author(s)
      喜田拓也
    • Organizer
      電子情報通信学会コンピュテーション研究会(COMP)
    • Place of Presentation
      北海道大学
    • Year and Date
      2009-06-29
    • Related Report
      2009 Annual Research Report
  • [Presentation] Suffix Tree Based VF-Coding for Compressed Pattern Matching2009

    • Author(s)
      Takuya Kida
    • Organizer
      In Proc.Data Compression Conference 2009 IEEE press, p.449
    • Place of Presentation
      Utah, USA
    • Year and Date
      2009-03-17
    • Related Report
      2010 Final Research Report
  • [Presentation] Suffix Tree Based VF-Coding for Compressed Pattern Matching2009

    • Author(s)
      Takuya Kida
    • Organizer
      Data Compression Conference 2009
    • Place of Presentation
      Snowbird, Utah, USA
    • Year and Date
      2009-03-17
    • Related Report
      2008 Annual Research Report
  • [Presentation] 頻度刈り込み接尾辞木によるVF符号化2009

    • Author(s)
      喜田拓也
    • Organizer
      第1回データ工学と情報マネジメントに関するフォーラム(DEIM2008)
    • Place of Presentation
      静岡県掛川市ヤマハリゾートつま恋
    • Year and Date
      2009-03-08
    • Related Report
      2008 Annual Research Report
  • [Presentation] Efficient Serial Episode Mining with Minimal Occurrences2009

    • Author(s)
      Hideyuki Ohtani, Takuya Kida, Takeaki Uno, Hiroki Arimura
    • Organizer
      Proc.of The 3rd International Conference on Ubiquitous Information Management and Communication (ICUIMC 2009), 471-479
    • Place of Presentation
      Suwon, Korea
    • Year and Date
      2009-01-16
    • Related Report
      2010 Final Research Report
  • [Presentation] Efficient Serial Episode Mining with Minimal Occurrences2009

    • Author(s)
      Hideyuki Ohtani, Takuya Kida, Takeaki Uno, Hiroki Arimura
    • Organizer
      The 3rd International Conference on Ubiquitous Information Manazement and Communication(ICUIMC 2009)
    • Place of Presentation
      Sungkyunkwan University, Suwon, Korea
    • Related Report
      2008 Annual Research Report
  • [Presentation] JPEG画像に対する2次元パターンマッチングアルゴリズム2008

    • Author(s)
      中野智晴, 喜田拓也
    • Organizer
      パターン認識・メディア理解研究会, PRMU2008
    • Place of Presentation
      熊本大学
    • Year and Date
      2008-12-18
    • Related Report
      2008 Annual Research Report
  • [Presentation] Flexible Framework for Time-Series Pattern Matching over Multi-Dimension Data Stream2008

    • Author(s)
      Takuya Kida, Tomoya Saito, Hiroki Arimura
    • Organizer
      Proc.the First International Workshop on Algorithms for Large-Scale Information Processing in Knowledge Discovery (ALSIP 2008), in conjunction with PAKDD 2008, 5-16
    • Place of Presentation
      Hotel Seagull Tempozan, Osaka
    • Year and Date
      2008-05-20
    • Related Report
      2010 Final Research Report
  • [Book] JPEG画像に対する2次元近似パターンマッチング,画像ラボ2009

    • Author(s)
      中野智晴, 喜田拓也
    • Publisher
      日本工業出版
    • Related Report
      2010 Final Research Report
  • [Remarks] ホームページ等

    • URL

      http://www-ikn.ist.hokudai.ac.jp/~kida/publication.html

    • Related Report
      2010 Final Research Report

URL: 

Published: 2008-04-01   Modified: 2016-04-21  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi