• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to project page

2006 Fiscal Year Final Research Report Summary

Automated synthesis of frequent event-sequences corpus from large-scale textual data and its application to WEB content tracking

Research Project

Project/Area Number 16500078
Research Category

Grant-in-Aid for Scientific Research (C)

Allocation TypeSingle-year Grants
Section一般
Research Field Intelligent informatics
Research InstitutionUniversity of Yamanashi

Principal Investigator

IWANUMA Koji  University of Yamanashi, Department of Research Interdisciplinary Graduate School of Medicine and Engineering, Professor, 大学院医学工学総合研究部, 教授 (30176557)

Project Period (FY) 2004 – 2006
Keywordssequential data mining / frequent sequence / text / WEB / online algorithm / newspaper article / relaxation method / event sequence corpus
Research Abstract

This research we studied and developed the following technologies:
1. a novel and rational frequency measure, called Total Frequency Measure, which satisfies anti-monotonic property and never causes duplicated counting within a very long single data sequence.
2. a online fast sequential data mining algorithm for extracting frequent subsequences within the framework of a infinite-length window.
3. a fast sequential mining algorithm based on the relaxation method which is intended for use for the framework of a finite-length window.
4. a intelligent sequential data mining method which uses an integrated occurrence criteria of frequency and information gain for subsequences.
5. a sequential pattern mining method for WEB access logs, which enables us to analyze access log data with considering page-staying time sequences
6. a new method for extracting important key words and/or phrases from newspaper articles in a huge newspaper corpus.
We showed the significance of the above technologies throughout huge amounts of experiments for evaluation.

  • Research Products

    (18 results)

All 2007 2006 2005 2004

All Journal Article (18 results)

  • [Journal Article] 情報量と頻度に基づく知的系列データマイニング手法2007

    • Author(s)
      大塚尚貴, 岩沼宏治, 鍋島英知
    • Journal Title

      人工知能学会 データマイニングと統計数理研究会資料 SIG-DMSM-A603-12

      Pages: 81-88

    • Description
      「研究成果報告書概要(和文)」より
  • [Journal Article] 緩和法に基づく系列データからの頻出部分系列の高速マイニング2006

    • Author(s)
      丸山育嗣, 岩沼宏治, 鍋島英知
    • Journal Title

      第5回情報科学技術レターズ LF-006

      Pages: 113-116

    • Description
      「研究成果報告書概要(和文)」より
  • [Journal Article] 専門検索エンジンの高速半自動生成法2006

    • Author(s)
      宮川礼子, 鈴木 悠生, 鍋島英知, 岩沼宏治
    • Journal Title

      第5回情報科学技術レターズ LL-005

      Pages: 335-358

    • Description
      「研究成果報告書概要(和文)」より
  • [Journal Article] Rapid Synthesis of Domain-Specific Web Search Engines Based on Semi-automatic Training-Example Generation2006

    • Author(s)
      H.Nabeshima, R.Miyagawa, Y.Suzuki, K.Iwanuma
    • Journal Title

      Proceedings of the International Conference on Web Intelligence 2006 (WI'06)

      Pages: 769-772

    • Description
      「研究成果報告書概要(和文)」より
  • [Journal Article] Webアクセスログに対する系列データマイニング-ページ滞在時間系列の解析-2006

    • Author(s)
      吉田修, 岩沼宏治, 鍋島英知
    • Journal Title

      電子情報通信学会技術研究報告 AI2006-14

      Pages: 13-18

    • Description
      「研究成果報告書概要(和文)」より
  • [Journal Article] A Fast Extraction of Frequent Subsequences Based on Relaxation Method from a Single Data Sequence2006

    • Author(s)
      Yasushi Maruyama, Koji Iwanuma, Hidetomo Nabeshima
    • Journal Title

      Information Science Technical Letters Vol.5

      Pages: 113-116

    • Description
      「研究成果報告書概要(欧文)」より
  • [Journal Article] Semi-automatic Synthesis of Domain-Specific Web Search Engines2006

    • Author(s)
      Reiko Miyagawa, Yuki Suzuki, Hidetomo Nabeshima, Koji Iwanuma Rapid
    • Journal Title

      Information Science Technical Letters Vol.5,

      Pages: 355-358

    • Description
      「研究成果報告書概要(欧文)」より
  • [Journal Article] Rapid Synthesis of Domain-Specific Web Search Engines Based on Semi-automatic Training-Example Generation2006

    • Author(s)
      H.Nabeshima, R.Miyagawa, Y.Suzuki, K.Iwanuma
    • Journal Title

      Proceedings of the International Conference on Web Intelligence (WI06)

      Pages: 769-772

    • Description
      「研究成果報告書概要(欧文)」より
  • [Journal Article] Analysis of page-staying time sequences2006

    • Author(s)
      Osamu Yoshida Koji Iwanuma, Hidetomo Nabeshima Sequential Pattern Mining for Web Access Logs
    • Journal Title

      IEICE Technical Report AI2006-14

      Pages: 7-12

    • Description
      「研究成果報告書概要(欧文)」より
  • [Journal Article] Nabeshima Intelligent Sequential Data Mining Based on Self-Informration and Frequency2006

    • Author(s)
      Naoki Ohtsuka, Koji Iwanuma, Hidetomo
    • Journal Title

      JSAI Technical Report SIG-DMSM-603

      Pages: 81-88

    • Description
      「研究成果報告書概要(欧文)」より
  • [Journal Article] Extracting Frequent Subsequences from a Single Long Data Sequence : A Novel Anti-Monotonic Measure and a Simple On-Line Algorithm2005

    • Author(s)
      K.Iwanuma, R.Ishihara, Y.Takano, H.Nabeshima
    • Journal Title

      Proceedings of the 5^<th> IEEE International Conference on Data Mining (ICDM 2005)

      Pages: 186-193

    • Description
      「研究成果報告書概要(和文)」より
  • [Journal Article] 大規模データ系列中に頻出する部分系列のオンライン抽出アルゴリズム2005

    • Author(s)
      石原龍一, 岩沼宏治, 鍋島英知
    • Journal Title

      第4回情報科学技術レターズ LF-002

      Pages: 89-92

    • Description
      「研究成果報告書概要(和文)」より
  • [Journal Article] On-line Extraction Algorithm of Frequent Subsequences from a Single Very-Long2005

    • Author(s)
      Ryuichi Ishihara, Koji Iwanuma, Hidetomo Nabeshima
    • Journal Title

      Information Science Technical Letters Vol.4

      Pages: 89-92

    • Description
      「研究成果報告書概要(欧文)」より
  • [Journal Article] Extracting Frequent Subsequences from a Single Long Data Sequence : A Novel Anti-Monotonic Measure and a Simple On-Line Algorithm2005

    • Author(s)
      Koji.Iwanuma, Ryuichi Ishihara, Yo Takano, Hidetomo Nabeshima
    • Journal Title

      Proceedings of the 5th IEEE International Conference on Data Mining (ICDM 2005)

      Pages: 186-193

    • Description
      「研究成果報告書概要(欧文)」より
  • [Journal Article] 単一の長大なデータ系列上の系列パターンの出現尺度とその逆単調性2004

    • Author(s)
      高野洋, 岩沼宏治, 鍋島英知
    • Journal Title

      第3回情報科学技術レターズ LF-012

      Pages: 115-118

    • Description
      「研究成果報告書概要(和文)」より
  • [Journal Article] On Anti-Monotone Frequency Measures for Extracting Sequential Patterns from a Single Very-Long Data Sequence2004

    • Author(s)
      K.Iwanuma, Y.Takano, H.Nabeshima
    • Journal Title

      Proceedings of IEEE International Conference on Cybernetics and Intelligence

      Pages: WP6.5

    • Description
      「研究成果報告書概要(和文)」より
  • [Journal Article] A Frequency Measure of Sequential Patterns on a Single Very-Large Data Sequence and its Anti-Monotonicity2004

    • Author(s)
      Yo Takano, Koji Iwanuma, Hidetomo Nabeshima
    • Journal Title

      Information Science Technical Letters Vol.3

      Pages: 115-118

    • Description
      「研究成果報告書概要(欧文)」より
  • [Journal Article] On Anti-Monotone Frequency Measures for Extracting Sequential Patterns from a Single Very-Long Data Sequence2004

    • Author(s)
      Koji Iwanuma, Yo Takano, Hidetomo Nabeshima
    • Journal Title

      Proceedings of IEEE International Conference on Cybernetics and Intelligence Paper No. WP6.5

    • Description
      「研究成果報告書概要(欧文)」より

URL: 

Published: 2008-05-27  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi