• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Automated synthesis of frequent event-sequences corpus from large-scale textual data and its application to WEB content tracking

Research Project

Project/Area Number 16500078
Research Category

Grant-in-Aid for Scientific Research (C)

Allocation TypeSingle-year Grants
Section一般
Research Field Intelligent informatics
Research InstitutionUniversity of Yamanashi

Principal Investigator

IWANUMA Koji  University of Yamanashi, Department of Research Interdisciplinary Graduate School of Medicine and Engineering, Professor, 大学院医学工学総合研究部, 教授 (30176557)

Project Period (FY) 2004 – 2006
Project Status Completed (Fiscal Year 2006)
Budget Amount *help
¥3,600,000 (Direct Cost: ¥3,600,000)
Fiscal Year 2006: ¥1,000,000 (Direct Cost: ¥1,000,000)
Fiscal Year 2005: ¥900,000 (Direct Cost: ¥900,000)
Fiscal Year 2004: ¥1,700,000 (Direct Cost: ¥1,700,000)
Keywordssequential data mining / frequent sequence / text / WEB / online algorithm / newspaper article / relaxation method / event sequence corpus / プラウジング支援 / データマイング
Research Abstract

This research we studied and developed the following technologies:
1. a novel and rational frequency measure, called Total Frequency Measure, which satisfies anti-monotonic property and never causes duplicated counting within a very long single data sequence.
2. a online fast sequential data mining algorithm for extracting frequent subsequences within the framework of a infinite-length window.
3. a fast sequential mining algorithm based on the relaxation method which is intended for use for the framework of a finite-length window.
4. a intelligent sequential data mining method which uses an integrated occurrence criteria of frequency and information gain for subsequences.
5. a sequential pattern mining method for WEB access logs, which enables us to analyze access log data with considering page-staying time sequences
6. a new method for extracting important key words and/or phrases from newspaper articles in a huge newspaper corpus.
We showed the significance of the above technologies throughout huge amounts of experiments for evaluation.

Report

(4 results)
  • 2006 Annual Research Report   Final Research Report Summary
  • 2005 Annual Research Report
  • 2004 Annual Research Report
  • Research Products

    (33 results)

All 2007 2006 2005 2004

All Journal Article (33 results)

  • [Journal Article] 情報量と頻度に基づく知的系列データマイニング手法2007

    • Author(s)
      大塚尚貴, 岩沼宏治, 鍋島英知
    • Journal Title

      人工知能学会 データマイニングと統計数理研究会資料 SIG-DMSM-A603-12

      Pages: 81-88

    • NAID

      130008079455

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] 緩和法に基づく系列データからの頻出部分系列の高速マイニング2006

    • Author(s)
      丸山育嗣, 岩沼宏治, 鍋島英知
    • Journal Title

      第5回情報科学技術レターズ LF-006

      Pages: 113-116

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2006 Annual Research Report 2006 Final Research Report Summary
  • [Journal Article] 専門検索エンジンの高速半自動生成法2006

    • Author(s)
      宮川礼子, 鈴木 悠生, 鍋島英知, 岩沼宏治
    • Journal Title

      第5回情報科学技術レターズ LL-005

      Pages: 335-358

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] Rapid Synthesis of Domain-Specific Web Search Engines Based on Semi-automatic Training-Example Generation2006

    • Author(s)
      H.Nabeshima, R.Miyagawa, Y.Suzuki, K.Iwanuma
    • Journal Title

      Proceedings of the International Conference on Web Intelligence 2006 (WI'06)

      Pages: 769-772

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] Webアクセスログに対する系列データマイニング-ページ滞在時間系列の解析-2006

    • Author(s)
      吉田修, 岩沼宏治, 鍋島英知
    • Journal Title

      電子情報通信学会技術研究報告 AI2006-14

      Pages: 13-18

    • NAID

      110005717345

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] A Fast Extraction of Frequent Subsequences Based on Relaxation Method from a Single Data Sequence2006

    • Author(s)
      Yasushi Maruyama, Koji Iwanuma, Hidetomo Nabeshima
    • Journal Title

      Information Science Technical Letters Vol.5

      Pages: 113-116

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] Semi-automatic Synthesis of Domain-Specific Web Search Engines2006

    • Author(s)
      Reiko Miyagawa, Yuki Suzuki, Hidetomo Nabeshima, Koji Iwanuma Rapid
    • Journal Title

      Information Science Technical Letters Vol.5,

      Pages: 355-358

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] Rapid Synthesis of Domain-Specific Web Search Engines Based on Semi-automatic Training-Example Generation2006

    • Author(s)
      H.Nabeshima, R.Miyagawa, Y.Suzuki, K.Iwanuma
    • Journal Title

      Proceedings of the International Conference on Web Intelligence (WI06)

      Pages: 769-772

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] Analysis of page-staying time sequences2006

    • Author(s)
      Osamu Yoshida Koji Iwanuma, Hidetomo Nabeshima Sequential Pattern Mining for Web Access Logs
    • Journal Title

      IEICE Technical Report AI2006-14

      Pages: 7-12

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] Nabeshima Intelligent Sequential Data Mining Based on Self-Informration and Frequency2006

    • Author(s)
      Naoki Ohtsuka, Koji Iwanuma, Hidetomo
    • Journal Title

      JSAI Technical Report SIG-DMSM-603

      Pages: 81-88

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] 専門検索エンジンの高速半自動生成法,2006

    • Author(s)
      宮川礼子, 鈴木 悠生, 鍋島英知, 岩沼宏治
    • Journal Title

      第5回情報科学技術レターズ LL-005

      Pages: 113-116

    • Related Report
      2006 Annual Research Report
  • [Journal Article] Rapid Synthesis of Domain-Specific Web Search Engines Based on Semi-automatic Training-Example Generation2006

    • Author(s)
      H.Nabeshima, R.Miyagawa, Y.Suzuki, K.Iwamura:
    • Journal Title

      Proceedings of the International Conference on Web Intelligence 2006 (WI'06),

      Pages: 769-772

    • Related Report
      2006 Annual Research Report
  • [Journal Article] Webアクセスログに対する系列データマイニング-ページ滞在時間系列の解析2006

    • Author(s)
      吉田修, 岩沼宏治, 鍋島英知
    • Journal Title

      電子情報通信学会技術研究報告 AI2006-14

      Pages: 7-12

    • NAID

      110005717345

    • Related Report
      2006 Annual Research Report
  • [Journal Article] 極大系列抽出を目的とする系列包含検査の高速化アルゴリズム2006

    • Author(s)
      市川博規, 岩沼宏治, 鍋島英知
    • Journal Title

      電子情報通信学会技術研究報告 AI2006-13

      Pages: 7-12

    • NAID

      110005717344

    • Related Report
      2006 Annual Research Report
  • [Journal Article] 情報量と頻度に基づく知的系列データマイニング手法2006

    • Author(s)
      大塚尚貴, 岩沼宏治, 鍋島英知
    • Journal Title

      人工知能学会 データマイニングと統計数理研究会資料 SIG-DMSM-603

      Pages: 81-88

    • NAID

      130008079455

    • Related Report
      2006 Annual Research Report
  • [Journal Article] 専門検索エンジンの半自動生成を目的とした類似度に基づくWEB学習データの精製2006

    • Author(s)
      宮川礼子, 岩沼宏治, 鍋島英知
    • Journal Title

      電子情報通信学会技術研究報告

    • NAID

      110004680124

    • Related Report
      2005 Annual Research Report
  • [Journal Article] 背景記事集合の類似度に基づく新聞記事のクラスタリング2006

    • Author(s)
      広瀬千夏, 岩沼宏治, 鍋島英知
    • Journal Title

      電子情報通信学会技術研究報告

    • NAID

      110004662875

    • Related Report
      2005 Annual Research Report
  • [Journal Article] Extracting Frequent Subsequences from a Single Long Data Sequence : A Novel Anti-Monotonic Measure and a Simple On-Line Algorithm2005

    • Author(s)
      K.Iwanuma, R.Ishihara, Y.Takano, H.Nabeshima
    • Journal Title

      Proceedings of the 5^<th> IEEE International Conference on Data Mining (ICDM 2005)

      Pages: 186-193

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2006 Final Research Report Summary 2005 Annual Research Report
  • [Journal Article] 大規模データ系列中に頻出する部分系列のオンライン抽出アルゴリズム2005

    • Author(s)
      石原龍一, 岩沼宏治, 鍋島英知
    • Journal Title

      第4回情報科学技術レターズ LF-002

      Pages: 89-92

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2006 Final Research Report Summary 2005 Annual Research Report
  • [Journal Article] On-line Extraction Algorithm of Frequent Subsequences from a Single Very-Long2005

    • Author(s)
      Ryuichi Ishihara, Koji Iwanuma, Hidetomo Nabeshima
    • Journal Title

      Information Science Technical Letters Vol.4

      Pages: 89-92

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] Extracting Frequent Subsequences from a Single Long Data Sequence : A Novel Anti-Monotonic Measure and a Simple On-Line Algorithm2005

    • Author(s)
      Koji.Iwanuma, Ryuichi Ishihara, Yo Takano, Hidetomo Nabeshima
    • Journal Title

      Proceedings of the 5th IEEE International Conference on Data Mining (ICDM 2005)

      Pages: 186-193

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] 系列パターンマイニングにおけるアイテム集合間の連結強度による頻出部分系列の絞込み2005

    • Author(s)
      大塚尚貴, 岩沼宏治, 鍋島英知
    • Journal Title

      電子情報通信学会技術研究報告

    • Related Report
      2005 Annual Research Report
  • [Journal Article] 検索隠し味の半自動生成を目的とした訓練データの精製2005

    • Author(s)
      鈴木悠生, 鍋島英知, 岩沼宏治
    • Journal Title

      電子情報通信学会技術研究報告

    • NAID

      110003205646

    • Related Report
      2005 Annual Research Report
  • [Journal Article] 大規模時系列データ中の頻出パターンのオンライン抽出アルゴリズム2005

    • Author(s)
      石原龍一, 岩沼宏治, 鍋島英知
    • Journal Title

      IEICE SIG Notes

    • Related Report
      2004 Annual Research Report
  • [Journal Article] 多重リンクを考慮するハイパーリンク最重要箇所の同定法とブラウジング支援の応用2005

    • Author(s)
      林直弘, 岩沼宏治, 鍋島英知
    • Journal Title

      IEICE SIG Notes

    • Related Report
      2004 Annual Research Report
  • [Journal Article] 診療系関連語彙テンプレートの自動生成とWebページの自動統合2005

    • Author(s)
      須田真行, 岩沼宏治, 鍋島英知
    • Journal Title

      IEICE SIG Notes

    • Related Report
      2004 Annual Research Report
  • [Journal Article] 新聞記事のイベント想起語群の自動生成2005

    • Author(s)
      広瀬千夏, 岩沼宏治, 鍋島英知
    • Journal Title

      IEICE SIG Notes

    • Related Report
      2004 Annual Research Report
  • [Journal Article] 単一の長大なデータ系列上の系列パターンの出現尺度とその逆単調性2004

    • Author(s)
      高野洋, 岩沼宏治, 鍋島英知
    • Journal Title

      第3回情報科学技術レターズ LF-012

      Pages: 115-118

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] On Anti-Monotone Frequency Measures for Extracting Sequential Patterns from a Single Very-Long Data Sequence2004

    • Author(s)
      K.Iwanuma, Y.Takano, H.Nabeshima
    • Journal Title

      Proceedings of IEEE International Conference on Cybernetics and Intelligence

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] A Frequency Measure of Sequential Patterns on a Single Very-Large Data Sequence and its Anti-Monotonicity2004

    • Author(s)
      Yo Takano, Koji Iwanuma, Hidetomo Nabeshima
    • Journal Title

      Information Science Technical Letters Vol.3

      Pages: 115-118

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] On Anti-Monotone Frequency Measures for Extracting Sequential Patterns from a Single Very-Long Data Sequence2004

    • Author(s)
      Koji Iwanuma, Yo Takano, Hidetomo Nabeshima
    • Journal Title

      Proceedings of IEEE International Conference on Cybernetics and Intelligence Paper No. WP6.5

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] 単一の長大なデータ系列上の系列パターンの出現尺度とその逆単調性2004

    • Author(s)
      高野洋, 岩沼宏治, 鍋島英知
    • Journal Title

      第3回情報科学技術レターズ Lf-012

      Pages: 115-118

    • Related Report
      2004 Annual Research Report
  • [Journal Article] On Anti-Monotone Frequency Measures for Extracting Sequential Patterns from a Single Very-Long Data Sequence2004

    • Author(s)
      K.Iwanuma, Y.Takano, H.Nabeshima
    • Journal Title

      Proceedings of 2004 IEEE International Conference on Cybernetics and Intelligence(CIS'04)

    • Related Report
      2004 Annual Research Report

URL: 

Published: 2004-04-01   Modified: 2016-04-21  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi