• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Research of Association Rule Mining Parallel Processing on Very Large Multi-processors

Research Project

Project/Area Number 11558030
Research Category

Grant-in-Aid for Scientific Research (B).

Allocation TypeSingle-year Grants
Section展開研究
Research Field 計算機科学
Research InstitutionUniversity of Tokyo

Principal Investigator

KITSUREGAWA Masaru  University of Tokyo, Institute of Industrial Science, Professor, 生産技術研究所, 教授 (40161509)

Co-Investigator(Kenkyū-buntansha) HAYASHI Hiroshi  University of Tokyo, Research Associate, 生産技術研究所, 助手 (50282596)
NAKAYAMA Masaya  University of Tokyo, Associate Professor, 情報基盤センター, 助教授 (90217943)
NAKANO Miyuki  Univ.of Tokyo, Institute of Industrial Science, Research Associate, 生産技術研究所, 助手 (30227863)
TORII Syunichi  Hitachi, Ltd.Main Researcher, ビジネスソリューション開発本部, 技術主幹
Project Period (FY) 1999 – 2000
Project Status Completed (Fiscal Year 2000)
Budget Amount *help
¥13,600,000 (Direct Cost: ¥13,600,000)
Fiscal Year 2000: ¥6,500,000 (Direct Cost: ¥6,500,000)
Fiscal Year 1999: ¥7,100,000 (Direct Cost: ¥7,100,000)
Keywordsdata mining / parallel database processing / distributed processing / 並列データベース
Research Abstract

In this research, we developed a parallel association rule mining algorithm and implemented the algorithm on the large multi-processors (100 processors). Then, we tried to make the proposed algorihtm fit for a practical use.
At the first year, we designed a parallel association rule mining algorithm considering the taxonomy for sequence data. We adopt a hashing method to the candidate rules so that we can easily achieve scalable performance on the environment of large number of processors. Then, the preliminary experiment was done by using the PC cluster. After the experiment, we investigated a run time load balancing method considering the taxonomy class and the frequency.
At the second year, we implemented our parallel association rule mining algorithm by SQL and executed our algorithm on the PC cluster and the yendor DBMS.Then, we investigated the pracical use of our algorithm on the large DBMS engine. Comparing our results with the special mining program written in C, we showed that the performance of SQL data mining algorithm with several nodes is almost equal to that of the special program. At last, we clarify the effectiveness of SQL parallel mining algorithm by considering reducing ratio of SQL algorithm to the special mining program.

Report

(3 results)
  • 2000 Annual Research Report   Final Research Report Summary
  • 1999 Annual Research Report
  • Research Products

    (19 results)

All Other

All Publications (19 results)

  • [Publications] Takahiko Shintani and Masaru Kitsuregawa: "Parallel Generalized Association Rule Mining on Large Scale PC Cluster"Large-Scale Parallel Data Mining ISBN 3-540-67194-3. 145-160 (2000)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] Masaru Kitsuregawa, Takahiko Shintani, Masahisa Tamura, Iko Pramudiono: "Parallel Data Mining on Large Scale PC cluster (Key note address)"WAIM. 15-26 (2000)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] Masaru Kitsuregawa, Takahiko Shintani, Takeshi Yoshizawa, Iko Pramudiono: "Web Log Mining and Parallel SQL Based Execution (Key note address)"International Workshop on Databases in Networked Information Systems (DNIS2000),, University of Aizu. 20-32 (2000)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] Iko Pramdiono, Takahiko Shintani, Takayuki Tamura, Masaru Kitsuregawa: "Parallel SQL Based Association Rule Mining on Large Scale PC Cluster : Performance Comparison with Directly Coded C Implementation"Proceedings of Third Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD99). 94-98 (1999)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] Takahiko Shintani, Masaru Kitsuregawa: "Parallel Generalized Association rule Mining on Large Scale PC Cluster"Proceedings of Workshop on Large-Scale Parallel KDD Systems. 35-44 (1999)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] Iko Pramdiono, Takahiko Shintani, Takayuki Tamura, Masaru Kitsuregawa: "Mining Generalized Association Rule using Parallel RDB Engine on PC Cluster"Proceedings of First International Conference on Data Warehousing and Knowledege Discovery (DAWAK99). 281-292 (1999)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] Takahiko Shintani and Masaru Kitsuregawa: "Parallel Generalized Association Rule Mining on Large Scale PC Cluster"Large-Scale Parallel Data Mining. ISBN 3-540-67194-3. 145-160 (2000)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] Masaru Kitsuregawa, Takahiko Shintani, Masahisa Tamura, Iko Pramudiono: "Parallel Data Mining on Large Scale PC cluster (Key note address)"WAIM 2000. 15-26 (2000)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] Masaru Kitsuregawa, Takahiko Shintani, Takeshi Yoshizawa, Iko Pramudiono: "Web Log Mining and Parallel SQL Based Execution (Key note address)"International Workshop on Databases in Networked Information Systems (DNIS2000). 20-32 (2000)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] Iko Pramdiono, Takahiko Shintani, Takayuki Tamura, Masaru Kitsuregawa: "Parallel SQL Based Association Rule Mining on Large Scale PC Cluster : Performance Comparison with Directly Coded C Implementation"Proceedings of Third Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD99). 94-98 (1999)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] Takahiko Shintani, Masaru Kitsuregawa: "Parallel Generalized Association rule Mining on Large Scale PC Cluster"Proceedings of Workshop on Large-Scale Parallel KDD Systems. 25-44 (1999)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] Iko Pramdiono, Takahiko Shintani, Takayuki Tamura, Masaru Kitsuregawa: "Mining Generalized Association Rule using Parallel RDB Engine on PC Cluster"Proceedings of First International Conference on Data Warehousing and Knowledege Discovery (DAWAK99). 28-292 (1999)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2000 Final Research Report Summary
  • [Publications] Takahiko Shintani and Masaru Kitsuregawa: "Parallel Generalized Association Rule Mining on Large Scale PC Cluster"Large-Scale Parallel Data Mining ISBN 3-540-67194-3. 145-160 (2000)

    • Related Report
      2000 Annual Research Report
  • [Publications] Masaru Kitsuregawa,Takahiko Shintani,Masahisa Tamura,Iko Pramudiono: "Parallel Data Mining on Large Scale PC cluster (Key note address)"WAIM. 15-26 (2000)

    • Related Report
      2000 Annual Research Report
  • [Publications] Masaru Kitsuregawa,Takahiko Shintani,Takeshi Yoshizawa,Iko Pramudiono: "Web Log Mining and Parallel SQL Based Execution (Key note address)"International Workshop on Databases in Networked Information Systems (DNIS2000),,University of Aizu. 20-32 (2000)

    • Related Report
      2000 Annual Research Report
  • [Publications] Iko Pramdino,Takahiko Shintani,Takayuki Tamura,Masaru Kitagawa: "Parallel SQL Based Association Rule Mining on Large Scale PC Cluster : Performance Comparison with Directly Coded C Implementation"Proceedings of Third Pacific-Asea Conference on Knowledge Discovery and Data Mining (PAKDD99). 94-98 (1999)

    • Related Report
      1999 Annual Research Report
  • [Publications] Takahiko Shintani,Masaru Kitsuregawa: "Parallel Generalized Association rule Mining on Large Scale PC Cluster"Proceedings of Workshop on Large-Scale Parallel KDD Systems. 35-44 (1999)

    • Related Report
      1999 Annual Research Report
  • [Publications] Iko Pramdiono,Takahiko Shintani,Takayuki Tamura,Masaru Kitsuregawa: "Mining Generalized Association Rule using Parallel RBD Engine on PC Cluster"Proceedings of First International Conference on Data Warehousing and Knoledege Discovery (DAWAK99). 281-292 (1999)

    • Related Report
      1999 Annual Research Report
  • [Publications] Takahiko Shintani,Masato Oguchi,Masaru Kitsuregawa: "Performance Analysis for Parallel Generalized Association Rule Mining on a Large Scale PC Cluster"Euro-par'99 Parallel Processing 5th International Euro-Par Conference. 1455-1459 (1999)

    • Related Report
      1999 Annual Research Report

URL: 

Published: 1999-04-01   Modified: 2016-04-21  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi