• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

WebGraph-Analysis of Discrete Structures of the Internet and Development of their Optimization Algorithms

Research Project

Project/Area Number 15500015
Research Category

Grant-in-Aid for Scientific Research (C)

Allocation TypeSingle-year Grants
Section一般
Research Field Fundamental theory of informatics
Research InstitutionOsaka Prefecture University

Principal Investigator

UNO Yushi  Osaka Prefecture University, Graduate School of Science, Assistant Professor, 理学系研究科, 講師 (60244670)

Project Period (FY) 2003 – 2006
Project Status Completed (Fiscal Year 2006)
Budget Amount *help
¥3,600,000 (Direct Cost: ¥3,600,000)
Fiscal Year 2006: ¥500,000 (Direct Cost: ¥500,000)
Fiscal Year 2005: ¥600,000 (Direct Cost: ¥600,000)
Fiscal Year 2004: ¥1,000,000 (Direct Cost: ¥1,000,000)
Fiscal Year 2003: ¥1,500,000 (Direct Cost: ¥1,500,000)
Keywordswebgraph / data mining / enumeration problem / graph algorithms / community / web algorithms / Webアルゴリズム
Research Abstract

In the explosively evolving Web, by regarding the Web as a huge database, it is extremely important not only to obtain primary information but to find hidden information that cannot be found by naive retrievals. It is often called 'web mining', and web structure mining aims to find hidden communities that share common interests in specified topics in the Web, etc., by focusing on the webgraph that represents the link structure among web pages. On this model, a set of web pages of a community or its core is usually supposed to constitute a dense subgraph or a frequent inherent substructures in the webgraph, and web structure mining is actually realized by extracting them from the webgraph.
As for significant substructures as communities, Kleinberg's hub-authority biclique model is well known and attractive. Some experimental research for this direction try to enumerate (a subset of) bicliques from the webgraph and are successful for mining communities (or their cores). However, since the … More re exist potentially enormous number of bicliques, it has become quie hard to carry out an exhaustive enumeration and to have effective outcome in the recent Web.
Our contributions in this series of research is summarize as follows:
(1)We implemented an efficient algorithm for enumerating maximal bicliques from a given graph, and performed an enumeration from the real web data. As a result, we found the structures that are obstacles for exhaustive enumeration of bicliques, and also revealed their semantic meanings.
(2)Instead of the above conventional structures, we adopt a novel new structure called 'isolated cliques' as candidates of communities in the Web. Their definition leads a very efficient algorithm for their enumeration, and it enables us to perform an exhaustive enumeration from the entire Web. As a result, we found that most of isolated cliques reside in single domains and stand for menu structures, which sometimes imply harmful link farm spams. This suggests the effectiveness of isolated cliques as a substructure of the webgraph.
(3)By observing the real webgraph, we found a new frequent substructure of the Web, which we name 'isolated stars'. We designed and implemented an efficient algorithm for their enumeration, and performed an enumeration experiment from the real web data. We also confirmed the effectiveness of isolated stars as a substructure of the web. Less

Report

(5 results)
  • 2006 Annual Research Report   Final Research Report Summary
  • 2005 Annual Research Report
  • 2004 Annual Research Report
  • 2003 Annual Research Report
  • Research Products

    (28 results)

All 2007 2006 2005 2004 Other

All Journal Article (26 results) Publications (2 results)

  • [Journal Article] On computing longest paths in small graph classes2007

    • Author(s)
      Y.Uno, et al.
    • Journal Title

      International Journal of Foundations of Computer Science (未定)

    • NAID

      120001063250

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] On computing longest paths in small graph classes2007

    • Author(s)
      Y.Uno, et al.
    • Journal Title

      International Journal of Foundations of Computer Science 未定

    • NAID

      120001063250

    • Related Report
      2006 Annual Research Report
  • [Journal Article] Mining communities and detecting link farms in the Web by isolated cliques2006

    • Author(s)
      Y.Uno, et al.
    • Journal Title

      Proc. 2nd International Conference on Knowledge Engineering and Decision Support

      Pages: 179-187

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2006 Annual Research Report 2006 Final Research Report Summary
  • [Journal Article] An experimental study of the Webgraph - Structural properties and web mining -2006

    • Author(s)
      Y.Uno, et al.
    • Journal Title

      Proc. 19th Workshop on Systems and Circuits

      Pages: 301-306

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2006 Annual Research Report 2006 Final Research Report Summary
  • [Journal Article] Web structure mining by isolated stars2006

    • Author(s)
      Y.Uno, et al.
    • Journal Title

      Proc. 4th Workshop on Algorithms and Models for the Web-Graph (未定)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] ウェブグラフ-その性質と利用2006

    • Author(s)
      Y.Uno
    • Journal Title

      日本オペレーションズ・リサーチ学会誌 51巻12号

      Pages: 757-763

    • NAID

      110004997636

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2006 Annual Research Report 2006 Final Research Report Summary
  • [Journal Article] Minimum edge ranking spanning trees of split graphs2006

    • Author(s)
      Y.Uno, et al.
    • Journal Title

      Discrete Applied Mathematics 154

      Pages: 2373-2386

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] Mining communities and detecting link farms in the Web by isolated cliques.2006

    • Author(s)
      Y.Uno, Y.Ota, A.Uemichi, M.Umano
    • Journal Title

      Proc.2nd International Conference on Knowledge Engineering and Decision Support

      Pages: 179-187

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] An experimental study of the Webgraph -Structural properties and web mining-.2006

    • Author(s)
      Y.Uno, Y.Ota, A.Uemichi, M.Umano
    • Journal Title

      Proc.19th Workshop on Systems and Circuits

      Pages: 301-306

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] Web structure mining by isolated stars.2006

    • Author(s)
      Y.Uno, Y.Ota, A.Uemichi, M.Umano
    • Journal Title

      Proc.4th Workshop on Algorithms and Models for the Web-Graph

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] Minimum edge ranking spanning trees of split graphs.2006

    • Author(s)
      K.Makino, Y.Uno, T.Ibaraki
    • Journal Title

      Discrete Applied Mathematics 154

      Pages: 2373-2386

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] Web structure mining by isolated stars2006

    • Author(s)
      Y.Uno, et al.
    • Journal Title

      Proc. 4th Workshop on Algorithms and Models for the Web-Graph

    • Related Report
      2006 Annual Research Report
  • [Journal Article] 孤立クリークを用いたウェブ構造マイニングとリンクファームの検出2006

    • Author(s)
      Y.Uno, et al.
    • Journal Title

      IEICE Technical Report, SIG-WI2 17

      Pages: 83-88

    • Related Report
      2005 Annual Research Report
  • [Journal Article] Laminar structure of Ptolemaic graph and its applications2005

    • Author(s)
      Y.Uno, et al.
    • Journal Title

      Lecture Notes in Computer Science, Springer 3827

      Pages: 186-195

    • NAID

      110003225064

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] Laminar structure of Ptolemaic graph and its applications.2005

    • Author(s)
      R.Uehara, Y.Uno
    • Journal Title

      Lecture Notes in Computer Science 3827(Springer)

      Pages: 186-195

    • NAID

      110003225064

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] An experimental study of the web graph.2005

    • Author(s)
      Y.Uno
    • Journal Title

      Proc.17th IFORS Conference

      Pages: 25-25

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] An experimental study on the Web graph2005

    • Author(s)
      Y.Uno
    • Journal Title

      Proc.17th IFORS Conference

      Pages: 25-25

    • Related Report
      2005 Annual Research Report
  • [Journal Article] On the laminar structure of, Ptolemaic and DH graphs2005

    • Author(s)
      Y.Uno, R.Uehara
    • Journal Title

      Lecture Notes in Computer Science, Springer 3827

      Pages: 186-195

    • Related Report
      2005 Annual Research Report
  • [Journal Article] On the laminar structure of Ptolemaic and DH graphs2005

    • Author(s)
      Y.Uno, R.Uehara
    • Journal Title

      IEICE Technical Report, SIG-COMP (発行予定)

    • Related Report
      2004 Annual Research Report
  • [Journal Article] An experimental study on the Web graph2005

    • Author(s)
      Y.Uno
    • Journal Title

      Proc.17th IFORS Conference (発行予定)

    • Related Report
      2004 Annual Research Report
  • [Journal Article] Efficient algorithms for the longest path problem2004

    • Author(s)
      Y.Uno, et al.
    • Journal Title

      Lecture Notes in Computer Science (Springer) 3341

      Pages: 871-883

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] Efficient algorithms for the longest path problem.2004

    • Author(s)
      R.Uehara, Y.Uno
    • Journal Title

      Lecture Notes in Computer Science 3341(Springer)

      Pages: 871-833

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Journal Article] Longest paths in small graph classes2004

    • Author(s)
      R.Uehara, Y.Uno
    • Journal Title

      IEICE Technical Report, SIG-COMP 104,55

      Pages: 53-60

    • NAID

      110003178872

    • Related Report
      2004 Annual Research Report
  • [Journal Article] Efficient algorithms for the longest path problem2004

    • Author(s)
      R.Uehara, Y.Uno
    • Journal Title

      Lecture Notes in Computer Science (Springer) 3341

      Pages: 871-883

    • Related Report
      2004 Annual Research Report
  • [Journal Article] Mining communities in the Web2004

    • Author(s)
      Y.Uno, F.Ueda
    • Journal Title

      Proc.Int'l Symp.on Discrete Algorithms and Optimization

      Pages: 11-11

    • Related Report
      2004 Annual Research Report
  • [Journal Article] On computing longest paths in small graph classes.

    • Author(s)
      R.Uehara, Y.Uno
    • Journal Title

      International Journal of Foundations of Computer Science. (to appear)

    • NAID

      120001063250

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2006 Final Research Report Summary
  • [Publications] R.Uehara, Y.Uno: "Longest paths in small graph classes"IEICE Technical Report, SIG-COMP. (to appear). (2004)

    • Related Report
      2003 Annual Research Report
  • [Publications] Y.Uno, F.Ueda: "Mining communities in the Web"Proc.Int'l Symp.on Discrete Algorithms and Optimization. 11-11 (2004)

    • Related Report
      2003 Annual Research Report

URL: 

Published: 2003-04-01   Modified: 2016-04-21  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi