WebGraph-Analysis of Discrete Structures of the Internet and Development of their Optimization Algorithms

Research Project

Project/Area Number	15500015
Research Category	Grant-in-Aid for Scientific Research (C)
Allocation Type	Single-year Grants
Section	一般
Research Field	Fundamental theory of informatics
Research Institution	Osaka Prefecture University
Principal Investigator	UNO Yushi Osaka Prefecture University, Graduate School of Science, Assistant Professor, 理学系研究科, 講師 (60244670)
Project Period (FY)	2003 – 2006
Project Status	Completed (Fiscal Year 2006)
Budget Amount *help	¥3,600,000 (Direct Cost: ¥3,600,000) Fiscal Year 2006: ¥500,000 (Direct Cost: ¥500,000) Fiscal Year 2005: ¥600,000 (Direct Cost: ¥600,000) Fiscal Year 2004: ¥1,000,000 (Direct Cost: ¥1,000,000) Fiscal Year 2003: ¥1,500,000 (Direct Cost: ¥1,500,000)
Keywords	webgraph / data mining / enumeration problem / graph algorithms / community / web algorithms / Webアルゴリズム
Research Abstract	In the explosively evolving Web, by regarding the Web as a huge database, it is extremely important not only to obtain primary information but to find hidden information that cannot be found by naive retrievals. It is often called 'web mining', and web structure mining aims to find hidden communities that share common interests in specified topics in the Web, etc., by focusing on the webgraph that represents the link structure among web pages. On this model, a set of web pages of a community or its core is usually supposed to constitute a dense subgraph or a frequent inherent substructures in the webgraph, and web structure mining is actually realized by extracting them from the webgraph. As for significant substructures as communities, Kleinberg's hub-authority biclique model is well known and attractive. Some experimental research for this direction try to enumerate (a subset of) bicliques from the webgraph and are successful for mining communities (or their cores). However, since the … More re exist potentially enormous number of bicliques, it has become quie hard to carry out an exhaustive enumeration and to have effective outcome in the recent Web. Our contributions in this series of research is summarize as follows: (1)We implemented an efficient algorithm for enumerating maximal bicliques from a given graph, and performed an enumeration from the real web data. As a result, we found the structures that are obstacles for exhaustive enumeration of bicliques, and also revealed their semantic meanings. (2)Instead of the above conventional structures, we adopt a novel new structure called 'isolated cliques' as candidates of communities in the Web. Their definition leads a very efficient algorithm for their enumeration, and it enables us to perform an exhaustive enumeration from the entire Web. As a result, we found that most of isolated cliques reside in single domains and stand for menu structures, which sometimes imply harmful link farm spams. This suggests the effectiveness of isolated cliques as a substructure of the webgraph. (3)By observing the real webgraph, we found a new frequent substructure of the Web, which we name 'isolated stars'. We designed and implemented an efficient algorithm for their enumeration, and performed an enumeration experiment from the real web data. We also confirmed the effectiveness of isolated stars as a substructure of the web. Less

Report

(5 results)

2006 Annual Research Report Final Research Report Summary
2005 Annual Research Report
2004 Annual Research Report
2003 Annual Research Report

Research Products

(28 results)

All 2007 2006 2005 2004 Other

All Journal Article (26 results) Publications (2 results)

[Journal Article] On computing longest paths in small graph classes2007
- Author(s)
  Y.Uno, et al.
- Journal Title
  
  International Journal of Foundations of Computer Science (未定)
- NAID
  120001063250
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2006 Final Research Report Summary
[Journal Article] On computing longest paths in small graph classes2007
- Author(s)
  Y.Uno, et al.
- Journal Title
  
  International Journal of Foundations of Computer Science 未定
- NAID
  120001063250
- Related Report
  2006 Annual Research Report
[Journal Article] Mining communities and detecting link farms in the Web by isolated cliques2006
- Author(s)
  Y.Uno, et al.
- Journal Title
  
  Proc. 2nd International Conference on Knowledge Engineering and Decision Support
  
  Pages: 179-187
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2006 Annual Research Report 2006 Final Research Report Summary
[Journal Article] An experimental study of the Webgraph - Structural properties and web mining -2006
- Author(s)
  Y.Uno, et al.
- Journal Title
  
  Proc. 19th Workshop on Systems and Circuits
  
  Pages: 301-306
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2006 Annual Research Report 2006 Final Research Report Summary
[Journal Article] Web structure mining by isolated stars2006
- Author(s)
  Y.Uno, et al.
- Journal Title
  
  Proc. 4th Workshop on Algorithms and Models for the Web-Graph (未定)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2006 Final Research Report Summary
[Journal Article] ウェブグラフ-その性質と利用2006
- Author(s)
  Y.Uno
- Journal Title
  
  日本オペレーションズ・リサーチ学会誌 51巻12号
  
  Pages: 757-763
- NAID
  110004997636
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2006 Annual Research Report 2006 Final Research Report Summary
[Journal Article] Minimum edge ranking spanning trees of split graphs2006
- Author(s)
  Y.Uno, et al.
- Journal Title
  
  Discrete Applied Mathematics 154
  
  Pages: 2373-2386
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2006 Final Research Report Summary
[Journal Article] Mining communities and detecting link farms in the Web by isolated cliques.2006
- Author(s)
  Y.Uno, Y.Ota, A.Uemichi, M.Umano
- Journal Title
  
  Proc.2nd International Conference on Knowledge Engineering and Decision Support
  
  Pages: 179-187
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2006 Final Research Report Summary
[Journal Article] An experimental study of the Webgraph -Structural properties and web mining-.2006
- Author(s)
  Y.Uno, Y.Ota, A.Uemichi, M.Umano
- Journal Title
  
  Proc.19th Workshop on Systems and Circuits
  
  Pages: 301-306
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2006 Final Research Report Summary
[Journal Article] Web structure mining by isolated stars.2006
- Author(s)
  Y.Uno, Y.Ota, A.Uemichi, M.Umano
- Journal Title
  
  Proc.4th Workshop on Algorithms and Models for the Web-Graph
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2006 Final Research Report Summary
[Journal Article] Minimum edge ranking spanning trees of split graphs.2006
- Author(s)
  K.Makino, Y.Uno, T.Ibaraki
- Journal Title
  
  Discrete Applied Mathematics 154
  
  Pages: 2373-2386
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2006 Final Research Report Summary
[Journal Article] Web structure mining by isolated stars2006
- Author(s)
  Y.Uno, et al.
- Journal Title
  
  Proc. 4th Workshop on Algorithms and Models for the Web-Graph
- Related Report
  2006 Annual Research Report
[Journal Article] 孤立クリークを用いたウェブ構造マイニングとリンクファームの検出2006
- Author(s)
  Y.Uno, et al.
- Journal Title
  
  IEICE Technical Report, SIG-WI2 17
  
  Pages: 83-88
- Related Report
  2005 Annual Research Report
[Journal Article] Laminar structure of Ptolemaic graph and its applications2005
- Author(s)
  Y.Uno, et al.
- Journal Title
  
  Lecture Notes in Computer Science, Springer 3827
  
  Pages: 186-195
- NAID
  110003225064
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2006 Final Research Report Summary
[Journal Article] Laminar structure of Ptolemaic graph and its applications.2005
- Author(s)
  R.Uehara, Y.Uno
- Journal Title
  
  Lecture Notes in Computer Science 3827(Springer)
  
  Pages: 186-195
- NAID
  110003225064
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2006 Final Research Report Summary
[Journal Article] An experimental study of the web graph.2005
- Author(s)
  Y.Uno
- Journal Title
  
  Proc.17th IFORS Conference
  
  Pages: 25-25
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2006 Final Research Report Summary
[Journal Article] An experimental study on the Web graph2005
- Author(s)
  Y.Uno
- Journal Title
  
  Proc.17th IFORS Conference
  
  Pages: 25-25
- Related Report
  2005 Annual Research Report
[Journal Article] On the laminar structure of, Ptolemaic and DH graphs2005
- Author(s)
  Y.Uno, R.Uehara
- Journal Title
  
  Lecture Notes in Computer Science, Springer 3827
  
  Pages: 186-195
- Related Report
  2005 Annual Research Report
[Journal Article] On the laminar structure of Ptolemaic and DH graphs2005
- Author(s)
  Y.Uno, R.Uehara
- Journal Title
  
  IEICE Technical Report, SIG-COMP (発行予定)
- Related Report
  2004 Annual Research Report
[Journal Article] An experimental study on the Web graph2005
- Author(s)
  Y.Uno
- Journal Title
  
  Proc.17th IFORS Conference (発行予定)
- Related Report
  2004 Annual Research Report
[Journal Article] Efficient algorithms for the longest path problem2004
- Author(s)
  Y.Uno, et al.
- Journal Title
  
  Lecture Notes in Computer Science (Springer) 3341
  
  Pages: 871-883
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2006 Final Research Report Summary
[Journal Article] Efficient algorithms for the longest path problem.2004
- Author(s)
  R.Uehara, Y.Uno
- Journal Title
  
  Lecture Notes in Computer Science 3341(Springer)
  
  Pages: 871-833
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2006 Final Research Report Summary
[Journal Article] Longest paths in small graph classes2004
- Author(s)
  R.Uehara, Y.Uno
- Journal Title
  
  IEICE Technical Report, SIG-COMP 104,55
  
  Pages: 53-60
- NAID
  110003178872
- Related Report
  2004 Annual Research Report
[Journal Article] Efficient algorithms for the longest path problem2004
- Author(s)
  R.Uehara, Y.Uno
- Journal Title
  
  Lecture Notes in Computer Science (Springer) 3341
  
  Pages: 871-883
- Related Report
  2004 Annual Research Report
[Journal Article] Mining communities in the Web2004
- Author(s)
  Y.Uno, F.Ueda
- Journal Title
  
  Proc.Int'l Symp.on Discrete Algorithms and Optimization
  
  Pages: 11-11
- Related Report
  2004 Annual Research Report
[Journal Article] On computing longest paths in small graph classes.
- Author(s)
  R.Uehara, Y.Uno
- Journal Title
  
  International Journal of Foundations of Computer Science. (to appear)
- NAID
  120001063250
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2006 Final Research Report Summary
[Publications] R.Uehara, Y.Uno: "Longest paths in small graph classes"IEICE Technical Report, SIG-COMP. (to appear). (2004)
- Related Report
  2003 Annual Research Report
[Publications] Y.Uno, F.Ueda: "Mining communities in the Web"Proc.Int'l Symp.on Discrete Algorithms and Optimization. 11-11 (2004)
- Related Report
  2003 Annual Research Report

WebGraph-Analysis of Discrete Structures of the Internet and Development of their Optimization Algorithms

Principal Investigator

UNO Yushi Osaka Prefecture University, Graduate School of Science, Assistant Professor, 理学系研究科, 講師 (60244670)

¥3,600,000 (Direct Cost: ¥3,600,000)

Report

Research Products

[Journal Article] On computing longest paths in small graph classes2007

Author(s)

Journal Title

NAID

Description

Related Report

[Journal Article] On computing longest paths in small graph classes2007

Author(s)

Journal Title

NAID

Related Report

[Journal Article] Mining communities and detecting link farms in the Web by isolated cliques2006

Author(s)

Journal Title

Description

Related Report

[Journal Article] An experimental study of the Webgraph - Structural properties and web mining -2006

Author(s)

Journal Title

Description

Related Report

[Journal Article] Web structure mining by isolated stars2006

Author(s)

Journal Title

Description

Related Report

[Journal Article] ウェブグラフ-その性質と利用2006

Author(s)

Journal Title

NAID

Description

Related Report

[Journal Article] Minimum edge ranking spanning trees of split graphs2006

Author(s)

Journal Title

Description

Related Report

[Journal Article] Mining communities and detecting link farms in the Web by isolated cliques.2006

Author(s)

Journal Title

Description

Related Report

[Journal Article] An experimental study of the Webgraph -Structural properties and web mining-.2006

Author(s)

Journal Title

Description

Related Report

[Journal Article] Web structure mining by isolated stars.2006

Author(s)

Journal Title

Description

Related Report

[Journal Article] Minimum edge ranking spanning trees of split graphs.2006

Author(s)

Journal Title

Description

Related Report

[Journal Article] Web structure mining by isolated stars2006

Author(s)

Journal Title

Related Report

[Journal Article] 孤立クリークを用いたウェブ構造マイニングとリンクファームの検出2006

Author(s)

Journal Title

Related Report

[Journal Article] Laminar structure of Ptolemaic graph and its applications2005

Author(s)

Journal Title

NAID

Description

Related Report

[Journal Article] Laminar structure of Ptolemaic graph and its applications.2005

Author(s)

Journal Title