• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

Search engine using automatic web page ranking procedure and automatic classification

Research Project

Project/Area Number 12558038
Research Category

Grant-in-Aid for Scientific Research (B)

Allocation TypeSingle-year Grants
Section展開研究
Research Field 情報システム学(含情報図書館学)
Research InstitutionKeio University

Principal Investigator

UEDA Shuichi  Keio University, Faculty of Letters, Professor, 文学部, 教授 (50134218)

Co-Investigator(Kenkyū-buntansha) WATANABE Michiko  Railway Technical Research Institute, Technical Support Center, Researcher, 技術支援部, 技師
KUNO Takashi  Sakushin Gakuin University, Women's College, lecturer, 女子短期大学部, 専任講師 (30310212)
AGATA Teru  Asia University, Faculty of International Relations, lecturer, 国際関係学部, 専任講師 (80306505)
Project Period (FY) 2000 – 2001
Project Status Completed (Fiscal Year 2001)
Budget Amount *help
¥5,600,000 (Direct Cost: ¥5,600,000)
Fiscal Year 2001: ¥2,900,000 (Direct Cost: ¥2,900,000)
Fiscal Year 2000: ¥2,700,000 (Direct Cost: ¥2,700,000)
KeywordsWorld Wide Web / Search engine / Web page / Automatic Classification / サーチエンジン / 自動格付け
Research Abstract

The amount of World Wide Web (WWW) pages has grown dramatically over the last few years with the growth of internet. It is estimated that there are currently over 3,200 million WWW pages. In order to satisfy the requirement for new search engines for WWW pages, it is necessary to develop automatic mechanisms for the deletion of less important pages, judgment of usefulness of pages, and subject classification for Web pages.
The first year, the automatic judging procedure for page type was developed. Web page were typed manually to standard pages, top pages, contents pages, bulletin boards, chat pages, link pages, diary pages, and input forms. The automatic judgment method based on quantitative analysis of judged pages was developed. The algorithm of a type judgment was based on the frequency of appearance of HTML tags, page length or words in titles and file names obtained from Web pages in Japanese.
In the second year, the total amount of a Web page was estimated, and automatic judgment system of useful Web pages and automatic classification system were developed. The algorithm of automatic judgment system is based on the morphological analysis of pages which obtained the high score by the judgment of "being good sources of information". In order to classify WWW pages in Japanese by subject, we present two classification algorithms based on relative frequencies of terms and information retrieval technique using vector-space model.
These methods are included in the search engine and it participated in the 2nd NTCIR workshop Web task.

Report

(3 results)
  • 2001 Annual Research Report   Final Research Report Summary
  • 2000 Annual Research Report
  • Research Products

    (22 results)

All Other

All Publications (22 results)

  • [Publications] 上田修一: "情報源としてのWWW"メディア・コミュニケーション. 51. 42-50 (2001)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2001 Final Research Report Summary
  • [Publications] 安形 輝: "WWW調査におけるサンプル集合の収集法"三田図書館・情報学会研究大会発表論文集. 2000. 37-40 (2000)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2001 Final Research Report Summary
  • [Publications] 久野高志他: "Webページのタイプ判定法"日本図書館情報学会研究大会発表要綱. 2000. 55-58 (2000)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2001 Final Research Report Summary
  • [Publications] 上田修一他: "Webページ評価の視点と基準"三田図書館・情報学会研究大会発表論文集. 2000. 33-36 (2000)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2001 Final Research Report Summary
  • [Publications] 安形輝他: "World Wide Webの測定:Webページ推定手法の比較"三田図書館・情報学会研究大会発表論文集. 2001. 17-20 (2001)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2001 Final Research Report Summary
  • [Publications] 久野高志他: "情報検索システムとしてみたサーチエンジン"日本図書館情報学会研究大会発表要綱. 2001. 47-50 (2001)

    • Description
      「研究成果報告書概要(和文)」より
    • Related Report
      2001 Final Research Report Summary
  • [Publications] UEDA, Shuichi: "WWW as information resources"Media Communication. 51 (2). 42-50 (2001)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2001 Final Research Report Summary
  • [Publications] AGATA, Teru, et al: "Sampling Mathods for WWW survey"Proceedings of MITA Society for Library and Information Science 2000. 37-40 (2000)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2001 Final Research Report Summary
  • [Publications] KUNO, Takashi, et al.: "Clustering technique of Web page types"Proceedings of Japan Society for Library and Information Science 2000. 55-58 (2000)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2001 Final Research Report Summary
  • [Publications] UEDA, Shuichi et al.: "Viewpoints for Evaluation of Web pages"Proceedings of MITA Society for Library and Information Science 2000. 33-36 (2000)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2001 Final Research Report Summary
  • [Publications] AGATA, Teru, et al.: "Measurement of World Wide Web"Proceedings of MITA Society for Library and Information Science 2001. 17-20 (2001)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2001 Final Research Report Summary
  • [Publications] KUNO, Takashi, et al.: "Search Engine in Information Retrieval"Proceedings of Japan Society for Library and Information Science 2001. 47-50 (2001)

    • Description
      「研究成果報告書概要(欧文)」より
    • Related Report
      2001 Final Research Report Summary
  • [Publications] 上田修一: "情報源としてのWWW"メディア・コミュニケーション. 51. 42-50 (2001)

    • Related Report
      2001 Annual Research Report
  • [Publications] 安形 輝: "WWW調査におけるサンプル集合の収集法"三田図書館・情報学会研究大会発表論文集. 2000. 37-40 (2000)

    • Related Report
      2001 Annual Research Report
  • [Publications] 久野高志他: "Webページのタイプ判定法"日本図書館情報学会研究大会発表要綱. 2000. 55-58 (2000)

    • Related Report
      2001 Annual Research Report
  • [Publications] 上田修一他: "Webページ評価の視点と基準"三田図書館・情報学会研究大会発表論文集. 2000. 33-36 (2000)

    • Related Report
      2001 Annual Research Report
  • [Publications] 安形 輝他: "World Wide Webの測定:Webページ推定手法の比較"三田図書館・情報学会研究大会発表論文集. 2001. 17-20 (2001)

    • Related Report
      2001 Annual Research Report
  • [Publications] 久野高志他: "情報検索システムとしてみたサーチエンジン"日本図書館情報学会研究大会発表要綱. 2001. 47-50 (2001)

    • Related Report
      2001 Annual Research Report
  • [Publications] 上田修一: "情報源としてのWWW"メディア・コミュニケーション. 50. 42-50 (2001)

    • Related Report
      2000 Annual Research Report
  • [Publications] 安形輝: "WWW調査におけるサンプル集合の収集法"三田図書館・情報学会研究大会発表論文集. 2000. 37-40 (2000)

    • Related Report
      2000 Annual Research Report
  • [Publications] 久野高志 他: "Webページのタイプ判定法"日本図書館情報学会研究大会発表要綱. 2000. 55-58 (2000)

    • Related Report
      2000 Annual Research Report
  • [Publications] 上田修一 他: "Webページ評価の視点と基準"三田図書館・情報学会研究大会発表論文集. 2000. 33-36 (2000)

    • Related Report
      2000 Annual Research Report

URL: 

Published: 2000-04-01   Modified: 2016-04-21  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi