• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to project page

2011 Fiscal Year Final Research Report

Knowledge Discovery from Numbers in Text

Research Project

  • PDF
Project/Area Number 22700137
Research Category

Grant-in-Aid for Young Scientists (B)

Allocation TypeSingle-year Grants
Research Field Intelligent informatics
Research InstitutionThe University of Tokyo

Principal Investigator

YOSHIDA Minoru  東京大学, 情報基盤センター, 助教 (40361688)

Project Period (FY) 2010 – 2011
Keywords自然言語処理 / 数値情報 / テキストマイニング / 接尾辞配列
Research Abstract

We studied a method for processing numbers written in text to discover relations between words and numbers. We indexed texts using suffix arrays augmented with functions for searching digits as numbers with the queries being able to include range of numbers. The search function can be performed in reasonable time for large text, which enabled us to obtain the relations between words and numbers interactively from such texts. We also studied methods for mining the texts that contain many numbers.

  • Research Products

    (10 results)

All 2012 2011 2010

All Journal Article (4 results) (of which Peer Reviewed: 3 results) Presentation (5 results) Book (1 results)

  • [Journal Article] 二段階クラスタリングを単語重み付与に応用した人名曖昧性解消2010

    • Author(s)
      吉田稔、池田雅紀、小野真吾、佐藤一誠、中川裕志
    • Journal Title

      日本データベース学会論文誌

      Volume: Vol.9, No.2 Pages: 19-24

    • Peer Reviewed
  • [Journal Article] テキストマイニングの活用2010

    • Author(s)
      吉田稔, 中川裕志
    • Journal Title

      情報の科学と技術

      Volume: 60巻6号 Pages: 230-235

  • [Journal Article] Person Name Disambiguation by Bootstrapping2010

    • Author(s)
      Minoru Yoshida, Masaki Ikeda, Shingo Ono, Issei Sato, and Hiroshi Nakagawa
    • Journal Title

      Proceedings of SIGIR-2010

      Pages: 10-17

    • Peer Reviewed
  • [Journal Article] Mining Numbers in Text Using Suffix Arrays and Clustering Based on Dirichlet Process Mixture Models2010

    • Author(s)
      Minoru Yoshida, Issei Sato, Hiroshi Nakagawa, Akira Terada
    • Journal Title

      Proceedings of PAKDD-2010

      Pages: 230-237

    • Peer Reviewed
  • [Presentation] ソーシャルメディアによる風邪流行の予測2012

    • Author(s)
      谷田和章,荒牧英治,佐藤一誠,吉田稔,中川裕志
    • Organizer
      言語処理学会第18回年次大会
    • Place of Presentation
      広島
    • Year and Date
      2012-03-15
  • [Presentation] テキストマイニングによる機器異常診断支援の試み2012

    • Author(s)
      吉田稔,中川裕志,渋谷久恵,前田俊二
    • Organizer
      第4回データ工学と情報マネジメントに関するフォーラム
    • Place of Presentation
      神戸
    • Year and Date
      2012-03-04
  • [Presentation] ニュース記事クラスタリングによる取引高予測の試み2011

    • Author(s)
      吉田稔,中川裕志,石田智也,中嶋啓浩,松井藤五郎,和泉潔,池田翔,本多隆虎
    • Organizer
      人工知能学会第25回全国大会
    • Place of Presentation
      盛岡
    • Year and Date
      2011-06-02
  • [Presentation] Web People Search2010

    • Author(s)
      Minoru Yoshida, Hiroshi Nakagawa
    • Organizer
      Person Name Disambiguation and Other Problems(Tutorial), the 2nd Asian Conference on Machine Learning(ACML 2010)
    • Year and Date
      2010-11-08
  • [Presentation] ITC-UT2010

    • Author(s)
      Minoru Yoshida, Shin Matsushima, Shingo Ono, Hiroshi Nakagawa
    • Organizer
      Tweet Categorization by Query Categrization for On-line Reputation management. WePS-3, CLEF 2010 Labs
    • Year and Date
      2010-09-23
  • [Book] Information Extraction from the Internet2011

    • Author(s)
      Minoru Yoshida, Hiroshi Nakagawa, AkiraTerada
    • Total Pages
      73-87
    • Publisher
      On-demand Synonym Extraction Using Suffix Arrays, Chapter in Book

URL: 

Published: 2013-07-31  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi