• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to project page

2004 Fiscal Year Final Research Report Summary

Analysis of the Relationship between Proper nouns in Large Scale Corpus

Research Project

Project/Area Number 15500090
Research Category

Grant-in-Aid for Scientific Research (C)

Allocation TypeSingle-year Grants
Section一般
Research Field Intelligent informatics
Research InstitutionToyohashi University of Technology

Principal Investigator

UMEMURA Kyoji  Toyohashi University of Technology, Information and Computer Sciences, Professor, 工学部, 教授 (80273324)

Project Period (FY) 2003 – 2004
KeywordsStatistical Analysis / Support Vector Machine / Medical System / Synonym
Research Abstract

In the first year, we have developed computer cluster system from parts, and developed the specialized software package for frequency analysis. Though most of these works are combination of existing result, we have realized a powerful environment to analyze the corpus. In the second year, we have used the SVM to detect keywords from corpus. The input of SVM is the statistical values of many strings, and the SVM judges whether the string is keywords or not. Sine this method does not use any kind of dictionary, the identical program works for both Japanese and Chinese. It is very interesting and remarkable result that the keyword can be extracted without any kind of dictionaries. All we need are samples of keywords in each language. We have also applied our environment to analyze the decease name of medical information systems. The data in the system consists of 7 years of medical record. Without our environment, it would be very difficult to analyze the data and get the synonyms of decease names from the data.

  • Research Products

    (6 results)

All 2005 2004 2003

All Journal Article (6 results)

  • [Journal Article] 医療情報システムのデータマイニングによる関連病名の発見2005

    • Author(s)
      Pattamon, 梅村
    • Journal Title

      情報処理学会プログラミング・シンポジウム (口頭発表)

      Pages: 6

    • Description
      「研究成果報告書概要(和文)」より
  • [Journal Article] 頻度差が著しい場合における一対多関係を推定する類似尺度2005

    • Author(s)
      岡部, 梅村
    • Journal Title

      情報処理学会2005年情報学シンポジウム (口頭発表)

      Pages: 8

    • Description
      「研究成果報告書概要(和文)」より
  • [Journal Article] SVMと一般化文書頻度によるキーワードの推定2004

    • Author(s)
      尾形, 寺尾, 梅村
    • Journal Title

      言語処理学会第10回年次大会NLP2004併設ワークショップ固有表現と専門語抽出 (口頭発表)

      Pages: 4

    • Description
      「研究成果報告書概要(和文)」より
  • [Journal Article] Japanese Multiword Extraction using SVM and Adaptation2004

    • Author(s)
      T.Ogata, K.Terao, K.Umemura
    • Journal Title

      LREC -2004 Workshop on Methodologies and Evaluation of Multiword Units in Real-world Applications (口頭発表)

      Pages: 4

    • Description
      「研究成果報告書概要(和文)」より
  • [Journal Article] Bigramの反復度を用いた技術用語抽出2004

    • Author(s)
      中瀬, 梅村
    • Journal Title

      第46回デジタルドキュメント研究会 Vol.2004 No.97

      Pages: 6

    • Description
      「研究成果報告書概要(和文)」より
  • [Journal Article] 一大規模コーパスに対す計数手法る般化文書頻度の2003

    • Author(s)
      寺尾健一郎, 梅村恭司
    • Journal Title

      情報処理学会夏のプログラミング・シンポジウム (口頭発表)

      Pages: 12

    • Description
      「研究成果報告書概要(和文)」より

URL: 

Published: 2006-07-11  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi