• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to project page

2013 Fiscal Year Final Research Report

Building Named Entity Recognizers by combining a large-scale lexicon and corpora

Research Project

  • PDF
Project/Area Number 23700159
Research Category

Grant-in-Aid for Young Scientists (B)

Allocation TypeMulti-year Fund
Research Field Intelligent informatics
Research InstitutionTohoku University

Principal Investigator

OKAZAKI Naoaki  東北大学, 情報科学研究科, 准教授 (50601118)

Project Period (FY) 2011 – 2012
Keywords自然言語処理 / 固有表現抽出
Research Abstract

This research builds Named Entity Recognizers, which extract text mentions of entities or concepts of specific semantic classes (e.g., product names and disease names) from text, at a low cost. In order to achieve this goal, this project addresses three challenges: (1) automatic acquisition of training data with mentions annotated with semantic classes; (2) building Named Entity Recognizers from the automatically acquired training data; and (3) evaluating the Named Entity Recognizers. We proposed a method for improving the quality of automatically acquired training data by using reference information in the dictionary, and demonstrated its effectiveness through the experiments. We also proposed a method for mining context gazetteers, which are dependency paths appearing around expressions of the target semantic classes, and confirmed improvements of accuracy of Named Entity Recognizers.

  • Research Products

    (13 results)

All 2013 2012 2011

All Journal Article (6 results) (of which Peer Reviewed: 5 results) Presentation (7 results)

  • [Journal Article] Named entity recognition with multiple segment representations2013

    • Author(s)
      Han-Cheol Cho, Naoaki Okazaki, Makoto Miwa, Jun'ichi Tsujii
    • Journal Title

      Information Processing & Management

      Volume: Vol.49, No.4 Pages: 954-965

    • DOI

      10.1016/j.ipm.2013.03.002

    • Peer Reviewed
  • [Journal Article] Learning Abbreviations from Chinese and English Terms by Modeling Non-local Information2013

    • Author(s)
      Xu Sun, Naoaki Okazaki, Junichi Tsujii, Houfeng Wang
    • Journal Title

      ACM Transactions on Asian Language Information Processing

      Volume: Vol.12, No.2 Pages: 5:1-5:17

    • DOI

      10.1145/2461316.2461317

    • Peer Reviewed
  • [Journal Article] 訂正パターンに基づく誤情報の収集と拡散状況の分析2013

    • Author(s)
      鍋島啓太, 渡邉研斗, 水野淳太, 岡崎直観, 乾健太郎
    • Journal Title

      自然言語処理

      Volume: Vol.20, No.3 Pages: 461-484

    • DOI

      10.5715/jnlp.20.461

    • Peer Reviewed
  • [Journal Article] カテゴリ間の兄弟関係を活用した集合拡張2013

    • Author(s)
      高瀬翔, 岡崎直観, 乾健太郎
    • Journal Title

      自然言語処理

      Volume: Vol.20, No.2 Pages: 273-296

    • DOI

      10.5715/jnlp.20.273

    • Peer Reviewed
  • [Journal Article] 言語処理による分析 - 日本栄養士会活動報告の分析2012

    • Author(s)
      岡崎直観, 鍋島啓太, 乾健太郎
    • Journal Title

      日本栄養士会雑誌

      Volume: Vol.55, No.12 Pages: 6-8

  • [Journal Article] 集合間類似度に対する簡潔かつ高速な類似文字列検索アルゴリズム2011

    • Author(s)
      岡崎直観, 辻井潤一
    • Journal Title

      自然言語処理

      Volume: Vol.18, No.2 Pages: 89-118

    • DOI

      10.5715/jnlp.18.89

    • Peer Reviewed
  • [Presentation] ウェブ文書の構造を利用した場所名・住所ペアの獲得2013

    • Author(s)
      佐藤貴大, 岡崎直観, 乾健太郎
    • Organizer
      第27回人工知能学会全国大会 (JSAI2013)
    • Place of Presentation
      富山国際会議場(富山県)
    • Year and Date
      20130604-07
  • [Presentation] Inducing Context Gazetteers from Encyclopedic Database for Named Entity Recognition2013

    • Author(s)
      Han-Cheol Cho, Naoaki Okazaki, Kentaro Inui
    • Organizer
      Proceedings of the 17th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2013), pp.378-389
    • Place of Presentation
      Gold Coast, Australia
    • Year and Date
      20130414-17
  • [Presentation] Exploiting Dependency Context Gazetteers for Named Entity Recognition2013

    • Author(s)
      Han-Cheol Cho, Naoaki Okazaki, Kentaro Inui
    • Organizer
      言語処理学会第19回年次大会(NLP2013), pp. 220-223
    • Place of Presentation
      名古屋大学(愛知県)
    • Year and Date
      20130313-15
  • [Presentation] 名詞カテゴリからの関係知識獲得に向けて2012

    • Author(s)
      高瀬翔, 岡崎直観, 乾健太郎
    • Organizer
      NLP 若手の会 第7回シンポジウム
    • Place of Presentation
      東北大学(宮城県)
    • Year and Date
      20120903-04
  • [Presentation] 意味カテゴリの階層関係を活用した集合拡張2012

    • Author(s)
      高瀬翔, 岡崎直観, 乾健太郎
    • Organizer
      言語処理学会第18回年次大会(NLP2012), pp.475-478
    • Place of Presentation
      広島市立大学(広島県)
    • Year and Date
      20120314-16
  • [Presentation] Set Expansion using Sibling Relations between Semantic Categories2012

    • Author(s)
      Sho Takase, Naoaki Okazaki, Kentaro Inui
    • Organizer
      Proceedings of the 26th Pacific Asia Conference on Language,Information and Computation (PACLIC 26), pp.567-576
    • Place of Presentation
      Bali, Indonesia
    • Year and Date
      2012-11-09
  • [Presentation] Automatic Acquisition of Huge Training Data for Bio-Medical Named Entity Recognition2011

    • Author(s)
      Yu Usami, Han-Cheol Cho, Naoaki Okazaki, Jun'ichi Tsujii
    • Organizer
      Proceedings of BioNLP 2011 Workshop, pp. 65-73
    • Place of Presentation
      Portland, Oregon, USA
    • Year and Date
      2011-06-23

URL: 

Published: 2015-06-25  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi