• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to project page

2004 Fiscal Year Final Research Report Summary

Establishment of Reading Support System of Ancient Documents aided by Handwritten Character Recognition Technology

Research Project

Project/Area Number 14580432
Research Category

Grant-in-Aid for Scientific Research (C)

Allocation TypeSingle-year Grants
Section一般
Research Field Intelligent informatics
Research InstitutionOsaka Electro-Communication University

Principal Investigator

UMEDA Michio  Osaka Electro-Communication University, Faculty of Information Science & Arts, Professor, 総合情報学部, 教授 (30213490)

Project Period (FY) 2002 – 2004
Keywordsancient documents / character recognition / character segmentation / character spotting / feature extraction / neural network / document reading / expert system
Research Abstract

This research presents a character segmentation and spotting method of ancient documents. In the segmentation method, the result of character recegnition process is utilized to cope with the cursive scripts and the mutual encroachment of characters which are peculiar to the ancient documents. In the spotting method, the previously designated characters are only extracted from the characters string. As an early segmentation, the characters string pattern is divided into the same cennected component by using the labelling processing. The area composed of the same component is surrounded with a rectangle and each character pattern is segmented each other by using the shape of rectangle such as height and width. Next, the individual character recognition technology is applied to the segmented pattern. From the recognition result, the rectangle failed in the segmentation is picked up and the re-segmantation is applied to the string contains this rectangle. Therefore, it is expected that the string is divided at the best position. On the other hand the neural network which corresponds to the previously designated character is prepared. The difference between input and output values of the network applied to the segmented pattern is calculated and the pattern which satisfies the condition is extracted as a spotting result. From the extraction experiment applied to 615 characters strings, the correct spotting rate of 94.22% was obtained to 5 designated characters by using the re-segmentation process, but the rate was 87.58% without the re-segmentation process. A reading support system of ancient documents for beginners was established by using the segmentation and spotting method.

  • Research Products

    (4 results)

All 2004 2002

All Journal Article (4 results)

  • [Journal Article] 背景領域の細線化に基づく古文書の文字切り出しと認識2004

    • Author(s)
      梅田三千雄, 橋本智広
    • Journal Title

      情報処理学会論文誌 45

      Pages: 1188-1197

    • Description
      「研究成果報告書概要(和文)」より
  • [Journal Article] Character Segmentation and Recognition of Ancient Documents Based on Thinning of Background Region2004

    • Author(s)
      Michio UMEDA, Tomohiro HASHIMOTO
    • Journal Title

      IPSJ Journal Vol.45 No.4

      Pages: 1188-1197

    • Description
      「研究成果報告書概要(欧文)」より
  • [Journal Article] 認識処理を援用した文字切り出しによる古文書のキャラクタスポッティング2002

    • Author(s)
      梅田三千雄, 橋本智広
    • Journal Title

      電気学会論文誌 122-C

      Pages: 1876-1884

    • Description
      「研究成果報告書概要(和文)」より
  • [Journal Article] Character Spotting of Historical Documents Using Pattern Segmentation Aided by Recognition Processing2002

    • Author(s)
      Michio UMEDA, Tomohiro HASHIMOTO
    • Journal Title

      Trans.IEE of Japan Vol.122-C No.11

      Pages: 1876-1884

    • Description
      「研究成果報告書概要(欧文)」より

URL: 

Published: 2006-07-11  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi