• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to project page

2004 Fiscal Year Final Research Report Summary

Improvement and performance evaluation of the mathematical formula recognition method for digitalization of mathematical journals

Research Project

Project/Area Number 14580446
Research Category

Grant-in-Aid for Scientific Research (C)

Allocation TypeSingle-year Grants
Section一般
Research Field 情報システム学(含情報図書館学)
Research InstitutionShinshu University

Principal Investigator

OKAMOTO Masayuki  Shinshu University, Department of Information Engineering, Professor, 工学部, 教授 (50109196)

Co-Investigator(Kenkyū-buntansha) SUZUKI Masakazu  Kyushu University, Graduate School of Mathematics, Professor, 大学院・数理学研究院, 教授 (20112302)
Project Period (FY) 2002 – 2004
KeywordsMathematical formula Recognition / Document Image Processing / Character Recognition / Pattern Recognition
Research Abstract

This research project aimed improvement and performance evaluation of the mathematical formula recognition system which has been developed in our laboratory. Automatic recognition of mathematical formula plays an important roles in digitization of scientific or engineering documents. But current OCR systems can not deal with mathematical formulas due to their two dimensional layout of characters or symbols.
We have collaborated with Professor Michler of the University of Essen, Germany, on the project of "Retro-digitalization of mathematical journals, and their integration searchable digital libraries". In this project, we developed a mathematical formula recognition system. This time, we improved this system in order to deal with the problems such as wide variety of formula types, low printing quality, and touching or separated characters and symbols. To evaluate the recognition performance, two kinds of mathematical journals were scanned and a Ground-Truth of formula images were created. This Ground-Truth includes 21472 formula images. The results of performance evaluation with respect to the recognition of symbols and structures are 99.4% and 99.09% respectively, This results show the potential of OCR which can convert scientific documents into electronic forms.

  • Research Products

    (12 results)

All 2005 2003 2002

All Journal Article (12 results)

  • [Journal Article] 大量の印刷数式画像を用いた数式認識システムの性能評価2005

    • Author(s)
      北原卓, 仲正幸, 岡本正行
    • Journal Title

      電子情報通信学会技術研究報告 PRMU2004-212-230

      Pages: 31-36

    • Description
      「研究成果報告書概要(和文)」より
  • [Journal Article] 英文数学文書の正解付き文字・記号画像データベース2005

    • Author(s)
      野村明弘, 内田誠一, 鈴木昌和
    • Journal Title

      電子情報通信学会技術研究報告 PRMU2004-212-230

      Pages: 37-42

    • Description
      「研究成果報告書概要(和文)」より
  • [Journal Article] Performance Evaluation of a Mathematical Formula Recognition System with a Large Scale of Printed Formula Images2005

    • Author(s)
      T.Kitahara
    • Journal Title

      IEICE Technical Report PRMU2004-212-230

      Pages: 31-36

    • Description
      「研究成果報告書概要(欧文)」より
  • [Journal Article] A Ground-Truthed Mathematical Character and Symbol Image Database2005

    • Author(s)
      A.Nomura
    • Journal Title

      IEICE Technical Report PRMU2004-212-230

      Pages: 37-42

    • Description
      「研究成果報告書概要(欧文)」より
  • [Journal Article] Detection and Segmentation of Touching Characters in Mathematical Expressions2003

    • Author(s)
      A.Nomura, K.Michishita S.Uchida, M.Suzuki
    • Journal Title

      Proceedings of ICDAR2003

      Pages: 126-130

    • Description
      「研究成果報告書概要(和文)」より
  • [Journal Article] Detection of Matrices and Segmentation of Matrix Elements in Scanned Images of Scientific Documents2003

    • Author(s)
      T.Kanahori, M.Suzuki
    • Journal Title

      Proceedings of ICDAR2003

      Pages: 433-437

    • Description
      「研究成果報告書概要(和文)」より
  • [Journal Article] 数式認識システムについての一考察2003

    • Author(s)
      中塚 翼, 仲正幸, 岡本正行
    • Journal Title

      「科学情報の自動処理とその応用をめぐる諸問題」研究集会資料

      Pages: 30-33

    • Description
      「研究成果報告書概要(和文)」より
  • [Journal Article] Detection and Segmentation of Touching Characters in Mathematical Expressions2003

    • Author(s)
      A.Nomura
    • Journal Title

      Proceedings of ICDAR2003

      Pages: 126-130

    • Description
      「研究成果報告書概要(欧文)」より
  • [Journal Article] Detection of Matrices and Segmentation of Matrix Elements in Scanned Images of Scientific Documents2003

    • Author(s)
      T.Kanahori
    • Journal Title

      Proceedings of ICDAR2003

      Pages: 433-437

    • Description
      「研究成果報告書概要(欧文)」より
  • [Journal Article] A Discussion on Mathematical Formula Recognition System2003

    • Author(s)
      T.Nakatsuka
    • Journal Title

      Report on Problems on Automatic Processing of Scientific Information and Its Applications

      Pages: 30-33

    • Description
      「研究成果報告書概要(欧文)」より
  • [Journal Article] 数式認識性能評価用データベースの作成2002

    • Author(s)
      中塚 翼, 仲正幸, 岡本正行
    • Journal Title

      科学技術分野における電子的情報処理に関する研究集会資料

      Pages: 11-13

    • Description
      「研究成果報告書概要(和文)」より
  • [Journal Article] Ground Truth for Performance Evaluation of Mathematical Formula Recognition2002

    • Author(s)
      T.Nakatsuka
    • Journal Title

      Report on Electronic Information Processing in the Scientific and Engineering Field

      Pages: 11-13

    • Description
      「研究成果報告書概要(欧文)」より

URL: 

Published: 2006-07-11  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi