Grant-in-Aid for Scientific Research (B)
|Allocation Type||Single-year Grants |
|Research Institution||KYUSHU UNIVERSITY |
SUZUKI Masakazu Kyushu University, Faculty of Mathematics, Professor, 大学院・数理学研究院, 教授 (20112302)
OKAMOTO Masayuki Shinshu University, Faculty of Engineering, Professor, 工学部, 教授 (50109196)
UCHIDA Seiichi Kyushu University, Faculty of Information Systems, Associate Professor, 大学院・システム情報科学研究院, 助教授 (70315125)
TAMARI Fumikazu Fukuoka University of Education, Faculty of Education, Professor, 教育学部, 教授 (70036937)
FUJIMOTO Mitsushi Fukuoka University of Education, Faculty of Education, Associate Professor, 教育学部, 助教授 (20270241)
KANAHORI Toshihiro Tsukuba University of Technology, Gakunai Kyodo Riyou Shisetsu, Associate Professor, 共同利用施設等, 助教授 (00352568)
大武 信之 筑波技術短期大学, 教育方法開発センター, 助教授 (10223851)
黄瀬 浩一 大阪府立, 工学部, 助教授 (80224939)
|Project Period (FY)
2002 – 2005
Completed (Fiscal Year 2005)
|Budget Amount *help
¥14,400,000 (Direct Cost: ¥14,400,000)
Fiscal Year 2005: ¥2,900,000 (Direct Cost: ¥2,900,000)
Fiscal Year 2004: ¥2,800,000 (Direct Cost: ¥2,800,000)
Fiscal Year 2003: ¥2,800,000 (Direct Cost: ¥2,800,000)
Fiscal Year 2002: ¥5,900,000 (Direct Cost: ¥5,900,000)
|Keywords||Formula recognition / Math recognition / Structure analysis / Optical Character recognition / Document digitization / Assist technology / Visually Impaired|
1.Throughout the research period, we build a ground-truthed database of page images of mathematical articles. Using the database, we developed and improved the math symbol recognition engine and the segmentation method of text areas and math expression areas. A part of the database is now open to public on our web site.
2.To improve the math structure analysis method base on virtual link network developed in the previous research, we adjusted the cost of the links of the network in detail using the database above. On the other hand, we introduced a notion of "center band", calculated robustly against mis-recognition of characters, to stabilize considerably the structure analysis of math expressions.
3.We developed a method to segment touched characters in math expressions using the matching of sub-patterns with other non-touched characters patterns in the same page. We also extended a framework used frequently to segment characters in text areas in a way adapted to math formulae images
4.We developed a method to recognize complicated matrices including repeat symbols or area symbols, using variable block pattern elements.
5.We investigated the method to detect bibliographic data and logical structure of math papers from the recognition results.
6.We finally studied the recognition of commutative diagrams in math papers and graphs of elementary functions in the figures of math texts as well. These are however still on the state of trial research.
7.A math document recognition software "Infty Reader" developed using the results of this research is available freely from the web site : http://www.inftyproject.org./