Establishment of Reading Support System of Ancient Documents aided by Handwritten Character Recognition Technology

Research Project

Project/Area Number	14580432
Research Category	Grant-in-Aid for Scientific Research (C)
Allocation Type	Single-year Grants
Section	一般
Research Field	Intelligent informatics
Research Institution	Osaka Electro-Communication University
Principal Investigator	UMEDA Michio Osaka Electro-Communication University, Faculty of Information Science & Arts, Professor, 総合情報学部, 教授 (30213490)
Project Period (FY)	2002 – 2004
Project Status	Completed (Fiscal Year 2004)
Budget Amount *help	¥3,300,000 (Direct Cost: ¥3,300,000) Fiscal Year 2004: ¥700,000 (Direct Cost: ¥700,000) Fiscal Year 2003: ¥1,100,000 (Direct Cost: ¥1,100,000) Fiscal Year 2002: ¥1,500,000 (Direct Cost: ¥1,500,000)
Keywords	ancient documents / character recognition / character segmentation / character spotting / feature extraction / neural network / document reading / expert system / 細線化
Research Abstract	This research presents a character segmentation and spotting method of ancient documents. In the segmentation method, the result of character recegnition process is utilized to cope with the cursive scripts and the mutual encroachment of characters which are peculiar to the ancient documents. In the spotting method, the previously designated characters are only extracted from the characters string. As an early segmentation, the characters string pattern is divided into the same cennected component by using the labelling processing. The area composed of the same component is surrounded with a rectangle and each character pattern is segmented each other by using the shape of rectangle such as height and width. Next, the individual character recognition technology is applied to the segmented pattern. From the recognition result, the rectangle failed in the segmentation is picked up and the re-segmantation is applied to the string contains this rectangle. Therefore, it is expected that the string is divided at the best position. On the other hand the neural network which corresponds to the previously designated character is prepared. The difference between input and output values of the network applied to the segmented pattern is calculated and the pattern which satisfies the condition is extracted as a spotting result. From the extraction experiment applied to 615 characters strings, the correct spotting rate of 94.22% was obtained to 5 designated characters by using the re-segmentation process, but the rate was 87.58% without the re-segmentation process. A reading support system of ancient documents for beginners was established by using the segmentation and spotting method.

Report

(4 results)

2004 Annual Research Report Final Research Report Summary
2003 Annual Research Report
2002 Annual Research Report

Research Products
(10 results)

All 2005 2004 2002 Other

All Journal Article (5 results) Publications (5 results)

[Journal Article] 頭部輪郭に着目した実時間歩行者計数システム2005
- Author(s)
  奥田隆史, 今井和正, 梅田三千雄
- Journal Title
  
  大阪電気通信大学ISC Technical Report ISC2004-02
  
  Pages: 13-19
- NAID
  40006690318
- Related Report
  2004 Annual Research Report
[Journal Article] 背景領域の細線化に基づく古文書の文字切り出しと認識2004
- Author(s)
  梅田三千雄, 橋本智広
- Journal Title
  
  情報処理学会論文誌 45
  
  Pages: 1188-1197
- NAID
  110002712166
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2004 Final Research Report Summary
[Journal Article] Character Segmentation and Recognition of Ancient Documents Based on Thinning of Background Region2004
- Author(s)
  Michio UMEDA, Tomohiro HASHIMOTO
- Journal Title
  
  IPSJ Journal Vol.45 No.4
  
  Pages: 1188-1197
- NAID
  110002712166
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2004 Final Research Report Summary
[Journal Article] 認識処理を援用した文字切り出しによる古文書のキャラクタスポッティング2002
- Author(s)
  梅田三千雄, 橋本智広
- Journal Title
  
  電気学会論文誌 122-C
  
  Pages: 1876-1884
- NAID
  10010454224
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2004 Final Research Report Summary
[Journal Article] Character Spotting of Historical Documents Using Pattern Segmentation Aided by Recognition Processing2002
- Author(s)
  Michio UMEDA, Tomohiro HASHIMOTO
- Journal Title
  
  Trans.IEE of Japan Vol.122-C No.11
  
  Pages: 1876-1884
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2004 Final Research Report Summary
[Publications] 梅田三千雄, 橋本智広: "背景領域の細線化に基づく古文書の文字切り出しと認識"情報処理学会論文誌. 45.4(掲載予定). (2004)
- Related Report
  2003 Annual Research Report
[Publications] 三崎揮市, 本庄大介, 梅田三千雄: "筆者認識研究用手書き文字データベースと表示・解析ソフトの作成"日本鑑識科学技術学会誌. 7・1. 71-81 (2002)
- Related Report
  2002 Annual Research Report
[Publications] 梅田三千雄, 三好健生, 三崎揮市: "自己想起型ニューラルネットワークによる筆者識別と照合"電気学会論文誌(C). 122-C・11. 1869-1875 (2002)
- Related Report
  2002 Annual Research Report
[Publications] 梅田三千雄, 橋本智広: "認識処理した援用した文字切り出しによる古文書のキャラクタスポッティング"電気学会論文誌(C). 122-C・11. 1876-1884 (2002)
- Related Report
  2002 Annual Research Report
[Publications] 梅田三千雄, 本庄大介: "完全一致法を用いた手書き住所文字列の認識"情報処理学会論文誌. 44・1. 11-20 (2003)
- Related Report
  2002 Annual Research Report

Establishment of Reading Support System of Ancient Documents aided by Handwritten Character Recognition Technology

Principal Investigator

UMEDA Michio Osaka Electro-Communication University, Faculty of Information Science & Arts, Professor, 総合情報学部, 教授 (30213490)

¥3,300,000 (Direct Cost: ¥3,300,000)

Report

Research Products

[Journal Article] 頭部輪郭に着目した実時間歩行者計数システム2005

Author(s)

Journal Title

NAID

Related Report

[Journal Article] 背景領域の細線化に基づく古文書の文字切り出しと認識2004

Author(s)

Journal Title

NAID

Description

Related Report

[Journal Article] Character Segmentation and Recognition of Ancient Documents Based on Thinning of Background Region2004

Author(s)

Journal Title

NAID

Description

Related Report

[Journal Article] 認識処理を援用した文字切り出しによる古文書のキャラクタスポッティング2002

Author(s)

Journal Title

NAID

Description

Related Report

[Journal Article] Character Spotting of Historical Documents Using Pattern Segmentation Aided by Recognition Processing2002

Author(s)

Journal Title

Description

Related Report

[Publications] 梅田三千雄, 橋本智広: "背景領域の細線化に基づく古文書の文字切り出しと認識"情報処理学会論文誌. 45.4(掲載予定). (2004)

Related Report

[Publications] 三崎揮市, 本庄大介, 梅田三千雄: "筆者認識研究用手書き文字データベースと表示・解析ソフトの作成"日本鑑識科学技術学会誌. 7・1. 71-81 (2002)

Related Report

[Publications] 梅田三千雄, 三好健生, 三崎揮市: "自己想起型ニューラルネットワークによる筆者識別と照合"電気学会論文誌(C). 122-C・11. 1869-1875 (2002)

Related Report

[Publications] 梅田三千雄, 橋本智広: "認識処理した援用した文字切り出しによる古文書のキャラクタスポッティング"電気学会論文誌(C). 122-C・11. 1876-1884 (2002)

Related Report

[Publications] 梅田三千雄, 本庄大介: "完全一致法を用いた手書き住所文字列の認識"情報処理学会論文誌. 44・1. 11-20 (2003)

Related Report