On the Chinese Character Full-text Retrieval System by WWW and its Implementation
Project/Area Number |
09610372
|
Research Category |
Grant-in-Aid for Scientific Research (C)
|
Allocation Type | Single-year Grants |
Section | 一般 |
Research Field |
Asian history
|
Research Institution | Osaka International University for Women |
Principal Investigator |
OKETANI Ikuo Faculty of Human Sciences, Osaka International University for Women Professor, 人間科学部, 教授 (90169269)
|
Project Period (FY) |
1997 – 1998
|
Project Status |
Completed (Fiscal Year 1998)
|
Budget Amount *help |
¥2,100,000 (Direct Cost: ¥2,100,000)
Fiscal Year 1998: ¥300,000 (Direct Cost: ¥300,000)
Fiscal Year 1997: ¥1,800,000 (Direct Cost: ¥1,800,000)
|
Keywords | internet / KWIC (keyword in context) / Non-standard character (gaiji) processing / text-retrieval function / image-display function |
Research Abstract |
The aim of our study is to construct and make publicly available a Chinese character full-text retrieval system by using a personal computer with workstation and Windows environment. The documents employed in the study are : the Ryukyu Kahu Texts (Ryukyu Genealogy) (a private family document made by the Ryukyu government and an important record to study the structure and characters of the Ryukyu Kingdom) ; the Ryukyu-koku Hyojo-sho Monjo texts (The Ryukyu Kingdom Official Documents) (the documents made at the Hyojo-sho, the Ryukyu government supreme office for political and diplomatic decisions) ; the Oshima Hikkl and related Chinese documents. The following points were considered in constructing the database : 1. The construction of Chinese character full-text database and the image input of the original texts ; 2. The correlation of the text retrieval function (including KWIC display) and the image display function ; 3. The implementation of the non-standard character (gaiji) input funct
… More
ion using the non-standard character input method (parts of Chinese characters and specific tags) and the character-string retrieval function ; 4. The use of non-standard character font using e-Character (Unicode, 673 characters ; provided by Professor Tetsuya Katsumura, Institute for Research in Humanities, Kyoto University), and the creation of non-standard characters(288 characters) from parts of Chinese characters ; 5. The construction of a database of non-standard character attributes (radicals, the number of strokes of a radical, the total number of strokes of a character, Unicode number, Dai-kanwa code number, Chinese pronunciation of a character, etc) ; 6. The publication of a Chinese character full-text retrieval system on the internet, and the distribution of the database of non-standard character attributes and the non-standard character fonts (Gif form file) using the FTP function ; 7. The storage and analysis of log files of a retrieval word and the implementation of various forms of statistical processing of documents. The results of this study are accessible on the internet in the form of our text database and the image database of historical materials (URL : http : //www. okinawa. oiu. ac. jp/) Less
|
Report
(3 results)
Research Products
(15 results)