Grant-in-Aid for International Scientific Research.
|Section||Joint Research .|
|Research Institution||Hiroshima City University|
REN Fuji Faculty of Information Sciences, Hiroshima City University, Associate Professor, 情報科学部, 助教授 (20264947)
ZHENSHENG Lu 清華大学, 中文学部, 教授
YOULIANG Jia 大連理工大学, 計算機学部, 教授
JIANYUN Nie モントリオール大学, 情報研究学部, 助教授
MARK Davis ニューメキシコ州立大学, CRL, 研究員
KITAKAMI Hajime Faculty of Information Sciences, Hiroshima City University Professor, 情報科学部, 教授 (50234240)
NIE Jianyun Informatique et Recherche oprationnelle, Universite de Montreal, Associate Profe, 情報研究学部, 助教授
REN Fuchun 華北電力大学, 工学部, 助教授
目良 和也 広島市立大学, 情報科学部, 助手 (50285425)
CHEN Chunxiang Information Certer, Hiroshima Prefetural University, Associate Professor, 情報教育センター, 助教授 (90264944)
LUO Zhenshen 清華大学, 中文学部, 教授
DAVIS Mark CRL,New Mexico State University, Researcher, CRL, 研究員
LUO Zhensheng Department of Chinese Lenguage and Literature, Tsinghua University, Professor
|Project Fiscal Year
1997 – 1998
Completed(Fiscal Year 1998)
|Budget Amount *help
¥9,300,000 (Direct Cost : ¥9,300,000)
Fiscal Year 1998 : ¥4,100,000 (Direct Cost : ¥4,100,000)
Fiscal Year 1997 : ¥5,200,000 (Direct Cost : ¥5,200,000)
|Keywords||Corpus / Chinese / Japanese / English / Machine Translation / Natural Language Processing / Information Retrival / knowledge acquisition / コーパス / 中国語 / 日本語 / 英語 / 自然言語処理 / 機械翻訳 / 情報検索 / 知識獲得 / 統計情報 / 辞書|
In this project, We studied Japanese-Chinese-English Translation Corpus and Its Application.
(l)We have proposed an algorithm for automatically aligning Chinese and Japanese parallel Corpora. The algorithm initially attempts to align the Chinese and Japanese sentences using measurements of the differences between sentence lengths in the two documents.
(2)We have proposed a new concept called "Sensitive Word", and described the segmentation problem and the impact of the sensitive word on natural language processing. We have shown the result of our examination.
(3)A crucial problem in rule-based machine translation is the acquisition of translation knowledge. Many studies have been conducted for automatic acquisition in the past, but they require a great deal of annotated examples. In this project, we have developed a semi-automatic mechanism to support knowledge acquisition from translation examples in Japanese-Chinese environment.
(4) We have proposed a relaxed segmentation process for Chinese which extracts not noly the longest words as one usually does, but also all the short words implied. Special rules are also designed in order to recongnize and standardize sepcial words such as proper names and nominal pre-determiners. It is shown that IR based on this segmentation leads to a higher effectiveness than bigrams.
(5) We have taken a new hybrid multi-engine approach to machine translation, which can take the advantages of the previously proposed methods and get rid of their disadvantages.
(6) We will publisize our "Translation corpora" and "Chinese dictionary" on the Web after we improve the multi-lingual envionment.