2011 Fiscal Year Final Research Report
Building a Native/Non溶ative EngIish Language Technical Paper Corpus from Web and its Release and Application
Project/Area Number |
20320082
|
Research Category |
Grant-in-Aid for Scientific Research (B)
|
Allocation Type | Single-year Grants |
Section | 一般 |
Research Field |
Foreign language education
|
Research Institution | Kyushu University |
Principal Investigator |
TOMIURA Yoich1 九州大学, システム情報科学研究院, 教授 (10217523)
|
Co-Investigator(Kenkyū-buntansha) |
TANAKA Shosaku 立命館大学, 文学部, 准教授 (00325549)
GOTO Kazuaki 摂南大学, 外国語学部, 講師 (90397662)
HAYAMA Megumi 濁協大学, 外国語学部, 准教授 (60409555)
ANDO Nahoko 九州大学, 大学院・法学研究院, 専門研究員 (50380655)
SHIBATA Msahiro 九州大学, 情報基盤研究開発センター, 学術研究員 (00452813)
|
Project Period (FY) |
2008 – 2011
|
Keywords | コーパス / Web / 英文の質判定 / 仮説検定 / 英作文支援 / 英語教育 / 著作権 |
Research Abstract |
We developed a method for collecting English language technical papers on the private web pages using web search engine and a statistical method for estimating the English quality of a document based on the characteristics about the sequences of part of speeches in the document. Furthermore, using these methods, we developed a system to build a large-scale English language technical paper corpus from Web, which includes the information about English quality for each paper. We also investigated copyright problems and what we should consider on building a corpus form Web and releasing it.
|
Research Products
(13 results)