Budget Amount *help |
¥3,580,000 (Direct Cost: ¥3,100,000、Indirect Cost: ¥480,000)
Fiscal Year 2009: ¥910,000 (Direct Cost: ¥700,000、Indirect Cost: ¥210,000)
Fiscal Year 2008: ¥1,170,000 (Direct Cost: ¥900,000、Indirect Cost: ¥270,000)
Fiscal Year 2007: ¥1,500,000 (Direct Cost: ¥1,500,000)
|
Research Abstract |
The goal of this research project is to explore the potential for automatic Web page classification based on non-topical categories (in addition to topical categories). Two kinds of classification have been explored in this project. The first was to develop a search engine to automatically detect academic articles on the Web, this was classification by document type. PDF files were collected from the Web and classified using attributes such as terms in PDF files. Second was the development and use of a new test collection for automatic labeling of sentences with ten human values. Experiment results appear promising in this preliminary study, clearly pointing to productive directions for future work.
|