Budget Amount *help |
¥1,560,000 (Direct Cost: ¥1,200,000、Indirect Cost: ¥360,000)
Fiscal Year 2009: ¥650,000 (Direct Cost: ¥500,000、Indirect Cost: ¥150,000)
Fiscal Year 2008: ¥910,000 (Direct Cost: ¥700,000、Indirect Cost: ¥210,000)
|
Research Abstract |
Time-series data of keywords within blogs, news, and spam is analyzed in terms of auto-correlation to find periodic topics in these information sources. The information is collected from Japanese blog sites and news sites. Spam blogs are then separated from legitimate blogs using a spam filtering system. To find differences among the three sources, an analysis system is developed to find periodic topics based on auto-correlation. Employing this system, distribution periods of keywords within each information source, weekly keywords, and yearly keywords are extracted. In terms of distribution and keywords, characteristics of information sources are illustrated. According to the results, periodic blog topics are TV programs, hobbies, and social events. Periodic news topics are political and economical events. Periodic topics in spam are automatically copied-and-pasted email newsletters and affiliates.
|