Studies on Stream Mining in Web Archive
Project/Area Number |
19500098
|
Research Category |
Grant-in-Aid for Scientific Research (C)
|
Allocation Type | Single-year Grants |
Section | 一般 |
Research Field |
Media informatics/Database
|
Research Institution | Nanzan University |
Principal Investigator |
KAWANO Hiroyuki Nanzan University, 情報理工学部, 教授 (70224813)
|
Project Period (FY) |
2007 – 2010
|
Project Status |
Completed (Fiscal Year 2010)
|
Budget Amount *help |
¥4,160,000 (Direct Cost: ¥3,200,000、Indirect Cost: ¥960,000)
Fiscal Year 2010: ¥650,000 (Direct Cost: ¥500,000、Indirect Cost: ¥150,000)
Fiscal Year 2009: ¥910,000 (Direct Cost: ¥700,000、Indirect Cost: ¥210,000)
Fiscal Year 2008: ¥1,430,000 (Direct Cost: ¥1,100,000、Indirect Cost: ¥330,000)
Fiscal Year 2007: ¥1,170,000 (Direct Cost: ¥900,000、Indirect Cost: ¥270,000)
|
Keywords | デジタルアーカイブ / コンテンツ流通 / 評判モデル / Webアーカイブ / Webクローリング / 階層化ストレージ / ファイルフォーマット / ピアツーピアシステム |
Research Abstract |
The size of the web archive is increasing exponentially, many national libraries and IIPC (International Internet Preservation Consortium) are making efforts to decide guidelines of long-term preservation of digital contents. In this research, from the view points of data mining techniques for reputation model, we reconsider a growth model of storage volume in web archive system. We discuss a basic architecture of hierarchical storage system based on characteristics of memory devices such as RAM, HDD, magnetic tapes and disks. We improve the file moving algorithm by using file retrieval patterns and access frequencies.
|
Report
(6 results)
Research Products
(32 results)