A Study on Top-K algorithm for Large Unordered Tree Databases
Project/Area Number |
24650042
|
Research Category |
Grant-in-Aid for Challenging Exploratory Research
|
Allocation Type | Multi-year Fund |
Research Field |
Media informatics/Database
|
Research Institution | National Institute of Informatics |
Principal Investigator |
TAKASU Atsuhiro 国立情報学研究所, コンテンツ科学研究系, 教授 (90216648)
|
Project Period (FY) |
2012-04-01 – 2015-03-31
|
Project Status |
Completed (Fiscal Year 2014)
|
Budget Amount *help |
¥3,900,000 (Direct Cost: ¥3,000,000、Indirect Cost: ¥900,000)
Fiscal Year 2014: ¥1,560,000 (Direct Cost: ¥1,200,000、Indirect Cost: ¥360,000)
Fiscal Year 2013: ¥1,560,000 (Direct Cost: ¥1,200,000、Indirect Cost: ¥360,000)
Fiscal Year 2012: ¥780,000 (Direct Cost: ¥600,000、Indirect Cost: ¥180,000)
|
Keywords | 木構造データ検索 / トップK検索 / インデキシング / 数式検索 |
Outline of Final Research Achievements |
Trees are used for representing and processing various data such as XML documents and mathematical formulas. We studied efficient tree matching and retrieval algorithms. This study focuses on the algorithms for unordered trees that generally require high computation cost. We first proposed an unordered tree matching algorithm that is especially effective for narrow trees and developed a program that can calculate the similarity of mid sized trees within reasonable processing time. For processing large tree databases, we developed efficient indices that can detect candidate trees from the database. For the case that tree structure is important for retrieval, we made a metric space-based index that converts each tree to a feature vector then makes a metric space for the vectors. Then, we apply a pivot-based indexing technique.
|
Report
(4 results)
Research Products
(6 results)