• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to project page

2015 Fiscal Year Final Research Report

Analysis of Repetition Structure in Huge Sequences

Research Project

  • PDF
Project/Area Number 25280079
Research Category

Grant-in-Aid for Scientific Research (B)

Allocation TypePartial Multi-year Fund
Section一般
Research Field Intelligent informatics
Research InstitutionHokkaido University

Principal Investigator

Nakamura Atsuyoshi  北海道大学, 情報科学研究科, 准教授 (50344487)

Co-Investigator(Kenkyū-buntansha) KUDO Mineichi  北海道大学, 大学院情報科学研究科, 教授 (60205101)
TAKIGAWA Ichigaku  北海道大学, 大学院情報科学研究科, 准教授 (10374597)
Co-Investigator(Renkei-kenkyūsha) MAMITSUKA Hiroshi  京都大学, 化学研究所, 教授 (00346107)
KIDA Takuya  北海道大学, 大学院情報科学研究科, 准教授 (70343316)
OKUBO Yoshiaki  北海道大学, 大学院情報科学研究科, 助教 (40271639)
Project Period (FY) 2013-04-01 – 2016-03-31
Keywords知識発見とデータマイニング / シーケンスマイニング / ゲノム情報処理 / 頻出パターンマイニング
Outline of Final Research Achievements

We developed an algorithm for enumerating frequent approximate string patterns, and proposed a method of extracting occurrence regions of the enumerated patterns as a method of extracting interspersed repetitive elements in a huge sequence like a DNA sequence. Patterns of proposed methods have occurrences of clear boundaries, so there is little chance to count essentially the same region more than once. Furthermore, our enumeration algorithm runs very fast and with small memory. According to our empirical results using human chromosome 21, a half of the known Alu regions, which are famous interspersed repetitive elements, is extracted as occurrence regions of 100 representative patterns that were selected from enumerated frequent approximate patterns.

Free Research Field

知能情報学

URL: 

Published: 2017-05-10  

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi