研究課題/領域番号 |
26700030
|
研究機関 | 国立研究開発法人産業技術総合研究所 |
研究代表者 |
Frith Martin 国立研究開発法人産業技術総合研究所, ゲノム情報研究センター, 主任研究員 (40462832)
|
研究期間 (年度) |
2014-04-01 – 2017-03-31
|
キーワード | バイオインフォマティクス / 生体情報学 / 比較ゲノム |
研究実績の概要 |
In this first year, groundwork for the project was laid. I improved computational methods to accurately compare and align genome sequences, in particular, taking careful account of rearrangements, such as inversions and translocations. I also improved methods to compare genomic DNA to protein sequence databases, which is important for annotating genomic elements. I also began a collaboration with Drs. Y Suzuki (Tokyo University) and W Makalowski (Muenster University) on analyzing DNA sequence data obtained with the new "nanopore" sequencing technique. This is a promising way to obtain new genomic sequence: it is cheap, fast, and produces long reads, but at present it has a very high error rate. Comparing high-error sequences is quite similar to comparing distantly-related genomes.
|
現在までの達成度 (区分) |
現在までの達成度 (区分)
4: 遅れている
理由
I aim to hire a postdoctoral fellow to assist with this project. Since this is my first ever funding, I could not start to advertise for a postdoctoral position until I knew that I would receive the funding. This year, I began the process of interviewing several candidates. The project can proceed more quickly once someone is hired to work on it.
|
今後の研究の推進方策 |
In the second year, we will finalize and publish the improved computational methods developed in the first year. We will also apply them to compare the human genome to genomes of various other species. We will also work on training alignment parameters. Distantly-related DNA sequences have certain frequencies of substitutions, insertions, and deletions: we will learn these from the data, so that we can then align such sequences more accurately. We have started doing this for nanopore DNA sequences, but the aim is to develop a general method that works for all kinds of DNA data. Since different kinds of DNA (e.g. protein-coding versus non-coding) will differ in these parameters, the training will have to be done in a careful way: separately for these different kinds of DNA (the main interest of this project being non-protein-coding DNA.)
|
次年度使用額が生じた理由 |
The main cost is to hire a postdoctoral fellow, to help do the work of this project. We also need to travel, to attend conferences and meet other researchers in related fields: our research will become more powerful and useful if we can collaborate with complementary researchers, such as those sequencing new genomes. Finally, since our project uses computers for deep analysis of large genomic datasets, we need powerful computer hardware.
|
次年度使用額の使用計画 |
The funding will be used to hire one postdoctoral fellow, and purchase startup equipment for him/her (a capable computer and standard office software). We will then attend several international and domestic conferences, including the 29th International Mammalian Genome Conference, and the Genome Informatics Workshop. Also, we will need to pay to publish our results in open-access journals.
|