Project/Area Number |
05558024
|
Research Category |
Grant-in-Aid for Developmental Scientific Research (B)
|
Allocation Type | Single-year Grants |
Research Field |
Statistical science
|
Research Institution | The Institute of Statistical Mathematics |
Principal Investigator |
HASEGAWA Masami The Inst.of Statist.Math., Prof., 予測制御研究系, 教授 (60011657)
|
Co-Investigator(Kenkyū-buntansha) |
YANO Taka-aki Showa Univ., Assoc.Prof., 教養部, 助教授 (70081792)
KISHINO Hirohisa Univ.Tokyo, Assoc.Prof., 教養学部, 助教授 (00141987)
HASHIMOTO Tetsuo The Inst.of Statist.Math., Assoc.Prof., 教育情報センター, 助教授 (50208451)
安永 照雄 大阪大学, 遺伝情報実験施設, 助教授
|
Project Period (FY) |
1993 – 1995
|
Project Status |
Completed (Fiscal Year 1995)
|
Budget Amount *help |
¥12,900,000 (Direct Cost: ¥12,900,000)
Fiscal Year 1995: ¥1,200,000 (Direct Cost: ¥1,200,000)
Fiscal Year 1994: ¥3,000,000 (Direct Cost: ¥3,000,000)
Fiscal Year 1993: ¥8,700,000 (Direct Cost: ¥8,700,000)
|
Keywords | evolutionary tree / maximum likelihood / Markov model / mitochondrial DNA / amino acid substitution / transition matrix / approximate likelihood / local rearrangement / RNA / 蛋白質 / 確率モデル / タンパク質 / アミノ酸置換 / モデル / シミュレーション / MOLPHY |
Research Abstract |
We have developed maximum likelihood (ML) methods for inferring evolutionary trees from DNA and protein sequences. Since it is important to use realistic models for the substitution process in the ML analyzes, we have developed several models for the nucleotide and amino acid substitutions. Among them, a Markov model for amino acid substitution of mitochondrial DNA-encoded proteins might be the most important. In recent years, mitochondrial DNA sequences have been used widely in molecular phylogenetics. However, since the genetic code of mitochondria differs a little from the universal code, and since most of the mitochondrial proteins are membraneous, the transition matrix specific to the mtDNA-encoded proteins might be different from those of the models which are mainly based on the proteins encoded by nuclear DNA.Therefore, we estimated the matrix for the mtDNA-encoded proteins from 22 completely sequenced data of vertebrates by the ML.This matrix would be useful in analyzing mtDNA-encoded proteins. Since the ML method is computationally intensive, mainly due to a huge number of possible trees, there exists several limitations in applying the method to real biological problems. We have developed several methods to reduce the computational burden of the ML analyzes ; i.e., (1) approximate likelihood method to reduce the number of candidate trees, (2) star-decomposition method for topology search, (3) local rearrangement method for topology search. These results have been implemented in our program package MOLPHY,which are available from the network and have already been used widely in the world.
|