Genome-wide DNA and protein conformational dynamics: sequence-based prediction and tissue-specific profiling

研究課題

研究課題/領域番号	15K00419
研究種目	基盤研究(C)
配分区分	基金
応募区分	一般
研究分野	生命・健康・医療情報学
研究機関	国立研究開発法人医薬基盤・健康・栄養研究所
研究代表者	シャンダーアハマド国立研究開発法人医薬基盤・健康・栄養研究所, その他部局等, 研究員 (80463298)
研究期間 (年度)	2015-04-01 – 2016-03-31
研究課題ステータス	中途終了 (2015年度)
配分額 *注記	4,290千円 (直接経費: 3,300千円、間接経費: 990千円) 2017年度: 1,430千円 (直接経費: 1,100千円、間接経費: 330千円) 2016年度: 1,430千円 (直接経費: 1,100千円、間接経費: 330千円) 2015年度: 1,430千円 (直接経費: 1,100千円、間接経費: 330千円)
キーワード	DNA structure / DNA dynamics / Transcription / Protein-DNA interactions / Machine learning
研究実績の概要	All the three objectives outlined in the project proposal have been achieved. First of all, we developed conformational ensembles of DNA sequences. Molecular dynamics (MD) snapshots data were used for this purpose. There are 136 possible tetranucleotide sequences and all of them had four flanking bases on either terminal leading to 136 unique 12-mers.We developed a support vector machine (SVM) based model, which takes a 5-mer sequence as input and returns predicted conformational ensemble populations for 12 conformational parameters in 5 bins each. Various benchmarks confirmed high accuracy of this method. We applied the newly developed tool to predict DNA conformational dynamics of the whole mouse and human genomes. Using genome-wide predicted values, we studied binding sites and their flanking regions in more than 1000 transcription factors in Emrbyonic Stem (ES) cells and one TF (STAT3) in four different cell types and in greater details. We showed that binding site flanking regions as far as 200 bases from the binding motif center carry significant conformational biases, which can distinguish binding-site flanking regions from rest of the genome. Separately, we also developed a method to predict DNA-binding proteins by using gene expression and sequence information together. We found that gene expression profiles and their global co-expression patterns can be useful in identifying proteins with week DNA-binding signals at the sequence level. Some of the results in this project are available via bioarxiv while others are being prepared for publication.

報告書

(1件)

2015 実績報告書

研究成果
(1件)

すべて雑誌論文 (1件) (うちオープンアクセス 1件、謝辞記載あり 1件)

[雑誌論文] Genome-wide transcription factor activities are explained by intrinsic conformational dynamics of binding-sites and distal flanking-regions2015
- 著者名/発表者名
  Munazah Andrabi, Andrew Paul Hutchins, Diego Miranda-Saavedra, Hidetoshi Kono, Ruth Nussinov, Kenji Mizuguchi, Shandar Ahmad
- 雑誌名
  
  bioRxiv
  
  巻: June 9
- DOI
  10.1101/020602
- 関連する報告書
  2015 実績報告書
- オープンアクセス / 謝辞記載あり