配分額 *注記 |
4,290千円 (直接経費: 3,300千円、間接経費: 990千円)
2017年度: 1,430千円 (直接経費: 1,100千円、間接経費: 330千円)
2016年度: 1,430千円 (直接経費: 1,100千円、間接経費: 330千円)
2015年度: 1,430千円 (直接経費: 1,100千円、間接経費: 330千円)
|
研究実績の概要 |
All the three objectives outlined in the project proposal have been achieved. First of all, we developed conformational ensembles of DNA sequences. Molecular dynamics (MD) snapshots data were used for this purpose. There are 136 possible tetranucleotide sequences and all of them had four flanking bases on either terminal leading to 136 unique 12-mers.We developed a support vector machine (SVM) based model, which takes a 5-mer sequence as input and returns predicted conformational ensemble populations for 12 conformational parameters in 5 bins each. Various benchmarks confirmed high accuracy of this method. We applied the newly developed tool to predict DNA conformational dynamics of the whole mouse and human genomes. Using genome-wide predicted values, we studied binding sites and their flanking regions in more than 1000 transcription factors in Emrbyonic Stem (ES) cells and one TF (STAT3) in four different cell types and in greater details. We showed that binding site flanking regions as far as 200 bases from the binding motif center carry significant conformational biases, which can distinguish binding-site flanking regions from rest of the genome. Separately, we also developed a method to predict DNA-binding proteins by using gene expression and sequence information together. We found that gene expression profiles and their global co-expression patterns can be useful in identifying proteins with week DNA-binding signals at the sequence level. Some of the results in this project are available via bioarxiv while others are being prepared for publication.
|