A Hybrid Approach to Computationally Hard Problems : Combining Approximation, Parallelization, and Randomization

Research Project

Project/Area Number	14580390
Research Category	Grant-in-Aid for Scientific Research (C)
Allocation Type	Single-year Grants
Section	一般
Research Field	計算機科学
Research Institution	Tokyo Denki University
Principal Investigator	CHEN Zhi-zhong Tokyo Denki University, Mathematical Sciences, Associate Professor, 理工学部, 助教授 (00242933)
Project Period (FY)	2002 – 2004
Project Status	Completed (Fiscal Year 2004)
Budget Amount *help	¥1,600,000 (Direct Cost: ¥1,600,000) Fiscal Year 2004: ¥500,000 (Direct Cost: ¥500,000) Fiscal Year 2003: ¥500,000 (Direct Cost: ¥500,000) Fiscal Year 2002: ¥600,000 (Direct Cost: ¥600,000)
Keywords	NP-hardness / Approximation Algorithms / Randomized Algorithms / Parallel Algorithms / NMR Spectral Peak Assignment / k-th Phylogenetic Roots / Sequence Alignment Problems / Heuristics / 最大巡回セールスマン問題 / Max SNP困難性 / k-次生物系統木再構築問題 / 蛋白質NMRピーク割り当て問題 / 二部マッチング問題 / 文字列編集問題 / 1平面グラフの彩色問題 / 近以アルゴリズム / 分岐限定法 / 貪欲濾過法
Research Abstract	In this research project, we studied the effect of a hybrid approach to computationally hard problems. This approach combines three basic approaches (namely, approximation, randomization, and parallelization) to computationally hard problems. Previously, not so many algorithms were based on such a hybrid approach. The main purpose here is to use this hybrid approach to solve computationally hard problems that have not been solved so far. This may lead to the finding of new design techniques of efficient algorithms for hard problems. We focused on three computationally hard problems arising from the field of computational biology. The first is the protein NMR peak assignment problem which is crucial towards the automation of assigning a group of "spin systems" obtained experimentally to a protein sequence of amino acids. We formulated this problem as an interval scheduling problem (ISP), where a protein sequence P of amino acids is viewed as a discrete time interval I (the amino acids on … More P one-to-one correspond to the time units of I), each subset S of spin systems that are known to originate from consecutive amino acids from P is viewed as a "job" js, the preference of assigning S to a subsequence Q of consecutive amino acids on P is viewed as the profit of executing job js in the subinterval of I corresponding to Q, and the goal is to maximize the total profit of executing the jobs (on a single machine) during I. We showed that the interval scheduling problem is Max SNP-hard (even if each job takes either one or two consecutive time units), and designed an efficient 2-approximation algorithm for it. However, our experiments show that the 2-approximation algorithm does not output satisfactory assignments in practice. The reason is as follows : In the real practice of protein NMR peak assignment, each job js usually requires at most 10 consecutive time units, and typically the jobs that require one or two consecutive time units are the most difficult to assign/schedule. For this reason, we then designed several efficient heuristics for the problem ; some of them run on PC-clusters in short (parallel) time. Our experiments show that these heuristics work very well in practice. The second problem we considered is the following : Given a set of species and their similarity data, reconstruct a phylogeny (also called evolutionary tree) so that species are close in the phylogeny if and only if they have high similarity. Assume that the similarity data are represented as a graph G=(V, E), where each vertex represents a species and two vertices are adjacent if they represent species of high similarity. The phylogeny reconstruction problem can then be abstracted as a graph-theoretic problem called the phylogenetic k-th root problem (PR_k), where k is a predetermined proximity threshold. We showed that the problem can be solved in linear time if the input data have no errors and the phylogeny to be constructed is of bounded degree. We also showed that the problem is NP-hard if the input data have errors (no matter the phylogeny to be constructed is of bounded degree or not). The third problem we considered is the problem of DNA sequence alignment with inversions and reversals. Previously, inversions and reversals had not been considered seriously in sequence alignment although there are real in practice ; the only known algorithm previously runs in O(n^2m^2) time and consumes O(n^2m^2) space, where n and m are the lengths of the two input sequences respectively. We designed a space-efficient algorithm for this problem which consumes only O(nm) space with the same amount of time. Our algorithm enables the computation for a pair of DNA sequences of length up to 10,000 to be carried out on an ordinary desktop computer. Less

Report

(4 results)

2004 Annual Research Report Final Research Report Summary
2003 Annual Research Report
2002 Annual Research Report

Research Products

(35 results)

All 2005 2004 2003 2002 Other

All Journal Article (20 results) Publications (15 results)

[Journal Article] More Reliable Protein NMR Peak Assignment via Improved 2-Interval Scheduling2005
- Author(s)
  Zhi-Zhong Chen
- Journal Title
  
  Journal of Computational Biology 12
  
  Pages: 129-146
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2004 Final Research Report Summary
[Journal Article] More Reliable Protein NMR Peak Assignment via Improved 2-Interval Scheduling.2005
- Author(s)
  Zhi-Zhong Chen, T.Jiang, G.-H.Lin, R.Rizzi, J.Wen, D.Xu, Y.Xu
- Journal Title
  
  Journal of Computational Biology Vol.12
  
  Pages: 129-146
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2004 Final Research Report Summary
[Journal Article] A Space Efficient Algorithm for Sequence Alignment with Inversions and Reversals2004
- Author(s)
  Zhi-Zhong Chen
- Journal Title
  
  Theoretical Computer Science 325
  
  Pages: 361-372
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2004 Annual Research Report 2004 Final Research Report Summary
[Journal Article] A Space Efficient Algorithm for Sequence Alignment with Inversions and Reversals.2004
- Author(s)
  Zhi-Zhong Chen, Y.Gao, G.-H.Lin, R.Niewiadomski, Y.Wang, J.Wu
- Journal Title
  
  Theoretical Computer Science Vol.325
  
  Pages: 361-372
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2004 Final Research Report Summary
[Journal Article] Protein NMR Peak Assignment : Algorithms and Complexity.2004
- Author(s)
  Zhi-Zhong Chen
- Journal Title
  
  Proceedings of 8th World Multi-Conference on Systemics, Cybernetics and Informatics Vol.IX
  
  Pages: 7-10
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2004 Final Research Report Summary
[Journal Article] Disc Embeddings of Planar Graphs2004
- Author(s)
  Zhi-Zhong Chen
- Journal Title
  
  Algorithmica 38
  
  Pages: 539-576
- Related Report
  2004 Annual Research Report
[Journal Article] New Bounds on the Number of Edges in a k-Map Graph2004
- Author(s)
  Zhi-Zhong Chen
- Journal Title
  
  Lecture Notes in Computer Scienc 3106
  
  Pages: 319-328
- Related Report
  2004 Annual Research Report
[Journal Article] Computing Phylogenetic Roots with Bounded Degrees and Errors2003
- Author(s)
  Zhi-Zhong Chen
- Journal Title
  
  SIAM Journal on Computing 32
  
  Pages: 864-879
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2004 Final Research Report Summary
[Journal Article] Approximation Algorithms for NMR Spectral Peak Assignment2003
- Author(s)
  Zhi-Zhong Chen
- Journal Title
  
  Theoretical Computer Science 299
  
  Pages: 211-229
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2004 Final Research Report Summary
[Journal Article] Computational Assignment of Protein Backbone NMR Peaks by Efficient Bounding and Filtering2003
- Author(s)
  Guohui Lin
- Journal Title
  
  Journal of Bioinformatics and Computational Biology 1
  
  Pages: 387-410
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2004 Final Research Report Summary
[Journal Article] Computing Phylogenetic Roots with Bounded Degrees and Errors.2003
- Author(s)
  Zhi-Zhong Chen, T.Jiang, G.-H.Lin
- Journal Title
  
  SIAM Journal on Computing Vol.32, No.4
  
  Pages: 864-879
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2004 Final Research Report Summary
[Journal Article] Computational Assignment of Protein Backbone NMR Peaks by Efficient Bounding and Filtering.2003
- Author(s)
  G.Lin, D.Xu, Zhi-Zhong Chen, T.Jiang, J.Wen, Y.Xu
- Journal Title
  
  Journal of Bioinformatics and Computational Biology Vol.1
  
  Pages: 387-410
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2004 Final Research Report Summary
[Journal Article] Approximation Algorithms for NMR Spectral Peak Assignment.2003
- Author(s)
  Zhi-Zhong Chen, T.Jiang, G.-H.Lin, J.Wen, D.Xu, J.Xu, Y.Xu
- Journal Title
  
  Theoretical Computer Science Vol.299
  
  Pages: 211-229
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2004 Final Research Report Summary
[Journal Article] Better Approximation Algorithms for NMR Spectral Peak Assignment.2002
- Author(s)
  Zhi-Zhong Chen, T.Jiang, G.-H.Lin, J.Wen, D.Xu, Y.Xu
- Journal Title
  
  Proceedings of 2nd Workshop on Algorithms in Bioinformatics (WABI2002), Lecture Notes in Computer Science Vol.2452
  
  Pages: 82-96
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2004 Final Research Report Summary
[Journal Article] The Longest Common Subsequence Problem for Sequences with Nested Arc Annotations.2002
- Author(s)
  G.-H.Lin, Zhi-Zhong Chen, T.Jiang, J.Wen
- Journal Title
  
  Journal of Computer and System Sciences Vol.65, No.3
  
  Pages: 465-480
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2004 Final Research Report Summary
[Journal Article] Computing Bounded-Degree Phylogenetic Roots of Disconnected Graphs
- Author(s)
  Zhi-Zhong Chen
- Journal Title
  
  Journal of Algorithms (to appear)
- Description
  「研究成果報告書概要(和文)」より
- Related Report
  2004 Final Research Report Summary
[Journal Article] Computing Bounded-Degree Phylogenetic Roots of Disconnected Graphs.
- Author(s)
  Zhi-Zhong Chen, T.Tsukiji
- Journal Title
  
  Journal of Algorithms (to appear)
- Description
  「研究成果報告書概要(欧文)」より
- Related Report
  2004 Final Research Report Summary
[Journal Article] Computing Bounded-Degree Phylogenetic Roots of Disconnected Graphs
- Author(s)
  Zhi-Zhong Chen
- Journal Title
  
  Journal of Algorithms (to appear)
- Related Report
  2004 Annual Research Report
[Journal Article] A Linear-Time Algorithm for 7-Coloring 1-Plane Graphs
- Author(s)
  Zhi-Zhong Chen
- Journal Title
  
  Algorithmica (to appear)
- Related Report
  2004 Annual Research Report
[Journal Article] More Reliable Protein NMR Peak Assignment via Improved 2-Interval Scheduling
- Author(s)
  Zhi-Zhong Chen
- Journal Title
  
  Journal of Computational Biology (to appear)
- Related Report
  2004 Annual Research Report
[Publications] Zhi-Zhong Chen: "A Space Efficient Algorithm for Sequence Alignment with Inversions"Theoretical Computer Science. (Accepted). (2004)
- Related Report
  2003 Annual Research Report
[Publications] Zhi-Zhong Chen: "Disc Embeddings of Planar Graphs"Algorithmica. 38・4. 539-576 (2004)
- Related Report
  2003 Annual Research Report
[Publications] Zhi-Zhong Chen: "More reliable protein NMR peak assignment via improved 2-interval scheduling"Lecture Notes in Computer Science. 2832. 580-592 (2003)
- Related Report
  2003 Annual Research Report
[Publications] Zhi-Zhong Chen: "A linear-time algorithm for 7-coloring 1-planar graphs"Lecture Notes in Computer Science. 2747. 348-357 (2003)
- Related Report
  2003 Annual Research Report
[Publications] Yong Gao: "A space efficient algorithm for sequence alignment with inversions"Lecture Notes in Computer Science. 2697. 57-67 (2003)
- Related Report
  2003 Annual Research Report
[Publications] Zhi-Zhong Chen: "Computing Phylogenetic Roots with Bounded Degrees and Errors"SIAM Journal on Computing. (In press).
- Related Report
  2002 Annual Research Report
[Publications] Guohui Lin: "Computational Assignment of Protein Backbone NMR Peaks by Efficient Bounding and Filtering"Journal of Bioinformatics and Computational Biology. (In press).
- Related Report
  2002 Annual Research Report
[Publications] Zhi-Zhong Chen: "Approximation Algorithms for NMR Spectral Peak Assignment"Theoretical Computer Science. (In press).
- Related Report
  2002 Annual Research Report
[Publications] Zhi-Zhong Chen: "Common-face Embeddings of Planar Graphs"SIAM Journal on Computing. 32・2. 408-434 (2003)
- Related Report
  2002 Annual Research Report
[Publications] Guohui Lin: "The Longest Common Subsequence Problem for Sequences with Nested Arc Annotations"Journal of Computer and System Sciences. 65・3. 465-480 (2002)
- Related Report
  2002 Annual Research Report
[Publications] Zhi-Zhong Chen: "Tight Upper Bound on the Number of Edges in a Bipartite K_<3,3>-free or K_5-free Graph with an Application"Information Processing Letters. 84・3. 141-145 (2002)
- Related Report
  2002 Annual Research Report
[Publications] Zhi-Zhong Chen: "Map Graphs"Journal of the ACM. 49・2. 127-138 (2002)
- Related Report
  2002 Annual Research Report
[Publications] Zhi-Zhong Chen: "Finding Double Euler Trails of Planar Graphs in Linear Time"SIAM Journal on Computing. 31・4. 1255-1285 (2002)
- Related Report
  2002 Annual Research Report
[Publications] Zhi-Zhong Chen: "Better Approximation Algorithms for NMR Spectral Peak Assignment"Lecture Notes in Computer Science. 2452. 82-96 (2002)
- Related Report
  2002 Annual Research Report
[Publications] Guohui Lin: "An Efficient Branch-and-Bound Algorithm for the Assignment of Protein Backbone MAR Peaks"Proceedings of the 2002 IEEE Computer Society Bioinformatics Conference. 165-174 (2002)
- Related Report
  2002 Annual Research Report

A Hybrid Approach to Computationally Hard Problems : Combining Approximation, Parallelization, and Randomization

Principal Investigator

CHEN Zhi-zhong Tokyo Denki University, Mathematical Sciences, Associate Professor, 理工学部, 助教授 (00242933)

¥1,600,000 (Direct Cost: ¥1,600,000)

Report

Research Products

[Journal Article] More Reliable Protein NMR Peak Assignment via Improved 2-Interval Scheduling2005

Author(s)

Journal Title

Description

Related Report

[Journal Article] More Reliable Protein NMR Peak Assignment via Improved 2-Interval Scheduling.2005

Author(s)

Journal Title

Description

Related Report

[Journal Article] A Space Efficient Algorithm for Sequence Alignment with Inversions and Reversals2004

Author(s)

Journal Title

Description

Related Report

[Journal Article] A Space Efficient Algorithm for Sequence Alignment with Inversions and Reversals.2004

Author(s)

Journal Title

Description

Related Report

[Journal Article] Protein NMR Peak Assignment : Algorithms and Complexity.2004

Author(s)

Journal Title

Description

Related Report

[Journal Article] Disc Embeddings of Planar Graphs2004

Author(s)

Journal Title

Related Report

[Journal Article] New Bounds on the Number of Edges in a k-Map Graph2004

Author(s)

Journal Title

Related Report

[Journal Article] Computing Phylogenetic Roots with Bounded Degrees and Errors2003

Author(s)

Journal Title

Description

Related Report

[Journal Article] Approximation Algorithms for NMR Spectral Peak Assignment2003

Author(s)

Journal Title

Description

Related Report

[Journal Article] Computational Assignment of Protein Backbone NMR Peaks by Efficient Bounding and Filtering2003

Author(s)

Journal Title

Description

Related Report

[Journal Article] Computing Phylogenetic Roots with Bounded Degrees and Errors.2003

Author(s)

Journal Title

Description

Related Report

[Journal Article] Computational Assignment of Protein Backbone NMR Peaks by Efficient Bounding and Filtering.2003

Author(s)

Journal Title

Description

Related Report

[Journal Article] Approximation Algorithms for NMR Spectral Peak Assignment.2003

Author(s)

Journal Title

Description

Related Report

[Journal Article] Better Approximation Algorithms for NMR Spectral Peak Assignment.2002

Author(s)

Journal Title

Description

Related Report

[Journal Article] The Longest Common Subsequence Problem for Sequences with Nested Arc Annotations.2002

Author(s)

Journal Title

Description

Related Report

[Journal Article] Computing Bounded-Degree Phylogenetic Roots of Disconnected Graphs