oalogo2  

AUTHOR(S):

Plamenka Borovska, Veska Gancheva

 

TITLE

Parallelization and Optimization of Multiple Biological Sequence Alignment Software Based on Social Behavior Model

pdf PDF

ABSTRACT

The huge amount of biological sequences accumulated in the world nucleotide and protein databases leads to the necessity of efficient tools for structural genomic and functional analysis. This scientific area requires powerful computing resources for exploring large sets of biological data. Multiple sequence alignment is an important method in the DNA and protein analysis, and is generally the alignment of three or more biological sequences of similar length. As a result of the processing, homology can be derived and the evolutionary relationships between the sequences can be explored. The goal of this paper is to propose parallelization and optimization of the multiple sequence alignment software MSA_BG in order to improve the performance, for the case study of the influenza virus sequences. The objective is code optimization, porting, scaling and performance evaluation of the parallel multiple sequence alignment software MSA_BG for Intel Xeon Phi (the MIC architecture). For this purpose a parallel multithreaded optimization including OpenMP has been implemented and verified. The experimental results show that the hybrid parallel implementation utilizing MPI and OpenMP provides considerably better performance than the original code.

KEYWORDS

Artificial Bee Colony, Bioinformatics, Hybrid Programming, High Performance Computing, Multiple Sequence Alignment, Parallel Programming, Performance

REFERENCES

[1] H. Carrillo and D. Lipman, “The Multiple Sequence Alignment Problem in Biology,” SIAM Journal of Applied Mathematics, vol. 48, no. 5, 1988, pp. 1073-1082.

[2] L. Wang and T. Jiang, "On the complexity of multiple sequence alignment," Journal of Computational Biology, vol. 1, no. 4, 1994, pp. 337–348, doi:10.1089/cmb.1994.1.337.

[3] W. Just, "Computational complexity of multiple sequence alignment with SP-score," Journal of Computational Biology, vol. 8, no. 6, 2001, pp. 615–23.

[4] S. Sze, Y. Lu, and Q. Yang, "A polynomial time solvable formulation of multiple sequence alignment," Journal of Computational Biology, vol. 13, no. 2, 2006, pp. 309–319, doi:10.1089/cmb.2006.13.309.

[5] EURORA – Configuration, http://www.hpc.cineca.it/content/eurora-user-guide#systemarchitecture.

[6] P. Borovska, V. Gancheva, N. Landzhev, Massively Parallel Algorithm for Multiple Biological Sequences Alignment, 36th International Conference on Telecommunications and Signal Processing (TSP), 2-4 July, 2013, Rome, Italy, pp. 638 - 642.

[7] D. Karaboga, “An Idea Based On Honey Bee Swarm for Numerical Optimization,” Technical Report-TR06, Erciyes University, Engineering Faculty, Computer Engineering Department, , mf.erciyes.edu.tr abc pub tr .pdf

[8] P. Borovska, V. Gancheva, Massively Parallel Algorithm for Multiple Sequence Alignment Based on Artificial Bee Colony, white paper, http://www.prace-ri.eu/IMG/pdf/wp114.pdf

[9] GenBank, http://www.ncbi.nlm.nih.gov/Genbank/

Cite this paper

Plamenka Borovska, Veska Gancheva. (2018) Parallelization and Optimization of Multiple Biological Sequence Alignment Software Based on Social Behavior Model. International Journal of Computers, 3, 69-74

 

cc.png
Copyright © 2018 Author(s) retain the copyright of this article.
This article is published under the terms of the Creative Commons Attribution License 4.0