Skip to content
2000
Volume 13, Issue 3
  • ISSN: 1574-8936
  • E-ISSN: 2212-392X

Abstract

Background: Most existing methods for comparing and analyzing DNA sequences use multiple sequence alignment (MSA) algorithms. However, the computation time required for MSA is usually very long and makes it impossible to analyze a large group of long DNA sequences. Objective: Here we propose a novel computational method to quickly characterize and compare DNA sequences. Method: We construct a new 2-dimensional (2D) graphical representation of DNA sequences based on the mathematical concept of joint probability. A dinucleotide is assigned by the product of the signed probability of the two nucleotides, which is totally independent of the choice of the species studied. Results: We perform similarity/dissimilarity analyses among three real DNA data sets, the first exon of the beta-globin gene of eleven animal species, ribulose bisphosphate carboxylase small chain (rbcS) gene of eleven species of flowering plants, and mitochondrial genome sequences of eleven mammal species, respectively. Conclusion: Our results coincide with existing biological analyses in the literature. We also compare our approach with MSA algorithm, which is much quicker and more effective.

Loading

Article metrics loading...

/content/journals/cbio/10.2174/1574893613666180305161928
2018-06-01
2025-06-24
Loading full text...

Full text loading...

/content/journals/cbio/10.2174/1574893613666180305161928
Loading
This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error
Please enter a valid_number test