Skip to content
2000
Volume 12, Issue 6
  • ISSN: 1570-193X
  • E-ISSN: 1875-6298

Abstract

The comparison of DNA sequences is the basic topic in computational biology and bioinformatics, helping in speculation about their previously ambiguous structure, function, and evolution relationship. In this article, we provide a novel DNA sequence comparison scheme by constructing feature vectors based on Markov chain and information entropy. A new measure, which is calculated as the entropy of K-string’s four one-step transition probabilities, is used to compose the feature vector to characterize DNA sequence. At the same time, we provide a novel concept to address the computation burden caused by the exponential growth of computation complexity when K grows in a traditional K-string model, which is named K-string list. The proposed scheme allows us to conduct similarity research and phylogenetic analysis on two real datasets, the first exon of 11 species’

Loading

Article metrics loading...

/content/journals/mroc/10.2174/1570193X13666151218191633
2015-12-01
2025-07-13
Loading full text...

Full text loading...

/content/journals/mroc/10.2174/1570193X13666151218191633
Loading
This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error
Please enter a valid_number test