High statistics block entropy measures of DNA sequences

Year: 1996

Authors: Lio P., Politi A., Buiatti M., Ruffo S.

Autors Affiliation: Dipartimento di Biologia Animale e Genetica, Università di Firenze, Via Romana 17, 50125 Firenze, Italy; Istituto Nazionale di Ottica, Largo E. Fermi 6, 50125 Firenze, Italy; Dipartimento di Energetica, Università di Firenze, Via S. Marta 3, 50139 Firenze, Italy; INFN, Sez. di Firenze Italy

Abstract: We have used an improved block-entropy measure in order to gain some further insights into the short-range correlations present in whole chromosomes of S. cerevisiae, viruses and organelles and very large genomic regions of E. coli. Although DNA sequences are largely inhomogeneous and word frequencies are unevenly distributed, the comparison of entire chromosomes and large genomic regions show a ’’bulk’’ composition homogeneity. This property suggests that biases in selection, directional mutational pressure and recombination processes act in homogenizing the base composition of the DNA molecules within a genome but their mode of action, relative impact and direction may vary in different organisms. The most interesting results appear to be the differences between the SW (C,G/A,T) and RY (A,G/C,T) two-letter alphabet entropies. Deviations from randomness in E. coli and S. cerevisiae sequences particularly concern SW dinucleotide frequencies and RY tetranucleotide frequencies.

Journal/Review: JOURNAL OF THEORETICAL BIOLOGY

Volume: 180 (2)      Pages from: 151  to: 160

KeyWords: Nucleotide-sequences; Codon Usage; Genome; Mutation; Recombination; Constraints; Stability; Evolution; Rna
DOI: 10.1006/jtbi.1996.0091

Citations: 29
data from “WEB OF SCIENCE” (of Thomson Reuters) are update at: 2024-11-17
References taken from IsiWeb of Knowledge: (subscribers only)
Connecting to view paper tab on IsiWeb: Click here
Connecting to view citations from IsiWeb: Click here