A statistical approach to the analysis of DNA sequences has been developed, which provides a confidence interval estimate for the proportion of naturally occurring amino acid polymorphisms that are selectively neutral. When applied to the gnd gene coding for 6-phosphogluconate dehydrogenase in a sample of seven natural isolates of Escherichia coli, the method indicates that the proportion of observed amino acid polymorphisms that are selectively neutral is unlikely to be greater than 49% (upper 95% confidence limit). On the other hand, the observations are also consistent with a model in which all of the observed amino acid substitutions are mildly deleterious with an average selection coefficient approximating 1.6 X 10(-7). Various models for the distribution of configurations at silent sites are also investigated.
A reference collection of 71 natural isolates of Escherichia coli (the ECOR collection) has been studied with respect to the distribution and abundance of transposable insertion sequences using DNA hybridization. The data include 1173 occurrences of six unrelated insertion sequences (IS1, IS2, IS3, IS4, IS5 and IS30). The number of insertion elements per strain, and the sizes of DNA restriction fragments containing them, is highly variable and can be used to discriminate even among closely related strains. The occurrence and abundance of pairs of unrelated insertion sequences are apparently statistically independent, but significant correlations result from stratifications in the reference collection. However, there is a highly significant positive association among the insertion sequences considered in the aggregate. Nine branching process models, which differ in assumptions regarding the regulation of transposition and the effect of copy number on fitness, have been evaluated with regard to their fit of the observed distributions. No single model fits all copy number distributions. The best models incorporate no regulation of transposition and a moderate to strong decrease in fitness with increasing copy number for IS1 and IS5, strong regulation of transposition and a negligible to weak decrease in fitness with increasing copy number for IS3, and less than strong regulation of transposition for IS2, IS4 and IS30.
A mutation in the white gene of Drosophila mauritiana that results from insertion of the transposable element mariner is genetically unstable in both germ cells and somatic cells. Somatic instability is indicated by the occurrence of animals having mosaic eyes with patches of pigmented cells on a peach-colored background. Normally uncommon, the frequency of mosaicism is so greatly enhanced in a particular mutant strain that virtually every animal in the strain is an eye-color mosaic. The molecular basis of the mosaicism is the excision of the mariner element from its location in the DNA of the white gene in somatic cells. The phenomenon results from a single dominant genetic factor located in chromosome 3. Genetic control over the excision of transposable elements may play a role in determining the persistence of transposable elements in the genome.
Studies of Escherichia coli under competition for lactose in chemostat cultures have been used to determine the selective effects of variation in the level of the beta-galactoside permease and the beta-galactosidase enzyme. The results determine the adaptive topography of these gene products relative to growth in limiting lactose and enable predictions concerning the selective effects of genetic variants found in natural populations. In the terms of metabolic control theory, the beta-galactosidase enzyme at wild-type-induced levels has a small control coefficient with respect to fitness (C = 0.018), and hence genetic variants resulting in minor changes in enzyme activity have disproportionately small effects on fitness. However, the apparent control coefficient of the beta-galactoside permease at wild-type-induced levels is large (C = 0.551), and hence even minor changes in activity affect fitness. Therefore, we predict that genetic polymorphisms in the lacZ gene are subject to less effective selection in natural populations than are those in the lacY gene. The beta-galactoside permease is also less efficient than might be expected, and possible forces resulting in selection for an intermediate optimum level of permease activity are considered. The selective forces that maintain the lactose operon in a regulated state in natural populations are also discussed.