On the abundance, amino acid composition, and evolutionary dynamics of low-complexity regions in proteins

Date Published:

Aug 15

Abstract:

Protein sequences frequently contain regions composed of a reduced number of amino acids. Despite their presence in about half of all proteins and their unusual prevalence in the malaria parasite Plasmodium falciparum, the function and evolution of such low-complexity regions (LCRs) remain unclear. Here we show that LCR abundance and amino acid composition depend largely, but not exclusively, on genomic A+T content and obey power-law growth dynamics. Further, our results indicate that LCRs are analogous to microsatellites in that DNA replication slippage and unequal crossover recombination are important molecular mechanisms for LCR expansion. We support this hypothesis by demonstrating that the size of LCR insertions/deletions among orthologous genes depends upon length. Moreover, we show that LCRs enable intra-exonic recombination in a key family of cell-surface antigens in P. falciparum and thus likely facilitate the generation of antigenic diversity. We conclude with a mechanistic model for LCR evolution that links the pattern of LCRs within P. falciparum to its high genomic A+T content and recombination rate.

Notes:

DePristo, Mark AZilversmit, Martine MHartl, Daniel LengResearch Support, Non-U.S. Gov'tNetherlands2006/06/30 09:00Gene. 2006 Aug 15;378:19-30. Epub 2006 May 11.

Last updated on 05/12/2015