Success of the global research agenda toward eradication of malaria will depend on development of new tools, including drugs, vaccines, insecticides and diagnostics. Genomic information, now available for the malaria parasites, their mosquito vectors, and human host, can be leveraged to both develop these tools and monitor their effectiveness. Although knowledge of genomic sequences for the malaria parasites, Plasmodium falciparum and Plasmodium vivax, have helped advance our understanding of malaria biology, simply knowing this sequence information has not yielded a plethora of new interventions to reduce the burden of malaria. Here we review and provide specific examples of how genomic information has increased our knowledge of parasite biology, focusing on P. falciparum malaria. We then discuss how population genetics can be applied toward the epidemiological and transmission-related goals outlined by the International Centers of Excellence for Malaria Research groups recently established by the National Institutes of Health. Finally, we propose genomics is a research area that can promote coordination and collaboration between various ICEMR groups, and that working together as a community can significantly advance the value of this information toward reduction of the global malaria burden.
Chimeric genes form through the combination of portions of existing coding sequences to create a new open reading frame. These new genes can create novel protein structures that are likely to serve as a strong source of novelty upon which selection can act. We have identified 14 chimeric genes that formed through DNA-level mutations in Drosophila melanogaster, and we investigate expression profiles, domain structures, and population genetics for each of these genes to examine their potential to effect adaptive evolution. We find that chimeric gene formation commonly produces mid-domain breaks and unites portions of wholly unrelated peptides, creating novel protein structures that are entirely distinct from other constructs in the genome. These new genes are often involved in selective sweeps. We further find a disparity between chimeric genes that have recently formed and swept to fixation versus chimeric genes that have been preserved over long periods of time, suggesting that preservation and adaptation are distinct processes. Finally, we demonstrate that chimeric gene formation can produce qualitative expression changes that are difficult to mimic through duplicate gene formation, and that extremely young chimeric genes (d(S) < 0.03) are more likely to be associated with selective sweeps than duplicate genes of the same age. Hence, chimeric genes can serve as an exceptional source of genetic novelty that can have a profound influence on adaptive evolution in D. melanogaster.
Antibiotic resistance can evolve through the sequential accumulation of multiple mutations. To study such gradual evolution, we developed a selection device, the 'morbidostat', that continuously monitors bacterial growth and dynamically regulates drug concentrations, such that the evolving population is constantly challenged. We analyzed the evolution of resistance in Escherichia coli under selection with single drugs, including chloramphenicol, doxycycline and trimethoprim. Over a period of approximately 20 days, resistance levels increased dramatically, with parallel populations showing similar phenotypic trajectories. Whole-genome sequencing of the evolved strains identified mutations both specific to resistance to a particular drug and shared in resistance to multiple drugs. Chloramphenicol and doxycycline resistance evolved smoothly through diverse combinations of mutations in genes involved in translation, transcription and transport. In contrast, trimethoprim resistance evolved in a stepwise manner, through mutations restricted to the gene encoding the enzyme dihydrofolate reductase (DHFR). Sequencing of DHFR over the time course of the experiment showed that parallel populations evolved similar mutations and acquired them in a similar order.
In many species, both morphological and molecular traits related to sex and reproduction evolve faster in males than in females. Ultimately, rapid male evolution relies on the acquisition of genetic variation associated with differential reproductive success. Many newly evolved genes are associated with novel functions that might enhance male fitness. However, functional evidence of the adaptive role of recently originated genes in males is still lacking. The Sperm dynein intermediate chain multigene family, which encodes a Sperm dynein intermediate chain presumably involved in sperm motility, originated from complex genetic rearrangements in the lineage that leads to Drosophila melanogaster within the last 5.4 million years since its split from Drosophila simulans. We deleted all the members of this multigene family resident on the X chromosome of D. melanogaster by chromosome engineering and found that, although the deletion does not result in a reduction of progeny number, it impairs the competence of the sperm in the presence of sperm from wild-type males. Therefore, the Sperm dynein intermediate chain multigene family contributes to the differential reproductive success among males and illustrates precisely how quickly a new gene function can be incorporated into the genetic network of a species.
Malaria is a deadly disease that causes nearly one million deaths each year. To develop methods to control and eradicate malaria, it is important to understand the genetic basis of Plasmodium falciparum adaptations to antimalarial treatments and the human immune system while taking into account its demographic history. To study the demographic history and identify genes under selection more efficiently, we sequenced the complete genomes of 25 culture-adapted P. falciparum isolates from three sites in Senegal. We show that there is no significant population structure among these Senegal sampling sites. By fitting demographic models to the synonymous allele-frequency spectrum, we also estimated a major 60-fold population expansion of this parasite population approximately 20,000-40,000 years ago. Using inferred demographic history as a null model for coalescent simulation, we identified candidate genes under selection, including genes identified before, such as pfcrt and PfAMA1, as well as new candidate genes. Interestingly, we also found selection against G/C to A/T changes that offsets the large mutational bias toward A/T, and two unusual patterns: similar synonymous and nonsynonymous allele-frequency spectra, and 18% of genes having a nonsynonymous-to-synonymous polymorphism ratio >1.
The study of the evolution of novel genes generally focuses on the formation of new coding sequences. However, equally important in the evolution of novel functional genes are the formation of regulatory regions that allow the expression of the genes and the effects of the new genes in the organism as well. Herein, we discuss the current knowledge on the evolution of novel functional genes, and we examine in more detail the youngest genes discovered. We examine the existing data on a very recent and rapidly evolving cluster of duplicated genes, the Sdic gene cluster. This cluster of genes is an excellent model for the evolution of novel genes, as it is very recent and may still be in the process of evolving.
Chromosomal inversions have been an enduring interest of population geneticists since their discovery in Drosophila melanogaster. Numerous lines of evidence suggest powerful selective pressures govern the distributions of polymorphic inversions, and these observations have spurred the development of many explanatory models. However, due to a paucity of nucleotide data, little progress has been made towards investigating selective hypotheses or towards inferring the genealogical histories of inversions, which can inform models of inversion evolution and suggest selective mechanisms. Here, we utilize population genomic data to address persisting gaps in our knowledge of D. melanogaster's inversions. We develop a method, termed Reference-Assisted Reassembly, to assemble unbiased, highly accurate sequences near inversion breakpoints, which we use to estimate the age and the geographic origins of polymorphic inversions. We find that inversions are young, and most are African in origin, which is consistent with the demography of the species. The data suggest that inversions interact with polymorphism not only in breakpoint regions but also chromosome-wide. Inversions remain differentiated at low levels from standard haplotypes even in regions that are distant from breakpoints. Although genetic exchange appears fairly extensive, we identify numerous regions that are qualitatively consistent with selective hypotheses. Finally, we show that In(1)Be, which we estimate to be approximately 60 years old (95% CI 5.9 to 372.8 years), has likely achieved high frequency via sex-ratio segregation distortion in males. With deeper sampling, it will be possible to build on our inferences of inversion histories to rigorously test selective models-particularly those that postulate that inversions achieve a selective advantage through the maintenance of co-adapted allele complexes.
Through rapid genetic adaptation and natural selection, the Plasmodium falciparum parasite--the deadliest of those that cause malaria--is able to develop resistance to antimalarial drugs, thwarting present efforts to control it. Genome-wide association studies (GWAS) provide a critical hypothesis-generating tool for understanding how this occurs. However, in P. falciparum, the limited amount of linkage disequilibrium hinders the power of traditional array-based GWAS. Here, we demonstrate the feasibility and power improvements gained by using whole-genome sequencing for association studies. We analyzed data from 45 Senegalese parasites and identified genetic changes associated with the parasites' in vitro response to 12 different antimalarials. To further increase statistical power, we adapted a common test for natural selection, XP-EHH (cross-population extended haplotype homozygosity), and used it to identify genomic regions associated with resistance to drugs. Using this sequence-based approach and the combination of association and selection-based tests, we detected several loci associated with drug resistance. These loci included the previously known signals at pfcrt, dhfr, and pfmdr1, as well as many genes not previously implicated in drug-resistance roles, including genes in the ubiquitination pathway. Based on the success of the analysis presented in this study, and on the demonstrated shortcomings of array-based approaches, we argue for a complete transition to sequence-based GWAS for small, low linkage-disequilibrium genomes like that of P. falciparum.
Although the Drosophila Y chromosome is degenerated, heterochromatic, and contains few genes, increasing evidence suggests that it plays an important role in regulating the expression of numerous autosomal and X-linked genes. Here we use 15 Y chromosomes originating from a single founder 550 generations ago to study the role of the Y chromosome in regulating rRNA gene transcription, position-effect variegation (PEV), and the link among rDNA copy number, global gene expression, and chromatin regulation. Based on patterns of rRNA gene transcription indicated by transcription of the retrotransposon R2 that specifically inserts into the 28S rRNA gene, we show that X-linked rDNA is silenced in males. The silencing of X-linked rDNA expression by the Y chromosome is consistent across populations and independent of genetic background. These Y chromosomes also vary more than threefold in rDNA locus size and cause dramatically different levels of PEV suppression. The degree of suppression is negatively associated with the number and fraction of rDNA units without transposon insertions, but not with total rDNA locus size. Gene expression profiling revealed hundreds of differentially expressed genes among these Y chromosome introgression lines, as well as a divergent global gene expression pattern between the low-PEV and high-PEV flies. Our findings suggest that the Y chromosome is involved in diverse phenomena related to transcriptional regulation including X-linked rDNA silencing and suppression of PEV phenotype. These results further expand our understanding of the role of the Y chromosome in modulating global gene expression, and suggest a link with modifications of the chromatin state.