Advanced search
Start date
Betweenand


Balancing selection in the human genome: biological relevance and deleterious consequence

Full text
Author(s):
Bárbara Domingues Bitarello
Total Authors: 1
Document type: Doctoral Thesis
Press: São Paulo.
Institution: Universidade de São Paulo (USP). Instituto de Biociências (IBIOC/SB)
Defense date:
Examining board members:
Diogo Meyer; Rodrigo Cogni; Carlos Eduardo Guerra Schrago; Maria Dulcetti Vibranovski
Advisor: Diogo Meyer
Abstract

Balancing selection is an evolutionary process that encompasses several mechanisms: heterozygote advantage, negative frequency dependent selection, selective pressure that fluctuates in time or in space, and some cases of pleiotropy. The study of these mechanisms .per se has been and still is a topic of great interest for evolutionary biologists, and has shaped the study of evolution throughout the last century. Before the proposition of the neutral theory of molecular evolution, it was believed that balancing selection was pervasive. The realization that much of the observed genetic diversity could be explained by neutral evolution thus motivated a better understanding of balancing selection as a selective regime capable of maintaining adaptive variants in populations. The study of balancing selection, in its early stages, was restricted to organisms that could be manipulated in the laboratory. With the advent of methods that allowed quantification of genetic variation - such as protein electrophoresis, small scale sequencing and genome-wide re-sequencing of thousands of individuals - human variation started to be actively studied and interpreted. Several studies have looked for signatures of natural selection - i.e., patterns of genomic variation that selective regimes leave in the genome - and evaluated their significance by comparing them to what would be expected under a strictly neutral scenario. Most of these efforts focused on the study of positive selection, thought of as the prime mechanism responsible for adaptive evolution. Only a few studies looked for signatures of balancing selection in the human genome. This is partially due to the paucity of powerful methods to detect its signatures. Moreover, previous studies either did not analyze data on genomic scale or focused primarily on protein-coding regions. Here, we describe a powerful and simple method to detect signatures of balancing selection. In humans, it outperforms other methods commonly used to detect such signatures and could in theory be used for other species, provided that its power is evaluated for each species through neutral simulations. Our method (\"Non-Central Deviation\", NCD) has two versions: NCD2, which requires polymorphism information on the ingroup species, as well as divergence information between the ingroup and an outgroup species, and NCD1, which only requires the ingroup information. Although NCD2 is more powerful for humans, NCD1 can be used for species that lack information from an outgroup. When applying NCD2 to human data, using chimpanzee as the outgroup, we found more than 200 protein-coding regions with strong signatures of balancing selection, only 1/3 of which had prior evidence for balancing selection. There was also an enrichment for several gene ontology categories, approximately half of which are related to immunity. We also found that among genes with evidence for balancing selection there was an excess of cases of preferential expression in specific tissues, such as \"adrenal\" and \"lung\", and an excess of genes with mono-allelic expression. Overall, we found that selected regions of the genome include both coding and regulatory sites. We failed to find a marked excess of balancing selection in regulatory regions, as reported in previous studies. Finally, we found an excess of nonsynonymous versus synonymous polymorphisms within the selected genes. Having documented the occurrence of balancing selection in the human genome and identified genes which were potential targets of this selective regime, we next investigated evolutionary consequences of this process. We hypothesized that balancing selection acting on a site reduces the efficiency with which purifying selection purges deleterious variants at nearby sites. This process is a consequence of how the dynamics of selection at one locus, mediated by linkage, can interfere with the frequencies of adjacent non-neutral sites. We tested this hypothesis by examining if the genes under balancing selection show an excess of deleterious variants with respect to expectations derived from the remainder of the genome. Using three different metrics to determine deleteriousness, we identified a significant excess of deleterious variants within balanced genes, and we show that this pattern cannot be attributed to confounding factors. This finding shows that together with the benefits associated with adaptive variation, balancing selection is increasing the burden of deleterious mutations in the human genome. Overall, our findings suggest that balancing selection likely maintains variation in a myriad of biological processes other than immunity and that it has been more common in the human genome than previously thought, affecting between 1-8% of human protein-coding genes, as well as a number of non-protein coding regions. Moreover, balancing selection appears to be important to human evolution not only because of its influence on fitness, but also because it has been an important force shaping current human genetic diversity and susceptibility to disease (AU)

FAPESP's process: 11/12500-2 - Maladaptation as a byproduct of adaptation: a genomic scale study
Grantee:Bárbara Domingues Bitarello
Support Opportunities: Scholarships in Brazil - Doctorate