首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 62 毫秒
1.
Runs of homozygosity (ROH) are widely used as predictors of whole-genome inbreeding levels in cattle. They identify regions that have an unfavorable effect on a phenotype when homozygous, but also identify the genes associated with traits of economic interest present in these regions. Here, the distribution of ROH islands and enriched genes within these regions in four dairy cattle breeds were investigated. Cinisara (71), Modicana (72), Reggiana (168) and Italian Holstein (96) individuals were genotyped using the 50K v2 Illumina BeadChip. The genomic regions most commonly associated with ROHs were identified by selecting the top 1% of the single nucleotide polymorphisms (SNPs) most commonly observed in the ROH of each breed. In total, 11 genomic regions were identified in Cinisara and Italian Holstein, and eight in Modicana and Reggiana, indicating an increased ROH frequency level. Generally, ROH islands differed between breeds. The most homozygous region (>45% of individuals with ROH) was found in Modicana on chromosome 6 within a quantitative trail locus affecting milk fat and protein concentrations. We identified between 126 and 347 genes within ROH islands, which are involved in multiple signaling and signal transduction pathways in a wide variety of biological processes. The gene ontology enrichment provided information on possible molecular functions, biological processes and cellular components under selection related to milk production, reproduction, immune response and resistance/susceptibility to infection and diseases. Thus, scanning the genome for ROH could be an alternative strategy to detect genomic regions and genes related to important economic traits.  相似文献   

2.
The intensive selection programs for milk made possible by mass artificial insemination increased the similarity among the genomes of North American (NA) Holsteins tremendously since the 1960s. This migration of elite alleles has caused certain regions of the genome to have runs of homozygosity (ROH) occasionally spanning millions of continuous base pairs at a specific locus. In this study, genome signatures of artificial selection in NA Holsteins born between 1953 and 2008 were identified by comparing changes in ROH between three distinct groups under different selective pressure for milk production. The ROH regions were also used to estimate the inbreeding coefficients. The comparisons of genomic autozygosity between groups selected or unselected since 1964 for milk production revealed significant differences with respect to overall ROH frequency and distribution. These results indicate selection has increased overall autozygosity across the genome, whereas the autozygosity in an unselected line has not changed significantly across most of the chromosomes. In addition, ROH distribution was more variable across the genomes of selected animals in comparison to a more even ROH distribution for unselected animals. Further analysis of genome-wide autozygosity changes and the association between traits and haplotypes identified more than 40 genomic regions under selection on several chromosomes (Chr) including Chr 2, 7, 16 and 20. Many of these selection signatures corresponded to quantitative trait loci for milk, fat, and protein yield previously found in contemporary Holsteins.  相似文献   

3.
The diversity of populations in domestic species offers great opportunities to study genome response to selection. The recently published Sheep HapMap dataset is a great example of characterization of the world wide genetic diversity in sheep. In this study, we re-analyzed the Sheep HapMap dataset to identify selection signatures in worldwide sheep populations. Compared to previous analyses, we made use of statistical methods that (i) take account of the hierarchical structure of sheep populations, (ii) make use of linkage disequilibrium information and (iii) focus specifically on either recent or older selection signatures. We show that this allows pinpointing several new selection signatures in the sheep genome and distinguishing those related to modern breeding objectives and to earlier post-domestication constraints. The newly identified regions, together with the ones previously identified, reveal the extensive genome response to selection on morphology, color and adaptation to new environments.  相似文献   

4.
5.
Inbreeding has long been recognized as a primary cause of fitness reduction in both wild and domesticated populations. Consanguineous matings cause inheritance of haplotypes that are identical by descent (IBD) and result in homozygous stretches along the genome of the offspring. Size and position of regions of homozygosity (ROHs) are expected to correlate with genomic features such as GC content and recombination rate, but also direction of selection. Thus, ROHs should be non-randomly distributed across the genome. Therefore, demographic history may not fully predict the effects of inbreeding. The porcine genome has a relatively heterogeneous distribution of recombination rate, making Sus scrofa an excellent model to study the influence of both recombination landscape and demography on genomic variation. This study utilizes next-generation sequencing data for the analysis of genomic ROH patterns, using a comparative sliding window approach. We present an in-depth study of genomic variation based on three different parameters: nucleotide diversity outside ROHs, the number of ROHs in the genome, and the average ROH size. We identified an abundance of ROHs in all genomes of multiple pigs from commercial breeds and wild populations from Eurasia. Size and number of ROHs are in agreement with known demography of the populations, with population bottlenecks highly increasing ROH occurrence. Nucleotide diversity outside ROHs is high in populations derived from a large ancient population, regardless of current population size. In addition, we show an unequal genomic ROH distribution, with strong correlations of ROH size and abundance with recombination rate and GC content. Global gene content does not correlate with ROH frequency, but some ROH hotspots do contain positive selected genes in commercial lines and wild populations. This study highlights the importance of the influence of demography and recombination on homozygosity in the genome to understand the effects of inbreeding.  相似文献   

6.

Background

The NCI-60 is a collection of tumor cell lines derived from a variety of human adult cancer tissue types and is commonly used for genetic analysis and screening of potential chemotherapeutic agents. We wanted to understand the contributions of specific mechanisms of genomic instability to the etiology of cancers represented by the NCI-60.

Results

We screened the NCI-60 for dysregulated homologous recombination by using the gene cluster instability (GCI) assay we pioneered, and for defects in base excision repair by sensitivity to 5-hydroxymethyl-2'-deoxyuridine (hmdUrd). We identified subsets of the NCI-60 lines that either displayed the characteristic molecular signature of GCI or were sensitive to hmdUrd. With the exception of the NCI-H23 lung cancer line, these phenotypes were not found to overlap. None of the lines examined in either subset exhibited significant changes in the frequency of sister chromatid exchanges (SCE), neither did any of the lines in either subset exhibit microsatellite instability (MSI) indicative of defects in DNA mismatch repair.

Conclusions

Gene cluster instability, sensitivity to hmdUrd and sister chromatid exchange are mechanistically distinct phenomena. Genomic instability in the NCI-60 appears to involve only one mechanism of instability for each individual cell line.  相似文献   

7.

Background

Recent developments in sequencing technology have facilitated widespread investigations of genomic variants, including continuous stretches of homozygous genomic regions. For cattle, a large proportion of these runs of homozygosity (ROH) are likely the result of inbreeding due to the accumulation of elite alleles from long-term selective breeding programs. In the present study, ROH were characterized in four cattle breeds with whole genome sequence data and the distribution of predicted functional variants was detected in ROH regions and across different ROH length classes.

Results

On average, 19.5 % of the genome was located in ROH across four cattle breeds. There were an average of 715.5 ROH per genome with an average size of ~750 kbp, ranging from 10 (minimum size considered) to 49,290 kbp. There was a significant correlation between shared short ROH regions and regions putatively under selection (p < 0.001). By investigating the relationship between ROH and the predicted deleterious and non-deleterious variants, we gained insight into the distribution of functional variation in inbred (ROH) regions. Predicted deleterious variants were more enriched in ROH regions than predicted non-deleterious variants, which is consistent with observations in the human genome. We also found that increased enrichment of deleterious variants was significantly higher in short (<100 kbp) and medium (0.1 to 3 Mbp) ROH regions compared with long (>3 Mbp) ROH regions (P < 0.001), which is different than what has been observed in the human genome.

Conclusions

This study illustrates the distribution of ROH and functional variants within ROH in cattle populations. These patterns are different from those in the human genome but consistent with the natural history of cattle populations, which is confirmed by the significant correlation between shared short ROH regions and regions putatively under selection. These findings contribute to understanding the effects of inbreeding and probably selection in shaping the distribution of functional variants in the cattle genome.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1715-x) contains supplementary material, which is available to authorized users.  相似文献   

8.
The availability of dense single nucleotide polymorphism (SNP) assays allows for the determination of autozygous segments based on runs of consecutive homozygous genotypes (ROH). The aim of the present study was to investigate the occurrence and distribution of ROH in 21 Italian sheep breeds using medium‐density SNP genotypes in order to characterize autozygosity and identify genomic regions that frequently appeared in ROH within individuals, namely ROH islands. After filtering, the final number of animals and SNPs retained for analyses were 502 and 46 277 respectively. A total of 12 302 ROH were identified. The mean number of ROH per breed ranged from 10.58 (Comisana) to 44.54 (Valle del Belice). The average length of ROH across breeds was 4.55 Mb and ranged from 3.85 Mb (Biellese) to 5.51 Mb (Leccese). Valle del Belice showed the highest value of inbreeding on the basis of ROH (FROH = 0.099), whereas Comisana showed the lowest (FROH = 0.016), and high standard deviation values revealed high variability in autozygosity levels within each breed. Differences also existed in the length of ROH. Analysis of the distribution of ROH according to their size showed that, for all breeds, the majority of the detected ROH were <10 Mb in length, with a few long ROH >25 Mb. The levels of ROH that we estimated here reflect the inbreeding history of the investigated sheep breeds. These results also highlight that ancient and recent inbreeding have had an impact on the genome of the Italian sheep breeds and suggest that several animals have experienced recent autozygosity events. Comisana and Bergamasca appeared as the less consanguineous breeds, whereas Barbaresca, Leccese and Valle del Belice showed ROH patterns typically produced by recent inbreeding. Moreover, within the genomic regions most commonly associated with ROH, several candidate genes were detected.  相似文献   

9.
Correct annotation of the genetic relationships between samples is essential for population genomic studies, which could be biased by errors or omissions. To this end, we used identity-by-state (IBS) and identity-by-descent (IBD) methods to assess genetic relatedness of individuals within HapMap phase III data. We analyzed data from 1,397 individuals across 11 ethnic populations. Our results support previous studies (Pemberton et al., 2010; Kyriazopoulou-Panagiotopoulou et al., 2011) assessing unknown relatedness present within this population. Additionally, we present evidence for 1,657 novel pairwise relationships across 9 populations. Surprisingly, significant Cotterman''s coefficients of relatedness K1 (IBD1) values were detected between pairs of known parents. Furthermore, significant K2 (IBD2) values were detected in 32 previously annotated parent-child relationships. Consistent with a hypothesis of inbreeding, regions of homozygosity (ROH) were identified in the offspring of related parents, of which a subset overlapped those reported in previous studies (Gibson et al. 2010; Johnson et al. 2011). In total, we inferred 28 inbred individuals with ROH that overlapped areas of relatedness between the parents and/or IBD2 sharing at a different genomic locus between a child and a parent. Finally, 8 previously annotated parent-child relationships had unexpected K0 (IBD0) values (resulting from a chromosomal abnormality or genotype error), and 10 previously annotated second-degree relationships along with 38 other novel pairwise relationships had unexpected IBD2 (indicating two separate paths of recent ancestry). These newly described types of relatedness may impact the outcome of previous studies and should inform the design of future studies relying on the HapMap Phase III resource.  相似文献   

10.
DNA base composition is a fundamental genome feature. However, the evolutionary pattern of base composition and its potential causes have not been well understood. Here, we report findings from comparative analysis of base composition at the whole-genome level across 2210 species, the polymorphic-site level across eight population comparison sets, and the mutation-site level in 12 mutation-tracking experiments. We first demonstrate that base composition follows the individual-strand base equality rule at the genome, chromosome and polymorphic-site levels. More intriguingly, clear separation of base-composition values calculated across polymorphic sites was consistently observed between basal and derived groups, suggesting common underlying mechanisms. Individuals in the derived groups show an A&T-increase/G&C-decrease pattern compared with the basal groups. Spontaneous and induced mutation experiments indicated these patterns of base composition change can emerge across mutation sites. With base-composition across polymorphic sites as a genome phenotype, genome scans with human 1000 Genomes and HapMap3 data identified a set of significant genomic regions enriched with Gene Ontology terms for DNA repair. For three DNA repair genes (BRIP1, PMS2P3 and TTDN), ENCODE data provided evidence for interaction between genomic regions containing these genes and regions containing the significant SNPs. Our findings provide insights into the mechanisms of genome evolution.  相似文献   

11.
An international effort is underway to generate a comprehensive haplotype map (HapMap) of the human genome represented by an estimated 300000 to 1 million ‘tag’ single nucleotide polymorphisms (SNPs). Our analysis indicates that the current human SNP map is not sufficiently dense to support the HapMap project. For example, 24.6% of the genome currently lacks SNPs at the minimal density and spacing that would be required to construct even a conservative tag SNP map containing 300 000 SNPs. In an effort to improve the human SNP map, we identified 140 696 additional SNP candidates using a new bioinformatics pipeline. Over 51 000 of these SNPs mapped to the largest gaps in the human SNP map, leading to significant improvements in these regions. Our SNPs will be immediately useful for the HapMap project, and will allow for the inclusion of many additional genomic intervals in the final HapMap. Nevertheless, our results also indicate that additional SNP discovery projects will be required both to define the haplotype architecture of the human genome and to construct comprehensive tag SNP maps that will be useful for genetic linkage studies in humans.  相似文献   

12.
Genome-wide scanning for signals of recent positive selection is essential for a comprehensive and systematic understanding of human adaptation. Here, we present a genomic survey of recent local selective sweeps, especially aimed at those nearly or recently completed. A novel approach was developed for such signals, based on contrasting the extended haplotype homozygosity (EHH) profiles between populations. We applied this method to the genome single nucleotide polymorphism (SNP) data of both the International HapMap Project and Perlegen Sciences, and detected widespread signals of recent local selection across the genome, consisting of both complete and partial sweeps. A challenging problem of genomic scans of recent positive selection is to clearly distinguish selection from neutral effects, given the high sensitivity of the test statistics to departures from neutral demographic assumptions and the lack of a single, accurate neutral model of human history. We therefore developed a new procedure that is robust across a wide range of demographic and ascertainment models, one that indicates that certain portions of the genome clearly depart from neutrality. Simulations of positive selection showed that our tests have high power towards strong selection sweeps that have undergone fixation. Gene ontology analysis of the candidate regions revealed several new functional groups that might help explain some important interpopulation differences in phenotypic traits.  相似文献   

13.
14.
A newly discovered gammaretrovirus, termed XMRV, was recently reported to be present in the prostate cancer cell line CWR22Rv1. Using a combination of both immunohistochemistry with broadly-reactive murine leukemia virus (MLV) anti-sera and PCR, we determined if additional prostate cancer or other cell lines contain XMRV or MLV-related viruses. Our study included a total of 72 cell lines, which included 58 of the 60 human cancer cell lines used in anticancer drug screens and maintained at the NCI-Frederick (NCI-60). We have identified gammaretroviruses in two additional prostate cancer cell lines: LAPC4 and VCaP, and show that these viruses are replication competent. Viral genome sequencing identified the virus in LAPC4 and VCaP as nearly identical to another known xenotropic MLV, Bxv-1. We also identified a gammaretrovirus in the non-small-cell lung carcinoma cell line EKVX. Prostate cancer cell lines appear to have a propensity for infection with murine gammaretroviruses, and we propose that this may be in part due to cell line establishment by xenograft passage in immunocompromised mice. It is unclear if infection with these viruses is necessary for cell line establishment, or what confounding role they may play in experiments performed with these commonly used lines. Importantly, our results suggest a need for regular screening of cancer cell lines for retroviral "contamination", much like routine mycoplasma testing.  相似文献   

15.
Genome-wide patterns of homozygosity runs and their variation across individuals provide a valuable and often untapped resource for studying human genetic diversity and evolutionary history. Using genotype data at 577,489 autosomal SNPs, we employed a likelihood-based approach to identify runs of homozygosity (ROH) in 1,839 individuals representing 64 worldwide populations, classifying them by length into three classes—short, intermediate, and long—with a model-based clustering algorithm. For each class, the number and total length of ROH per individual show considerable variation across individuals and populations. The total lengths of short and intermediate ROH per individual increase with the distance of a population from East Africa, in agreement with similar patterns previously observed for locus-wise homozygosity and linkage disequilibrium. By contrast, total lengths of long ROH show large interindividual variations that probably reflect recent inbreeding patterns, with higher values occurring more often in populations with known high frequencies of consanguineous unions. Across the genome, distributions of ROH are not uniform, and they have distinctive continental patterns. ROH frequencies across the genome are correlated with local genomic variables such as recombination rate, as well as with signals of recent positive selection. In addition, long ROH are more frequent in genomic regions harboring genes associated with autosomal-dominant diseases than in regions not implicated in Mendelian diseases. These results provide insight into the way in which homozygosity patterns are produced, and they generate baseline homozygosity patterns that can be used to aid homozygosity mapping of genes associated with recessive diseases.  相似文献   

16.
Genetic and genomic studies highlight the substantial complexity and heterogeneity of human cancers and emphasize the general lack of therapeutics that can match this complexity. With the goal of expanding opportunities for drug discovery, we describe an approach that makes use of a phenotype-based screen combined with the use of multiple cancer cell lines. In particular, we have used the NCI-60 cancer cell line panel that includes drug sensitivity measures for over 40,000 compounds assayed on 59 independent cells lines. Targets are cancer-relevant phenotypes represented as gene expression signatures that are used to identify cells within the NCI-60 panel reflecting the signature phenotype and then connect to compounds that are selectively active against those cells. As a proof-of-concept, we show that this strategy effectively identifies compounds with selectivity to the RAS or PI3K pathways. We have then extended this strategy to identify compounds that have activity towards cells exhibiting the basal phenotype of breast cancer, a clinically-important breast cancer characterized as ER-, PR-, and Her2- that lacks viable therapeutic options. One of these compounds, Simvastatin, has previously been shown to inhibit breast cancer cell growth in vitro and importantly, has been associated with a reduction in ER-, PR- breast cancer in a clinical study. We suggest that this approach provides a novel strategy towards identification of therapeutic agents based on clinically relevant phenotypes that can augment the conventional strategies of target-based screens.  相似文献   

17.
Organisms are remarkably adapted to diverse environments by specialized metabolisms, morphology, or behaviors. To address the molecular mechanisms underlying environmental adaptation, we have utilized a Drosophila melanogaster line, termed "Dark-fly", which has been maintained in constant dark conditions for 57 years (1400 generations). We found that Dark-fly exhibited higher fecundity in dark than in light conditions, indicating that Dark-fly possesses some traits advantageous in darkness. Using next-generation sequencing technology, we determined the whole genome sequence of Dark-fly and identified approximately 220,000 single nucleotide polymorphisms (SNPs) and 4,700 insertions or deletions (InDels) in the Dark-fly genome compared to the genome of the Oregon-R-S strain, a control strain. 1.8% of SNPs were classified as non-synonymous SNPs (nsSNPs: i.e., they alter the amino acid sequence of gene products). Among them, we detected 28 nonsense mutations (i.e., they produce a stop codon in the protein sequence) in the Dark-fly genome. These included genes encoding an olfactory receptor and a light receptor. We also searched runs of homozygosity (ROH) regions as putative regions selected during the population history, and found 21 ROH regions in the Dark-fly genome. We identified 241 genes carrying nsSNPs or InDels in the ROH regions. These include a cluster of alpha-esterase genes that are involved in detoxification processes. Furthermore, analysis of structural variants in the Dark-fly genome showed the deletion of a gene related to fatty acid metabolism. Our results revealed unique features of the Dark-fly genome and provided a list of potential candidate genes involved in environmental adaptation.  相似文献   

18.
Estivill X  Armengol L 《PLoS genetics》2007,3(10):1787-1799
Genome-wide association scans (GWASs) using single nucleotide polymorphisms (SNPs) have been completed successfully for several common disorders and have detected over 30 new associations. Considering the large sample sizes and genome-wide SNP coverage of the scans, one might have expected many of the common variants underpinning the genetic component of various disorders to have been identified by now. However, these studies have not evaluated the contribution of other forms of genetic variation, such as structural variation, mainly in the form of copy number variants (CNVs). Known CNVs account for over 15% of the assembled human genome sequence. Since CNVs are not easily tagged by SNPs, might have a wide range of copy number variability, and often fall in genomic regions not well covered by whole-genome arrays or not genotyped by the HapMap project, current GWASs have largely missed the contribution of CNVs to complex disorders. In fact, some CNVs have already been reported to show association with several complex disorders using candidate gene/region approaches, underpinning the importance of regions not investigated in current GWASs. This reveals the need for new generation arrays (some already in the market) and the use of tailored approaches to explore the full dimension of genome variability beyond the single nucleotide scale.  相似文献   

19.
This review presents a broader approach to the implementation and study of runs of homozygosity (ROH) in animal populations, focusing on identifying and characterizing ROH and their practical implications. ROH are continuous homozygous segments that are common in individuals and populations. The ability of these homozygous segments to give insight into a population's genetic events makes them a useful tool that can provide information about the demographic evolution of a population over time. Furthermore, ROH provide useful information about the genetic relatedness among individuals, helping to minimize the inbreeding rate and also helping to expose deleterious variants in the genome. The frequency, size and distribution of ROH in the genome are influenced by factors such as natural and artificial selection, recombination, linkage disequilibrium, population structure, mutation rate and inbreeding level. Calculating the inbreeding coefficient from molecular information from ROH (FROH) is more accurate for estimating autozygosity and for detecting both past and more recent inbreeding effects than are estimates from pedigree data (FPED). The better results of FROH suggest that FROH can be used to infer information about the history and inbreeding levels of a population in the absence of genealogical information. The selection of superior animals has produced large phenotypic changes and has reshaped the ROH patterns in various regions of the genome. Additionally, selection increases homozygosity around the target locus, and deleterious variants are seen to occur more frequently in ROH regions. Studies involving ROH are increasingly common and provide valuable information about how the genome's architecture can disclose a population's genetic background. By revealing the molecular changes in populations over time, genome‐wide information is crucial to understanding antecedent genome architecture and, therefore, to maintaining diversity and fitness in endangered livestock breeds.  相似文献   

20.
Malignant tumours are often characterised by significant rearrangement of the genome. This may be visible in the form of a deranged karyotype with both loss and gain of DNA sequences extending from chromosomal regions to whole chromosomes. In several tumour types, however, gross genomic derangements are minimal, and tumour cells contain one or more additional (supernumerary) chromosomes that may be unrecognisable in terms of a single origin. In this review we term such chromosomes cancer‐associated neochromosomes (CaNCs). In the absence of other identified genomic abnormalities, and because the CaNC is a common feature of the cancer type, it is hypothesised that the genetic alterations required for cell transformation are contained within its structure. In this review, we discuss the potential impact of modern genomic technologies on our understanding of the nature and causes of CaNC formation, which is central to several cancer types, exemplified here by well‐differentiated liposarcoma.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号