首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 903 毫秒
1.
Wang J 《Genetics》2011,187(3):887-901
Knowledge of the genetic relatedness between individuals is important in many research areas in quantitative genetics, conservation genetics, forensics, evolution, and ecology. In the absence of pedigree records, relatedness can be estimated from genetic marker data using a number of estimators. These estimators, however, make the critical assumption of a large random mating population without genetic structures. The assumption is frequently violated in the real world where geographic/social structures or nonrandom mating usually lead to genetic structures. In this study, I investigated two approaches to the estimation of relatedness between a pair of individuals from a subpopulation due to recent common ancestors (i.e., relatedness is defined and measured with the current focal subpopulation as reference). The indirect approach uses the allele frequencies of the entire population with and without accounting for the population structure, and the direct approach uses the allele frequencies of the current focal subpopulation. I found by simulations that currently widely applied relatedness estimators are upwardly biased under the indirect approach, but can be modified to become unbiased and more accurate by using Wright's F(st) to account for population structures. However, the modified unbiased estimators under the indirect approach are clearly inferior to the unmodified original estimators under the direct approach, even when small samples are used in estimating both allele frequencies and relatedness.  相似文献   

2.
The identification of related and unrelated individuals from molecular marker data is often difficult, particularly when no pedigree information is available and the data set is large. High levels of relatedness or inbreeding can influence genotype frequencies and thus genetic marker evaluation, as well as the accurate inference of hidden genetic structure. Identification of related and unrelated individuals is also important in breeding programmes, to inform decisions about breeding pairs and translocations. We present Friends and Family, a Windows executable program with a graphical user interface that identifies unrelated individuals from a pairwise relatedness matrix or table generated in programs such as coancestry and genalex . Friends and Family outputs a list of samples that are all unrelated to each other, based on a user‐defined relatedness cut‐off value. This unrelated data set can be used in downstream analyses, such as marker evaluation or inference of genetic structure. The results can be compared to that of the full data set to determine the effect related individuals have on the analyses. We demonstrate one of the applications of the program: how the removal of related individuals altered the Hardy–Weinberg equilibrium test outcome for microsatellite markers in an empirical data set. Friends and Family can be obtained from https://github.com/DeondeJager/Friends-and-Family .  相似文献   

3.
Characterizing the genetic structure of worldwide populations is important for understanding human history and is essential to the design and analysis of genetic epidemiological studies. In this study, we examined genetic structure and distant relatedness and their effect on the extent of linkage disequilibrium (LD) and homozygosity in the founder population of Quebec (Canada). In the French Canadian founder population, such analysis can be performed using both genomic and genealogical data. We investigated genetic differences, extent of LD, and homozygosity in 140 individuals from seven sub-populations of Quebec characterized by different demographic histories reflecting complex founder events. Genetic findings from genome-wide single nucleotide polymorphism data were correlated with genealogical information on each of these sub-populations. Our genomic data showed significant population structure and relatedness present in the contemporary Quebec population, also reflected in LD and homozygosity levels. Our extended genealogical data corroborated these findings and indicated that this structure is consistent with the settlement patterns involving several founder events. This provides an independent and complementary validation of genomic-based studies of population structure. Combined genomic and genealogical data in the Quebec founder population provide insights into the effects of the interplay of two important sources of bias in genetic epidemiological studies, unrecognized genetic structure and cryptic relatedness.  相似文献   

4.
Both the ability to generate DNA data and the variety of analytical methods for conservation genetics are expanding at an ever-increasing pace. Analytical approaches are now possible that were unthinkable even five years ago due to limitations in computational power or the availability of DNA data, and this has vastly expanded the accuracy and types of information that may be gained from population genetic data. Here we provide a guide to recently developed methods for population genetic analysis, including identification of population structure, quantification of gene flow, and inference of demographic history. We cover both allele-frequency and sequence-based approaches, with a special focus on methods relevant to conservation genetic applications. Although classical population genetic approaches such as F st (and its derivatives) have carried the field thus far, newer, more powerful, methods can infer much more from the data, rely on fewer assumptions, and are appropriate for conservation genetic management when precise estimates are needed.  相似文献   

5.
Management of game ungulates alters population structure and habitat features, with potential effects on genetic structure. Here, we study 26 red deer (Cervus elaphus) populations in Spain. We used census data and habitat features as well as genetic information at 11 microsatellite markers from 717 individuals. We found that metapopulations presented a distribution associated with forest interruptions. Within metapopulations, fences did not have a significant effect on red deer genetic structure. The metapopulations we studied presented similar population structure, but they differed in habitat features and genetic structure. The metapopulation with higher resource availability showed a genetic structure pattern in which genetic relatedness between geographically close individuals was high while relatedness between geographically distant individuals was low. Contrarily, the metapopulation with lower resource availability presented a genetic structure pattern in which the genetic relatedness between individuals of different populations was independent of the geographic distance. We discuss the possible connection between resource availability and genetic structure. Finally, we did not find any population or environmental variable related to genetic differentiation within metapopulations.  相似文献   

6.
Effective conservation and management of pond‐breeding amphibians depends on the accurate estimation of population structure, demographic parameters, and the influence of landscape features on breeding‐site connectivity. Population‐level studies of pond‐breeding amphibians typically sample larval life stages because they are easily captured and can be sampled nondestructively. These studies often identify high levels of relatedness between individuals from the same pond, which can be exacerbated by sampling the larval stage. Yet, the effect of these related individuals on population genetic studies using genomic data is not yet fully understood. Here, we assess the effect of within‐pond relatedness on population and landscape genetic analyses by focusing on the barred tiger salamanders (Ambystoma mavortium) from the Nebraska Sandhills. Utilizing genome‐wide SNPs generated using a double‐digest RADseq approach, we conducted standard population and landscape genetic analyses using datasets with and without siblings. We found that reduced sample sizes influenced parameter estimates more than the inclusion of siblings, but that within‐pond relatedness led to the inference of spurious population structure when analyses depended on allele frequencies. Our landscape genetic analyses also supported different models across datasets depending on the spatial resolution analyzed. We recommend that future studies not only test for relatedness among larval samples but also remove siblings before conducting population or landscape genetic analyses. We also recommend alternative sampling strategies to reduce sampling siblings before sequencing takes place. Biases introduced by unknowingly including siblings can have significant implications for population and landscape genetic analyses, and in turn, for species conservation strategies and outcomes.  相似文献   

7.
The conservation and management of endangered species requires information on their genetic diversity, relatedness and population structure. The main genetic markers applied for these questions are microsatellites and single nucleotide polymorphisms (SNPs), the latter of which remain the more resource demanding approach in most cases. Here, we compare the performance of two approaches, SNPs obtained by restriction‐site‐associated DNA sequencing (RADseq) and 16 DNA microsatellite loci, for estimating genetic diversity, relatedness and genetic differentiation of three, small, geographically close wild brown trout (Salmo trutta) populations and a regionally used hatchery strain. The genetic differentiation, quantified as FST, was similar when measured using 16 microsatellites and 4,876 SNPs. Based on both marker types, each brown trout population represented a distinct gene pool with a low level of interbreeding. Analysis of SNPs identified half‐ and full‐siblings with a higher probability than the analysis based on microsatellites, and SNPs outperformed microsatellites in estimating individual‐level multilocus heterozygosity. Overall, the results indicated that moderately polymorphic microsatellites and SNPs from RADseq agreed on estimates of population genetic structure in moderately diverged, small populations, but RADseq outperformed microsatellites for applications that required individual‐level genotype information, such as quantifying relatedness and individual‐level heterozygosity. The results can be applied to other small populations with low or moderate levels of genetic diversity.  相似文献   

8.
Identifying population structure is one of the most common and important objectives of spatial analyses using population genetic data. Population structure is detected either by rejecting the null hypothesis of a homogenous distribution of genetic variation, or by estimating low migration rates. Issues arise with most current population genetic inference methods when the genetic divergence is low among putative populations. Low levels of genetic divergence may be as a result of either high ongoing migration or historic high migration but no current, ongoing migration. We direct attention to recent developments in the use of the tempo-spatial distribution of closely related individuals to detect population structure or estimate current migration rates. These 'kinship-based' approaches complement more traditional population-based genetic inference methods by providing a means to detect population structure and estimate current migration rates when genetic divergence is low. However, for kinship-based methods to become widely adopted, formal estimation procedures applicable to a range of species life histories are needed.  相似文献   

9.
Knowledge of how individuals are related is important in many areas of research, and numerous methods for inferring pairwise relatedness from genetic data have been developed. However, the majority of these methods were not developed for situations where data are limited. Specifically, most methods rely on the availability of population allele frequencies, the relative genomic position of variants and accurate genotype data. But in studies of non‐model organisms or ancient samples, such data are not always available. Motivated by this, we present a new method for pairwise relatedness inference, which requires neither allele frequency information nor information on genomic position. Furthermore, it can be applied not only to accurate genotype data but also to low‐depth sequencing data from which genotypes cannot be accurately called. We evaluate it using data from a range of human populations and show that it can be used to infer close familial relationships with a similar accuracy as a widely used method that relies on population allele frequencies. Additionally, we show that our method is robust to SNP ascertainment and applicable to low‐depth sequencing data generated using different strategies, including resequencing and RADseq, which is important for application to a diverse range of populations and species.  相似文献   

10.
Characterizing the spatial patterns of genetic diversity in human populations has a wide range of applications, from detecting genetic mutations associated with disease to inferring human history. Current approaches, including the widely used principal-component analysis, are not suited for the analysis of linked markers, and local and long-range linkage disequilibrium (LD) can dramatically reduce the accuracy of spatial localization when unaccounted for. To overcome this, we have introduced an approach that performs spatial localization of individuals on the basis of their genetic data and explicitly models LD among markers by using a multivariate normal distribution. By leveraging external reference panels, we derive closed-form solutions to the optimization procedure to achieve a computationally efficient method that can handle large data sets. We validate the method on empirical data from a large sample of European individuals from the POPRES data set, as well as on a large sample of individuals of Spanish ancestry. First, we show that by modeling LD, we achieve accuracy superior to that of existing methods. Importantly, whereas other methods show decreased performance when dense marker panels are used in the inference, our approach improves in accuracy as more markers become available. Second, we show that accurate localization of genetic data can be achieved with only a part of the genome, and this could potentially enable the spatial localization of admixed samples that have a fraction of their genome originating from a given continent. Finally, we demonstrate that our approach is resistant to distortions resulting from long-range LD regions; such distortions can dramatically bias the results when unaccounted for.  相似文献   

11.
We present a method, fastIBD, for finding tracts of identity by descent (IBD) between pairs of individuals. FastIBD can be applied to thousands of samples across genome-wide SNP data and is significantly more powerful for finding short tracts of IBD than existing methods for finding IBD tracts in such data. We show that fastIBD can detect facets of population structure that are not revealed by other methods. In the Wellcome Trust Case Control Consortium bipolar disorder case-control data, we find a genome-wide excess of IBD in case-case pairs of individuals compared to control-control pairs. We show that this excess can be explained by the geographical clustering of cases. We also show that it is possible to use fastIBD to generate highly accurate estimates of genome-wide IBD sharing between pairs of distant relatives. This is useful for estimation of relationship and for adjusting for relatedness in association studies. FastIBD is incorporated in the freely available Beagle software package.  相似文献   

12.
Studies of relatedness have been crucial in molecular ecology over the last decades. Good evidence of this is the fact that studies of population structure, evolution of social behaviours, genetic diversity and quantitative genetics all involve relatedness research. The main aim of this article was to review the most common graphical methods used in allele sharing studies for detecting and identifying family relationships. Both IBS‐ and IBD‐based allele sharing studies are considered. Furthermore, we propose two additional graphical methods from the field of compositional data analysis: the ternary diagram and scatterplots of isometric log‐ratios of IBS and IBD probabilities. We illustrate all graphical tools with genetic data from the HGDP‐CEPH diversity panel, using mainly 377 microsatellites genotyped for 25 individuals from the Maya population of this panel. We enhance all graphics with convex hulls obtained by simulation and use these to confirm the documented relationships. The proposed compositional graphics are shown to be useful in relatedness research, as they also single out the most prominent related pairs. The ternary diagram is advocated for its ability to display all three allele sharing probabilities simultaneously. The log‐ratio plots are advocated as an attempt to overcome the problems with the Euclidean distance interpretation in the classical graphics.  相似文献   

13.
Hardy OJ 《Molecular ecology》2003,12(6):1577-1588
A new estimator of the pairwise relatedness coefficient between individuals adapted to dominant genetic markers is developed. This estimator does not assume genotypes to be in Hardy-Weinberg proportions but requires a knowledge of the departure from these proportions (i.e. the inbreeding coefficient). Simulations show that the estimator provides accurate estimates, except for some particular types of individual pairs such as full-sibs, and performs better than a previously developed estimator. When comparing marker-based relatedness estimates with pedigree expectations, a new approach to account for the change of the reference population is developed and shown to perform satisfactorily. Simulations also illustrate that this new relatedness estimator can be used to characterize isolation by distance within populations, leading to essentially unbiased estimates of the neighbourhood size. In this context, the estimator appears fairly robust to moderate errors made on the assumed inbreeding coefficient. The analysis of real data sets suggests that dominant markers (random amplified polymorphic DNA, amplified fragment length polymorphism) may be as valuable as co-dominant markers (microsatellites) in studying microgeographic isolation-by-distance processes. It is argued that the estimators developed should find major applications, notably for conservation biology.  相似文献   

14.
Ritland K 《Molecular ecology》2000,9(9):1195-1204
This paper presents a perspective of how inferred relatedness, based on genetic marker data such as microsatellites or amplified fragment length polymorphisms (AFLPs), can be used to demonstrate quantitative genetic variation in natural populations. Variation at two levels is considered: among pairs of individuals within populations, and among pairs of subpopulations within a population. In the former, inferred pairwise relatedness, combined with trait measures, allow estimates of heritability 'in the wild'. In the latter, estimates of QST are obtained, in the absence of known heritabilities, via estimates of pairwise FST. Estimators of relatedness based on the 'Kronecker operator' are given. Both methods require actual variation of relationship, a rarely studied aspect of population structure, and not necessarily present. Some conditions for appropriate population structures in the wild are identified, in part through a review of recent studies.  相似文献   

15.
The estimation of quantitative genetic parameters in wild populations is generally limited by the accuracy and completeness of the available pedigree information. Using relatedness at genomewide markers can potentially remove this limitation and lead to less biased and more precise estimates. We estimated heritability, maternal genetic effects and genetic correlations for body size traits in an unmanaged long‐term study population of Soay sheep on St Kilda using three increasingly complete and accurate estimates of relatedness: (i) Pedigree 1, using observation‐derived maternal links and microsatellite‐derived paternal links; (ii) Pedigree 2, using SNP‐derived assignment of both maternity and paternity; and (iii) whole‐genome relatedness at 37 037 autosomal SNPs. In initial analyses, heritability estimates were strikingly similar for all three methods, while standard errors were systematically lower in analyses based on Pedigree 2 and genomic relatedness. Genetic correlations were generally strong, differed little between the three estimates of relatedness and the standard errors declined only very slightly with improved relatedness information. When partitioning maternal effects into separate genetic and environmental components, maternal genetic effects found in juvenile traits increased substantially across the three relatedness estimates. Heritability declined compared to parallel models where only a maternal environment effect was fitted, suggesting that maternal genetic effects are confounded with direct genetic effects and that more accurate estimates of relatedness were better able to separate maternal genetic effects from direct genetic effects. We found that the heritability captured by SNP markers asymptoted at about half the SNPs available, suggesting that denser marker panels are not necessarily required for precise and unbiased heritability estimates. Finally, we present guidelines for the use of genomic relatedness in future quantitative genetics studies in natural populations.  相似文献   

16.
The genomic era has led to an unprecedented increase in the availability of genome‐wide data for a broad range of taxa. Wildlife management strives to make use of these vast resources to enable refined genetic assessments that enhance biodiversity conservation. However, as new genomic platforms emerge, problems remain in adapting the usually complex approaches for genotyping of noninvasively collected wildlife samples. Here, we provide practical guidelines for the standardized development of reduced single nucleotide polymorphism (SNP) panels applicable for microfluidic genotyping of degraded DNA samples, such as faeces or hairs. We demonstrate how microfluidic SNP panels can be optimized to efficiently monitor European wildcat (Felis silvestris S.) populations. We show how panels can be set up in a modular fashion to accommodate informative markers for relevant population genetics questions, such as individual identification, hybridization assessment and the detection of population structure. We discuss various aspects regarding the implementation of reduced SNP panels and provide a framework that will allow both molecular ecologists and practitioners to help bridge the gap between genomics and applied wildlife conservation.  相似文献   

17.
Demographic processes directly affect patterns of genetic variation within contemporary populations as well as future generations, allowing for demographic inference from patterns of both present-day and past genetic variation. Advances in laboratory procedures, sequencing and genotyping technologies in the past decades have resulted in massive increases in high-quality genome-wide genetic data from present-day populations and allowed retrieval of genetic data from archaeological material, also known as ancient DNA. This has resulted in an explosion of work exploring past changes in population size, structure, continuity and movement. However, as genetic processes are highly stochastic, patterns of genetic variation only indirectly reflect demographic histories. As a result, past demographic processes need to be reconstructed using an inferential approach. This usually involves comparing observed patterns of variation with model expectations from theoretical population genetics. A large number of approaches have been developed based on different population genetic models that each come with assumptions about the data and underlying demography. In this article I review some of the key models and assumptions underlying the most commonly used approaches for past demographic inference and their consequences for our ability to link the inferred demographic processes to the archaeological and climate records.This article is part of the theme issue ‘Cross-disciplinary approaches to prehistoric demography’.  相似文献   

18.
Although a few hundred single nucleotide polymorphisms (SNPs) suffice to infer close familial relationships, high density genome-wide SNP data make possible the inference of more distant relationships such as 2nd to 9th cousinships. In order to characterize the relationship between genetic similarity and degree of kinship given a timeframe of 100–300 years, we analyzed the sharing of DNA inferred to be identical by descent (IBD) in a subset of individuals from the 23andMe customer database (n = 22,757) and from the Human Genome Diversity Panel (HGDP-CEPH, n = 952). With data from 121 populations, we show that the average amount of DNA shared IBD in most ethnolinguistically-defined populations, for example Native American groups, Finns and Ashkenazi Jews, differs from continentally-defined populations by several orders of magnitude. Via extensive pedigree-based simulations, we determined bounds for predicted degrees of relationship given the amount of genomic IBD sharing in both endogamous and ‘unrelated’ population samples. Using these bounds as a guide, we detected tens of thousands of 2nd to 9th degree cousin pairs within a heterogenous set of 5,000 Europeans. The ubiquity of distant relatives, detected via IBD segments, in both ethnolinguistic populations and in large ‘unrelated’ populations samples has important implications for genetic genealogy, forensics and genotype/phenotype mapping studies.  相似文献   

19.

Key message

Nineteen tuber quality traits in potato were phenotyped in 205 cultivars and 299 breeder clones. Association analysis using 3364 AFLP loci and 653 SSR-alleles identified QTL for these traits.

Abstract

Two association mapping panels were analysed for marker–trait associations to identify quantitative trait loci (QTL). The first panel comprised 205 historical and contemporary tetraploid potato cultivars that were phenotyped in field trials at two locations with two replicates (the academic panel). The second panel consisted of 299 potato cultivars and included recent breeds obtained from five Dutch potato breeding companies and reference cultivars (the industrial panel). Phenotypic data for the second panel were collected during subsequent clonal selection generations at the individual breeding companies. QTL were identified for 19 agro-morphological and quality traits. Two association mapping models were used: a baseline model without, and a more advanced model with correction for population structure and genetic relatedness. Correction for population structure and genetic relatedness was performed with a kinship matrix estimated from marker information. The detected QTL partly not only confirmed previous studies, e.g. for tuber shape and frying colour, but also new QTL were found like for after baking darkening and enzymatic browning. Pleiotropic effects could be discerned for several QTL.  相似文献   

20.
The northern hairy-nosed (NHN) wombat is perhaps Australia's most endangered mammal. Being fossorial and nocturnal as well as rare, NHN wombats are difficult to observe in the wild. Hence little is known of their social biology, such as their mating and dispersal systems. A hypothesis has been advanced that adult females of the species disperse post-breeding, leaving their young to inhabit the natal burrow. Female-biased dispersal is expected to result in higher relatedness amongst males in a burrow cluster than amongst females in a burrow cluster. The usefulness of a panel of microsatellite markers in estimating the relatedness structure, and in reconstructing pedigrees for, the sole known population of NHN wombats was assessed. Microsatellite genotypes at eight or nine loci were obtained from 58 of the 85 known individuals, and used to estimate pairwise individual relatedness using Queller & Goodnight's (1989) RELATEDNESS 4.2. Our analysis gave the unexpected result that both males and females were significantly more closely related to their same-sex burrow cluster mates than random, while opposite-sex animals sharing burrows were only slightly (nonsignificantly) more related than random. This raises the possibility of dispersal patterns which lead to association of same-sex relatives. The observed relatedness structure is not expected to make likely a high incidence of inbred matings, as close relatives of the opposite sex are not significantly associated in space. Parentage analysis was attempted using genetic exclusion and LOD likelihood ratios, but proved difficult because of low genetic variation, incomplete sampling of potential parents, and paucity of ecological data such as known mother/offspring pairs and ages of individuals.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号