首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 0 毫秒
1.
The use of genome-wide single nucleotide polymorphism (SNP) data has recently proven useful in the study of human population structure. We have studied the internal genetic structure of the Swedish population using more than 350,000 SNPs from 1525 Swedes from all over the country genotyped on the Illumina HumanHap550 array. We have also compared them to 3212 worldwide reference samples, including Finns, northern Germans, British and Russians, based on the more than 29,000 SNPs that overlap between the Illumina and Affymetrix 250K Sty arrays. The Swedes - especially southern Swedes - were genetically close to the Germans and British, while their genetic distance to Finns was substantially longer. The overall structure within Sweden appeared clinal, and the substructure in the southern and middle parts was subtle. In contrast, the northern part of Sweden, Norrland, exhibited pronounced genetic differences both within the area and relative to the rest of the country. These distinctive genetic features of Norrland probably result mainly from isolation by distance and genetic drift caused by low population density. The internal structure within Sweden (FST = 0.0005 between provinces) was stronger than that in many Central European populations, although smaller than what has been observed for instance in Finland; importantly, it is of the magnitude that may hamper association studies with a moderate number of markers if cases and controls are not properly matched geographically. Overall, our results underline the potential of genome-wide data in analyzing substructure in populations that might otherwise appear relatively homogeneous, such as the Swedes.  相似文献   

2.
Knowledge of population structure and breed composition of a population can be advantageous for a number of reasons; these include designing optimal (cross)breeding strategies in order to maximise non-additive genetic effects, maintaining flockbook integrity by authenticating animals being registered and as a quality control measure in the genotyping process. The objectives of the present study were to 1) describe the population structure of 24 sheep breeds, 2) quantify the breed composition of both flockbook-recorded and crossbred animals using single nucleotide polymorphism BLUP (SNP-BLUP), and 3) quantify the accuracy of breed composition prediction from low-density genotype panels containing between 2000 and 6000 SNPs. In total, 9334 autosomal SNPs on 11 144 flockbook-recorded animals and 1172 crossbred animals were used. The population structure of all breeds was characterised by principal component analysis (PCA) as well as the pairwise breed fixation index (Fst). The total number of animals, all of which were purebred, included in the calibration population for SNP-BLUP was 2579 with the number of animals per breed ranging from 9 to 500. The remaining 9559 flockbook-recorded animals, composite breeds and crossbred animals represented the test population; three breeds were excluded from breed composition prediction. The breed composition predicted using SNP-BLUP with 9334 SNPs was considered the gold standard prediction. The pairwise breed Fst ranged from 0.040 (between the Irish Blackface and Scottish Blackface) to 0.282 (between the Border Leicester and Suffolk). Principal component analysis revealed that the Suffolk from Ireland and the Suffolk from New Zealand formed distinct, non-overlapping clusters. In contrast, the Texel from Ireland and that from New Zealand formed integrated, overlapping clusters. Composite animals such as the Belclare clustered close to its founder breeds (i.e., Finn, Galway, Lleyn and Texel). When all 9334 SNPs were used to predict breed composition, an animal that had a majority breed proportion predicted to be ≥0.90 was defined as purebred for the present study. As the panel density decreased, the predicted breed proportion threshold, used to identify animals as purebred, also decreased (≥0.85 with 6000 SNPs to ≥0.60 with 2000 SNPs). In all, results from the study suggest that breed composition for purebred and crossbred animals can be determined with SNP-BLUP using ≥5000 SNPs.  相似文献   

3.
The existence of a large-scale population structure was investigated in Arabidopsis thaliana by studying patterns of polymorphism in a set of 71 European accessions. We used sequence polymorphism surveyed in 10 fragments of approximately 600 nucleotides and a set of nine microsatellite markers. Population structure was investigated using a model-based inference framework. Among the accessions studied, the presence of four groups was inferred using genetic data, without using prior information on the geographical origin of the accessions. Significant genetic isolation by geographical distance was detected at the group level, together with a geographical gradient in allelic richness across groups. These results are discussed with respect to the previously proposed scenario of postglacial colonization of Europe from putative glacial refugia. Finally, the contribution of the inferred structure to linkage disequilibrium among 171 pairs of essentially unlinked markers was also investigated. Linkage disequilibrium analysis revealed that significant associations detected in the whole sample were mainly due to genetic differentiation among the inferred groups. We discuss the implication of this finding for future association studies in A. thaliana.  相似文献   

4.
5.
Highly informative genetic markers are essential for efficient management of cattle populations, as well as for food safety. After a decade of domination by microsatellite markers, a new type of genetic marker, single nucleotide polymorphism (SNP), has recently appeared on the scene. In the present study, the exclusion power of both kinds of markers with regards to individual identification and parental analysis was directly compared in a Galloway cattle population. Seventeen bovine microsatellites were distributed in three incremental marker sets (10, 14 and 17 microsatellite markers) and used for cattle genotyping. A set of 43 bovine SNP was used for genotyping the same cattle population. The accuracy of both kinds of markers in individual identification was evaluated using probability of identity estimations. These were 2.4 x 10(-8) for the 10 microsatellite set, 2.3 x 10(-11) for the 14 microsatellite set, and 1.4 x 10(-13) for the 17 microsatellite marker set. For the 43 SNP markers, the estimated probability of identity was 5.3 x 10(-11). The exclusion power of both kinds of markers in parental analysis was evaluated using paternity exclusion estimations, and, in addition to this, by estimation of the parental exclusion probability in 18 Galloway family trios. Paternity exclusion was estimated to be over 99% for microsatellites, and approx. 98% for SNP. Both, microsatellite and SNP sets of markers showed similar parental exclusion probabilities.  相似文献   

6.
Crossbreeding is an essential way of improving herd performance. However, frequent parentage record errors appear, which results in the lower accuracy of genetic parameter estimation and genetic evaluation. This study aims to build a single nucleotide polymorphism (SNP) panel with sufficient power for parentage testing in the crossbred population of Simmental and Holstein cattle. The direct sequencing technique in PCR products of pooling DNA along with matrix-assisted laser desorption/ionization time-of-flight MS method for genotyping the individuals was applied. A panel comprising 50 highly informative SNPs for parentage analysis was developed in the crossbred population. The average minor allele frequency for SNPs was 0.43, and the cumulative probability of exclusion for single-parent and both-parent inference met 0.99797 and 0.999999, respectively. The maker-set for parentage verification was then used in a group of 81 trios with aid of the likelihood-based parentage-assignment program of Cervus software. Reconfirmation with on-farm records showed that this 50-SNP system could provide sufficient and reliable information for parentage testing with the parental errors for mother–offspring and sire–offspring being 8.6 and 18.5%, respectively. In conclusion, a set of low-cost and efficient SNPs for the paternity testing in the Simmental and Holstein crossbred population are provided.  相似文献   

7.
We report 39 single nucleotide polymorphisms (SNPs) observed in 23 nuclear DNA sequences in coho salmon Oncorhynchus kisutch. High‐throughput genotyping assays based on the 5′‐nuclease reaction were developed for 17 of these nuclear SNPs and for two previously published mitochondrial DNA SNPs. Minor allele frequency differences (Δq) among collections were between 5.2% and 51.2%, resulting in per locus FST estimates of 0.00–0.24 with an average of 0.09.  相似文献   

8.
We report the characterization of 13 single nucleotide polymorphism (SNP) genotyping assays for chum salmon (Oncorhynchus keta). These assays are based on the 5′‐nuclease reaction and thus facilitate high‐throughput genotyping with minimal optimization time. Because data generated using these markers may be transported and combined across laboratories, SNPs offer the potential to reduce the amount of redundant work being done in mixture and migratory studies of chum salmon.  相似文献   

9.
The simultaneous analysis of multiple genomic loci is a powerful approach to studying the effects of population history and natural selection on patterns of genetic variation of a species. By surveying nucleotide sequence polymorphism at 334 randomly distributed genomic regions in 12 accessions of Arabidopsis thaliana, we examined whether a standard neutral model of nucleotide sequence polymorphism is consistent with observed data. The average nucleotide diversity was 0.0071 for total sites and 0.0083 for silent sites. Although levels of diversity are variable among loci, no correlation with local recombination rate was observed, but polymorphism levels were correlated for physically linked loci (<250 kb). We found that observed distributions of Tajima's D- and D/D(min)- and of Fu and Li's D-, D*- and F-, F*-statistics differed significantly from the expected distributions under a standard neutral model due to an excess of rare polymorphisms and high variances. Observed and expected distributions of Fay and Wu's H were not different, suggesting that demographic processes and not selection at multiple loci are responsible for the deviation from a neutral model. Maximum-likelihood comparisons of alternative demographic models like logistic population growth, glacial refugia, or past bottlenecks did not produce parameter estimates that were more consistent with observed patterns. However, exclusion of highly polymorphic "outlier loci" resulted in a fit to the logistic growth model. Various tests of neutrality revealed a set of candidate loci that may evolve under selection.  相似文献   

10.
Single nucleotide polymorphism (SNP) data can be used for parameter estimation via maximum likelihood methods as long as the way in which the SNPs were determined is known, so that an appropriate likelihood formula can be constructed. We present such likelihoods for several sampling methods. As a test of these approaches, we consider use of SNPs to estimate the parameter Theta = 4N(e)micro (the scaled product of effective population size and per-site mutation rate), which is related to the branch lengths of the reconstructed genealogy. With infinite amounts of data, ML models using SNP data are expected to produce consistent estimates of Theta. With finite amounts of data the estimates are accurate when Theta is high, but tend to be biased upward when Theta is low. If recombination is present and not allowed for in the analysis, the results are additionally biased upward, but this effect can be removed by incorporating recombination into the analysis. SNPs defined as sites that are polymorphic in the actual sample under consideration (sample SNPs) are somewhat more accurate for estimation of Theta than SNPs defined by their polymorphism in a panel chosen from the same population (panel SNPs). Misrepresenting panel SNPs as sample SNPs leads to large errors in the maximum likelihood estimate of Theta. Researchers collecting SNPs should collect and preserve information about the method of ascertainment so that the data can be accurately analyzed.  相似文献   

11.
We developed algorithms that find a set of single nucleotide polymorphism (SNP) markers based on interval regularity, given either the number of SNPs to choose (m) or the desired interval (I), subject to minimum variance or minimum sum of squared deviations from I. In both cases, the number of all possible sets increases exponentially with respect to the number of input SNPs (n), but our algorithms find the minima only with O(n(2)) calculations and comparisons by elimination of redundancy.  相似文献   

12.
The International HapMap Project and the arrival of technologies that type more than 100,000 SNPs in a single experiment have made genome-wide single nucleotide polymorphism (GW-SNP) assay a realistic endeavor. This has sparked considerable debate regarding the promise of GW-SNP typing to identify genetic association in disease. As has already been shown, this approach has the potential to localize common genetic variation underlying disease risk. The data provided from this technology also lends itself to several other lines of investigation; autozygosity mapping in consanguineous families and outbred populations, direct detection of structural variation, admixture analysis, and other population genetic approaches. In this review we will discuss the potential uses and practical application of GW-SNP typing including those above and beyond simple association testing.  相似文献   

13.
A high-throughput system for the measurement of recombination frequencies in the genetic model plant, Arabidopsis thaliana, is described. It is based on 21 mono-transgenic isogenic lines harboring antibiotic resistance genes on all five chromosomes. Recombination between pairs of gene insertions in repulsion phase that confer resistance against kanamycin (kan) and hygromycin (hyg) is determined by a phenotypic assay of progeny (DART: Double Antibiotic Resistance Technique). DART allows testing for the influence of numerous environmental and genetic factors, including candidate genes, on recombination frequencies in specific genomic regions as well as the entire genome. Its usefulness is demonstrated by investigating the effects of UV treatment, different temperature and phosphorus supply regimes, and sex on recombination frequencies for all five chromosomes of A. thaliana. Electronic Publication  相似文献   

14.
We characterized 37 single nucleotide polymorphism (SNP) makers for eelgrass Zostera marina. SNP markers were developed using existing EST (expressed sequence tag)-libraries to locate polymorphic loci and develop primers from the functional expressed genes that are deposited in The ZOSTERA database (V1.2.1). SNP loci were genotyped using a single-base-extension approach which facilitated high-throughput genotyping with minimal optimization time. These markers show a wide range of variability among 25 eelgrass populations and will be useful for population genetic studies including evaluation of population structure, historical demography, and phylogeography. Potential applications include haplotype inference of physically linked SNPs and identification of genes under selection for temperature and desiccation stress.  相似文献   

15.
Sequencing of part of seven genes from Aedes aegypti collected in 16 Brazilian cities revealed the existence of 53 single nucleotide polymorphisms (SNPs), representing one SNP every 52 base pairs. From these 53 SNPs, we selected eight that are independent and highly polymorphic. We describe the use of these markers for differentiation of Brazilian populations of A. aegypti. These are the first SNPs developed for delineating population structure in A. aegypti, and will be a useful complement to epidemiological studies.  相似文献   

16.
We report the characterization of 15 new single nucleotide polymorphism markers for a threatened species, the chimpanzee (Pan troglodytes), developed using a targeted gene approach. These markers are derived from the Y chromosome and autosomal regions of the genome and show frequency differences between chimpanzee subspecies from central and western Africa. These single nucleotide polymorphism markers are the first to be designed for the genotyping of wild chimpanzee populations and will provide a useful addition to the genetic tools employed for the conservation management of this threatened species.  相似文献   

17.
We describe the three-dimensional structure of the product of Arabidopsis thaliana gene At5g66040.1 as determined by NMR spectroscopy. This protein is categorized as single-domain sulfurtransferase and is annotated as a senescence-associated protein (sen1-like protein) and ketoconazole resistance protein (http://arabidopsis.org/info/genefamily/STR_genefamily.html). The sequence of At5g66040.1 is virtually identical to that of a protein from Arabidopsis found by others to confer ketoconazole resistance in yeast. Comparison of the three-dimensional structure with those in the Protein Data Bank revealed that At5g66040.1 contains an additional mobile beta-hairpin not found in other rhodaneses that may function in binding specific substrates. This represents the first structure of a single-domain plant sulfurtransferase. The enzymatically active cysteine-containing domain belongs to the CDC25 class of phosphatases, sulfide dehydrogenases, and stress proteins such as senescence specific protein 1 in plants, PspE and GlpE in bacteria, and cyanide and arsenate resistance proteins. Versions of this domain that lack the active site cysteine are found in other proteins, such as phosphatases, ubiquitin hydrolases, and sulfuryltransferases.  相似文献   

18.
Over 3.5 million expressed sequence tags from the major cereal taxa were used to electronically mine over 176 000 putative single nucleotide polymorphisms (SNPs). The density, distribution and degree of linkage between these SNPs were compared among the different taxa. The frequency of sequence polymorphism was lowest in diploid taxa (rice, barley and sorghum), intermediate in tetraploid maize and highest in allohexaploid wheat and octoploid sugarcane. SNPs were further categorized as either intravarietal (differences between gene family members and homoeologues) or varietal (differences between two varieties), and as either co-segregating or non-co-segregating with neighbouring polymorphisms. Varietal co-segregating SNPs represent the best candidates for molecular markers as they show variation between varieties and have a high probability of being validated, as sequencing errors are unlikely to co-segregate with one another. This elite class of SNPs was most abundant in barley and least abundant in wheat and rice. Despite the large number of observed sequence polymorphisms in allohexaploid wheat, only a fraction of those available are likely to make good molecular markers. In addition, we found that rice SNPs up to 10 kb apart were in linkage disequilibrium (LD), but that high levels of LD attributable to population structure confounded the tracking of LD over greater distances.  相似文献   

19.
Threonine synthase (TS) is a PLP-dependent enzyme that catalyzes the last reaction in the synthesis of threonine from aspartate. In plants, the methionine pathway shares the same substrate, O-phospho-L-homoserine (OPH), and TS is activated by S-adenosyl-methionine (SAM), a downstream product of methionine synthesis. This positive allosteric effect triggered by the product of another pathway is specific to plants. The crystal structure of Arabidopsis thaliana apo threonine synthase was solved at 2.25 A resolution from triclinic crystals using MAD data from the selenomethionated protein. The structure reveals a four-domain dimer with a two-stranded beta-sheet arm protruding from one monomer onto the other. This domain swap could form a lever through which the allosteric effect is transmitted. The N-terminal domain (domain 1) has a unique fold and is partially disordered, whereas the structural core (domains 2 and 3) shares the functional domain of PLP enzymes of the same family. It also has similarities with SAM-dependent methyltransferases. Structure comparisons allowed us to propose potential sites for pyridoxal-phosphate and SAM binding on TS; they are close to regions that are disordered in the absence of these molecules.  相似文献   

20.
Although the human genome has been nearly completely sequenced, the functions and the roles of the vast majority of the genes, and the influences of single nucleotide polymorphisms (SNPs) in these genes are not entirely known. A modified mutation detection method was developed for large-scale cloning of the possible SNPs between tumor and normal cells for facilitating the identification of genetic factors that associated with cancer formation and progression. The method involves hybridization of restriction enzyme-cut chromosomal DNA, cleavage and modification of the sites of differences by enzymes, and differential cloning of sequence variations with a designed vector. Experimental validations of the presence and location of sequence variations in the isolated clones by PCR and DNA sequencing support the capability of this method in identifying sequence differences between tumor cells and normal cells.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号