首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
A rapid, deterministic method (DET) based on a recursive algorithm and a stochastic method based on Markov Chain Monte Carlo (MCMC) for calculating identity-by-descent (IBD) matrices conditional on multiple markers were compared using stochastic simulation. Precision was measured by the mean squared error (MSE) of the relationship coefficients in predicting the true IBD relationships, relative to MSE obtained from using pedigree only. Comparisons were made when varying marker density, allele numbers, allele frequencies, and the size of full-sib families. The precision of DET was 75–99% relative to MCMC, but was not simply related to the informativeness of individual loci. For situations mimicking microsatellite markers or dense SNP, the precision of DET was ≥ 95% relative to MCMC. Relative precision declined for the SNP, but not microsatellites as marker density decreased. Full-sib family size did not affect the precision. The methods were tested in interval mapping and marker assisted selection, and the performance was very largely determined by the MSE. A multi-locus information index considering the type, number, and position of markers was developed to assess precision. It showed a marked empirical relationship with the observed precision for DET and MCMC and explained the complex relationship between relative precision and the informativeness of individual loci.  相似文献   

2.
Pedigree reconstruction using genotypic markers has become an important tool for the study of natural populations. The nonstandard nature of the underlying statistical problems has led to the necessity of developing specialized statistical and computational methods. In this article, a new version of pedigree reconstruction tools (PRT 2.0) is presented. The software implements algorithms proposed in Almudevar & Field (Journal of Agricultural Biological and Environmental Statistics, 4, 1999, 136) and Almudevar (Biometrics, 57, 2001a, 757) for the reconstruction of single generation sibling groups (SG). A wider range of enumeration algorithms is included, permitting improved computational performance. In particular, an iterative version of the algorithm designed for larger samples is included in a fully automated form. The new version also includes expanded simulation utilities, as well as extensive reporting, including half-sibling compatibility, parental genotype estimates and flagging of potential genotype errors. A number of alternative algorithms are described and demonstrated. A comparative discussion of the underlying methodologies is presented. Although important aspects of this problem remain open, we argue that a number of methodologies including maximum likelihood estimation (COLONY 1.2 and 2.0) and the set cover formulation (KINALYZER) exhibit undesirable properties in the sibling reconstruction problem. There is considerable evidence that large sets of individuals not genetically excluded as siblings can be inferred to be a true sibling group, but it is also true that unrelated individuals may be genetically compatible with a true sibling group by chance. Such individuals may be identified on a statistical basis. PRT 2.0, based on these sound statistical principles, is able to efficiently match or exceed the highest reported accuracy rates, particularly for larger SG. The new version is available at http://www.urmc.rochester.edu/biostat/people/faculty/almudevar.cfm.  相似文献   

3.
A method was described for calculating population statistics on relationship coefficients without using corresponding individual data. It relied on the structure of the inverse of the numerator relationship matrix between individuals under investigation and ancestors. Computation times were observed on simulated populations and were compared to those incurred with a conventional direct approach. The indirect approach turned out to be very efficient for multiplying the relationship matrix corresponding to planned matings (full design) by any vector. Efficiency was generally still good or very good for calculating statistics on these simulated populations. An extreme implementation of the method is the calculation of inbreeding coefficients themselves. Relative performances of the indirect method were good except when many full-sibs during many generations existed in the population.  相似文献   

4.
目的 研究构建基于共祖(identity-by-descent,IBD)片段算法预测远亲缘关系分析流程并评估预测准确性。方法 采用高密度单核苷酸多态性(single nucleotide polymorphism,SNP)芯片对253份家系样本进行检测,研究基于IBD片段算法的分析流程进行两两个体间亲缘关系预测,评估预测准确性。随机减少SNP位点,评估位点数对算法预测准确性的影响。结果 IBD片段算法预测1~7级亲缘关系平均置信区间准确率为94.72%,预测可信度为99.77%,6级及以上亲缘关系预测时出现假阴性。随着SNP数量减少,预测准确性会出现一定程度的下降。结论 IBD片段算法可用于7级以内亲缘关系的预测,该算法在群体遗传学、法医遗传学等领域有重要应用价值。  相似文献   

5.
Estimating dispersal—a key parameter for population ecology and management—is notoriously difficult. The use of pedigree assignments, aided by likelihood‐based software, has become popular to estimate dispersal rate and distance. However, the partial sampling of populations may produce false assignments. Further, it is unknown how the accuracy of assignment is affected by the genealogical relationships of individuals and is reflected by software‐derived assignment probabilities. Inspired by a project managing invasive American mink (Neovison vison), we estimated individual dispersal distances using inferred pairwise relationships of culled individuals. Additionally, we simulated scenarios to investigate the accuracy of pairwise inferences. Estimates of dispersal distance varied greatly when derived from different inferred pairwise relationships, with mother–offspring relationship being the shortest (average = 21 km) and the most accurate. Pairs assigned as maternal half‐siblings were inaccurate, with 64%–97% falsely assigned, implying that estimates for these relationships in the wild population were unreliable. The false assignment rate was unrelated to the software‐derived assignment probabilities at high dispersal rates. Assignments were more accurate when the inferred parents were older and immigrants and when dispersal rates between subpopulations were low (1% and 2%). Using 30 instead of 15 loci increased pairwise reliability, but half‐sibling assignments were still inaccurate (>59% falsely assigned). The most reliable approach when using inferred pairwise relationships in polygamous species would be not to use half‐sibling relationship types. Our simulation approach provides guidance for the application of pedigree inferences under partial sampling and is applicable to other systems where pedigree assignments are used for ecological inference.  相似文献   

6.
Pedigrees, depicting genealogical relationships between individuals, are important in several research areas. Molecular markers allow inference of pedigrees in wild species where relationship information is impossible to collect by observation. Marker data are analysed statistically using methods based on Mendelian inheritance rules. There are numerous computer programs available to conduct pedigree analysis, but most software is inflexible, both in terms of assumptions and data requirements. Most methods only accommodate monogamous diploid species using codominant markers without genotyping error. In addition, most commonly used methods use pairwise comparisons rather than a full-pedigree likelihood approach, which considers the likelihood of the entire pedigree structure and allows the simultaneous inference of parentage and sibship. Here, we describe colony, a computer program implementing full-pedigree likelihood methods to simultaneously infer sibship and parentage among individuals using multilocus genotype data. colony can be used for both diploid and haplodiploid species; it can use dominant and codominant markers, and can accommodate, and estimate, genotyping error at each locus. In addition, colony can carry out these inferences for both monoecious and dioecious species. The program is available as a Microsoft Windows version, which includes a graphical user interface, and a Macintosh version, which uses an R-based interface.  相似文献   

7.
Upland cotton (Gossypium hirsutum) is the world's largest source of natural fibre and dominates the global textile industry. Hybrid cotton varieties exhibit strong heterosis that confers high fibre yields, yet the genome‐wide effects of artificial selection that have influenced Upland cotton during its breeding history are poorly understood. Here, we resequenced Upland cotton genomes and constructed a variation map of an intact breeding pedigree comprising seven elite and 19 backbone parents. Compared to wild accessions, the 26 pedigree accessions underwent strong artificial selection during domestication that has resulted in reduced genetic diversity but stronger linkage disequilibrium and higher extents of selective sweeps. In contrast to the backbone parents, the elite parents have acquired significantly improved agronomic traits, with an especially pronounced increase in the lint percentage. Notably, identify by descent (IBD) tracking revealed that the elite parents inherited abundant beneficial trait segments and loci from the backbone parents and our combined analyses led to the identification of a core genomic segment which was inherited in the elite lines from the parents Zhong 7263 and Ejing 1 and that was strongly associated with lint percentage. Additionally, SNP correlation analysis of this core segment showed that a non‐synonymous SNP (A‐to‐G) site in a gene encoding the cell wall‐associated receptor‐like kinase 3 (GhWAKL3) protein was highly correlated with increased lint percentage. Our results substantially increase the valuable genomics resources available for future genetic and functional genomics studies of cotton and reveal insights that will facilitate yield increases in the molecular breeding of cotton.  相似文献   

8.
基于高密度SNP标记估计群体间遗传关联   总被引:1,自引:0,他引:1  
周子文  王雪  丁向东 《遗传》2021,(4):340-349
联合育种的准确性受到群体间遗传关联程度的影响.本研究通过比较基于系谱数据和基因组数据计算的群体遗传关联,探究高密度SNP标记在遗传关联估计中的应用前景.本研究同时使用了模拟数据和真实数据,采用6种不同的遗传关联计算方法,包括PEVD(prediction error variance of differences)、P...  相似文献   

9.
10.
An extraordinarily large number of single nucleotide polymorphisms (SNPs) are now available in humans as well as in other model organisms. Technological advancements may soon make it feasible to assay hundreds of SNPs in virtually any organism of interest. One potential application of SNPs is the determination of pairwise genetic relationships in populations without known pedigrees. Although microsatellites are currently the marker of choice for this purpose, the number of independently segregating microsatellite markers that can be feasibly assayed is limited. Thus, it can be difficult to distinguish reliably some classes of relationship (e.g. full-sibs from half-sibs) with microsatellite data alone. We assess, via Monte Carlo computer simulation, the potential for using a large panel of independently segregating SNPs to infer genetic relationships, following the analytical approach of Blouin et al. (1996). We have explored a 'best case scenario' in which 100 independently segregating SNPs are available. For discrimination among single-generation relationships or for the identification of parent-offspring pairs, it appears that such a panel of moderately polymorphic SNPs (minor allele frequency of 0.20) will provide discrimination power equivalent to only 16-20 independently segregating microsatellites. Although newly available analytical methods that can account for tight genetic linkage between markers will, in theory, allow improved estimation of relationships using thousands of SNPs in highly dense genomic scans, in practice such studies will only be feasible in a handful of model organisms. Given the comparable amount of effort required for the development of both types of markers, it seems that microsatellites will remain the marker of choice for relationship estimation in nonmodel organisms, at least for the foreseeable future.  相似文献   

11.
12.
BackgroundTo date, the cause of inflammatory bowel disease (IBD) remains a mystery. A balance between cell proliferation and apoptosis maintains intestinal tissue homeostasis. Dissociation-induced myosin-actin contraction results in stem cell apoptosis. This study aiming to evaluate the influence of the myosin heavy chain 9 (MYH9) gene single nucleotide polymorphisms (SNPs) on inflammatory bowel disease.Subjectsand methods: The study carried on eighty patients with IBD and seventy controls. All participants subjected to history taking, thorough physical examination, colonoscopy and laboratory investigations. Genotyping performed for rs4821480 and rs3752462 by SNP assay real-time PCR methods.ResultsOn analyzing rs3752462 CT and TT genotypes were significantly more frequent in IBD patients as compared to controls with 4.6 fold increase in the risk of IBD. While on analyzing rs4821480, The TG and GG genotypes have significant increased distribution among the IBD patients as compared to the controls with 5.3 fold increase in the risk of IBD and higher prevalence of GG genotype in patients with low hemoglobin level and higher BMI.ConclusionThe rs3752462 T allele and rs4821480 G allele of MYH9 are associated with more susceptibility to IBD.  相似文献   

13.
14.
该研究对高山杜鹃(Rhododendron L.)的总DNA提取方法进行了改良,然后综合利用高山杜鹃EST数据库和该实验室的马缨杜鹃高通量测序数据,引用已发表文献的SSR引物,从154对SSR引物中筛选出了26对多态性高、重复性好、条带清晰的SSR引物。再从中随机选择10对SSR引物进行荧光标记,对69份不同高山杜鹃的种质进行遗传多样性分析。结果表明,平均有效等位基因数6.959 2个;位点多态性信息含量(PIC)、观测杂合度(HO)、期望杂合度(HE)和Neis基因多样性(H)分别为0.795 2、0.543 5、0.826 5和0.820 2;杂交种间的非加权配对算术平均(UPGMA)聚类结果与其谱系分析结果基本一致,可实现对一些未知来源的育成品种资源进行祖先亲本类型的推测。  相似文献   

15.
The accelerating rate at which DNA sequence data are now generated by high‐throughput sequencing instruments provides both opportunities and challenges for population genetic and ecological investigations of animals and plants. We show here how the common practice of calling genotypes from a single SNP per sequenced region ignores substantial additional information in the phased short‐read sequences that are provided by these sequencing instruments. We target sequenced regions with multiple SNPs in kelp rockfish (Sebastes atrovirens) to determine “microhaplotypes” and then call these microhaplotypes as alleles at each locus. We then demonstrate how these multi‐allelic marker data from such loci dramatically increase power for relationship inference. The microhaplotype approach decreases false‐positive rates by several orders of magnitude, relative to calling bi‐allelic SNPs, for two challenging analytical procedures, full‐sibling and single parent–offspring pair identification. We also show how the identification of half‐sibling pairs requires so much data that physical linkage becomes a consideration, and that most published studies that attempt to do so are dramatically underpowered. The advent of phased short‐read DNA sequence data, in conjunction with emerging analytical tools for their analysis, promises to improve efficiency by reducing the number of loci necessary for a particular level of statistical confidence, thereby lowering the cost of data collection and reducing the degree of physical linkage amongst markers used for relationship estimation. Such advances will facilitate collaborative research and management for migratory and other widespread species.  相似文献   

16.
Recent investigations showed that anandamide, the main endogenous ligand of CB1 and CB2 cannabinoid receptors, possesses analgesic, antidepressant and anti-inflammatory effects. In the perspective to treat inflammatory bowel disease (IBD), our approach was to develop new selective CB2 receptor agonists without psychotropic side effects associated to CB1 receptors. In this purpose, a new series of 3-carboxamido-5-aryl-isoxazoles, never described previously as CB2 receptor agonists, was designed, synthesized and evaluated for their biological activity. The pharmacological results have identified great selective CB2 agonists with in vivo anti-inflammatory activity in a DSS-induced acute colitis mouse model.  相似文献   

17.
A number of procedures have been developed that allow the genetic parameters of natural populations to be estimated using relationship information inferred from marker data rather than known pedigrees. Three published approaches are available; the regression, pair‐wise likelihood and Markov Chain Monte Carlo (MCMC) sib‐ship reconstruction methods. These were applied to body weight and molecular data collected from the Soay sheep population of St. Kilda, which has a previously determined pedigree. The regression and pair‐wise likelihood approaches do not specify an exact pedigree and yielded unreliable heritability estimates, that were sensitive to alteration of the fixed effects. The MCMC method, which specifies a pedigree prior to heritability estimation, yielded results closer to those determined using the known pedigree. In populations of low average relationship, such as the Soay sheep population, determination of a reliable pedigree is more useful than indirect approaches that do not specify a pedigree.  相似文献   

18.
一个常染色体显性遗传白癜风家系   总被引:3,自引:0,他引:3  
本文报道了一白癜风家系,对其发病原因进行了探讨。作者认为该家系的白癜风为常染色体显性遗传。 Abstract:A family history of vitiligo was reported in this paper,and the reason causing disease was discussed.We think that the vitiligo in the family history is caused by autosomal dominant inheritance.  相似文献   

19.
圈养大熊猫的系谱分析   总被引:4,自引:0,他引:4  
应用Sparks Verl.4软件结合手工算法对现有的圈养大熊猫系谱进行了遗传分析。结果表明,圈养大熊猫群体规模小,漂变是导致遗传多样性丢失的主要因素。由于分散管理,现有群体正面临着近亲交配、种源枯竭的危险。因此,应统一遗传管理,加强各繁殖系之间的基因交流,使圈养大熊猫的遗传多样性能够保持在较高的水平之上。  相似文献   

20.
In the current research landscape, microbiota composition studies are of extreme interest, since it has been widely shown that resident microorganisms affect and shape the ecological niche they inhabit. This complex micro-world is characterized by different types of interactions. Understanding these relationships provides a useful tool for decoding the causes and effects of communities’ organizations. Next-Generation Sequencing technologies allow to reconstruct the internal composition of the whole microbial community present in a sample. Sequencing data can then be investigated through statistical and computational method coming from network theory to infer the network of interactions among microbial species.Since there are several network inference approaches in the literature, in this paper we tried to shed light on their main characteristics and challenges, providing a useful tool not only to those interested in using the methods, but also to those who want to develop new ones. In addition, we focused on the frameworks used to produce synthetic data, starting from the simulation of network structures up to their integration with abundance models, with the aim of clarifying the key points of the entire generative process.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号