共查询到20条相似文献,搜索用时 10 毫秒
1.
Background
Human genome contains millions of common single nucleotide polymorphisms (SNPs) and these SNPs play an important role in understanding the association between genetic variations and human diseases. Many SNPs show correlated genotypes, or linkage disequilibrium (LD), thus it is not necessary to genotype all SNPs for association study. Many algorithms have been developed to find a small subset of SNPs called tag SNPs that are sufficient to infer all the other SNPs. Algorithms based on the r 2 LD statistic have gained popularity because r 2 is directly related to statistical power to detect disease associations. Most of existing r 2 based algorithms use pairwise LD. Recent studies show that multi-marker LD can help further reduce the number of tag SNPs. However, existing tag SNP selection algorithms based on multi-marker LD are both time-consuming and memory-consuming. They cannot work on chromosomes containing more than 100 k SNPs using length-3 tagging rules. 相似文献2.
Analyses of high-density SNPs in genetic studies have the potential problems of prohibitive genotyping costs and inflated false discovery rates. Current methods select subsets of representative SNPs (tagSNPs) using information either on potential biologic functionality of the SNPs or on the underlying linkage disequilibrium (LD) structure, but not both. Combining the two types of information may lead to more effective tagSNP selection. The proposed method combines both functional and LD information using a weighted factor analysis (WFA) model. The WFA was applied to the dense SNP collection from 129 genes sequenced by the SeattleSNPs Program for Genomic Application. TagSNPs selected by WFA were compared with those selected by an LD-based method. WFA allowed prioritization of SNPs that would otherwise share equivalent ranking due to underlying LD structure alone. Furthermore, WFA consistently included SNPs not selected by function or by LD alone. A literature review of a subset of genes revealed that SNPs selected by WFA were more likely represented in published reports. 相似文献
3.
An approach to the investigation of the evolution of quantitative traits on the basis of analysis of two-locus marginal systems dynamics has been developed. It has been shown that under stabilizing selection the "quasi-stationary" state is quickly reached and maintained continuously. The "quasi-stationary" state is characterized by small changes in allele frequencies and by linkage disequilibrium that significantly decreases genotypic variance. Equations defining the role of linkage disequilibrium in the stationary state of mutation-selection balance are derived. 相似文献
4.
Brian P Kinghorn 《遗传、选种与进化》2011,43(1):4
Background
Mate selection can be used as a framework to balance key technical, cost and logistical issues while implementing a breeding program at a tactical level. The resulting mating lists accommodate optimal contributions of parents to future generations, in conjunction with other factors such as progeny inbreeding, connection between herds, use of reproductive technologies, management of the genetic distribution of nominated traits, and management of allele/genotype frequencies for nominated QTL/markers.Methods
This paper describes a mate selection algorithm that is widely used and presents an extension that makes it possible to apply constraints on certain matings, as dictated through a group mating permission matrix.Results
This full algorithm leads to simpler applications, and to computing speed for the scenario tested, which is several hundred times faster than the previous strategy of penalising solutions that break constraints.Conclusions
The much higher speed of the method presented here extends the use of mate selection and enables implementation in relatively large programs across breeding units. 相似文献5.
SUMMARY: We have developed U-PRIMER, a primer design program, to compute a minimal primer set (MPS) for any given set of DNA sequences. The U-PRIMER algorithm, which uses automatic variable fixing and automatic redundant constraint elimination to tackle the binary integer programming problem associated with the MPS selection problem. The program has been tested successfully with 32 adipocyte development-related genes and 9 TB-specific genes to obtain their respective MPSs. AVAILABILITY: A free copy of U-PRIMER implemented in C++ programming language is available from http://www.u-vision-biotech.com 相似文献
6.
7.
Background
During the lifetime of a fermenter culture, the soil bacterium S. coelicolor undergoes a major metabolic switch from exponential growth to antibiotic production. We have studied gene expression patterns during this switch, using a specifically designed Affymetrix genechip and a high-resolution time-series of fermenter-grown samples.Results
Surprisingly, we find that the metabolic switch actually consists of multiple finely orchestrated switching events. Strongly coherent clusters of genes show drastic changes in gene expression already many hours before the classically defined transition phase where the switch from primary to secondary metabolism was expected. The main switch in gene expression takes only 2 hours, and changes in antibiotic biosynthesis genes are delayed relative to the metabolic rearrangements. Furthermore, global variation in morphogenesis genes indicates an involvement of cell differentiation pathways in the decision phase leading up to the commitment to antibiotic biosynthesis.Conclusions
Our study provides the first detailed insights into the complex sequence of early regulatory events during and preceding the major metabolic switch in S. coelicolor, which will form the starting point for future attempts at engineering antibiotic production in a biotechnological setting. 相似文献8.
9.
Robert Lawrence Aaron G Day-Williams Richard Mott John Broxholme Lon R Cardon Eleftheria Zeggini 《BMC bioinformatics》2009,10(1):367-5
Background
A number of tools for the examination of linkage disequilibrium (LD) patterns between nearby alleles exist, but none are available for quickly and easily investigating LD at longer ranges (>500 kb). We have developed a web-based query tool (GLIDERS: Genome-wide LInkage DisEquilibrium Repository and Search engine) that enables the retrieval of pairwise associations with r2 ≥ 0.3 across the human genome for any SNP genotyped within HapMap phase 2 and 3, regardless of distance between the markers. 相似文献10.
J. P. Mueller J. W. James 《TAG. Theoretical and applied genetics. Theoretische und angewandte Genetik》1983,65(1):25-30
Summary Selection for a character controlled by additive genes induces linkage disequilibrium which reduces the additive genetic variance usable for further selective gains. Additive x additive epistasis contributes to selection response through development of linkage disequilibrium between interacting loci. To investigate the relative importance of the two effects of linkage disequilibrium, formulae are presented and results are reported of simulations using models involving additive, additive x additive and dominance components. The results suggest that so long as epistatic effects are not large relative to additive effects, and the proportion of pairs of loci which show epistasis is not very high, the predominant effect of linkage disequilibrium will be to reduce the rate of selection response. 相似文献
11.
Background
A major QTL for fatness and growth, denoted FAT1, has previously been detected on pig chromosome 4q (SSC4q) using a Large White – wild boar intercross. Progeny that carried the wild boar allele at this locus had higher fat deposition, shorter length of carcass, and reduced growth. The position and the estimated effects of the FAT1 QTL for growth and fatness have been confirmed in a previous study. In order to narrow down the QTL interval we have traced the inheritance of the wild boar allele associated with high fat deposition through six additional backcross generations.Results
Progeny-testing was used to determine the QTL genotype for 10 backcross sires being heterozygous for different parts of the broad FAT1 region. The statistical analysis revealed that five of the sires were segregating at the QTL, two were negative while the data for three sires were inconclusive. We could confirm the QTL effects on fatness/meat content traits but not for the growth traits implying that growth and fatness are controlled by distinct QTLs on chromosome 4. Two of the segregating sires showed highly significant QTL effects that were as large as previously observed in the F2 generation. The estimates for the remaining three sires, which were all heterozygous for smaller fragments of the actual region, were markedly smaller. With the sample sizes used in the present study we cannot with great confidence determine whether these smaller effects in some sires are due to chance deviations, epistatic interactions or whether FAT1 is composed of two or more QTLs, each one with a smaller phenotypic effect. Under the assumption of a single locus, the critical region for FAT1 has been reduced to a 3.3 cM interval between the RXRG and SDHC loci.Conclusion
We have further characterized the FAT1 QTL on pig chromosome 4 and refined its map position considerably, from a QTL interval of 70 cM to a maximum region of 20 cM and a probable region as small as 3.3 cM. The flanking markers for the small region are RXRG and SDHC and the orthologous region of FAT1 in the human genome is located on HSA1q23.3 and harbors approximately 20 genes. Our strategy to further refine the map position of this major QTL will be i) to type new markers in our pigs that are recombinant in the QTL interval and ii) to perform Identity-By-Descent (IBD) mapping across breeds that have been strongly selected for lean growth. 相似文献12.
Twinning is a complex trait with negative impacts on health and reproduction, which cause economic loss in dairy production. Several twinning rate quantitative trait loci (QTL) have been detected in previous studies, but confidence intervals for QTL location are broad and many QTL are unreplicated. To identify genomic regions or genes associated with twinning rate, QTL analysis based on linkage combined with linkage disequilibrium (LLD) and individual marker associations was conducted across the genome using high-throughput single nucleotide polymorphism (SNP) genotypes. A total of 9919 SNP markers were genotyped with 200 sires and sons in 19 half-sib North American Holstein dairy cattle families. After SNPs were genotyped, informative markers were selected for genome-wide association tests and QTL searches. Evidence for twinning rate QTL was found throughout the genome. Thirteen markers significantly associated with twinning rate were detected on chromosomes 2, 5 and 14 ( P < 2.3 × 10−5 ). Twenty-six regions on fourteen chromosomes were identified by LLD analysis at P < 0.0007. Seven previously reported ovulation or twinning rate QTL were supported by results of single marker association or LLD analyses. Single marker association analysis and LLD mapping were complementary tools for the identification of putative QTL in this genome scan. 相似文献
13.
A simple and efficient algorithm for gene selection using sparse logistic regression 总被引:4,自引:0,他引:4
MOTIVATION: This paper gives a new and efficient algorithm for the sparse logistic regression problem. The proposed algorithm is based on the Gauss-Seidel method and is asymptotically convergent. It is simple and extremely easy to implement; it neither uses any sophisticated mathematical programming software nor needs any matrix operations. It can be applied to a variety of real-world problems like identifying marker genes and building a classifier in the context of cancer diagnosis using microarray data. RESULTS: The gene selection method suggested in this paper is demonstrated on two real-world data sets and the results were found to be consistent with the literature. AVAILABILITY: The implementation of this algorithm is available at the site http://guppy.mpe.nus.edu.sg/~mpessk/SparseLOGREG.shtml Supplementary Information: Supplementary material is available at the site http://guppy.mpe.nus.edu.sg/~mpessk/SparseLOGREG.shtml 相似文献
14.
The effect of selection and linkage on the decay of linkage disequilibrium, D, is investigated for a hierarchy of two-locus models. The method of analysis rests upon a qualitative classification of the dynamic of D under selection relative to the neutral dynamic. To eliminate the confounding effects of gene frequency change, the behavior of D is first studied with gene frequencies fixed at their invariant values. Second, the results are extended to certain special situations where gene frequencies are changing simultaneously.A wide variety of selection regimes can cause an acceleration of the rate of decay of D relative to the neutral rate. Specifically, the asymptotic rate of decay is always faster than the neutral rate in the neighborhood of a stable equilibrium point, when viabilities are additive or only one locus is selected. This is not necessarily the case for models in which there is nonzero additive epistasis. With multiplicative viabilities, decay is always accelerated near a stable boundary equilibrium, but decay is only faster near the stable central equilibrium (with
= 0) if linkage is sufficiently loose. In the symmetric viability model, decay may even be retarded near a stable boundary equilibrium. Decay is only accelerated near a stable corner equilibrium when the double homozygote is more fit than the double heterozygotes. Decay near a stable edge equilibrium may be retarded if there is loose linkage. With symmetric viabilities there is usually an acceleration of the decay process for gene frequencies near 1/2 when the central equilibrium (with
= 0) is stable. This is always the case when the sign of the epistasis is negative or zero.Conversely, the decay ofD is retarded in the neighborhood of a stable equilibrium in the multiplicative and symmetric viability models if any of the conditions above are violated. Near an unstable equilibrium of any of the models considered,D may either increase or decay at a rate slower than, equal to, or faster than the neutral rate. These analytic results are supplemented by numerical studies of the symmetric viability model. 相似文献
15.
MacLean CJ Martin RB Sham PC Wang H Straub RE Kendler KS 《American journal of human genetics》2000,66(3):1062-1075
Single-marker linkage-disequilibrium (LD) methods cannot fully describe disequilibrium in an entire chromosomal region surrounding a disease allele. With the advent of myriad tightly linked microsatellite markers, we have an opportunity to extend LD analysis from single markers to multiple-marker haplotypes. Haplotype analysis has increased statistical power to disclose the presence of a disease locus in situations where it correctly reflects the historical process involved. For maximum efficiency, evidence of LD ought to come not just from a single haplotype, which may well be rare, but in addition from many similar haplotypes that could have descended from the same ancestral founder but have been trimmed in succeeding generations. We present such an analysis, called the "trimmed-haplotype method." We focus on chromosomal regions that are small enough that disequilibrium in significant portions of them may have been preserved in some pedigrees and yet that contain enough markers to minimize coincidental occurrence of the haplotype in the absence of a disease allele: perhaps regions 1-2 cM in length. In general, we could have no idea what haplotype an ancestral founder carried generations ago, nor do we usually have a precise chromosomal location for the disease-susceptibility locus. Therefore, we must search through all possible haplotypes surrounding multiple locations. Since such repeated testing obliterates the sampling distribution of the test, we employ bootstrap methods to calculate significance levels. Trimmed-haplotype analysis is performed on family data in which genotypes have been assembled into haplotypes. It can be applied either to conventional parent-affected-offspring triads or to multiplex pedigrees. We present a method for summarizing the LD evidence, in any pedigree, that can be employed in trimmed-haplotype analysis as well as in other methods. 相似文献
16.
The analysis of systems involving many loci is important in population and quantitativegenetics. An important problem is the study of linkage disequilibrium (LD), a conceptrelevant in genome-enabled prediction of quantitative traits and in exploration ofmarker–phenotype associations. This article introduces a new estimator of a LDparameter (ρ2) that is much easier to compute than a maximumlikelihood (or Bayesian) estimate of a tetra-choric correlation. We examined theconjecture that the sampling distribution of the estimator of ρ2could be less frequency dependent than that of the estimator ofr2, a widely used metric for assessing LD. This was donevia an empirical evaluation of LD in 806 Holstein–Friesian cattle using 771single-nucleotide polymorphism (SNP) markers and of HapMap III data on 21 991 SNPs(chromosome 3) observed in 88 unrelated individuals from Tuscany. Also, 1600 haplotypesover a region of 1 Mb simulated under the coalescent were used to estimate LD usingthe two measures. Subsequently, a simulation study compared the new estimator with that ofr2 using several scenarios of LD and allelic frequencies.From these studies, it is concluded that ρ2 provides a usefulmetric for the study of LD as the distribution of its estimator is less frequencydependent than that of the standard estimator of r2. 相似文献
17.
A multi-locus QTL mapping method is presented, which combines linkage and linkage disequilibrium (LD) information and uses multitrait data. The method assumed a putative QTL at the midpoint of each marker bracket. Whether the putative QTL had an effect or not was sampled using Markov chain Monte Carlo (MCMC) methods. The method was tested in dairy cattle data on chromosome 14 where the DGAT1 gene was known to be segregating. The DGAT1 gene was mapped to a region of 0.04 cM, and the effects of the gene were accurately estimated. The fitting of multiple QTL gave a much sharper indication of the QTL position than a single QTL model using multitrait data, probably because the multi-locus QTL mapping reduced the carry over effect of the large DGAT1 gene to adjacent putative QTL positions. This suggests that the method could detect secondary QTL that would, in single point analyses, remain hidden under the broad peak of the dominant QTL. However, no indications for a second QTL affecting dairy traits were found on chromosome 14. 相似文献
18.
In a simulation study, different designs were compared for efficiency of fine-mapping of QTL. The variance component method for fine-mapping of QTL was used to estimate QTL position and variance components. The design of many families with small size gave a higher mapping resolution than a design with few families of large size. However, the difference is small in half sib designs. The proportion of replicates with the QTL positioned within 3 cM of the true position is 0.71 in the best design, and 0.68 in the worst design applied to 128 animals with a phenotypic record and a QTL explaining 25% of the phenotypic variance. The design of two half sib families each of size 64 was further investigated for a hypothetical population with effective size of 1000 simulated for 6000 generations with a marker density of 0.25 cM and with marker mutation rate 4 × 10-4 per generation. In mapping using bi-allelic markers, 42~55% of replicated simulations could position QTL within 0.75 cM of the true position whereas this was higher for multi allelic markers (48~76%). The accuracy was lowest (48%) when mutation age was 100 generations and increased to 68% and 76% for mutation ages of 200 and 500 generations, respectively, after which it was about 70% for mutation ages of 1000 generations and older. When effective size was linearly decreasing in the last 50 generations, the accuracy was decreased (56 to 70%). We show that half sib designs that have often been used for linkage mapping can have sufficient information for fine-mapping of QTL. It is suggested that the same design with the same animals for linkage mapping should be used for fine-mapping so gene mapping can be cost effective in livestock populations. 相似文献
19.
Sham PC Ao SI Kwan JS Kao P Cheung F Fong PY Ng MK 《Bioinformatics (Oxford, England)》2007,23(1):129-131
We have developed an online program, WCLUSTAG, for tag SNP selection that allows the user to specify variable tagging thresholds for different SNPs. Tag SNPs are selected such that a SNP with user-specified tagging threshold C will have a minimum R2 of C with at least one tag SNP. This flexible feature is useful for researchers who wish to prioritize genomic regions or SNPs in an association study. AVAILABILITY: The online WCLUSTAG program is available at http://bioinfo.hku.hk/wclustag/ 相似文献
20.
Identification of positive selection signatures in pigs by comparing linkage disequilibrium variances 下载免费PDF全文
Selection affects the patterns of linkage disequilibrium (LD) around the site of a beneficial allele with an increase in LD among the hitchhiking alleles. Comparing the differences in regional LD between pig populations could help to identify putative genomic regions with potential adaptations for economic traits. In this study, using Illumina Porcine SNP60K BeadChip genotyping data from 207 Chinese indigenous, 117 South American village and 408 Large White pigs, we estimated the variation of genome‐wide LD between populations using the varld program. The top 0.1% standardized VarLD scores were used as a criterion for all comparisons, and compared with LD blocks, a total of four selection signatures on Sus scrofa chromosome (SSC) 7, 9, 13 and 14 were identified in all populations. These signatures overlapped with quantitative trait loci for linoleic acid content, age at puberty, number of muscle fibers per unit area, hip structure and body weight traits in pigs. Among them, one of the signatures (56.5–56.6 Mb on SSC7) in Large White pigs harbored the ADAMTSL3 gene, which is known to affect body length. The findings of this study seem to point toward recent selection in different pig populations. Further investigations are encouraged to confirm the selection signatures detected by varld in the present study. 相似文献