期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

FastTagger: an efficient algorithm for genome-wide tag SNP selection using multi-marker linkage disequilibrium

Guimei Liu Yue Wang Limsoon Wong 《BMC bioinformatics》2010,11(1):66

Background

Human genome contains millions of common single nucleotide polymorphisms (SNPs) and these SNPs play an important role in understanding the association between genetic variations and human diseases. Many SNPs show correlated genotypes, or linkage disequilibrium (LD), thus it is not necessary to genotype all SNPs for association study. Many algorithms have been developed to find a small subset of SNPs called tag SNPs that are sufficient to infer all the other SNPs. Algorithms based on the r ² LD statistic have gained popularity because r ² is directly related to statistical power to detect disease associations. Most of existing r ² based algorithms use pairwise LD. Recent studies show that multi-marker LD can help further reduce the number of tag SNPs. However, existing tag SNP selection algorithms based on multi-marker LD are both time-consuming and memory-consuming. They cannot work on chromosomes containing more than 100 k SNPs using length-3 tagging rules. 相似文献

2.

A novel method combining linkage disequilibrium information and imputed functional knowledge for tagSNP selection

Rochat RH de las Fuentes L Stormo G Davila-Roman VG Gu CC 《Human heredity》2007,64(4):243-249

Analyses of high-density SNPs in genetic studies have the potential problems of prohibitive genotyping costs and inflated false discovery rates. Current methods select subsets of representative SNPs (tagSNPs) using information either on potential biologic functionality of the SNPs or on the underlying linkage disequilibrium (LD) structure, but not both. Combining the two types of information may lead to more effective tagSNP selection. The proposed method combines both functional and LD information using a weighted factor analysis (WFA) model. The WFA was applied to the dense SNP collection from 129 genes sequenced by the SeattleSNPs Program for Genomic Application. TagSNPs selected by WFA were compared with those selected by an LD-based method. WFA allowed prioritization of SNPs that would otherwise share equivalent ranking due to underlying LD structure alone. Furthermore, WFA consistently included SNPs not selected by function or by LD alone. A literature review of a subset of genes revealed that SNPs selected by WFA were more likely represented in published reports. 相似文献

3.

Stabilizing selection and linkage disequilibrium

L A Zhivotovski? S Iu Gavrilets 《Genetika》1990,26(2):222-231

An approach to the investigation of the evolution of quantitative traits on the basis of analysis of two-locus marginal systems dynamics has been developed. It has been shown that under stabilizing selection the "quasi-stationary" state is quickly reached and maintained continuously. The "quasi-stationary" state is characterized by small changes in allele frequencies and by linkage disequilibrium that significantly decreases genotypic variance. Equations defining the role of linkage disequilibrium in the stationary state of mutation-selection balance are derived. 相似文献

4.

An algorithm for efficient constrained mate selection

Brian P Kinghorn 《遗传、选种与进化》2011,43(1):4

Background

Mate selection can be used as a framework to balance key technical, cost and logistical issues while implementing a breeding program at a tactical level. The resulting mating lists accommodate optimal contributions of parents to future generations, in conjunction with other factors such as progeny inbreeding, connection between herds, use of reproductive technologies, management of the genetic distribution of nominated traits, and management of allele/genotype frequencies for nominated QTL/markers.

Methods

This paper describes a mate selection algorithm that is widely used and presents an extension that makes it possible to apply constraints on certain matings, as dictated through a group mating permission matrix.

Results

This full algorithm leads to simpler applications, and to computing speed for the scenario tested, which is several hundred times faster than the previous strategy of penalising solutions that break constraints.

Conclusions

The much higher speed of the method presented here extends the use of mate selection and enables implementation in relatively large programs across breeding units. 相似文献

5.

An efficient algorithm for minimal primer set selection

Hsieh MH Hsu WC Chiu SK Tzeng CM 《Bioinformatics (Oxford, England)》2003,19(2):285-286

SUMMARY: We have developed U-PRIMER, a primer design program, to compute a minimal primer set (MPS) for any given set of DNA sequences. The U-PRIMER algorithm, which uses automatic variable fixing and automatic redundant constraint elimination to tackle the binary integer programming problem associated with the MPS selection problem. The program has been tested successfully with 32 adipocyte development-related genes and 9 TB-specific genes to obtain their respective MPSs. AVAILABILITY: A free copy of U-PRIMER implemented in C++ programming language is available from http://www.u-vision-biotech.com 相似文献

6.

Supervised learning-based tagSNP selection for genome-wide disease classifications

Liu Q Yang J Chen Z Yang MQ Sung AH Huang X 《BMC genomics》2008,9(Z1):S6

相似文献

7.

Computer simulation of marker-assisted selection utilizing linkage disequilibrium

W. Zhang C. Smith 《TAG. Theoretical and applied genetics. Theoretische und angewandte Genetik》1992,83(6-7):813-820

相似文献

8.

Detecting positive selection from genome scans of linkage disequilibrium

Chad D Huff Henry C Harpending Alan R Rogers 《BMC genomics》2010,11(1):1-9

Background

During the lifetime of a fermenter culture, the soil bacterium S. coelicolor undergoes a major metabolic switch from exponential growth to antibiotic production. We have studied gene expression patterns during this switch, using a specifically designed Affymetrix genechip and a high-resolution time-series of fermenter-grown samples.

Results

Surprisingly, we find that the metabolic switch actually consists of multiple finely orchestrated switching events. Strongly coherent clusters of genes show drastic changes in gene expression already many hours before the classically defined transition phase where the switch from primary to secondary metabolism was expected. The main switch in gene expression takes only 2 hours, and changes in antibiotic biosynthesis genes are delayed relative to the metabolic rearrangements. Furthermore, global variation in morphogenesis genes indicates an involvement of cell differentiation pathways in the decision phase leading up to the commitment to antibiotic biosynthesis.

Conclusions

Our study provides the first detailed insights into the complex sequence of early regulatory events during and preceding the major metabolic switch in S. coelicolor, which will form the starting point for future attempts at engineering antibiotic production in a biotechnological setting. 相似文献

9.

GLIDERS - A web-based search engine for genome-wide linkage disequilibrium between HapMap SNPs

Robert Lawrence Aaron G Day-Williams Richard Mott John Broxholme Lon R Cardon Eleftheria Zeggini 《BMC bioinformatics》2009,10(1):367-5

Background

A number of tools for the examination of linkage disequilibrium (LD) patterns between nearby alleles exist, but none are available for quickly and easily investigating LD at longer ranges (>500 kb). We have developed a web-based query tool (GLIDERS: Genome-wide LInkage DisEquilibrium Repository and Search engine) that enables the retrieval of pairwise associations with r² ≥ 0.3 across the human genome for any SNP genotyped within HapMap phase 2 and 3, regardless of distance between the markers. 相似文献

10.

Effect on linkage disequilibrium of selection for a quantitative character with epistasis

J. P. Mueller J. W. James 《TAG. Theoretical and applied genetics. Theoretische und angewandte Genetik》1983,65(1):25-30

Summary Selection for a character controlled by additive genes induces linkage disequilibrium which reduces the additive genetic variance usable for further selective gains. Additive x additive epistasis contributes to selection response through development of linkage disequilibrium between interacting loci. To investigate the relative importance of the two effects of linkage disequilibrium, formulae are presented and results are reported of simulations using models involving additive, additive x additive and dominance components. The results suggest that so long as epistatic effects are not large relative to additive effects, and the proportion of pairs of loci which show epistasis is not very high, the predominant effect of linkage disequilibrium will be to reduce the rate of selection response. 相似文献

11.

A simple and efficient algorithm for gene selection using sparse logistic regression 总被引：4，自引：0，他引：4

Shevade SK Keerthi SS 《Bioinformatics (Oxford, England)》2003,19(17):2246-2253

MOTIVATION: This paper gives a new and efficient algorithm for the sparse logistic regression problem. The proposed algorithm is based on the Gauss-Seidel method and is asymptotically convergent. It is simple and extremely easy to implement; it neither uses any sophisticated mathematical programming software nor needs any matrix operations. It can be applied to a variety of real-world problems like identifying marker genes and building a classifier in the context of cancer diagnosis using microarray data. RESULTS: The gene selection method suggested in this paper is demonstrated on two real-world data sets and the results were found to be consistent with the literature. AVAILABILITY: The implementation of this algorithm is available at the site http://guppy.mpe.nus.edu.sg/~mpessk/SparseLOGREG.shtml Supplementary Information: Supplementary material is available at the site http://guppy.mpe.nus.edu.sg/~mpessk/SparseLOGREG.shtml 相似文献

12.

Genome-wide scan for bovine twinning rate QTL using linkage disequilibrium

E.-S. Kim P. J. Berger B. W. Kirkpatrick 《Animal genetics》2009,40(3):300-307

Twinning is a complex trait with negative impacts on health and reproduction, which cause economic loss in dairy production. Several twinning rate quantitative trait loci (QTL) have been detected in previous studies, but confidence intervals for QTL location are broad and many QTL are unreplicated. To identify genomic regions or genes associated with twinning rate, QTL analysis based on linkage combined with linkage disequilibrium (LLD) and individual marker associations was conducted across the genome using high-throughput single nucleotide polymorphism (SNP) genotypes. A total of 9919 SNP markers were genotyped with 200 sires and sons in 19 half-sib North American Holstein dairy cattle families. After SNPs were genotyped, informative markers were selected for genome-wide association tests and QTL searches. Evidence for twinning rate QTL was found throughout the genome. Thirteen markers significantly associated with twinning rate were detected on chromosomes 2, 5 and 14 ( P < 2.3 × 10⁻⁵). Twenty-six regions on fourteen chromosomes were identified by LLD analysis at P < 0.0007. Seven previously reported ovulation or twinning rate QTL were supported by results of single marker association or LLD analyses. Single marker association analysis and LLD mapping were complementary tools for the identification of putative QTL in this genome scan. 相似文献

13.

Volume measures for linkage disequilibrium

Yuguo Chen Chia-Ho Lin Chiara Sabatti 《BMC genetics》2006,7(1):1-8

Background

A major QTL for fatness and growth, denoted FAT1, has previously been detected on pig chromosome 4q (SSC4q) using a Large White – wild boar intercross. Progeny that carried the wild boar allele at this locus had higher fat deposition, shorter length of carcass, and reduced growth. The position and the estimated effects of the FAT1 QTL for growth and fatness have been confirmed in a previous study. In order to narrow down the QTL interval we have traced the inheritance of the wild boar allele associated with high fat deposition through six additional backcross generations.

Results

Progeny-testing was used to determine the QTL genotype for 10 backcross sires being heterozygous for different parts of the broad FAT1 region. The statistical analysis revealed that five of the sires were segregating at the QTL, two were negative while the data for three sires were inconclusive. We could confirm the QTL effects on fatness/meat content traits but not for the growth traits implying that growth and fatness are controlled by distinct QTLs on chromosome 4. Two of the segregating sires showed highly significant QTL effects that were as large as previously observed in the F₂ generation. The estimates for the remaining three sires, which were all heterozygous for smaller fragments of the actual region, were markedly smaller. With the sample sizes used in the present study we cannot with great confidence determine whether these smaller effects in some sires are due to chance deviations, epistatic interactions or whether FAT1 is composed of two or more QTLs, each one with a smaller phenotypic effect. Under the assumption of a single locus, the critical region for FAT1 has been reduced to a 3.3 cM interval between the RXRG and SDHC loci.

Conclusion

We have further characterized the FAT1 QTL on pig chromosome 4 and refined its map position considerably, from a QTL interval of 70 cM to a maximum region of 20 cM and a probable region as small as 3.3 cM. The flanking markers for the small region are RXRG and SDHC and the orthologous region of FAT1 in the human genome is located on HSA1q23.3 and harbors approximately 20 genes. Our strategy to further refine the map position of this major QTL will be i) to type new markers in our pigs that are recombinant in the QTL interval and ii) to perform Identity-By-Descent (IBD) mapping across breeds that have been strongly selected for lean growth. 相似文献

14.

Rates of decay of linkage disequilibrium under two-locus models of selection

Marjorie A. Asmussen Michael T. Clegg 《Journal of mathematical biology》1982,14(1):37-70

The effect of selection and linkage on the decay of linkage disequilibrium, D, is investigated for a hierarchy of two-locus models. The method of analysis rests upon a qualitative classification of the dynamic of D under selection relative to the neutral dynamic. To eliminate the confounding effects of gene frequency change, the behavior of D is first studied with gene frequencies fixed at their invariant values. Second, the results are extended to certain special situations where gene frequencies are changing simultaneously.A wide variety of selection regimes can cause an acceleration of the rate of decay of D relative to the neutral rate. Specifically, the asymptotic rate of decay is always faster than the neutral rate in the neighborhood of a stable equilibrium point, when viabilities are additive or only one locus is selected. This is not necessarily the case for models in which there is nonzero additive epistasis. With multiplicative viabilities, decay is always accelerated near a stable boundary equilibrium, but decay is only faster near the stable central equilibrium (with = 0) if linkage is sufficiently loose. In the symmetric viability model, decay may even be retarded near a stable boundary equilibrium. Decay is only accelerated near a stable corner equilibrium when the double homozygote is more fit than the double heterozygotes. Decay near a stable edge equilibrium may be retarded if there is loose linkage. With symmetric viabilities there is usually an acceleration of the decay process for gene frequencies near 1/2 when the central equilibrium (with = 0) is stable. This is always the case when the sign of the epistasis is negative or zero.Conversely, the decay ofD is retarded in the neighborhood of a stable equilibrium in the multiplicative and symmetric viability models if any of the conditions above are violated. Near an unstable equilibrium of any of the models considered,D may either increase or decay at a rate slower than, equal to, or faster than the neutral rate. These analytic results are supplemented by numerical studies of the symmetric viability model. 相似文献

15.

The trimmed-haplotype test for linkage disequilibrium

下载免费PDF全文

MacLean CJ Martin RB Sham PC Wang H Straub RE Kendler KS 《American journal of human genetics》2000,66(3):1062-1075

Single-marker linkage-disequilibrium (LD) methods cannot fully describe disequilibrium in an entire chromosomal region surrounding a disease allele. With the advent of myriad tightly linked microsatellite markers, we have an opportunity to extend LD analysis from single markers to multiple-marker haplotypes. Haplotype analysis has increased statistical power to disclose the presence of a disease locus in situations where it correctly reflects the historical process involved. For maximum efficiency, evidence of LD ought to come not just from a single haplotype, which may well be rare, but in addition from many similar haplotypes that could have descended from the same ancestral founder but have been trimmed in succeeding generations. We present such an analysis, called the "trimmed-haplotype method." We focus on chromosomal regions that are small enough that disequilibrium in significant portions of them may have been preserved in some pedigrees and yet that contain enough markers to minimize coincidental occurrence of the haplotype in the absence of a disease allele: perhaps regions 1-2 cM in length. In general, we could have no idea what haplotype an ancestral founder carried generations ago, nor do we usually have a precise chromosomal location for the disease-susceptibility locus. Therefore, we must search through all possible haplotypes surrounding multiple locations. Since such repeated testing obliterates the sampling distribution of the test, we employ bootstrap methods to calculate significance levels. Trimmed-haplotype analysis is performed on family data in which genotypes have been assembled into haplotypes. It can be applied either to conventional parent-affected-offspring triads or to multiplex pedigrees. We present a method for summarizing the LD evidence, in any pedigree, that can be employed in trimmed-haplotype analysis as well as in other methods. 相似文献

16.

An evaluation of a novel estimator of linkage disequilibrium

D Gianola S Qanbari H Simianer 《Heredity》2013,111(4):275-285

The analysis of systems involving many loci is important in population and quantitativegenetics. An important problem is the study of linkage disequilibrium (LD), a conceptrelevant in genome-enabled prediction of quantitative traits and in exploration ofmarker–phenotype associations. This article introduces a new estimator of a LDparameter (ρ²) that is much easier to compute than a maximumlikelihood (or Bayesian) estimate of a tetra-choric correlation. We examined theconjecture that the sampling distribution of the estimator of ρ²could be less frequency dependent than that of the estimator ofr², a widely used metric for assessing LD. This was donevia an empirical evaluation of LD in 806 Holstein–Friesian cattle using 771single-nucleotide polymorphism (SNP) markers and of HapMap III data on 21 991 SNPs(chromosome 3) observed in 88 unrelated individuals from Tuscany. Also, 1600 haplotypesover a region of 1 Mb simulated under the coalescent were used to estimate LD usingthe two measures. Subsequently, a simulation study compared the new estimator with that ofr² using several scenarios of LD and allelic frequencies.From these studies, it is concluded that ρ² provides a usefulmetric for the study of LD as the distribution of its estimator is less frequencydependent than that of the standard estimator of r². 相似文献

17.

The efficiency of designs for fine-mapping of quantitative trait loci using combined linkage disequilibrium and linkage

Sang Hong Lee Julius HJ van der Werf 《遗传、选种与进化》2004,36(2):145-161

In a simulation study, different designs were compared for efficiency of fine-mapping of QTL. The variance component method for fine-mapping of QTL was used to estimate QTL position and variance components. The design of many families with small size gave a higher mapping resolution than a design with few families of large size. However, the difference is small in half sib designs. The proportion of replicates with the QTL positioned within 3 cM of the true position is 0.71 in the best design, and 0.68 in the worst design applied to 128 animals with a phenotypic record and a QTL explaining 25% of the phenotypic variance. The design of two half sib families each of size 64 was further investigated for a hypothetical population with effective size of 1000 simulated for 6000 generations with a marker density of 0.25 cM and with marker mutation rate 4 × 10^-4per generation. In mapping using bi-allelic markers, 42~55% of replicated simulations could position QTL within 0.75 cM of the true position whereas this was higher for multi allelic markers (48~76%). The accuracy was lowest (48%) when mutation age was 100 generations and increased to 68% and 76% for mutation ages of 200 and 500 generations, respectively, after which it was about 70% for mutation ages of 1000 generations and older. When effective size was linearly decreasing in the last 50 generations, the accuracy was decreased (56 to 70%). We show that half sib designs that have often been used for linkage mapping can have sufficient information for fine-mapping of QTL. It is suggested that the same design with the same animals for linkage mapping should be used for fine-mapping so gene mapping can be cost effective in livestock populations. 相似文献

18.

A haplotype-based algorithm for multilocus linkage disequilibrium mapping of quantitative trait loci with epistasis 总被引：6，自引：0，他引：6

Lou XY Casella G Littell RC Yang MC Johnson JA Wu R 《Genetics》2003,163(4):1533-1548

For tightly linked loci, cosegregation may lead to nonrandom associations between alleles in a population. Because of its evolutionary relationship with linkage, this phenomenon is called linkage disequilibrium. Today, linkage disequilibrium-based mapping has become a major focus of recent genome research into mapping complex traits. In this article, we present a new statistical method for mapping quantitative trait loci (QTL) of additive, dominant, and epistatic effects in equilibrium natural populations. Our method is based on haplotype analysis of multilocus linkage disequilibrium and exhibits two significant advantages over current disequilibrium mapping methods. First, we have derived closed-form solutions for estimating the marker-QTL haplotype frequencies within the maximum-likelihood framework implemented by the EM algorithm. The allele frequencies of putative QTL and their linkage disequilibria with the markers are estimated by solving a system of regular equations. This procedure has significantly improved the computational efficiency and the precision of parameter estimation. Second, our method can detect marker-QTL disequilibria of different orders and QTL epistatic interactions of various kinds on the basis of a multilocus analysis. This can not only enhance the precision of parameter estimation, but also make it possible to perform whole-genome association studies. We carried out extensive simulation studies to examine the robustness and statistical performance of our method. The application of the new method was validated using a case study from humans, in which we successfully detected significant QTL affecting human body heights. Finally, we discuss the implications of our method for genome projects and its extension to a broader circumstance. The computer program for the method proposed in this article is available at the webpage http://www.ifasstat.ufl.edu/genome/~LD. 相似文献

19.

Combining functional and linkage disequilibrium information in the selection of tag SNPs

Sham PC Ao SI Kwan JS Kao P Cheung F Fong PY Ng MK 《Bioinformatics (Oxford, England)》2007,23(1):129-131

We have developed an online program, WCLUSTAG, for tag SNP selection that allows the user to specify variable tagging thresholds for different SNPs. Tag SNPs are selected such that a SNP with user-specified tagging threshold C will have a minimum R2 of C with at least one tag SNP. This flexible feature is useful for researchers who wish to prioritize genomic regions or SNPs in an association study. AVAILABILITY: The online WCLUSTAG program is available at http://bioinfo.hku.hk/wclustag/ 相似文献

20.

Extended linkage disequilibrium surrounding the hemoglobin E variant due to malarial selection

下载免费PDF全文

Ohashi J Naka I Patarapotikul J Hananantachai H Brittenham G Looareesuwan S Clark AG Tokunaga K 《American journal of human genetics》2004,74(6):1198-1208

The hemoglobin E variant (HbE; ( beta )26Glu-->Lys) is concentrated in parts of Southeast Asia where malaria is endemic, and HbE carrier status has been shown to confer some protection against Plasmodium falciparum malaria. To examine the effect of natural selection on the pattern of linkage disequilibrium (LD) and to infer the evolutionary history of the HbE variant, we analyzed biallelic markers surrounding the HbE variant in a Thai population. Pairwise LD analysis of HbE and 43 surrounding biallelic markers revealed LD of HbE extending beyond 100 kb, whereas no LD was observed between non-HbE variants and the same markers. The inferred haplotype network suggests a single origin of the HbE variant in the Thai population. Forward-in-time computer simulations under a variety of selection models indicate that the HbE variant arose 1,240-4,440 years ago. These results support the conjecture that the HbE mutation occurred recently, and the allele frequency has increased rapidly. Our study provides another clear demonstration that a high-resolution LD map across the human genome can detect recent variants that have been subjected to positive selection. 相似文献