共查询到20条相似文献,搜索用时 9 毫秒
1.
David Quigley 《BMC bioinformatics》2015,16(1)
Background
Gene expression microarrays measure the levels of messenger ribonucleic acid (mRNA) in a sample using probe sequences that hybridize with transcribed regions. These probe sequences are designed using a reference genome for the relevant species. However, most model organisms and all humans have genomes that deviate from their reference. These variations, which include single nucleotide polymorphisms, insertions of additional nucleotides, and nucleotide deletions, can affect the microarray’s performance. Genetic experiments comparing individuals bearing different population-associated single nucleotide polymorphisms that intersect microarray probes are therefore subject to systemic bias, as the reduction in binding efficiency due to a technical artifact is confounded with genetic differences between parental strains. This problem has been recognized for some time, and earlier methods of compensation have attempted to identify probes affected by genome variants using statistical models. These methods may require replicate microarray measurement of gene expression in the relevant tissue in inbred parental samples, which are not always available in model organisms and are never available in humans.Results
By using sequence information for the genomes of organisms under investigation, potentially problematic probes can now be identified a priori. However, there is no published software tool that makes it easy to eliminate these probes from an annotation. I present equalizer, a software package that uses genome variant data to modify annotation files for the commonly used Affymetrix IVT and Gene/Exon platforms. These files can be used by any microarray normalization method for subsequent analysis. I demonstrate how use of equalizer on experiments mapping germline influence on gene expression in a genetic cross between two divergent mouse species and in human samples significantly reduces probe hybridization-induced bias, reducing false positive and false negative findings.Conclusions
The equalizer package reduces probe hybridization bias from experiments performed on the Affymetrix microarray platform, allowing accurate assessment of germline influence on gene expression. 相似文献2.
Validation and extension of an empirical Bayes method for SNP calling on Affymetrix microarrays 下载免费PDF全文
Multiple algorithms have been developed for the purpose of calling single nucleotide polymorphisms (SNPs) from Affymetrix microarrays. We extend and validate the algorithm CRLMM, which incorporates HapMap information within an empirical Bayes framework. We find CRLMM to be more accurate than the Affymetrix default programs (BRLMM and Birdseed). Also, we tie our call confidence metric to percent accuracy. We intend that our validation datasets and methods, refered to as SNPaffycomp, serve as standard benchmarks for future SNP calling algorithms. 相似文献
3.
High-resolution analysis of chromosomal imbalances using the Affymetrix 10K SNP genotyping chip 总被引:5,自引:0,他引:5
Array-based comparative genome hybridization is a powerful tool for detecting chromosomal imbalances at high resolution. However, the design and setup of such arrays are time consuming and expensive and thus worthwhile only when large numbers of arrays will be processed. To provide a feasible solution, we have developed an algorithm that renders the publicly available Affymetrix 10K SNP genotyping chip useful for high-resolution analysis of chromosomal imbalances. We have used our newly developed algorithm to analyze data from Affymetrix 10K chips that were hybridized with DNA probes from a variety of different sources, such as primary tumors, cell lines, and blood from patients with unbalanced translocations. In summary, we were able to (i) demonstrate the capability of our method by reproduction of published and unpublished data obtained with alternative methods and (ii) identify novel imbalances that were not shown before. 相似文献
4.
Background
Affymetrix microarrays are used by many laboratories to generate gene expression profiles. Generally, only large differences (> 1.7-fold) between conditions have been reported. Computational methods to reduce inter-array variability might be of value when attempting to detect smaller differences. We examined whether inter-array variability could be reduced by using data based on the Affymetrix algorithm for pairwise comparisons between arrays (ratio method) rather than data based on the algorithm for analysis of individual arrays (signal method). Six HG-U95A arrays that probed mRNA from young (21–31 yr old) human muscle were compared with six arrays that probed mRNA from older (62–77 yr old) muscle.Results
Differences in mean expression levels of young and old subjects were small, rarely > 1.5-fold. The mean within-group coefficient of variation for 4629 mRNAs expressed in muscle was 20% according to the ratio method and 25% according to the signal method. The ratio method yielded more differences according to t-tests (124 vs. 98 differences at P < 0.01), rank sum tests (107 vs. 85 differences at P < 0.01), and the Significance Analysis of Microarrays method (124 vs. 56 differences with false detection rate < 20%; 20 vs. 0 differences with false detection rate < 5%). The ratio method also improved consistency between results of the initial scan and results of the antibody-enhanced scan.Conclusion
The ratio method reduces inter-array variance and thereby enhances statistical power.5.
Rigaill G Hupé P Almeida A La Rosa P Meyniel JP Decraene C Barillot E 《Bioinformatics (Oxford, England)》2008,24(6):768-774
MOTIVATION: Affymetrix SNP arrays can be used to determine the DNA copy number measurement of 11 000-500 000 SNPs along the genome. Their high density facilitates the precise localization of genomic alterations and makes them a powerful tool for studies of cancers and copy number polymorphism. Like other microarray technologies it is influenced by non-relevant sources of variation, requiring correction. Moreover, the amplitude of variation induced by non-relevant effects is similar or greater than the biologically relevant effect (i.e. true copy number), making it difficult to estimate non-relevant effects accurately without including the biologically relevant effect. RESULTS: We addressed this problem by developing ITALICS, a normalization method that estimates both biological and non-relevant effects in an alternate, iterative manner, accurately eliminating irrelevant effects. We compared our normalization method with other existing and available methods, and found that ITALICS outperformed these methods for several in-house datasets and one public dataset. These results were validated biologically by quantitative PCR. AVAILABILITY: The R package ITALICS (ITerative and Alternative normaLIzation and Copy number calling for affymetrix Snp arrays) has been submitted to Bioconductor. 相似文献
6.
Algorithms for large-scale genotyping microarrays 总被引:7,自引:0,他引:7
Liu WM Di X Yang G Matsuzaki H Huang J Mei R Ryder TB Webster TA Dong S Liu G Jones KW Kennedy GC Kulp D 《Bioinformatics (Oxford, England)》2003,19(18):2397-2403
MOTIVATION: Analysis of many thousands of single nucleotide polymorphisms (SNPs) across whole genome is crucial to efficiently map disease genes and understanding susceptibility to diseases, drug efficacy and side effects for different populations and individuals. High density oligonucleotide microarrays provide the possibility for such analysis with reasonable cost. Such analysis requires accurate, reliable methods for feature extraction, classification, statistical modeling and filtering. RESULTS: We propose the modified partitioning around medoids as a classification method for relative allele signals. We use the average silhouette width, separation and other quantities as quality measures for genotyping classification. We form robust statistical models based on the classification results and use these models to make genotype calls and calculate quality measures of calls. We apply our algorithms to several different genotyping microarrays. We use reference types, informative Mendelian relationship in families, and leave-one-out cross validation to verify our results. The concordance rates with the single base extension reference types are 99.36% for the SNPs on autosomes and 99.64% for the SNPs on sex chromosomes. The concordance of the leave-one-out test is over 99.5% and is 99.9% higher for AA, AB and BB cells. We also provide a method to determine the gender of a sample based on the heterozygous call rate of SNPs on the X chromosome. See http://www.affymetrix.com for further information. The microarray data will also be available from the Affymetrix web site. AVAILABILITY: The algorithms will be available commercially in the Affymetrix software package. 相似文献
7.
Yang CH Cheng YH Yang CH Chuang LY 《IEEE/ACM transactions on computational biology and bioinformatics / IEEE, ACM》2012,9(3):837-845
Polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP) is useful in small-scale basic research studies of complex genetic diseases that are associated with single nucleotide polymorphism (SNP). Designing a feasible primer pair is an important work before performing PCR-RFLP for SNP genotyping. However, in many cases, restriction enzymes to discriminate the target SNP resulting in the primer design is not applicable. A mutagenic primer is introduced to solve this problem. GA-based Mismatch PCR-RFLP Primers Design (GAMPD) provides a method that uses a genetic algorithm to search for optimal mutagenic primers and available restriction enzymes from REBASE. In order to improve the efficiency of the proposed method, a mutagenic matrix is employed to judge whether a hypothetical mutagenic primer can discriminate the target SNP by digestion with available restriction enzymes. The available restriction enzymes for the target SNP are mined by the updated core of SNP-RFLPing. GAMPD has been used to simulate the SNPs in the human SLC6A4 gene under different parameter settings and compared with SNP Cutter for mismatch PCR-RFLP primer design. The in silico simulation of the proposed GAMPD program showed that it designs mismatch PCR-RFLP primers. The GAMPD program is implemented in JAVA and is freely available at http://bio.kuas.edu.tw/gampd/. 相似文献
8.
9.
High-throughput SNP genotyping 总被引:5,自引:0,他引:5
Whole genome approaches using single nucleotide polymorphism (SNP) markers have the potential to transform complex disease genetics and expedite pharmacogenetics research. This has led to a requirement for high-throughput SNP genotyping platforms. Development of a successful high-throughput genotyping platform depends on coupling reliable assay chemistry with an appropriate detection system to maximise efficiency with respect to accuracy, speed and cost. Current technology platforms are able to deliver throughputs in excess of 100 000 genotypes per day, with an accuracy of >99%, at a cost of 20-30 cents per genotype. In order to meet the demands of the coming years, however, genotyping platforms need to deliver throughputs in the order of one million genotypes per day at a cost of only a few cents per genotype. In addition, DNA template requirements must be minimised such that hundreds of thousands of SNPs can be interrogated using a relatively small amount of genomic DNA. As such, it is predicted that the next generation of high-throughput genotyping platforms will exploit large-scale multiplex reactions and solid phase assay detection systems. 相似文献
10.
Genotyping and annotation of Affymetrix SNP arrays 总被引:1,自引:0,他引:1
In this paper we develop a new method for genotyping Affymetrix single nucleotide polymorphism (SNP) array. The method is based on (i) using multiple arrays at the same time to determine the genotypes and (ii) a model that relates intensities of individual SNPs to each other. The latter point allows us to annotate SNPs that have poor performance, either because of poor experimental conditions or because for one of the alleles the probes do not behave in a dose–response manner. Generally, our method agrees well with a method developed by Affymetrix. When both methods make a call they agree in 99.25% (using standard settings) of the cases, using a sample of 113 Affymetrix 10k SNP arrays. In the majority of cases where the two methods disagree, our method makes a genotype call, whereas the method by Affymetrix makes a no call, i.e. the genotype of the SNP is not determined. By visualization it is indicated that our method is likely to be correct in majority of these cases. In addition, we demonstrate that our method produces more SNPs that are in concordance with Hardy–Weinberg equilibrium than the method by Affymetrix. Finally, we have validated our method on HapMap data and shown that the performance of our method is comparable to other methods. 相似文献
11.
de Lambert B Chaix C Charreyre MT Martin T Aigoui A Perrin-Rubens A Pichot C Mandrand B 《Analytical biochemistry》2008,373(2):229-238
Polymer-oligonucleotide conjugates were synthesized from the amphiphilic block copolymer poly(tert-butylacrylamide-b-(N-acryloylmorpholine-co-N-acryloxysuccinimide)) using an original solid-phase DNA synthesis strategy. This method provided conjugates highly functionalized with oligonucleotides throughout the polymer chain. After purification, block copolymer-oligonucleotide conjugates were spotted on a multidetection microarray system developed by Apibio using a standard nanodroplet piezo inkjet spotting technique to develop the oligosorbent assay (OLISA). Two genotyping models (HLA-DQB1 and platelet glycoproteins [GPs]), which are particularly difficult to study with standard systems, were evaluated. For both models, block copolymer-oligonucleotide conjugates used as capture probes amplified the responses of in vitro diagnostic assays. The detection limit reached by using conjugates was estimated at 15 pM for a 219-bp DNA target (HLA-DQB1 model). Moreover, single nucleotide polymorphism was detected in the platelet GPs genotyping model. The use of polymer conjugates led to a significant improvement in both sensitivity and specificity of standard hybridization assays even when applied to complex biological models. 相似文献
12.
MOTIVATION: DNA copy number aberrations are frequently found in different types of cancer. Recent developments of microarray-based approaches have broadened the knowledge on number and structure of such aberrations. High-density single nucleotide polymorphism (SNP) microarrays provide an extremely high resolution with up to 500,000 SNPs per genome. Owing to the enormous amount of data the detection of common aberrations in large datasets is a great challenge. We describe a novel open source software tool--IdeogramBrowser--which was specifically designed for use with the Affymetrix SNP arrays. It provides an interactive karyotypic visualization of multiple aberration profiles and direct links to GeneCards. Visualization of consensus regions together with gene representation allows the explorative assessment of the data. AVAILABILITY: IdeogramBrowser and its source code are freely available under a creative commons license and can be obtained from http://www.informatik.uni-ulm.de/ni/staff/HKestler/ideo/. IdeogramBrowser is a platform independent Java application. 相似文献
13.
Association mapping aimed at identifying DNA polymorphisms that contribute to variation in complex traits entails genotyping a large number of single-nucleotide polymorphisms (SNPs) in a very large panel of individuals. Few technologies, however, provide inexpensive high-throughput genotyping. Here, we present an efficient approach developed specifically for genotyping large fixed panels of diploid individuals. The cost-effective, open-source nature of our methodology may make it particularly attractive to those working in nonmodel systems. 相似文献
14.
Quantitative evaluation by minisequencing and microarrays reveals accurate multiplexed SNP genotyping of whole genome amplified DNA 总被引:8,自引:3,他引:8 下载免费PDF全文
Whole genome amplification (WGA) procedures such as primer extension preamplification (PEP) or multiple displacement amplification (MDA) have the potential to provide an unlimited source of DNA for large-scale genetic studies. We have performed a quantitative evaluation of PEP and MDA for genotyping single nucleotide polymorphisms (SNPs) using multiplex, four-color fluorescent minisequencing in a microarray format. Forty-five SNPs were genotyped and the WGA methods were evaluated with respect to genotyping success, signal-to-noise ratios, power of genotype discrimination, yield and imbalanced amplification of alleles in the MDA product. Both PEP and MDA products provided genotyping results with a high concordance to genomic DNA. For PEP products the power of genotype discrimination was lower than for MDA due to a 2-fold lower signal-to-noise ratio. MDA products were indistinguishable from genomic DNA in all aspects studied. To obtain faithful representation of the SNP alleles at least 0.3 ng DNA should be used per MDA reaction. We conclude that the use of WGA, and MDA in particular, is a highly promising procedure for producing DNA in sufficient amounts even for genome wide SNP mapping studies. 相似文献
15.
Homer N Szelinger S Redman M Duggan D Tembe W Muehling J Pearson JV Stephan DA Nelson SF Craig DW 《PLoS genetics》2008,4(8):e1000167
We use high-density single nucleotide polymorphism (SNP) genotyping microarrays to demonstrate the ability to accurately and robustly determine whether individuals are in a complex genomic DNA mixture. We first develop a theoretical framework for detecting an individual's presence within a mixture, then show, through simulations, the limits associated with our method, and finally demonstrate experimentally the identification of the presence of genomic DNA of specific individuals within a series of highly complex genomic mixtures, including mixtures where an individual contributes less than 0.1% of the total genomic DNA. These findings shift the perceived utility of SNPs for identifying individual trace contributors within a forensics mixture, and suggest future research efforts into assessing the viability of previously sub-optimal DNA sources due to sample contamination. These findings also suggest that composite statistics across cohorts, such as allele frequency or genotype counts, do not mask identity within genome-wide association studies. The implications of these findings are discussed. 相似文献
16.
17.
Finocchiaro G Parise P Minardi SP Alcalay M Müller H 《Bioinformatics (Oxford, England)》2004,20(18):3670-3672
SUMMARY: GenePicker allows efficient analysis of Affymetrix gene expression data performed in replicate, through definition of analysis schemes, data normalization, t-test/ANOVA, Change-Fold Change-analysis and yields lists of differentially expressed genes with high confidence. Comparison of noise and signal analysis schemes allows determining a signal-to-noise ratio in a given experiment. Change Call, Fold Change and Signal mean ratios are used in the analysis. While each parameter alone yields gene lists that contain up to 30% false positives, the combination of these parameters nearly eliminates the false positives as verified by northern blotting, quantitative PCR in numerous independent experiments as well as by the analysis of spike-in data. AVAILABILITY: http://www.ifom-firc.it/RESEARCH/Appl_Bioinfo/tools.html. SUPPLEMENTARY INFORMATION: http://www.ifom-firc.it/RESEARCH/Appl_Bioinfo/tools.html. 相似文献
18.
High-throughput SNP genotyping platforms use automated genotype calling algorithms to assign genotypes. While these algorithms work efficiently for individual platforms, they are not compatible with other platforms, and have individual biases that result in missed genotype calls. Here we present data on the use of a second complementary SNP genotype clustering algorithm. The algorithm was originally designed for individual fluorescent SNP genotyping assays, and has been optimized to permit the clustering of large datasets generated from custom-designed Affymetrix SNP panels. In an analysis of data from a 3K array genotyped on 1,560 samples, the additional analysis increased the overall number of genotypes by over 45,000, significantly improving the completeness of the experimental data. This analysis suggests that the use of multiple genotype calling algorithms may be advisable in high-throughput SNP genotyping experiments. The software is written in Perl and is available from the corresponding author. 相似文献
19.
A facile, rapid, stable and sensitive approach for fluorescent detection of single nucleotide polymorphism (SNP) is designed based on DNA ligase reaction and π-stacking between the graphene and the nucleotide bases. In the presence of perfectly matched DNA, DNA ligase can catalyze the linkage of fluorescein amidite-labeled single-stranded DNA (ssDNA) and a phosphorylated ssDNA, and thus the formation of a stable duplex in high yield. However, the catalytic reaction cannot effectively carry out with one-base mismatched DNA target. In this case, we add graphene to the system in order to produce different quenching signals due to its different adsorption affinity for ssDNA and double-stranded DNA. Taking advantage of the unique surface property of graphene and the high discriminability of DNA ligase, the proposed protocol exhibits good performance in SNP genotyping. The results indicate that it is possible to accurately determine SNP with frequency as low as 2.6% within 40 min. Furthermore, the presented flexible strategy facilitates the development of other biosensing applications in the future. 相似文献
20.
Roded Sharan Jens Gramm Zohar Yakhini Amir Ben-Dor 《Journal of computational biology》2005,12(5):514-533
Association studies in populations relate genomic variation among individuals with medical condition. Key to these studies is the development of efficient and affordable genotyping techniques. Generic genotyping assays are independent of the target SNPs and offer great flexibility in the genotyping process. Efficient use of such assays calls for identifying sets of SNPs that can be interrogated in parallel under constraints imposed by the genotyping technology. In this paper, we study problems arising in the design of genotyping experiments using generic assays. Our problem formulation deals with two main factors that affect the genotyping cost: the number of assays used and the number of PCR reactions required for sample preparation. We prove that the resulting computational problems are hard, but provide approximate and heuristic solutions to these problems. Our algorithmic approach is based on recasting the multiplexing problems as partitioning and packing problems on a bipartite graph. We tested our algorithmic approaches on an extensive collection of synthetic data and on data that was simulated using real SNP sequences. Our results show that the algorithms achieve near-optimal designs in many cases and demonstrate the applicability of generic assays to SNP genotyping. 相似文献