首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Copy number variants (CNVs) are currently defined as genomic sequences that are polymorphic in copy number and range in length from 1000 to several million base pairs. Among current array-based CNV detection platforms, long-oligonucleotide arrays promise the highest resolution. However, the performance of currently available analytical tools suffers when applied to these data because of the lower signal:noise ratio inherent in oligonucleotide-based hybridization assays. We have developed wuHMM, an algorithm for mapping CNVs from array comparative genomic hybridization (aCGH) platforms comprised of 385 000 to more than 3 million probes. wuHMM is unique in that it can utilize sequence divergence information to reduce the false positive rate (FPR). We apply wuHMM to 385K-aCGH, 2.1M-aCGH and 3.1M-aCGH experiments comparing the 129X1/SvJ and C57BL/6J inbred mouse genomes. We assess wuHMM's performance on the 385K platform by comparison to the higher resolution platforms and we independently validate 10 CNVs. The method requires no training data and is robust with respect to changes in algorithm parameters. At a FPR of <10%, the algorithm can detect CNVs with five probes on the 385K platform and three on the 2.1M and 3.1M platforms, resulting in effective resolutions of 24 kb, 2–5 kb and 1 kb, respectively.  相似文献   

2.
We consider the problem of sequence reconstruction in sequencing-by-hybridization in the presence of spectrum errors. As suggested by intuition, and reported in the literature, false-negatives (i.e., missing spectrum probes) are by far the leading cause of reconstruction failures. In a recent paper we have described an algorithm, called "threshold-theta", designed to recover from false negatives. This algorithm is based on overcompensating for missing extensions by allowing larger reconstruction subtrees. We demonstrated, both analytically and with simulations, the increasing effectiveness of the approach as the parameter theta grows, but also pointed out that for larger error rates the size of the extension trees translates into an unacceptable computational burden. To obviate this shortcoming, in this paper we propose an adaptive approach which is both effective and efficient. Effective, because for a fixed value of theta it performs as well as its single-threshold counterpart, efficient because it exhibits substantial speed-ups over it. The idea is that, for moderate error rates a small fraction of the target sequence can be involved in error recovery; thus, expectedly the remainder of the sequence is reconstructible by the standard noiseless algorithm, with the provision to switch to operation with increasingly higher thresholds after detecting failure. This policy generates interesting and complex interplays between fooling probes and false negatives. These phenomena are carefully analyzed for random sequences and the results are found to be in excellent agreement with the simulations. In addition, the experimental algorithmic speed-ups of the multithreshold approach are explained in terms of the interaction amongst the different threshold regimes.  相似文献   

3.
Sequence-specific oligonucleotide probes play a crucial role in hybridization techniques including PCR, DNA microarray and RNA interference. Once the entire genome becomes the search space for target genes/genomic sequences, however, cross-hybridization to non-target sequences becomes a problem. Large gene families with significant similarity among family members, such as the P450s, are particularly problematic. Additionally, accurate single nucleotide polymorphism (SNP) detection depends on probes that can distinguish between nearly identical sequences. Conventional oligonucleotide probes that are perfectly matched to target genes/genomic sequences are often unsuitable in such cases. Carefully designed mismatches can be used to decrease cross-hybridization potential, but implementing all possible mismatch probes is impractical. Our study provides guidelines for designing non-perfectly matched DNA probes to target DNA sequences as desired throughout the genome. These guidelines are based on the analysis of hybridization data between perfectly matched and non-perfectly matched DNA sequences (single-point or double-point mutated) calculated in silico. Large changes in hybridization temperature predicted by these guidelines for non-matched oligonucleotides fit independent experimental data very well. Applying the guidelines to find oligonucleotide microarray probes for P450 genes, we confirmed the ability of our point mutation method to differentiate the individual genes in terms of thermodynamic calculations of hybridization and sequence similarity.  相似文献   

4.
A method for mapping nonisotopically labeled probes to human metaphase chromosomes that can be used with laser scanning confocal microscopy has been developed. Only a limited number of wavelengths are available from the argon ion lasers used in most commercial instruments and therefore a method that allowed the visualization of bands on human chromosomes stained with propidium iodide and, simultaneously, the detection of hybridization signals using FITC-labeled antibodies was developed. The confocal microscope was used to map single-copy probes to chromosome bands and the positions of the probes on the R-banded chromosomes corresponded to map positions previously determined on Hoechst 33258-stained chromosomes (G-banded). A comparison of confocal imaging of single-copy hybridization signals with conventional fluorescence microscopy and high-sensitivity video cameras revealed little difference in sensitivity but greater resolution of chromosome bands with the confocal microscope. The polymerase chain reaction was used to prepare nonisotopically labeled probes for in situ hybridization and to amplify Alu and KpnI family repeats from cloned DNA to be used to suppress hybridization of these repeat sequences so that a cosmid probe could be mapped to a chromosome band.  相似文献   

5.
The emergence of drug-resistant strains of Mycobacterium tuberculosis is a serious public health problem. Many of the specific gene mutations that cause drug resistance in M. tuberculosis are point mutations. We are developing a PCR-peptide nucleic acid (PNA)-based ELISA as a diagnostic method to recognize point mutations in genes associated with isoniazid and rifampin resistance in M. tuberculosis. Specific point mutation-containing sequences and wild-type sequences of cloned mycobacterial genes were PCR-amplified, denatured, and hybridized with PNA probes bound to microplate wells. Using 15-base PNA probes, we established the hybridization temperatures (50 degrees C-55 degrees C) and other experimental conditions suitable for detecting clinically relevant point mutations in the katG and rpoB genes. Hybridization of PCR-amplified sequences that contained these point mutations with complementary mutation-specific PNAs resulted in significant increases in ELISA response compared with hybridization using wild-type-specific PNAs. Conversely, PCR-amplified wild-type sequences hybridized much more efficiently with wild-type PNAs than with the mutation-specific PNAs. Using the M. tuberculosis cloned genes and PCR-PNA-ELISA format developed here, M. tuberculosis sequences containing point mutations associated with drug resistance can be identified in less than 24 h.  相似文献   

6.
Fluorescence in situ hybridization (FISH) has impacted profoundly on our knowledge of the in situ ecophysiology and biodiversity of bacteria in natural communities. However, it has many technical challenges including the possibility of false positives from the binding of probes to non‐target rRNA sequences. We show here that probe target sites containing single‐base insertions or deletions can lead to false FISH positives, the result of hybridization with a bulge around the missing base. Experimental and in silico data suggest this situation occurs at a surprisingly high frequency. The existence of such sites is not currently considered during most FISH probe design processes. We describe software to identify potential non‐target sites resulting from single‐base insertions or deletions in rRNA sequences. This software also provides an estimate of the FISH probe hybridization efficiency to these sites.  相似文献   

7.
In this paper, we describe the detection of bacterial cell-extracted 16S ribosomal RNA (rRNA) using an emerging technology, surface plasmon resonance (SPR) imaging of DNA arrays. Surface plasmon resonance enables detection of molecular interactions on surfaces in response to changes in the index of refraction, therefore eliminating the need for a fluorescent or radioactive label. A variation of the more common SPR techniques, SPR imaging enables detection from multiple probes in a reusable array format. The arrays developed here contain DNA probes (15-21 bases) designed to be complementary to 16S rRNA gene sequences of Escherichia coli and Bacillus subtilis as well as to a highly conserved sequence found in rRNAs from most members of the domain Bacteria. We report species-specific hybridization of cell-extracted total RNA and in vitro transcribed 16S rRNA to oligonucleotide probes on SPR arrays. We tested multiple probe sequences for each species, and found that success or failure of hybridization was dependent upon probe position in the 16S rRNA molecule. It was also determined that one of the probes intended to bind 16S rRNA also bound an unknown protein. The amount of binding to these probes was quantified with SPR imaging. A detection limit of 2 micro g ml-1 was determined for fragmented E. coli total cellular RNA under the experimental conditions used. These results indicate the feasibility of using SPR imaging for 16S rRNA identification and encourage further development of this method for direct detection of other RNA molecules.  相似文献   

8.

Background  

The detection of enriched DNA or RNA fragments by tiling microarrays has become more and more popular. These microarrays contain a high number of small probes covering genomic loci. However, to achieve high coverage the probe sequences cannot be selected for their hybridization properties. The affinity of the probes towards their targets varies in a sequence-dependent manner. In order to remove this bias a number of approaches have been developed and shown to increase the detection of enriched DNA or RNA fragments. However, these approaches also employ a peak detection algorithm that is different from the one used previously. Thus, it seems possible that the enhancement of detection is due to the peak detection algorithm rather than the sequence-dependent normalization.  相似文献   

9.
The previously developed magnetic-capture hybridization technique employing bacterial magnetic particles was applied to discriminate between Atlantic and Pacific subspecies of the northern bluefin tuna (Thunnus thynnus) using specific DNA sequences. Nucleotide sequences of a 925-bp fragment (ATCO) flanking the mitochondrial ATPase and cytochrome oxidase subunit III genes in these two subspecies were compared. Two regions having single-nucleotide and three-nucleotide differences between the subspecies were adopted to design DNA probes (NR1, 21-mer; NR2, 29-mer), and two internal primer sets were designed to amplify DNA fragments containing these regions. The DNA probes were immobilized on bacterial magnetic particles via streptavidin-biotin conjugation and subjected to magnetic-capture hybridization with the digoxigenin-labeled fragments amplified using the internal primers. The luminescence intensities of DNA on bacterial magnetic particles obtained by hybridization between the probes and the complementary fragments were higher than those obtained by hybridization with noncomplementary fragments. These data suggest that this system employing DNA on bacterial magnetic particles may be useful for discrimination of these two subspecies by recognizing a single-nucleotide difference. Received January 17, 2000; accepted January 28, 2000.  相似文献   

10.

Background  

DNA microarrays contain thousands of different probe sequences represented on their surface. These are designed in such a way that potential cross-hybridization reactions with non-target sequences are minimized. However, given the large number of probes, the occurrence of cross hybridization events cannot be excluded. This problem can dramatically affect the data quality and cause false positive/false negative results.  相似文献   

11.
探针设计是SARS病毒再测序DNA微阵列制作的关键步骤,为了保证探针的杂交条件尽可能一致,采用了作者提出的两种等长变覆盖的探针设计方法,即基于Tm距离的算法和遗传算法。针对SAILS病毒基因组中的两段特异序列设计了一组探针,并与等长移位法和变长变覆盖法的设计结果进行了比较。等长变覆盖法得到的探针集在探针长度一致的情况下,探针的Tm值有较小的标准差和变化范围。结果表明,等长变覆盖法得到的探针具有更好的杂交条件一致性。  相似文献   

12.
Bacterial diversity is central to ecosystem sustainability and soil biological function, for which the role of roots is important. The high-throughput analysis potential of taxonomic microarray should match the breadth of bacterial diversity. Here, the power of this technology was evidenced through methodological verifications and analysis of maize rhizosphere effect based on a 16S rRNA-based microarray developed from the prototype of H. Sanguin et al. (Environ. Microbiol. 8:289-307, 2006). The current probe set was composed of 170 probes (41 new probes in this work) that targeted essentially the Proteobacteria. Cloning and sequencing of 16S rRNA amplicons were carried out on maize rhizosphere and bulk soil DNA. All tested clones that had a perfect match with corresponding probes were positive in the hybridization experiment. The hierarchically nested probes were reliable, but the level of taxonomic identification was variable, depending on the probe set specificity. The comparison of experimental and theoretical hybridizations revealed 0.91% false positives and 0.81% false negatives. The microarray detection threshold was estimated at 0.03% of a given DNA type based on DNA spiking experiments. A comparison of the maize rhizosphere and bulk soil hybridization results showed a significant rhizosphere effect, with a higher predominance of Agrobacterium spp. in the rhizosphere, as well as a lower prevalence of Acidobacteria, Bacteroidetes, Verrucomicrobia, and Planctomycetes, a new taxon of interest in soil. In addition, well-known taxonomic groups such as Sphingomonas spp., Rhizobiaceae, and Actinobacteria were identified in both microbial habitats with strong hybridization signals. The taxonomic microarray developed in the present study was able to discriminate and characterize bacterial community composition in related biological samples, offering extensive possibilities for systematic exploration of bacterial diversity in ecosystems.  相似文献   

13.
Identifying variants using high-throughput sequencing data is currently a challenge because true biological variants can be indistinguishable from technical artifacts. One source of technical artifact results from incorrectly aligning experimentally observed sequences to their true genomic origin (‘mismapping’) and inferring differences in mismapped sequences to be true variants. We developed BlackOPs, an open-source tool that simulates experimental RNA-seq and DNA whole exome sequences derived from the reference genome, aligns these sequences by custom parameters, detects variants and outputs a blacklist of positions and alleles caused by mismapping. Blacklists contain thousands of artifact variants that are indistinguishable from true variants and, for a given sample, are expected to be almost completely false positives. We show that these blacklist positions are specific to the alignment algorithm and read length used, and BlackOPs allows users to generate a blacklist specific to their experimental setup. We queried the dbSNP and COSMIC variant databases and found numerous variants indistinguishable from mapping errors. We demonstrate how filtering against blacklist positions reduces the number of potential false variants using an RNA-seq glioblastoma cell line data set. In summary, accounting for mapping-caused variants tuned to experimental setups reduces false positives and, therefore, improves genome characterization by high-throughput sequencing.  相似文献   

14.
Cross-hybridization of repetitive sequences in genomic and expression arrays is reported to be suppressed with repeat-blocking nucleic acids (Cot-1 DNA). Contrary to expectation, we demonstrated that Cot-1 also enhanced non-specific hybridization between probes and genomic targets. When added to target DNA, Cot-1 enhanced hybridization (2.2- to 3-fold) to genomic probes containing conserved repetitive elements. In addition to repetitive sequences, Cot-1 was found to be enriched for linked single copy (sc) sequences. Adventitious association between these sequences and probes distort quantitative measurements of the probes hybridized to desired genomic targets. Quantitative microarray hybridization studies using Cot-1 DNA are also susceptible to these effects, especially for probes that map to genomic regions containing conserved repetitive sequences. Hybridization measurements with such probes are less reproducible in the presence of Cot-1 than for probes derived from sc regions or regions containing divergent repeat elements, a finding with significant ramifications for genomic and expression microarray studies. We mitigated the requirement for Cot-1 either by hybridizing with computationally defined sc probes lacking repeats or by substituting synthetic repetitive elements complementary to sequences in genomic probes.  相似文献   

15.
We have developed a low-density oligonucleotide-based micro-array where 5'-end-tethered capture probe sequences were derived from Primer Express software. The capture probes represent hydrolysis probe sequences devoid of any fluorochromes and were shown to retain hybridization binding specificity to their amplicons; hybridization specificity was retained independent to probe sequences. This procedure allowed the specificity of each capture probe to be verified using real-time polymerase chain reaction (PCR) in the presence of nucleic acid sequences typically expected to be present within a sample and therefore has reduced possibility of nonspecific hybridization when used in a micro-array format. We propose that specificity-validated probes are applied to form a micro-array for the purpose of general target screening, with incumbent multiparallelization and cost and time savings. However, if required, the subset of probe sequences of interest can be used for quantitative assessment in conventional real-time PCR.  相似文献   

16.
The microarray approach has been proposed for high throughput analysis of the microbial community by providing snapshots of the microbial diversity under different environmental conditions. For this purpose, a prototype of a 16S rRNA-based taxonomic microarray was developed and evaluated for assessing bacterial community diversity. The prototype microarray is composed of 122 probes that target bacteria at various taxonomic levels from phyla to species (mostly Alphaproteobacteria). The prototype microarray was first validated using bacteria in pure culture. Differences in the sequences of probes and potential target DNAs were quantified as weighted mismatches (WMM) in order to evaluate hybridization reliability. As a general feature, probes having a WMM > 2 with target DNA displayed only 2.8% false positives. The prototype microarray was subsequently tested with an environmental sample, which consisted of an Agrobacterium-related polymerase chain reaction amplicon from a maize rhizosphere bacterial community. Microarray results were compared to results obtained by cloning-sequencing with the same DNA. Microarray analysis enabled the detection of all 16S rRNA gene sequences found by cloning-sequencing. Sequences representing only 1.7% of the clone library were detected. In conclusion, this prototype 16S rRNA-based taxonomic microarray appears to be a promising tool for the analysis of Alphaproteobacteria in complex ecosystems.  相似文献   

17.
目的 研制并初步评估问号钩端螺旋体(简称钩体)赖型赖株的基因组DNA芯片。方法 利用Primegens引物设计软件筛选出问号钩体赖型赖株全基因组中的特异性基因进行引物设计。对成功设计出相应引物的3 290个基因用聚合酶链反应方法进行扩增,以纯化后的产物点样制备芯片。并用双色荧光杂交策略对芯片质量进行了初步平估。结果 共获得3 290个基因产物用于点样。参考株自身杂交实验结果表明:该芯片有较高的点一致性、信噪比和较低的假阳性率。结论 成功制备了包含问号钩体赖型赖株3 290个目的基因的基因组DNA芯片,并可用于基于该芯片的问号钩体比较基因组学的研究。  相似文献   

18.
Nonradioactive in situ hybridization techniques are becoming increasingly important tools for rapid analysis of the topological organization of DNA and RNA sequences within cells. Prerequisite for further advances with these techniques are multiple labeling and detection systems for different probes. Here we summarize our results with a recently developed labeling and detection system. The DNA probe for in situ hybridization is modified with digoxigenin-labeled deoxyuridine-triphosphate. Digoxigenin is linked to dUTP via an 11-atom linear spacer (Dig-[11]-dUTP). Labeled DNA probes were hybridized in situ to chromosome preparations. The hybridization signal was detected using digoxigenin-specific antibodies covalently coupled to enzyme markers (alkaline phosphatase or peroxidase) or to fluorescent dyes. Color reactions catalyzed by the enzymes resulted in precipitates located on the chromosomes at the site of probe hybridization. This was verified by hybridizing DNA probes of known chromosomal origin. The signals were analyzed by bright field, reflection contrast and fluorescence microscopy. The results indicate that the new technique gives strong signals and can also be used in combination with other systems (e.g., biotin) to detect differently labeled DNA probes on the same metaphase plate.  相似文献   

19.
MOTIVATION: The phylogenetic structure of the bacterial world has been intensively studied by comparing sequences of 16S ribosomal RNA (16S rRNA). This database of sequences is now widely used to design probes for the detection of specific bacteria or groups of bacteria one at a time. The success of such methods reflects the fact that there are local sequence segments that are highly characteristic of particular organisms or groups of organisms. It is not clear, however, the extent to which such signature sequences exist in the 16S rRNA dataset. A better understanding of the numbers and distribution of highly informative oligonucleotide sequences may facilitate the design of hybridization arrays that can characterize the phylogenetic position of an unknown organism or serve as the basis for the development of novel approaches for use in bacterial identification. RESULTS: A computer-based algorithm that characterizes the extent to which any individual oligonucleotide sequence in 16S rRNA is characteristic of any particular bacterial grouping was developed. A measure of signature quality, Q(s), was formulated and subsequently calculated for every individual oligonucleotide sequence in the size range of 5-11 nucleotides and for 15mers with reference to each cluster and subcluster in a 929 organism representative phylogenetic tree. Subsequently, the perfect signature sequences were compared to the full set of 7322 sequences to see how common false positives were. The work completed here establishes beyond any doubt that highly characteristic oligonucleotides exist in the bacterial 16S rRNA sequence dataset in large numbers. Over 16,000 15mers were identified that might be useful as signatures. Signature oligonucleotides are available for over 80% of the nodes in the representative tree.  相似文献   

20.
Sequencing by hybridization is a method for reconstructing a DNA sequence based on its k-mer content. This content, called the spectrum of the sequence, can be obtained from hybridization with a universal DNA chip. However, even with a sequencing chip containing all 4(9) 9-mers and assuming no hybridization errors, only about 400-bases-long sequences can be reconstructed unambiguously. Drmanac et al. (1989) suggested sequencing long DNA targets by obtaining spectra of many short overlapping fragments of the target, inferring their relative positions along the target, and then computing spectra of subfragments that are short enough to be uniquely recoverable. Drmanac et al. do not treat the realistic case of errors in the hybridization process. In this paper, we study the effect of such errors. We show that the probability of ambiguous reconstruction in the presence of (false negative) errors is close to the probability in the errorless case. More precisely, the ratio between these probabilities is 1 + O(p = (1 - p)(4). 1 = d) where d is the average length of subfragments, and p is the probability of a false negative. We also obtain lower and upper bounds for the probability of unambiguous reconstruction based on an errorless spectrum. For realistic chip sizes, these bounds are tighter than those given by Arratia et al. (1996). Finally, we report results on simulations with real DNA sequences, showing that even in the presence of 50% false negative errors, a target of cosmid length can be recovered with less than 0.1% miscalled bases.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号