共查询到20条相似文献,搜索用时 0 毫秒
1.
2.
3.
Raphael B Liu LT Varghese G 《IEEE/ACM transactions on computational biology and bioinformatics / IEEE, ACM》2004,1(2):91-94
Buhler and Tompa (2002) introduced the random projection algorithm for the motif discovery problem and demonstrated that this algorithm performs well on both simulated and biological samples. We describe a modification of the random projection algorithm, called the uniform projection algorithm, which utilizes a different choice of projections. We replace the random selection of projections by a greedy heuristic that approximately equalizes the coverage of the projections. We show that this change in selection of projections leads to improved performance on motif discovery problems. Furthermore, the uniform projection algorithm is directly applicable to other problems where the random projection algorithm has been used, including comparison of protein sequence databases. 相似文献
4.
Z Sedlacek D S Konecki R Siebenhaar P Kioschis A Poustka 《Nucleic acids research》1993,21(15):3419-3425
An essential requirement in the analysis of genomes is the identification of functionally important sequence elements, which are often evolutionarily conserved. We describe here the development of a novel procedure for the selective isolation of conserved sequences which is based on hybridization of PCR-amplifiable DNA fragments from the whole genome of one species to biotinylated DNA from a genomic region of another species. The interspecies DNA hybrids are immobilized and the PCR-amplifiable DNA fragments are eluted, amplified and after further hybridization-amplification rounds cloned. This method was used for the generation of sublibraries of conserved sequences from mouse and pig DNA from regions corresponding to cosmids from the human Xq28 region. Mouse and pig homologs of sequences containing exons of known human genes, as well as exons from novel genes have been identified. 相似文献
5.
A software program combining sequence motif searches with keywords for finding repeats containing DNA sequences 总被引:3,自引:0,他引:3
MOTIVATION: One of the most interesting features of genomes (both coding and non-coding regions) is the presence of relatively short tandemly repeated DNA sequences known as tandem repeats (TRs). We developed a new PC-based stand-alone software analysis program, combining sequence motif searches with keywords such as organs, tissues, cell lines or development stages for finding exact, inexact and compound, TRs. Tandem Repeats Analyzer 1.5 (TRA) has several advanced repeat search parameters/options over other repeat finder programs as it does not only accept GenBank, FASTA and expressed sequence tag (EST) sequence files but also does analysis of multifiles with multisequences. Advanced user-defined parameters/options let the researchers use different motif lengths search criteria for varying motif lengths simultaneously. The outputs show statistical results to be evaluated by the user. The discovery of TRs in ESTs could be useful for both gene mapping and association studies and discovering TRs located in coding regions of important genes that are expressed under various conditions of environment, stress, organ, tissue and development stage. RESULTS: In this paper, we demonstrated applications of TRA using 175 899 ESTs sequences for three Arabidopsis spp. downloaded from GenBank. The EST-SSRs/ESTs ratios were found 43.1%, 15.3% and 2.34% in A.lyrata, A.thaliana and A.halleri, respectively. Analysis revealed that organs, tissues and development stages possessed different amounts of repeats and repeat compositions. This indicated that the distribution of TRs among the tissues or organs may not be random differing from the untranscribed repeats found in genomes. AVAILABILITY: The program can be obtained free by anonymous FTP from ftp.akdeniz.edu.tr/Araclar/TRA. 相似文献
6.
Direct amplification of minisatellite-region DNA with VNTR core sequences in the genus Oryza 总被引:3,自引:0,他引:3
Z. Zhou P. J. Bebeli D. J. Somers J. P. Gustafson 《TAG. Theoretical and applied genetics. Theoretische und angewandte Genetik》1997,95(5-6):942-949
A polymerase chain reaction (PCR) application, involving the directed amplification of minisatellite-region DNA (DAMD) with
several minisatellite core sequences as primers, was used to detect genetic variation in 17 species of the genus Oryza and several rice cultivars (O. sativa L.). The electrophoretic analysis of DAMD-PCR products showed high levels of variation between different species and little
variation between different cultivars of O. sativa. Polymorphisms were also found between accessions within a species, and between individual plants within an accession of
several wild species. The DAMD-PCR yielded genome-specific banding patterns for the species studied. Several DAMD-PCR-generated
DNA fragments were cloned and characterized. One clone was capable of detecting multiple fragments and revealed individual-specific
hybridization banding patterns using genomic DNA from wild species as well as rice cultivars. A second clone detected only
a single polymorphic locus, while a third clone expressed a strong genome specificity by Southern analysis. The results demonstrated
that DAMD-PCR is potentially useful for species and genome identification in Oryza. The DAMD-PCR technique also allows for the isolation of informative molecular probes to be utilized in DNA fingerprinting
and genome identification in rice.
Received: 1 October 1996 / Accepted: 25 April 1997 相似文献
7.
8.
Yang CH Chou PJ Luo ZL Chou IC Chang JC Cheng CC Martin CR Waring MJ Sheh L 《Bioorganic & medicinal chemistry》2003,11(15):3279-3288
Two dodecapeptide amines: (WPRK)(3)NH(2)[WR-12] and (YPRK)(3)NH(2)[YR-12], and a 30-mer polypeptide amide (SP-30) were synthesized by solid-phase peptide methodology. DNase I footprinting studies on a 117-mer DNA showed that WR-12 and YR-12 bind selectively to DNA sequences in a manner similar to SP-30 which has a repeating SPK(R)K sequence. The most distinctive blockages seen with all three peptides occur at positions 26-30, 21-24 and 38-45 around sequences 5'-GAATT-3', 5'-TAAT-3' and 5'-AAAACGAC-3', respectively. However, it appears that YR-12 is better able to extend its recognition site to include CG pairs than is SP-30. At low concentrations YR-12 was able to induce enhanced rates of DNase I cleavage at regions surrounding some of its binding sites. To obtain further quantitative data supplementary to the footprinting work, equilibrium binding experiments were performed in which the binding of the two peptides to six decanucleotide duplexes was compared. Scatchard analyses indicated that WR-12 may be more selective for oligomers containing runs of consecutive purines or pyrimidines. On the other hand, YR-12 binds better to d(CTTAGACGTC)- d(GACGTCTAAG) than to the other oligomer duplexes, denoting selectivity for evenly distributed C/G and A/T sequences. 相似文献
9.
Chin F Leung HC 《IEEE/ACM transactions on computational biology and bioinformatics / IEEE, ACM》2008,5(1):110-119
The problem of discovering novel motifs of binding sites is important to the understanding of gene regulatory networks. Motifs are generally represented by matrices (position weight matrix (PWM) or position specific scoring matrix (PSSM) or strings. However, these representations cannot model biological binding sites well because they fail to capture nucleotide interdependence. It has been pointed out by many researchers that the nucleotides of the DNA binding site cannot be treated independently, e.g. the binding sites of zinc finger in proteins. In this paper, a new representation called Scored Position Specific Pattern (SPSP), which is a generalization of the matrix and string representations, is introduced which takes into consideration the dependent occurrences of neighboring nucleotides. Even though the problem of discovering the optimal motif in SPSP representation is proved to be NP-hard, we introduce a heuristic algorithm called SPSP-Finder, which can effectively find optimal motifs in most simulated cases and some real cases for which existing popular motif finding software, such as Weeder, MEME and AlignACE, fail. 相似文献
10.
Direct determination of DNA nucleotide sequences: structure of a fragment of bacteriophage phiX172 DNA 总被引:28,自引:0,他引:28
Methods for the direct determination of nucleotide sequences in DNA have been developed and used to determine the complete primary structure of a fragment of bacteriophage φX174 DNA which is 48 residues in length. This fragment was liberated from φX DNA by digestion at low temperature and high ionic strength with the T4 phage-induced endonuclease IV. The fragment was redigested with endonuclease IV under vigorous conditions and the products fractionated two-dimensionally providing a characteristic endonuclease IV “fingerprint” of the fragment. The Burton (Burton &; Petersen, 1960) depurination reaction was used to characterize the redigestion products and identify the pyrimidine residues at their 5′ and 3′ termini. These oligonucleotide products were then fully sequenced by partial exonuclease digestion with spleen and snake venom phosphodiesterase and analysis of the fractionated digests by base composition, depurination, and 5′ end-group analysis using exonuclease I. Rules for the interpretation of two-dimensional fingerprints of partial exonuclease digests, which rapidly provide sequence information by simple inspection, were also deduced. To derive the complete structure of the fragment, the fully sequenced oligonucleotides were ordered by characterizing large, overlapping, partial endonuclease IV digestion products by means of the depurination reaction. The sequencing methods described are general and may be used for the direct determination of the primary structures of other fragments of DNA. 相似文献
11.
Direct detection of unamplified DNA from pathogenic mycobacteria using DNA-derivatized gold nanoparticles 总被引:1,自引:0,他引:1
Emmanouil Liandris Maria Gazouli Margarita Andreadou Mirjana omor Nadica Abazovic Leonardo A. Sechi John Ikonomopoulos 《Journal of microbiological methods》2009,78(3):260-264
Mycobacterial infections have a high economic, human and animal health impact. Herein, we present the development of a colorimetric method that relies on the use of gold nanoparticles for fast and specific detection of Mycobacterium spp. dispensing with the need for DNA amplification. The result can be recorded by visual and/or spectrophotometric comparison of solutions before and after acid induced AuNP-probe aggregation. The presence of a complementary target prevents aggregation and the solution remains pink, whereas in the opposite event it turns to purple. The application of the proposed method on isolated bacteria produced positive results with the mycobacterial isolates and negative with the controls. The minimum detection limit of the assay was defined at 18.75 ng of mycobacterial DNA diluted in a sample-volume of 10 μl. In order to obtain an indication of the method's performance on clinical samples we applied the optimized assay to the detection of Mycobacterium avium subsp. paratuberculosis DNA in faeces, in comparison with real-time PCR. The concordance of the two methods with connection to real-time PCR positive and negative sample was defined respectively as 87.5% and 100%. The proposed method could be used as a highly specific and sensitive screening tool for the detection of mycobacteria directly from clinical samples in a very simple manner, without the need of high-cost dedicated equipment. The technology described here, may develop into a platform that could accommodate detection of many bacterial species and could be easily adapted for high throughput and expedite screening of samples. 相似文献
12.
DNA sequences required for translational frameshifting in production of the transposase encoded by IS1. 总被引:7,自引:0,他引:7
Summary The transposase encoded by insertion sequence IS 1 is produced from two out-of-phase reading frames (insA and B-insB) by translational frameshifting, which occurs within a run of six adenines in the –1 direction. To determine the sequence essential for frameshifting, substitution mutations were introduced within the region containing the run of adenines and were examined for their effects on frameshifting. Substitutions at each of three (2nd, 3rd and 4th) adenine residues in the run, which are recognized by tRNALys reading insA, caused serious defects in frameshifting, showing that the three adenine residues are essential for frameshifting. The effects of substitution mutations introduced in the region flanking the run of adenines and in the secondary structures located downstream were, however, small, indicating that such a region and structures are not essential for frameshifting. Deletion of a region containing the termination codon of insA caused a decrease in -galactosidase activity specified by the lacZ fusion plasmid in frame with B-insB. Exchange of the wild-type termination codon of insA for a different one or introduction of an additional termination codon in the region upstream of the native termination codon caused an increase in -galactosidase activity, indicating that the termination codon in insA affects the efficiency of frameshifting. 相似文献
13.
14.
15.
F Bailly C Bailly M J Waring J P Hénichart 《Biochemical and biophysical research communications》1992,184(2):930-937
The sequence selectivity of binding to DNA by an acridine-linked peptide ligand has been investigated by means of footprinting methodologies. The ligand conjugates an anilino-acridine intercalating chromophore with the potentially minor groove binder octapeptide SPKKSPKK. This basic peptide corresponds to a highly conserved DNA recognition motif found in histone H1 and several other nonhistone proteins. Three complementary techniques using DNase I, hydroxyl radicals and osmium tetroxide as sequencing probes have been employed to evaluate both the sequence specificity of binding and the drug-induced conformational changes in DNA. The results converge to demonstrate the AT-selectivity and support a model in which the peptide moiety lies in the minor groove. DNA-binding sites of the conjugate are restricted to a few alternating AT-sequences proximal to GC-rich regions. Binding to homooligomeric runs of A and T is clearly disfavoured by the hybrid whereas such sequences represent preferred binding sites for the unsubstituted basic peptide. These differences reflect the influence of the anilino-acridine chromophore, which evidently contributes to the DNA recognition process allowing the peptide only to contact defined DNA sequences. 相似文献
16.
Noncanonical parallel-stranded DNA double helices (ps-DNA) comprising natural nucleotide sequences are usually second in stability to antiparallel-stranded (aps) canonical DNA structures, which ensures reliable cell functioning. However, recent data indicate a possible role of ps-DNA in DNA loops or in trinucleotide repeats connected with neurodegenerative diseases. The review surveys recent studies on the effect of nucleotide sequence on preference of one or other type of DNA duplex. (1) Ps-DNA with mixed AT/GC composition was found to have conformational and thermodynamic properties drastically different from those of Watson-Crick double helix. Its stability depends strongly on the specific sequence in a manner peculiar to the ps double helix, because of the energy disadvantage of the AT/GC contacts. The AT/GC boundary facilitated flipping of A and T out of the ps double helix. Proton acceptor groups of bases are exposed into the both grooves of the ps-DNA and are accessible to solvent and ligands, including proteins. (2) DNA regions containing natural minor bases isoguanine and isomethylcytosine were shown to form ps-DNA with transAT-, trans isoGC, and trans iso5meCG pairs exceeding in stability a related aps duplex. (3) Nucleotide sequence dG(GT)4G from yeast telomeres and microsatellites was demonstrated to form novel ps-DNA with GG and TT base pairing. Unlike d(GT)n and d(GnTm) sequences able to form quadruplexes, the dG(GT)4G sequence formed no alternative double- or multistranded structures in a wide range of experimental conditions, thus suggesting that the nucleotide context governs the observed structural polymorphism of the d(GT)n sequence. The possible biological role of ps-DNA and the prospects of its study are discussed. 相似文献
17.
The helix-turn-helix DNA binding motif 总被引:98,自引:0,他引:98
18.
Direct cloning of specific genomic DNA sequences in plasmid libraries following fragment enrichment. 总被引:16,自引:3,他引:16
下载免费PDF全文

We describe a simple method to directly clone any DNA fragment for which a flanking restriction enzyme map is known. Genomic DNA is digested with multiple enzymes cutting outside the fragment to be cloned, selected by electroelution from an agarose gel, and cloned directly into a plasmid vector. It is only necessary to screen 10-1000 colonies and recombinant DNA is ready for immediate molecular analysis without further subcloning. The use of this technique is demonstrated for the cloning of a sequence from within the human alpha-globin complex that was previously shown to be "unclonable" in bacteriophage and cosmid vectors and which is a multiallelic general genetic marker, as well as both beta-globin alleles from an individual with beta-thalassaemia. 相似文献
19.