首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
DNA and protein sequence comparisons are performed by a number of computational algorithms. Most of these algorithms search for the alignment of two sequences that optimizes some alignment score. It is an important problem to assess the statistical significance of a given score. In this paper we use newly developed methods for Poisson approximation to derive estimates of the statistical significance ofk-word matches on a diagonal of a sequence comparison. We require at leastq of thek letters of the words to match where 0<qk. The distribution of the number of matches on a diagonal is approximated as well as the distribution of the order statistics of the sizes of clumps of matches on the diagonal. These methods provide an easily computed approximation of the distribution of the longest exact matching word between sequences. The methods are validated using comparisons of vertebrate andE. coli protein sequences. In addition, we compare two HLA class II transplantation antigens by this method and contrast the results with a dynamic programming approach. Several open problems are outlined in the last section. This work was supported by grants DMS 90-05833 from NSF and GM 36230 from NIH.  相似文献   

2.
A modified rapid amplification of cDNA ends (RACE) strategy has been developed for cloning highly conserved cDNA sequences. Using this modified method, the growth hormone (GH) encoding cDNA sequences ofLabeo rohita, Cirrhina mrigala andCatla catla have been cloned, characterized and overexpressed inEscherichia coli. These sequences show 96–98% homology to each other and are about 85% homologous to that of common carp. Besides, an attempt has been made for the first time to describe a 3-D model of the fish GH protein.  相似文献   

3.
Well-known dynamic programming algorithms exist for comparing two finite sequences inO(N 2) time and storage, whereN is the common sequence length. Extensions to the comparison ofM finite sequences requireO((2N) M) time and storage, making such algorithms difficult even forM=3. A simple generalization of the sequences makes it possible to obtain some results about the geometry of sequence alignments. These ideas suggest heuristic approaches to problems of comparing several sequences. IfM sequences are known to be related by a binary tree, they can be aligned inO(MN 2) time andO(N 2+NM) storage. This work was supported by a grant from the System Development Foundation.  相似文献   

4.
Twelve fungi namelyAlternaria alternata, Aspergillus flavus, A niger, A ochraceus, Actinomucor repens, Capnodoium spp., Curvularia lunata, Fusarium pallidoroseum, F solani, F verticillioides, Penicillium citrinum and Rhizopus stolonifer were recorded from samples ofAegle marmelos, Aesculus indica, Buchanania lanzan andPinus gerardiana. In case ofPrunus amygdalus only Rstolonifer was recorded. A significant variation in pattern of mycoflora incidence was observed in terms of source and season. Fungal infestation in most of the substrates was found to be highest during monsoon. Aflatoxins were the most common mycotoxins elaborated by different isolates ofA flavus obtained fromA marmelos, B lanzan andP gerardiana. The amount of aflatoxins produced by the toxigenic isolates ofA flavus was in the range of traces to 0.9–26.0 μg/ml inA marmelos, 0.8–17.5 μg/ml inP gerardiana and 0.65–13.2 μg/ml inB lanzan. The percentage toxigenicity was comparatively lower in the isolates of other mycotoxigenic fungi. Aflatoxins were detected almost in all the samples analyzed for mycotoxin contamination. However, traces of zearalenone were detected inA marmelos. The concentration of aflatoxin B1 was in the range of 0.13–0.75 μg/g inA marmelos, 0.09–0.60 μg/g inP gerardiana and 0.01–0.20 ug/g inB lanzan. Mycotoxins were not detected inAesculus indica andPrunus amygdalus.  相似文献   

5.
A complete set of candidate disease resistance ( R) genes encoding nucleotide-binding sites (NBSs) was identified in the genome sequence of japonica rice ( Oryza sativa L. var. Nipponbare). These putative R genes were characterized with respect to structural diversity, phylogenetic relationships and chromosomal distribution, and compared with those in Arabidopsis thaliana. We found 535 NBS-coding sequences, including 480 non-TIR (Toll/IL-1 receptor) NBS-LRR (Leucine Rich Repeat) genes. TIR NBS-LRR genes, which are common in A. thaliana, have not been identified in the rice genome. The number of non-TIR NBS-LRR genes in rice is 8.7 times higher than that in A. thaliana, and they account for about 1% of all of predicted ORFs in the rice genome. Some 76% of the NBS genes were located in 44 gene clusters or in 57 tandem arrays, and 16 apparent gene duplications were detected in these regions. Phylogenetic analyses based both NBS and N-terminal regions classified the genes into about 200 groups, but no deep clades were detected, in contrast to the two distinct clusters found in A. thaliana. The structural and genetic diversity that exists among NBS-LRR proteins in rice is remarkable, and suggests that diversifying selection has played an important role in the evolution of R genes in this agronomically important species. (Supplemental material is available online at .)Communicated by R. HagemannThe first three authors contributed equally to this work  相似文献   

6.
Given two independent sequences of letters, we seek the probability distribution of the length of the longest matching word. This word can be in different positions in the two sequences and we consider both perfect and nearly perfect matching. We derive bounds and approximations for the probability and compare them with other bounds and approximations. The results can be applied to DNA sequences in molecular biology and generalized matching between two independent random sequences.  相似文献   

7.
InPseudomonas aeruginosa, the products of thexcp genes are required for the secretion of exoproteins across the outer membrane. Despite structural conservation of the Xcp components, secretion of exoproteins via the Xcp pathway is generally not found in heterologous organisms. To study the specificity of this protein secretion pathway, thexcp genes of another fluorescent pseudomonad, the plant growth-promotingPseudomonas putida strain WCS358, were cloned and characterized. Nucleotide sequence analysis revealed the presence of at least five genes, i.e.,xcpP, Q, R, S, andT, with homology toxcp genes ofP. aeruginosa. Unlike the genetic organization inP. aeruginosa, where thexcp cluster consists of two divergently transcribed operons, thexcp genes inP. putida are all oriented in the same direction, and probably comprise a single operon. Upstream ofxcpP inP. putida, an additional open reading frame, with no homolog inP. aeruginosa, was identified, which possibly encodes a lipoprotein. Mutational inactivation ofxcp genes inP. putida did not affect secretion, indicating that no proteins are secreted via the Xcp system under the growth conditions tested, and that an alternative secretion system is operative. To obtain some insight into the secretory pathway involved, the amino acid sequence of the N-terminus of the major extracellular protein was determined. The protein could be identified as flagellin. Mutations in thexcpQ andR genes ofP. aeruginosa could not be complemented by introduction of the correspondingxcp genes ofP. putida. However, expression of a hybrid XcpR protein, composed of the N-terminal one-third ofP. aeruginosa XcpR and the C-terminal two-thirds ofP. putida XcpR, did restore protein secretion in aP. aeruginosa xcpR mutant.  相似文献   

8.
This paper reports on expression and posttranslational modifications of the catalytic subunits of pol α and pol δ from fission yeastSchizosaccharomyces pombe. Okadaic acid treatment ofS. pombe spheroplasts in amounts known to inhibit phosphatases, 1 and 2A resulted in decreased proteolysis of both pol α and pol δ. Computer analysis of pol α and pol δ sequences confirmed the presence of consensus motifs for protein phosphorylation. Indirect immunofluorescence microscopy ofS. pombe cells showed nuclear location of both proteins in wild type cells. However, whereas cells transformed with a vector expressing pol α produced a clear increase of the nuclear signal no increase was detectable in cells transformed with pol δ. This observation suggests the existence of a mechanism limiting thecell concentration of pol δ in the cell. Constitutive expression ofS. pombe pol δ inE. coli was possible only with vectors containing truncated forms of its gene, indicating a toxic effect of pol δ onE. coli growth.  相似文献   

9.
A 6.5 kb DNA fragment containing the gene (thrC) encoding threonine synthase, the last enzyme of the threonine biosynthetic pathway, has been cloned from the DNA ofBacillus sp. ULM1 by complementation ofEscherichia coli andBrevibacterium lactofermentum thrC auxotrophs. Complementation studies showed that thethrB gene (encoding homoserine kinase) is found downstream from thethrC gene, and analysis of nucleotide sequences indicated that thehom gene (encoding homoserine dehydrogenase) is located upstream of thethrC gene. The organization of this cluster of genes is similar to theBacillus subtilis threonine operon (hom—thrC—thrB). An 1.9 kbBclI, fragment from theBacillus sp. ULM1 DNA insert that complementedthrC mutations both inE. coli and in corynebacteria was sequenced, and an ORF encoding a protein of 351 amino acids was found corresponding to a protein of 37462 Da. ThethrC gene showed a low G+C content (39.4%) and the encoded threonine synthase is very similar to theB. subtilis enzyme. Expression of the 1.9 kbBclI DNA fragment inE. coli minicells resulted in the formation of a 37 kDa protein. The upstream region of this gene shows promoter activity inE. coli but not in corynebacteria. A peptide sequence, including a lysine that is known to bind the pyridoxal phosphate cofactor, is conserved in all threonine synthase sequences and also in the threonine and serine dehydratase genes. Amino acid comparison of nine threonine synthases revealed evolutionary relationships between different groups of bacteria. Dedicated to Dr. J. Spížek on the occasion of his 60th birthday  相似文献   

10.
Rhamnogalacturonase was purified from culture filtrate ofAspergillus aculeatus after growth in medium with sugar-beet pulp as carbon source. Purified protein was used to raise antibodies in mice and with the antiserum obtained a gene coding for rhamnogalacturonase (rhgA) was isolated from a λ cDNA expression library. The clonedrhgA gene has an open-reading frame of 1320 base pairs encoding a protein of 440 amino acids with a predicted molecular mass of 45 962 Da. The protein contains a potential signal peptidase cleavage site behind Gly-18 and three potential sites forN-glycosylation. Limited homology withA. niger polygalacturonase amino acid sequences is found. A genomic clone ofrhgA was isolated from a recombinant phage λ genomic library. Comparison of the genomic and cDNA sequences revealed that the coding region of the gene is interrupted by three introns. Furthermore, amino acid sequences of four different peptides, derived from purifiedA. aculeatus rhamnogalacturonase, were also found in the deduced amino acid sequence ofrhgA.A. aculeatus strains overexpressing rhamnogalacturonase were obtained by cotransformation using either theA. niger pyrA gene or theA. aculeatus pyr A gene as selection marker. For expression of rhamnogalacturonase inA. awamori theA. awamori pyrA gene was used as selection marker. Degradation patterns of modified hairy regions, determined by HPLC, show the recombinant rhamnogalacturonase to be active, and the enzyme was found to have a positive effect in the apple hot-mash liquefaction process.  相似文献   

11.
This paper describes a rapid method of constructing homologous recombinant baculovirus inE. coli with PCR-amplified fragments. By using this method, the traditional steps of constructing transfer vector are omitted. The method is based on phage λ red system which can promote the recombination between the homologous fragments with the length above 36 bp. Taking HaSNPV as an example, this paper describes the rapid recombination process by using chloramphenicol resistance gene (Cm R ) to replaceorf135 in HaSNPV genome. A pair of primers with length of 60 bp was synthesized, in which 40 bp was homologous to the each end sequence oforf135, and the rest 20 bp was homologous to the each end sequence ofCm R . By using these primers, a linear fragment containing the completeCm R gene between 40 bp of homologous arms oforf135 was generated by PCR with the plasmid pKD3 which containsCm R as the template. By transforming the linear fragment into theE. coli containing the bacterial artificial chromosome of HaSNPV and with the help of a plasmid expressing λ recombinase, the recombinants on which the homologue replacement had taken place were selected by chloramphenicol resistance. This method greatly shortens the process of constructing recombinant baculovirus since the process was performed inE. coli and does not need to construct transfer vectors. It can be further used for gene replacement and gene deletion of other large viral genomes.  相似文献   

12.
TheNor-loci of polyploid wheats and their putative diploid progenitor species were assayed by probing isolated nuclear DNA with ribosomal DNA spacer sequences (spacer rDNA sequences, isolated by cloning), from theNor-loci of genomes B (Triticum aestivum), G (T. timopheevi), B (syn. S,T. speltoides), A (T. monococcum) and V (Dasypyrum villosum). DNA samples for analysis were digested with the restriction endonuclease Taq 1 and assayed by DNA-DNA hybridization under standard (37°C) and high stringency (64°C) conditions. The assay procedure emphasized differences between the divergent spacer sequences of the polyploid species and allowed relative homologies to the respective sequences in diploid species to be established. — The studies indicated thatT. timopheevi andT. speltoides contain different sets of spacer rDNA sequences which were readily distinguishable and, in the case ofT. timopheevi, assigned toNor-loci on different chromosomes. This contrast with the spacer rDNA sequences of the majorNor-loci on chromosomes 1 B and 6 B inT. aestivum, which were difficult to distinguish and were deduced to contain very similar sequences. Among the diploid progenitor species only the spacer rDNA fromT. speltoides shared close homology with polyploid wheat species. OneNor-locus inT. timopheevi (on chromosome 6 G) did not show close homology with any of the rDNA spacer probes available. — The data suggestsT. speltoides was the origin of someNor-loci for both theT. timopheevi andT. turgidum lines of tetraploid wheats. The possibility that the 6GNor-locus inT. timopheevi may have derived from an unknown diploid species by introgressive hybridization is discussed. The spacer rDNA sequence probe fromT. monococcum shared good homology with some accessions ofD. villosum and a line ofT. dicoccoides; the implications of this finding for evolution of present-day wheats are discussed.  相似文献   

13.
Two subtracted cDNA libraries ofDunaliella salina (Volvocales, Chlorophyceae) under different hyperosmotic shock were constructed using the suppression subtractive hybridization (SSH) method. The mRNA isolated from algae grown without stress was used as a “driver”, and the mRNAs isolated from algae 16 h (short-term treatment) or 7 d (long-term treatment) after salt stress were used as “testers”. The differentially expressed cDNA fragments inD. salina under salt stress were identified by screening these 2 libraries. Two cDNA fragments,D27 andD114, were identified from clones pL27 and pL114 after the long-term treatment. Three cDNA fragments,D21, D39, andD88, were identified from clones pSh21, pSh39, and pSh88 after the short-term treatment. The homology analysis revealed that D27 was highly similar (91%) to the subunit V of PS I reaction center inChlamydomonas reinhardtii. D21 was similar to fructose-1,6-diphosphate aldolase (78.4%). After searching GenBank with the sequences ofD39, D88, andD114, no similar sequences were found. Northern analysis revealed that the expression levels of all 5 cDNAs were increased significantly after salt stress. This means that SSH can be used in cloning differentially expressed cDNAs inD. salina under salt stress. The expression ofD27, D21, andD88 wasde novo induced by salt stress, and the expression ofD114 andD39 was increased from a relatively lower level; this indicates that all 5 cDNAs might exert an influence on the alga under hyperosmotic shock.  相似文献   

14.
The main difference between LTR retrotransposons and retroviruses is the presence of theenvelope (env) gene in the latter, downstream of thepol gene. Theenv gene is involved in their infectious capacity. Here we report the presence ofenv-like sequences in the genome ofQuercus suber (cork oak), one of the most economically important Portuguese species. These gene sequences were isolated through DNA amplification betweenRNaseH conserved motifs and 3’ LTR, based on the structure ofcopia retrotransposons. Phylogenetic analysis revealed that almost all the clones isolated are clustered withCyclops-2, aTy3-gypsy element identified inPisum sativum, except one clustered withgypsy andcopia retroelements found in different species. This suggests the existence of a potential ancestral sequence of theenv gene, prior to the separation ofTy3-gypsy andTy1-copia retrotransposons. Additionally, the isolatedenv-like sequences showed 26–39% of homology withenv-like sequences characterized in viruses. The origin ofenv-like sequences in retrotransposons from host plant taxa is discussed.  相似文献   

15.
We present a new, practical algorithm to resolve the experimental data in restriction site analysis, which is a common technique for mapping DNA. Specifically, we assert that multiple digestions with a single restriction enzyme can provide sufficient information to identify the positions of the restriction sites with high probability. The motivation for the new approach comes from combinatorial results on the number of mutually homeometric sets in one dimension, where two sets ofn points are homeometric if the multiset ofn(n−1)/2 distances they determine are the same. Since experimental data contain errors, we propose algorithms for reconstructing sets from noisy interpoint distances, including the possibility of missing fragments. We analyse the performance of these algorithms under a reasonable probability distribution, establishing a relative error limit ofr=Θ(1/n 2) beyond which our technique becomes infeasible. Through simulations, we establish that our technique is robust enough to reconstruct data with relative errors of up to 7.0% in the measured fragment lengths for typical problems, which appears sufficient for certain biological applications.  相似文献   

16.
Summary We have analyzed and compared the amino acid sequences of the type 4 fimbrial subunits fromPseudomonas aeruginosa, Moraxella bovis, M. nonliquefaciens, Bacteroides nodosus, Neisseria gonorrhoeae, andN. meningitidis. We propose a consensus sequence for the highly conserved aminoterminal regions of these proteins. In the variable regions, a domain corresponding to an epitope common toN. gonorrhoeae andN. meningitidis fimbriae is conserved, both in sequence and in environment, in fimbrial subunits fromB. nodosus. The subunits fromM. bovis andP. aeruginosa do not show any homologies to this sequence. In all of the subunits, the carboxy-terminal half of the molecule consists of a series of fairly hydrophobic domains. The last three domains, two of which include the cysteines of the disulfide bridge inN. gonorrhoeae, P. aeruginosa, andM. bovis, are more or less conserved in sequence in all of the proteins including that ofB. nodosus. We propose that these conserved hydrophobic regions, which have the potential to form a series of beta-sheets, form a structural framework around which more variable hydrophilic sequences determining immunological profile are arranged. The evolutionary relationships of the contemporary proteins and the distribution of type 4 fimbriae are also discussed.  相似文献   

17.
Reassociation kinetics ofDaucus carota andPetroselinum crispum (Apiaceae), andDatura innoxia (Solanaceae) are presented. Hybridization of3H-labelled DNA of two carrot cultivars indicate strong qualitative homologies of DNA sequences; nevertheless, certain quantitative differences in some Cotregions seem to exist. However, homologous sequences ofDaucus DNA with DNA ofDatura, and, suprisingly, even with DNA ofPetroselinum are very restricted: between 8% in the repeated regions and ca. 7–9% in the unique regions.  相似文献   

18.
In sweet cherry (Prunus avium L.), theS4′ haplotype, characterized by a self-incompatibility (SI) defect in pollen, is self-compatible and is derived from the self-incompatibleS4 haplotype by x-ray mutagenesis.SFBs (S haplotype-specific F-box protein genes) have been found to associate with pollen determinant of SI. This report identified theSFB4′ of the self-compatibleS4′ haplotype. The alignment of the sequences ofSFB4′ andSFB4 by the BLAST program revealed a 4-bp deletion inSFB4′, which is TTTA. The sequence polymorphism generated by the TTTA deletion inSFB4′ was exploited to develop a simple molecular marker specific for detecting theS4′ but not theS4 haplotype. The simple marker specific to theS4′ haplotype can be visualized directly on an agarose gel, so it can be immediately applied to a marker-assistant cherry-breeding program. Thus, this work provides a practical molecular marker for cherry breeding. Principal author. An erratum to this article is available at .  相似文献   

19.
Summary The presence of the ‘group specific protein antigen’ inT. zuelzerae is demonstrated. From the results of complement fixation and gel diffusion tests it can be concluded that this organism and the Reiter strain ofT. pallidum differ in their polysaccharide fractions.  相似文献   

20.
Plasmids without an origin of replication, but bearing theURA3 gene ofSaccharomyces cerevisiae as a selective marker for transformation, are shown to replicate autonomously inHansenula polymorpha, indicating that parts of theS. cerevisiae URA3 gene can fulfil an autonomous replication and stabilization function inH. polymorpha. Such plasmids, replicated in low copy number in monomeric conformation, could be rescued inE. coli, and showed a low mitotic stability under selective and non-selective conditions. Selective propagation of such transformants, however, led to the integration of plasmid sequences into theH. polymorpha genome. The integration event usually occurred in high copy number (approx. 30–50) at a single non-homologous site of the genome. The plasmid sequences were found to be present in tandem array and stable under non-selective conditions. It contrast, the use of homologousURA3 gene under similar conditions led to low-copy-number transformants.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号