首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
2.
The nucleotide sequence running from the genetic left end of bacteriophage T7 DNA to within the coding sequence of gene 4 is given, except for the internal coding sequence for the gene 1 protein, which has been determined elsewhere. The sequence presented contains nucleotides 1 to 3342 and 5654 to 12,100 of the approximately 40,000 base-pairs of T7 DNA. This sequence includes: the three strong early promoters and the termination site for Escherichia coli RNA polymerase: eight promoter sites for T7 RNA polymerase; six RNAase III cleavage sites; the primary origin of replication of T7 DNA; the complete coding sequences for 13 previously known T7 proteins, including the anti-restriction protein, protein kinase, DNA ligase, the gene 2 inhibitor of E. coli RNA polymerase, single-strand DNA binding protein, the gene 3 endonuclease, and lysozyme (which is actually an N-acetylmuramyl-l-alanine amidase); the complete coding sequences for eight potential new T7-coded proteins; and two apparently independent initiation sites that produce overlapping polypeptide chains of gene 4 primase. More than 86% of the first 12,100 base-pairs of T7 DNA appear to be devoted to specifying amino acid sequences for T7 proteins, and the arrangement of coding sequences and other genetic elements is very efficient. There is little overlap between coding sequences for different proteins, but junctions between adjacent coding sequences are typically close, the termination codon for one protein often overlapping the initiation codon for the next. For almost half of the potential T7 proteins, the sequence in the messenger RNA that can interact with 16 S ribosomal RNA in initiation of protein synthesis is part of the coding sequence for the preceding protein. The longest non-coding region, about 900 base-pairs, is at the left end of the DNA. The right half of this region contains the strong early promoters for E. coli RNA polymerase and the first RNAase III cleavage site. The left end contains the terminal repetition (nucleotides 1 to 160), followed by a striking array of repeated sequences (nucleotides 175 to 340) that might have some role in packaging the DNA into phage particles, and an A · T-rich region (nucleotides 356 to 492) that contains a promoter for T7 RNA polymerase, and which might function as a replication origin.  相似文献   

3.
Adipose differentiation-related protein (ADFP) is important for regulation of lipid metabolism and insulin secretion in beta-cells. In this study, we investigated polymorphisms within the caprine ADFP gene and determined its relationship with production traits. As there was no sequence information available for the caprine ADFP gene, we generated DNA sequence data and examined the genomic organisation. The caprine ADFP gene is organised into 7 exons and 6 introns that span approximately 8.7 kbp and is transcribed into mRNA containing 1353 bp of sequence coding for a protein of 450 amino acids. The protein sequences showed substantial similarity (71–99%) to orthologues from cattle, human and mouse. We identified polymorphisms in the sequences using DNA sequencing, PCR-RFLP and forced PCR-RFLP methods. Seven single nucleotide polymorphisms (SNPs) were identified using samples from 4 different goat populations consisting of 1408 healthy and unrelated individuals. Six haplotypes involving the 7 SNPs from the caprine ADFP gene were identified and their effects on production traits were analysed. Haplotype 6 had the highest haplotype frequency and was highly significantly associated with chest circumference and milk yield in the analysed populations. The results of this study suggest that the ADFP gene is a strong candidate gene affecting production traits and may be used for marker-assisted selection and management in Chinese dairy goat breeding programmes.  相似文献   

4.
5.
6.
The Ti plasmid of the Agrobacterium vitis nopaline-type strain AB4 was subcloned and mapped. Several regions of the 157 kb Ti plasmid are similar or identical to parts of the A. vitis octopine/cucumopine (o/c)-type Ti plasmids, and other regions are homologous to the nopaline-type Ti plasmid pTiC58. The T-DNA of pTiAB4 is a chimaeric structure of recent origin: the left part is 99.2% homologous to the left part of the TA-DNA of the o/c-type Ti plasmids, while the right part is 97.1 % homologous to the right part of an unusual nopaline T-DNA recently identified in strain 82.139, a biotype Il strain from wild cherry. The 3′ non-coding regions of the ipt genes from pTiAB4 and pTi82.139 are different from those of other ipt genes and contain a 62 by fragment derived from the coding sequence of an ipt gene of unknown origin. A comparison of different ipt gene sequences indicates that the corresponding 62 by sequence within the coding region of the AB4 ipt gene has been modified during the course of its evolution, apparently by sequence transfer from the 62 by sequence in the 3′ non-coding region. In pTi82.139 the original coding region of the ipt gene has remained largely unmodified. The pTiAB4 6b gene differs from its pTi82.139 counterpart by the lack of a 12 by repeat in the 3′ part of the coding sequence. This leads to the loss of four glutamic acid residues from a series of ten. In spite of these differences, the ipt and 6b genes of pTiAB4 are functional. Our results provide new insight into the evolution of Agrobacterium Ti plasmids and confirm the remarkable plasticity of these genetic elements. Possible implications for the study of bacterial phylogeny are discussed.  相似文献   

7.
The major heat shock protein of 70,000 Mr in Drosophila melanogaster is encoded by two variant gene types located, respectively, at the chromosomal sites 87A7 and 87C1. We present the DNA sequence of a complete hsp702 gene of the 87A7 type and of the adjacent regions from both variants, extending to 1·2 × 103 bases upstream from the start of the messenger coding region. We find an untranslated region of 250 nucleotides at the 5′ end of the messenger coding sequence in both variants. There is only one open reading frame which allows coding of a 70,000 Mr protein within the 87A7 variant, as found for an 87C1 variant (Ingolia et al., 1980). We observe 4·2% nucleotide divergence between these two variants with complete conservation of the reading frame. There is a conserved sequence of 355 nucleotides in front of each hsp70 gene, which is 85% homologous between the two variants. The presence of the same sequence element in γ, in front of the αβ heat shock genes (R. W. Hackett & J. T. Lis, personal communication) suggests that this element contains the regulatory signals for the coordinate expression of both the hsp70 and the αβ heat shock genes. Finally we find a very A + T-rich sequence of 150 basepairs which is highly conserved (91·8%) 0·6 × 103 bases upstream from two hps70 gene variants.  相似文献   

8.
Actinomyces naeslundii and Actinomyces oris are members of the oral biofilm. Their identification using 16S rRNA sequencing is problematic and better achieved by comparison of metG partial sequences. A. oris is more abundant and more frequently isolated than A. naeslundii. We used a multi-locus sequence typing approach to investigate the genotypic diversity of these species and assigned A. naeslundii (n = 37) and A. oris (n = 68) isolates to 32 and 68 sequence types (ST), respectively. Neighbor-joining and ClonalFrame dendrograms derived from the concatenated partial sequences of 7 house-keeping genes identified at least 4 significant subclusters within A. oris and 3 within A. naeslundii. The strain collection we had investigated was an under-representation of the total population since at least 3 STs composed of single strains may represent discrete clusters of strains not well represented in the collection. The integrity of these sub-clusters was supported by the sequence analysis of fimP and fimA, genes coding for the type 1 and 2 fimbriae, respectively. An A. naeslundii subcluster was identified with both fimA and fimP genes and these strains were able to bind to MUC7 and statherin while all other A. naeslundii strains possessed only fimA and did not bind to statherin. An A. oris subcluster harboured a fimA gene similar to that of Actinomyces odontolyticus but no detectable fimP failed to bind significantly to either MUC7 or statherin. These data are evidence of extensive genotypic and phenotypic diversity within the species A. oris and A. naeslundii but the status of the subclusters identified here will require genome comparisons before their phylogenic position can be unequivocally established.  相似文献   

9.
We have isolated and sequenced two maize genomic clones that are homologous to the Drosophila hsp70 gene. One of the maize hsp70 clones contains the entire hsp70 coding region and 81 nucleotides of the 5' nontranslated sequence. The predicted amino acid sequence for this maize protein is 68% homologous to the hsp70 of Drosophila. The second maize hsp70 clone contains only part of the coding sequence and 1.1 kb of the 5' flanking sequence. This 5' flanking sequence contains two sequences homologous to the consensus heat-shock-element sequence. Both maize genes are thermally inducible and each contains an intron in the same position as that of the heat-shock-cognate gene, hsc1, of Drosophila. The presence of an intron in the maize genes is a distinguishing feature in that no other thermally inducible hsp70 genes described to date contain an intron. We have constructed a hybrid hsp70 gene containing the entire hsp70 coding sequence with an intron, and 1.1 kb of the 5' flanking sequence. We demonstrate that this hybrid gene is thermally inducible in a transgenic petunia plant and that the gene is expressed from its own promoter.  相似文献   

10.
A transposable element has been isolated from the industrially important fungus Aspergillus niger (strain N402). The element was identified as an insertion sequence within the coding region of the nitrate reductase gene. It had inserted at a TA site and appeared to have duplicated the target site upon insertion. The isolated element was found to be 4798 by in length and contained 37-bp inverted, imperfect, terminal repeats (ITRs). The sequence of the central region of the element revealed an open reading frame (designated ORF1) which showed similarity, at the amino acid level, to the transposase of the Tc1/mariner class of DNA transposons. Another sequence within the central region of the element showed similarity to the 3′ coding and downstream untranslated region of the amyA gene of A. niger. Sequence homology and structural features indicate that this element, which has been named Ant1 (A. niger transposon 1), is related to the Tc1/mariner group of DNA transposons. Ant1 is apparently present as a single copy in strain N402 of A. niger.  相似文献   

11.
The single gene Le1, coding for soybean seed lectin, was compared to le1, a naturally occurring mutant allele containing a 3.4 kb insertion within its coding region. Le1 is devoid of introns and produces a 1.0 kb mRNA. It codes for a signal sequence of 32 amino acids and a mature protein of 253 amino acids. With the exception of six single-base substitutions, the coding and flanking sequences in le1 are identical with those in the uninterrupted gene. The insertion termini are imperfect inverted repeats flanked by a 3 bp duplication of lectin target DNA. Inverted repeats within the lectin gene are located symmetrically with respect to the insertion site and are homologous to a region of the insertion termini. These molecular traits conform with the structural aspects of transposable elements in other organisms and imply some degree of site specificity.  相似文献   

12.
The nucleotide sequences of the entire gene family, comprising six genes, that encodes the Rubisco small subunit (rbcS) multigene family in Mesembryanthemum crystallinum (common ice plant), were determined. Five of the genes are arranged in a tandem array spanning 20 kb, while the sixth gene is not closely linked to this array. The mature small subunit coding regions are highly conserved and encode four distinct polypeptides of equal lengths with up to five amino acid differences distinguishing individual genes. The transit peptide coding regions are more divergent in both amino acid sequence and length, encoding five distinct peptide sequences that range from 55 to 61 amino acids in length. Each of the genes has two introns located at conserved sites within the mature peptide-coding regions. The first introns are diverse in sequence and length ranging from 122 by to 1092 bp. Five of the six second introns are highly conserved in sequence and length. Two genes, rbcS-4 and rbcS-5, are identical at the nucleotide level starting from 121 by upstream of the ATG initiation codon to 9 by downstream of the stop codon including the sequences of both introns, indicating recent gene duplication and/or gene conversion. Functionally important regulatory elements identified in rbcS promoters of other species are absent from the upstream regions of all but one of the ice plant rbcS genes. Relative expression levels were determined for the rbcS genes and indicate that they are differentially expressed in leaves.  相似文献   

13.
Next generation sequencing technologies have accelerated the rate at which whole genome sequencing (WGS) data is acquired. The sheer volume of data generated by WGS requires computational annotation to define potential coding regions and chromosomal features. The accuracy of genomic annotation is thus limited by the power of the computational algorithm and the sequence coverage provided by the raw data. Sequencing of the New Zealand White (NZW) rabbit has been performed to a 7× depth of coverage, leaving large gaps in coverage and potential errors within the draft assembly. In the present study, we have resequenced the collagen type I, alpha 1 (Col1A1) gene of Oryctolagus cuniculus (n?=?8). We have characterized the full length cDNA, identified splicing errors within the reference sequence, and identified single nucleotide polymorphisms within the gene. These data underscore the need for a higher resolution assembly of the rabbit genome to advance research in this important large animal model.  相似文献   

14.
A pseudogene, ψnad7, which has significant sequence similarity (66.7% amino acid identity) with the bovine nuclear gene for a 49 kDa subunit of the NADH dehydrogenase (NADH:ubiquinone oxidoreductase, EC 1.6.99.3), has been identified on the mitochondrial genome of the liverwort Marchantia polymorpha. The predicted coding region, which includes six termination codons, is actively transcribed into RNA molecules of 16 and 9.6 kb in length, but RNA splicing products were not detected in the liverwort mitochondria. Genomic DNA blot analysis and RNA blot analysis using poly(A)+ RNA suggest that a structurally related nuclear gene encodes the mitochondrial ND7 polypeptide. These results imply that this ψnad7 is a relic of a gene transfer event from the mitochondrial genome into the nuclear genome during mitochondrial evolution in M. polymorpha.  相似文献   

15.
Ghd7 is an important rice gene that has a major effect on several agronomic traits, including yield. To reveal the origin of Ghd7 and sequence evolution of this locus, we performed a comparative sequence analysis of the Ghd7 orthologous regions from ten diploid Oryza species, Brachypodium distachyon, sorghum and maize. Sequence analysis demonstrated high gene collinearity across the genus Oryza and a disruption of collinearity among non-Oryza species. In particular, Ghd7 was not present in orthologous positions except in Oryza species. The Ghd7 regions were found to have low gene densities and high contents of repetitive elements, and that the sizes of orthologous regions varied tremendously. The large transposable element contents resulted in a high frequency of pseudogenization and gene movement events surrounding the Ghd7 loci. Annotation information and cytological experiments have indicated that Ghd7 is a heterochromatic gene. Ghd7 orthologs were identified in B. distachyon, sorghum and maize by phylogenetic analysis; however, the positions of orthologous genes differed dramatically as a consequence of gene movements in grasses. Rather, we identified sequence remnants of gene movement of Ghd7 mediated by illegitimate recombination in the B. distachyon genome.  相似文献   

16.
17.
18.
The 18S rRNA gene is fundamental to cellular and organismal protein synthesis and because of its stable persistence through generations it is also used in phylogenetic analysis among taxa. Sequence variation in this gene within a single species is rare, but it has been observed in few metazoan organisms. More frequently it has mostly been reported in the non-transcribed spacer region. Here, we have identified two sequence variants within the near full coding region of 18S rRNA gene from a single reniform nematode (RN) Rotylenchulus reniformis labeled as reniform nematode variant 1 (RN_VAR1) and variant 2 (RN_VAR2). All sequences from three of the four isolates had both RN variants in their sequences; however, isolate 13B had only RN variant 2 sequence. Specific variable base sites (96 or 5.5%) were found within the 18S rRNA gene that can clearly distinguish the two 18S rDNA variants of RN, in 11 (25.0%) and 33 (75.0%) of the 44 RN clones, for RN_VAR1 and RN_VAR2, respectively. Neighbor-joining trees show that the RN_VAR1 is very similar to the previously existing R. reniformis sequence in GenBank, while the RN_VAR2 sequence is more divergent. This is the first report of the identification of two major variants of the 18S rRNA gene in the same single RN, and documents the specific base variation between the two variants, and hypothesizes on simultaneous co-existence of these two variants for this gene.  相似文献   

19.
《Gene》1998,207(1):25-32
The sequence of the chicken interferon-γ (ifn-γ) gene was determined, one of the first non-mammalian cytokine gene structures to be elucidated. Initial genomic clones were amplified from chicken genomic DNA and were used to isolate a cosmid clone covering the entire gene for sequencing. The exon:intron structure of chicken ifn-γ is very similar to those of its mammalian homologues, with the exception of the third intron, which is markedly shorter in the chicken. The first exon contains both 5′ UTR and signal sequence and the first 22 aa of the mature protein. The remainder of the coding region lies in exons 2–4. Exon 4 also encodes the stop codon and the 3′ UTR, including two possible polyadenylation signals. A number of potential regulatory sequences similar to those found in mammals have been identified, in the promoter, in each intron and in the 3′ UTR. In the promoter, these include the TATAATA- and CCAT-boxes, a consensus GATA motif in the reverse orientation and a potential NF-κB binding site. Other regulatory elements identified in the promoters of mammalian ifn-γ genes are absent. Internal to the gene structure, regulatory sequences identified include elements found in the DNase I hypersensitivity region of the first intron of the human ifn-γ gene and several potential NF-κB binding sites. The 3′ UTR contains an AT-rich sequence, including nine repeats of the `instability' motif ATTTA. As in mammals, chicken ifn-γ is a single copy gene. The gene is highly conserved, with no polymorphisms yet identified using either RFLP or SSCP in the coding region. However, promoter sequence polymorphisms between different inbred lines of chickens have been identified, with possible links to disease resistance.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号