首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
2.
We have analyzed a sequence of approximately 70 base pairs (bp) that shows a high degree of similarity to sequences present in the non-coding regions of a number of human and other mammalian genes. The sequence was discovered in a fragment of human genomic DNA adjacent to an integrated hepatitis B virus genome in cells derived from human hepatocellular carcinoma tissue. When one of the viral flanking sequences was compared to nucleotide sequences in GenBank, more than thirty human genes were identified that contained a similar sequence in their non-coding regions. The sequence element was usually found once or twice in a gene, either in an intron or in the 5' or 3' flanking regions. It did not share any similarities with known short interspersed nucleotide elements (SINEs) or presently known gene regulatory elements. This element was highly conserved at the same position within the corresponding human and mouse genes for myoglobin and N-myc, indicating evolutionary conservation and possible functional importance. Preliminary DNase I footprinting data suggested that the element or its adjacent sequences may bind nuclear factors to generate specific DNase I hypersensitive sites. The size, structure, and evolutionary conservation of this sequence indicates that it is distinct from other types of short interspersed repetitive elements. It is possible that the element may have a cis-acting functional role in the genome.  相似文献   

3.
4.
5.
6.
7.
8.
In our ongoing project to deduce the nucleotide sequence of Arabidopsis thaliana chromosome 5, non-redundant P1 and TAC clones have been sequenced on the basis of the fine physical map, and as of January, 2000, the sequences of 16.6 Mb representing approximately 60% of chromosome 5 have been accumulated and released at our web site. Along with the sequence determination, structural features of the sequenced regions have been analyzed by applying a variety of computer programs, and we already predicted a total of 2697 potential protein coding genes in the 11,166,130 bp regions, which are covered by 159 P1 and TAC clones. In this paper, we describe the structural features of the 3,076,755 bp regions covered by newly analyzed 60 P1 and TAC clones. A total of 715 potential protein coding genes were identified, giving an average density of the genes identified of 1 gene per 4001 bp. Introns were observed in 80% of the genes, and the average number per gene and the average length of the introns were 4.5 and 147 bp, respectively. These sequence features are nearly identical to those in our latest report in which the data were compiled based on a new standard of gene assignment including the computer-predicted hypothetical genes. The regions also contained 12 tRNA genes when searched by similarity to reported tRNA genes and the tRNA scan-SE program. The sequence data and information on the potential genes are available through the World Wide Web database KAOS (Kazusa Arabidopsis data Opening Site) at http://www.kazusa.or.jp/kaos/.  相似文献   

9.
The invertible P-DNA segment in the chromosome of Escherichia coli.   总被引:14,自引:2,他引:12       下载免费PDF全文
In the chromosome of many strains of Escherichia coli K12 the excisable element e14 is found, which contains an invertible DNA region. This invertible P region, and the gene responsible for the inversion (pin) were cloned, together with other e14 sequences. The element e14 contains a gene which kills the host cell. This can be repressed by a function also coded by e14. The kil and repressor genes as well as the attachment site of the element were mapped in different regions of the element. The invertible segment and pin gene were sequenced. The invertible segment is 1794 bp long, and contains one large internal open reading frame of 879 bp and reading frames which overlap the end pont of the invertible segment. Although pin highly homologous to gin of phage Mu, neither the genetic organization of the P segment nor the sequence of the putative proteins resemble the invertible G segment of phage Mu (which codes for genes involved in tail fiber assembly). The complete DNA sequences of both invertible segments were screened for homology. No resemblance was found. The P segment is flanked by inverted repeat sequences of 16 bp. Comparison of these with related inversion systems points out that the recombination site maps probably within a 2-bp region. This cross-over site is contained within a short palindromic sequence (AAACC AA GGTTT) which is more or less conserved in the recombination sites of all related DNA invertases.  相似文献   

10.
The complex gene regulatory networks governed by growth factor signaling are still poorly understood. In order to accelerate the rate of progress in uncovering these networks, we explored the usefulness of interspecies sequence comparison (phylogenetic footprinting) to identify conserved growth factor response elements. The promoter regions of two direct target genes of Bone Morphogenetic Protein (BMP) signaling in Xenopus, Xvent2 and XId3, were compared with the corresponding human and/or mouse counterparts to identify conserved sequences. A comparison between the Xenopus and human Vent2 promoter sequences revealed a highly conserved 21 bp sequence that overlaps the previously reported Xvent2 BMP response element (BRE). Reporter gene assays using Xenopus animal pole ectodermal explants (animal caps) revealed that this conserved 21 bp BRE is both necessary and sufficient for BMP responsiveness. We combine the same phylogenetic footprinting approach with luciferase assays to identify a highly conserved 49 bp BMP responsive region in the Xenopus Id3 promoter. GFP reporters containing multimers of either the Xvent2 or XId3 BREs appear to recapitulate endogenous BMP signaling activity in transgenic Xenopus embryos. Comparison of the Xvent2 and the XId3 BRE revealed core sequence features that are both necessary and sufficient for BMP responsiveness: a Smad binding element (SBE) and a GC-rich element resembling an OAZ binding site. Based on these findings, we have implemented genome scanning to identify over 100 additional putative target genes containing 2 or more BRE-like sequences which are conserved between human and mouse. RT-PCR and in situ analyses revealed that this in silico approach can effectively be used to identify potential BMP target genes.  相似文献   

11.
A repetitive element from the genome of Leptospira interrogans serovar hardjo type hardjo-bovis ('L. hardjo-bovis') was identified, cloned and sequenced. Similar sequences were shown by hybridization to be encoded by a further eight of 32 other leptospiral serovars tested. An undefined number of repetitive elements were located in the L. hardjo-bovis genome; sequence degeneracy of the elements was observed and no significant open reading frames were identified within the AT-rich (60%) 1467 bp repetitive element. The termini encoded a GC-rich 8 bp repeat motif and two variants showed rearrangements centred on these motifs. The nucleotide sequences of the chromosomal regions flanking the repetitive elements were determined but showed no similarities, with one exception which had a GAAC repeat directly adjacent to both termini. Similar hybridization patterns were shown by Southern transfers of L. hardjo-bovis total genomic digests probed with the repetitive element. Oligonucleotide primer pairs designed from sequences internal to the repetitive element and adjacent chromosomal regions were used in polymerase chain reaction experiments. With one primer pair all L. hardjo-bovis isolates, but no other serovar, gave identical amplified products. Evidence that the repetitive element may have derived from an acquired insertion sequence that is now inactive and chromosomally fixed is discussed.  相似文献   

12.
13.
Sixty-five TAC (transformation-competent artificial chromosomes) clones were selected from a genomic library of Lotus japonicus accession MG-20 based on the sequence information of expressed sequences tags (ESTs), cDNA and gene information, and their nucleotide sequences were determined. The average insert size of the TAC clone was approximately 100 kb, and the total length of the sequenced regions in this study is 6,556,100 bp. Together with the nucleotide sequences of 56 TAC clones previously reported, the regions sequenced so far total 12,029,295 bp. By comparison with the sequences in protein and EST databases and by analysis with computer programs for gene modeling, a total of 711 potential protein-encoding genes with known or predicted functions, 239 gene segments and 90 pseudogenes were identified in the newly sequenced regions. The average gene density assigned so far was 1 gene/9140 bp. The average length of the assigned genes was 2.6 kb, which is considerably larger than that assigned in the Arabidopsis thaliana genome (1.9 kb for 6451 genes). Introns were identified in approximately 73% of the potential genes, and the average number and length of the introns per gene were 3.4 and 377 bp, respectively. Simple sequence repeat length polymorphism (SSLP) or derived cleaved amplified polymorphic sequence (dCAPS) markers were generated based on the nucleotide sequences of the genomic clones obtained, and each clone was mapped onto the linkage map using the F2 mapping population derived from a cross of two accessions of L. japonicus, Gifu B-129 and Miyakojima MG-20. The sequence data, gene information and mapping information are available through the World Wide Web at http://www.kazusa.or.jp/lotus/.  相似文献   

14.
The nucleotide sequences of the 5'-flanking regions of the duplicated Amy genes in eight sibling species belonging to the melanogaster species subgroup are analyzed. In Drosophila melanogaster, a region of about 450 bp immediately upstream of the translation initiation site of the two paralogous genes (the proximal and distal genes) has sequence similarities. However, we could not detect any significant sequence similarity in the region more upstream than -450. This result indicates that the coding regions of the ancestral Amy gene were duplicated together with 450 bp of the 5'-flanking region as one unit. Multiple alignment of these 450-bp sequences in the proximal and distal genes of all eight species revealed a mosaic pattern of highly conserved and divergent regions. The conserved regions included almost all the putative regulatory elements identified in previous analyses of the sequences. A phylogenetic analysis of the aligned sequences shows that these 450-bp sequences are clustered into the proximal and the distal groups. As a whole, the divergence between groups in this region is very large in contrast to that in the coding regions. Based on the divergence between groups, the 450-bp region is divided into two subregions. We found that the ratios of the divergence between groups to that within groups differ in the two subregions. From these observations, we discuss a possibility of positive selection acting on the subregion immediately upstream of the Amy coding region to cause divergence of regulatory elements of the paralogous genes.   相似文献   

15.
Approximately 1200 nucleotides of sequence data from the promoter and 5'-flanking regions of each of three pea (Pisum sativum L.) legumin genes (legA, legB and legC) are presented. The promoter regions of all three genes were found to be identical including the "TATA box", and "CAAT box', and sequences showing homology to the SV40 enhancers. The legA sequence begins to diverge from the others about 300bp from the start codon, whereas the other two genes remain identical for another 550bp. The regions of partial homology exhibit deletions or insertions and some short, comparatively well conserved sequences. The significance of these features is discussed in terms of evolutionary mechanisms and their possible functional roles. The legC gene contains a region that may potentially form either of two mutually exclusive stem-loop structures, one of which has a stem 42bp long, which suggests that it could be fairly stable. We suggest that a mechanism of switching between such alternative structures may play some role in gene control or may represent the insertion of a transposable element.  相似文献   

16.
17.
Two sequences of major histocompatibility complex (MHC) regions in the domestic cat, 2.976 and 0.362 Mbps, which were separated by an ancient chromosome break (55-80 MYA) and followed by a chromosomal inversion were annotated in detail. Gene annotation of this MHC was completed and identified 183 possible coding regions, 147 human homologues, possible functional genes and 36 pseudo/unidentified genes) by GENSCAN and BLASTN, BLASTP RepeatMasker programs. The first region spans 2.976 Mbp sequence, which encodes six classical class II antigens (three DRA and three DRB antigens) lacking the functional DP, DQ regions, nine antigen processing molecules (DOA/DOB, DMA/DMB, TAPASIN, and LMP2/LMP7,TAP1/TAP2), 52 class III genes, nineteen class I genes/gene fragments (FLAI-A to FLAI-S). Three class I genes (FLAI-H, I-K, I-E) may encode functional classical class I antigens based on deduced amino acid sequence and promoter structure. The second region spans 0.362 Mbp sequence encoding no class I genes and 18 cross-species conserved genes, excluding class I, II and their functionally related/associated genes, namely framework genes, including three olfactory receptor genes. One previously identified feline endogenous retrovirus, a baboon retrovirus derived sequence (ECE1) and two new endogenous retrovirus sequences, similar to brown bat endogenous retrovirus (FERVmlu1, FERVmlu2) were found within a 140 Kbp interval in the middle of class I region. MHC SNPs were examined based on comparisons of this BAC sequence and MHC homozygous 1.9x WGS sequences and found that 11,654 SNPs in 2.84 Mbp (0.00411 SNP per bp), which is 2.4 times higher rate than average heterozygous region in the WGS (0.0017 SNP per bp genome), and slightly higher than the SNP rate observed in human MHC (0.00337 SNP per bp).  相似文献   

18.
Tnr1 (235 bp long) is a transposable element in rice. Polymerase chain reactions (PCRs) done with a primer(s) that hybridizes to terminal inverted repeat sequences (TIRs) of Tnr1 detected new Tnr1 members with one or two insertions in rice genomes. Six identified insertion sequences (Tnr4, Tnr5, Tnr11, Tnr12, Tnr13 and RIRE9) did not have extensive homology to known transposable elements, rather they had structural features characteristic of transposable elements. Tnr4 (1767 bp long) had imperfect 64-bp TIRs and appeared to generate duplication of a 9-bp sequence at the target site. However, the TIR sequences were not homologous to those of known transposable elements, indicative that Tnr4 is a new transposable element. Tnr5 (209 bp long) had imperfect 46-bp TIRs and appeared to generate duplication of sequence TTA like that of some elements of the Tourist family. Tnr11 (811 bp long) had 73-bp TIRs with significant homology to those of Tnr1 and Stowaway and appeared to generate duplication of sequence TA, indicative that Tnr11 is a transposable element of the Tnr1/Stowaway family. Tnr12 (2426 bp long) carried perfect 9-bp TIRs, which began with 5'-CACTA- -3' from both ends and appeared to generate duplication of a 3-bp target sequence, indicative that Tnr12 is a transposable element of the En/Spm family. Tnr13 (347 bp long) had 31-bp TIRs and appeared to generate duplication of an 8-bp target sequence. Two sequences, one the transposon-like element Crackle, had partial homology in the Tnr13 ends. All five insertions appear to be defective elements derived from autonomous ones encoding the transposase gene. All had characteristic tandem repeat sequences which may be recognized by transposase. The sixth insertion sequence, named RIRE9 (3852 bp long), which begins with 5'-TG- -3' and ends with 5'- -CA-3', appeared to generate duplication of a 5-bp target sequence. These and other structural features indicate that this insertion is a solo LTR (long terminal repeat) of a retrotransposon. The transposable elements described above could be identified as insertions into Tnr1, which do not deleteriously affect the growth of rice cells.  相似文献   

19.
20.
Estrogen induces the expression of three vitellogenin genes in chicken hepatocytes. To survey the vitellogenin III (VTGIII) gene region for possible distal regulatory sequences, we identified tissue-specific hypersensitive (HS) sites within a 45 kb chromatin region spanning this gene. Five constitutive HS sites were found to mark the VTGIII gene region in hormone-naive hepatocytes. Strikingly, the constitutive HS site located 5.5 kb upstream of the VTGIII gene and a previously identified HS site located within the coordinately regulated VTGII gene mapped to nearly identical copies of a 72 bp sequence. Moreover, it would appear that there has been evolutionary pressure to retain specifically this 72 bp of VTGII-like sequence near the VTGIII gene subsequent to the VTGIII and VTGII genes becoming unlinked approximately 16 Myr ago. Two additional sets of HS sites were induced in the VTGIII gene region in response to estrogen. One set mapped immediately upstream of the gene in the vicinity of what we show to be a functional estrogen response element (ERE). The other induced HS site mapped 7.5 kb upstream of the gene. This far-upstream region was sequenced and was found to contain two imperfect ERE consensus sequences spaced 88 bp apart. In transient expression assays neither of these individual imperfect ERE sequences was functional, but a fragment spanning both sequences behaved as a strong ERE. In contrast to this synergism between imperfect ERE sequences, the presence of an NF-1 binding site 23 bp away from the more distal imperfect ERE sequence was not sufficient to render the latter a functional ERE in our assays.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号