首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The complete sequence of the Bacillus phage phi 29 right early region   总被引:10,自引:0,他引:10  
K J Garvey  H Yoshikawa  J Ito 《Gene》1985,40(2-3):301-309
  相似文献   

2.
3.
4.

Background

Cytoplasmic male sterility (CMS) is an inability to produce functional pollen that is caused by mutation of the mitochondrial genome. Comparative analyses of mitochondrial genomes of lines with and without CMS in several species have revealed structural differences between genomes, including extensive rearrangements caused by recombination. However, the mitochondrial genome structure and the DNA rearrangements that may be related to CMS have not been characterized in Capsicum spp.

Results

We obtained the complete mitochondrial genome sequences of the pepper CMS line FS4401 (507,452 bp) and the fertile line Jeju (511,530 bp). Comparative analysis between mitochondrial genomes of peppers and tobacco that are included in Solanaceae revealed extensive DNA rearrangements and poor conservation in non-coding DNA. In comparison between pepper lines, FS4401 and Jeju mitochondrial DNAs contained the same complement of protein coding genes except for one additional copy of an atp6 gene (ψatp6-2) in FS4401. In terms of genome structure, we found eighteen syntenic blocks in the two mitochondrial genomes, which have been rearranged in each genome. By contrast, sequences between syntenic blocks, which were specific to each line, accounted for 30,380 and 17,847 bp in FS4401 and Jeju, respectively. The previously-reported CMS candidate genes, orf507 and ψatp6-2, were located on the edges of the largest sequence segments that were specific to FS4401. In this region, large number of small sequence segments which were absent or found on different locations in Jeju mitochondrial genome were combined together. The incorporation of repeats and overlapping of connected sequence segments by a few nucleotides implied that extensive rearrangements by homologous recombination might be involved in evolution of this region. Further analysis using mtDNA pairs from other plant species revealed common features of DNA regions around CMS-associated genes.

Conclusions

Although large portion of sequence context was shared by mitochondrial genomes of CMS and male-fertile pepper lines, extensive genome rearrangements were detected. CMS candidate genes located on the edges of highly-rearranged CMS-specific DNA regions and near to repeat sequences. These characteristics were detected among CMS-associated genes in other species, implying a common mechanism might be involved in the evolution of CMS-associated genes.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-561) contains supplementary material, which is available to authorized users.  相似文献   

5.
Two loci in the human genome, chromosomes 4q12–q21 and 17q11.2, contain clusters of CXC and CC chemokine subfamily genes, respectively. Since mice appear to contain fewer chemokine genes than humans, numerous gene duplications might have occurred in each locus of the human genome. Here we describe the genomic organization of the human pulmonary and activation-regulated CC chemokine (PARC), also known as DC-CK1 and AMAC-1. Despite high sequence similarity to a CC chemokine macrophage inflammatory protein-1α (MIP-1α)/LD78α, PARC is chemotactic for lymphocytes and not for monocytes and does not share its receptor with MIP-1α. Analyses of the BAC clones containing the humanPARCgene indicated that the gene is located most closely toMIP-1α(HGMW-approved symbolSCYA3) andMIP-1β(HGMW-approved symbolSCYA4) on chromosome 17q11.2. Dot-plot comparison suggested that thePARCgene had been generated by fusion of twoMIP-1α-like genes with deletion and selective usage of exons. Base changes accumulated before and after the fusion might have adapted the gene to a new function. Since there are variably duplicated copies of theMIP-1αgene calledLD78β(HGMW-approved symbolSCYA3L) in the vicinity of theMIP-1αgene, the locus surrounding theMIP-1αgene seems to be a “hot spring” that continuously produces new family genes. This evidence provides a new model, duplication and fusion, of the molecular basis for diversity within a gene family.  相似文献   

6.
The central gene cluster of chromosome III was one of the first regions to be sequenced by the Caenorhabditis elegans genome project. We have performed an essential gene analysis on the left part of this cluster, in the region around dpy-17III balanced by the duplication sDp3. We isolated 151 essential gene mutations and characterized them with regard to their arrest stages. To facilitate positioning of these mutations, we generated six new deficiencies that, together with preexisting chromosomal rearrangements, subdivide the region into 14 zones. The 151 mutations were mapped into these zones. They define 112 genes, of which 110 were previously unidentified. Thirteen of the zones have been anchored to the physical sequence by polymerase chain reaction deficiency mapping. Of the 112 essential genes mapped, 105 are within these 13 zones. They span 4.2 Mb of nucleotide sequence. From the nucleotide sequence data, 920 genes are predicted. From a Poisson distribution of our mutations, we predict that 234 of the genes will be essential genes. Thus, the 105 genes constitute 45% of the estimated number of essential genes in the physically defined zones and between 2 and 5% of all essential genes in C. elegans. Received: 23 April 1998 / Accepted: 18 August 1998  相似文献   

7.
TheGPX2gene codes for GSHPx-GI, a glutathione peroxidase whose mRNA is readily detectable in the gastrointestinal tract. AlthoughGPX2is a single gene in humans, there are two genes in the mouse genome with homology toGPX2.By analyzing a panel of mouse interspecies DNA from the Jackson Laboratory's backcross resource, we have chromosomally mapped these two genes. One was mapped to the central region of mouse chromosome 12 betweenD12Mit4andD12Mit5,nearfosandTgfb3.This region is homologous to human 14q24.1, where humanGPX2has been mapped, and most likely represents the functional mouseGpx2gene. The otherGpx2-like gene was mapped to mouse chromosome 7 betweenPcsk3andHbb.We have isolated the latter gene from a P1 phage library. Its pseudogene nature is revealed by the sequence analysis: (a) it is intronless; (b) it has a single nucleotide deletion in the coding region; and (c) it has a poly(A) tail at its 3′-untranslated region.  相似文献   

8.
A whole-genome duplication in the ray-finned fish lineage has been supported by the analyses of the genome sequence of the Japanese pufferfish, Fugu rubripes. Recently, genome sequence of a second teleost fish, the freshwater pufferfish, Tetraodon nigroviridis, was completed. Comparisons of long-range synteny between the Tetraodon and human genomes provided additional evidence for the whole-genome duplication in the ray-finned fish lineage. In the present study, we conducted phylogenetic analysis of the Tetraodon and human proteins to identify ray-finned fish lineage-specific (‘fish-specific’) duplicate genes in the Tetraodon genome. Our analyses provide evidence for 1087 well defined fish-specific duplicate genes in Tetraodon. We also analyzed the Fugu proteome that was predicted in the recent Fugu genome assembly, and identified 346 duplicate genes in addition to the 425 duplicates previously identified. We estimated the ages of duplicate genes using the molecular clock. The ages of duplicate genes in the two pufferfishes independently support a large-scale gene duplication around 380–400 Myr ago. In addition, a burst of recent gene duplications was evident in the Tetraodon lineage. These findings provide further evidence for a whole-genome duplication early in the evolution of ray-finned fishes, and suggest that independent gene duplications have occurred recently in the Tetraodon lineage.  相似文献   

9.
The chromosomal assignments of an expressed β-tubulin gene and two related sequences have been determined by Southern blot analysis of DNA from a panel of human X Chinese hamster somatic cell hybrids cleaved with Hind III or EcoR I. Probes containing the 3′ untranslated regions of the expressed gene M40 and of pseudogene 21β were used to localize the M40 sequence (gene symbol TUBB) to chromosome 6 region 6p21 → 6pter, the 21β pseudogene (TUBBP1) to chromosome 8 region 8q21 → 8pter and a third related sequence (TUBBP2) to chromosome 13. Asynteny of expressed genes and related processed pseudogenes has now been demonstrated for several gene families.  相似文献   

10.
11.
《Genomics》1995,29(3)
By using primers complementary to the rat βB1 crystallin gene sequence, we amplified exons 5 and 6 of the orthologous human gene (CRYBB1). The amplified human segments displayed greater than 88% sequence homology to the corresponding rat and bovine sequences.CRYBB1was assigned to the group 5 region in 22q11.2–q12.1 by hybridizing the exon 6 PCR product to somatic cell hybrids containing defined portions of human chromosome 22. The exon 5 and exon 6 PCR products ofCRYBB1were used to localize, by interspecific backcross mapping, the mouse gene (Crybb1) to the central portion of chromosome 5. Three other β crystallin genes (βB2(−1), βB3, and βA4) have previously been mapped to the same regions in human and mouse. We demonstrate that the βB1 and βA4 crystallin genes are very closely linked in the two species. These assignments complete the mapping and identification of the human and mouse homologues of the major β crystallins genes that are expressed in the bovine lens.  相似文献   

12.
Full-length coding sequences of two novel human cadherin cDNAs were obtained by sequence analysis of several EST clones and 5′ and 3′ rapid amplification of cDNA ends (RACE) products. Exons for a third cDNA sequence were identified in a public-domain human genomic sequence, and the coding sequence was completed by 3′ RACE. One of the sequences (CDH7L1, HGMW-approved gene symbol CDH7) is so similar to chicken cadherin-7 gene that we consider it to be the human orthologue. In contrast, the published partial sequence of human cadherin-7 is identical to our second cadherin sequence (CDH7L2), for which we propose CDH19 as the new name. The third sequence (CDH7L3, HGMW-approved gene symbol CDH20) is almost identical to the mouse “cadherin-7” cDNA. According to phylogenetic analysis, this mouse cadherin-7 and its here presented human homologue are most likely the orthologues of Xenopus F-cadherin. These novel human genes, CDH7, CDH19, and CDH20, are localized on chromosome 18q22–q23, distal of both the gene CDH2 (18q11) encoding N-cadherin and the locus of the six desmosomal cadherin genes (18q12). Based on genetic linkage maps, this genomic region is close to the region to which Paget's disease was linked. Interestingly, the expression patterns of these three closely related cadherins are strikingly different.  相似文献   

13.
Mapping‐by‐sequencing analyses have largely required a complete reference sequence and employed whole genome re‐sequencing. In species such as wheat, no finished genome reference sequence is available. Additionally, because of its large genome size (17 Gb), re‐sequencing at sufficient depth of coverage is not practical. Here, we extend the utility of mapping by sequencing, developing a bespoke pipeline and algorithm to map an early‐flowering locus in einkorn wheat (Triticum monococcum L.) that is closely related to the bread wheat genome A progenitor. We have developed a genomic enrichment approach using the gene‐rich regions of hexaploid bread wheat to design a 110‐Mbp NimbleGen SeqCap EZ in solution capture probe set, representing the majority of genes in wheat. Here, we use the capture probe set to enrich and sequence an F2 mapping population of the mutant. The mutant locus was identified in T. monococcum, which lacks a complete genome reference sequence, by mapping the enriched data set onto pseudo‐chromosomes derived from the capture probe target sequence, with a long‐range order of genes based on synteny of wheat with Brachypodium distachyon. Using this approach we are able to map the region and identify a set of deleted genes within the interval.  相似文献   

14.
We compare the 5S gene structure from nine Drosophila species. New sequence data (5S genes of D. melanogaster, D. mauritiana, D. sechellia, D. yakuba, D. erecta, D. orena, and D. takahashii) and already-published data (5S genes of D. melanogaster, D. simulans, and D. teissieri) are used in these comparisons. We show that four regions within the Drosophila 5S genes display distinct rates of evolution: the coding region (120 bp), the 5-flanking region (54–55 bp), the 3-flanking region (21–22 bp), and the internal spacer (149–206 bp). Intra- and interspecific heterogeneity is due mainly to insertions and deletions of 6–17-bp oligomers. These small rearrangements could be generated by fork slippages during replication and could produce rapid sequence divergence in a limited number of steps. Correspondence to: M. Wegnez  相似文献   

15.
Casuarina equisetifolia (C. equisetifolia), a conifer‐like angiosperm with resistance to typhoon and stress tolerance, is mainly cultivated in the coastal areas of Australasia. C. equisetifolia, making it a valuable model to study secondary growth associated genes and stress‐tolerance traits. However, the genome sequence is unavailable and therefore wood‐associated growth rate and stress resistance at the molecular level is largely unexplored. We therefore constructed a high‐quality draft genome sequence of C. equisetifolia by a combination of Illumina second‐generation sequencing reads and Pacific Biosciences single‐molecule real‐time (SMRT) long reads to advance the investigation of this species. Here, we report the genome assembly, which contains approximately 300 megabases (Mb) and scaffold size of N50 is 1.06 Mb. Additionally, gene annotation, assisted by a combination of prediction and RNA‐seq data, generated 29 827 annotated protein‐coding genes and 1983 non‐coding genes, respectively. Furthermore, we found that the total number of repetitive sequences account for one‐third of the genome assembly. Here we also construct the genome‐wide map of DNA modification, such as two novel forms N6‐adenine (6mA) and N4‐methylcytosine (4mC) at the level of single‐nucleotide resolution using single‐molecule real‐time (SMRT) sequencing. Interestingly, we found that 17% of 6mA modification genes and 15% of 4mC modification genes also included alternative splicing events. Finally, we investigated cellulose, hemicellulose, and lignin‐related genes, which were associated with secondary growth and contained different DNA modifications. The high‐quality genome sequence and annotation of C. equisetifolia in this study provide a valuable resource to strengthen our understanding of the diverse traits of trees.  相似文献   

16.
Tandem stretches of guanines can associate in hydrogen-bonded arrays to form G-quadruplexes, which are stabilized by K+ ions. Using computational methods, we searched for G-Quadruplex Sequence (GQS) patterns in the model plant species Arabidopsis thaliana. We found ∼1200 GQS with a G3 repeat sequence motif, most of which are located in the intergenic region. Using a Markov modeled genome, we determined that GQS are significantly underrepresented in the genome. Additionally, we found ∼43 000 GQS with a G2 repeat sequence motif; notably, 80% of these were located in genic regions, suggesting that these sequences may fold at the RNA level. Gene Ontology functional analysis revealed that GQS are overrepresented in genes encoding proteins of certain functional categories, including enzyme activity. Conversely, GQS are underrepresented in other categories of genes, notably those for non-coding RNAs such as tRNAs and rRNAs. We also find that genes that are differentially regulated by drought are significantly more likely to contain a GQS. CD-detected K+ titrations performed on representative RNAs verified formation of quadruplexes at physiological K+ concentrations. Overall, this study indicates that GQS are present at unique locations in Arabidopsis and that folding of RNA GQS may play important roles in regulating gene expression.  相似文献   

17.
Comparative evolutionary analyses of gene families among divergent lineages can provide information on the order and timing of major gene duplication events and evolution of gene function. Here we investigate the evolutionary history of the α-globin gene family in mammals by isolating and characterizing α-like globin genes from an Australian marsupial, the tammar wallaby, Macropus eugenii. Sequence and phylogenetic analyses indicate that the tammar α-globin family consists of at least four genes including a single adult-expressed gene (α), two embryonic/neonatally expressed genes (ζ and ζ′), and θ-globin, each orthologous to the respective α-, ζ-, and θ-globin genes of eutherian mammals. The results suggest that the θ-globin lineage arose by duplication of an ancestral adult α-globin gene and had already evolved an unusual promoter region, atypical of all known α-globin gene promoters, prior to the divergence of the marsupial and eutherian lineages. Evolutionary analyses, using a maximum likelihood approach, indicate that θ-globin, has evolved under strong selective constraints in both marsupials and the lineage leading to human θ-globin, suggesting a long-term functional status. Overall, our results indicate that at least a four-gene cluster consisting of three α-like and one β-like globin genes linked in the order 5′–ζ–α–θ–ω–3′ existed in the common ancestor of marsupials and eutherians. However, results are inconclusive as to whether the two tammar ζ-globin genes arose by duplication prior to the radiation of the marsupial and eutherian lineages, with maintenance of exon sequences by gene conversion, or more recently within marsupials.Reviewing Editor: Dr. John Oakeshott  相似文献   

18.
V Paces  C Vlcek  P Urbánek  Z Hostomsky 《Gene》1986,44(1):115-120
We have sequenced the rightmost 2079 bp of the Bacillus subtilis phage PZA genome. This region encompasses the right early region. We compared it with the homologous region of phage phi 29. Six open reading frames (ORFs) were found in this region of PZA and one of them was assigned to gene 17. Analysis of putative ribosome-binding sites and comparison with phi 29 ORFs indicate that at least some of the remaining ORFs could encode proteins. Corresponding genes were not identified so far by genetic methods. Promoter candidates in the right early region of PZA were found and compared to phi 29 promoters. The sequenced region together with previously determined sequences [Paces et al., Gene 38 (1985) 45-56 and 44 (1986) 107-114] completes the entire 19,366-bp sequence of phage PZA genome.  相似文献   

19.
Yuan Y  Li Q  Kong L  Yu H 《Molecular biology reports》2012,39(2):1287-1292
Molluscs in general, and bivalves in particular, exhibit an extraordinary degree of mitochondrial gene order variation when compared with other metazoans. The complete mitochondrial genome of Solen grandis (Bivalvia: Solenidae) was determined using long-PCR and genome walking techniques. The entire mitochondrial genome sequence of S. grandis is 16,784 bp in length, and contains 36 genes including 12 protein-coding genes (atp8 is absent), 2 ribosomal RNAs, and 22 tRNAs. All genes are encoded on the same strand. Compared with other species, it bears a novel gene order. Besides these, we find a peculiar non-coding region of 435 bp with a microsatellite-like (TA)12 element, poly-structures and many hairpin structures. In contrast to the available heterodont mitochondrial genomes from GenBank, the complete mtDNA of S. grandis has the shortest cox3 gene, and the longest atp6, nad4, nad5 genes.  相似文献   

20.
A molecular understanding of porcine reproduction is of biological interest and economic importance. Our Midwest Consortium has produced cDNA libraries containing the majority of genes expressed in major female reproductive tissues, and we have deposited into public databases 21,499 expressed sequence tag (EST) gene sequences from the 3 end of clones from these libraries. These sequences represent 10,574 different genes, based on sequence comparison among these data, and comparison with existing porcine ESTs and genes indicate as many as 4652 of these EST clusters are novel. In silico analysis identified sequences that are expressed in specific pig tissues or organs and confirmed the broad expression in pig for many genes ubiquitously expressed in human tissues. Furthermore, we have developed computer software to identify sequence similarity of these pig genes with their human counterparts, and to extract the mapping information of these human homologues from genome databases. We demonstrate the utility of this software for comparative mapping by localizing 61 genes on the porcine physical map for Chromosomes (Chrs) 5, 10, and 14. The following Accession numbers were assigned to our deposited sequences: BF701840 – BF704551, BF708383, BF708386 – BF713604, BG322266 – BG322271, BI398567 – BI405235, BQ597354 – BQ605166.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号