首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Sequencing of a contiguous 215-kb interval of Triticum monococcum showed the presence of five genes in the same order as in previously sequenced colinear barley and rice BACs. Gene 2 was in the same orientation in wheat and rice but inverted in barley. Gene density in this region was 1 gene per 43 kb and the ratio of physical to genetic distance was estimated to be 2,700 kb cM–1. Twenty more-or-less intact retrotransposons were found in the intergenic regions, covering at least 70% of the sequenced region. The insertion times of 11 retrotransposons were less than 5 million years ago and were consistent with their nested structure. Five new families of retroelements and the first full-length elements for two additional retrotransposon families were discovered in this region. Significantly higher values of GC content were observed for Triticeae BACs compared with rice BACs. Relative enrichment or depletion of certain dinucleotides was observed in the comparison of introns, exons and retrotransposons. A higher proportion of transitions in CG and CNG sites that are targets for cytosine methylation was observed in retrotransposons (76%) than in introns (37%). These results showed that the wheat genome is a complex mixture of different sequence elements, but with general patterns of content and interspersion that are similar to those seen in maize and barley. Electronic Publication  相似文献   

2.
3.
We have constructed a map of the duplicated regions of Kluyveromyces lactis subtelomeres. Seven out of 12 subtelomeres contain an almost identical 9 kb long segment starting from the end. This segment is bordered by a long terminal repeat element. Two of the subtelomeres share sequence similarity that extends over a total of 20 kb. The other subtelomeres also contain duplicated regions of 1-6 kb. Nonduplicated regions contain unique genes and genes from paralog families. All duplicated segments are in the same orientation with respect to the telomere, probably as a result of genetic exchange. We map the only two copies of retrotransposons in the genome, in subtelomeres. Low-complexity gene sequences that encode threonine- and serine-rich peptides are associated with the subtelomeres of K. lactis, as in Saccharomyces cerevisiae. The ubiquity of these sequences in hemiascomycete genomes, and the propensity they have to encode proteins with extracellular localization, make these genes ideal candidates for fast evolving 'contingency' genes involved in the adaptation of a species to its environment.  相似文献   

4.
Retrotransposon-based molecular markers are a powerful tool for mapping and diversity studies. The scarcity of retrotransposon long terminal repeat (LTR) sequences limits the application of retrotransposon-based molecular marker systems. Here, we isolated two novel complete Ty1-copia retrotransposons (CTcrm1 and CTcrm2) in apple using a genome walking strategy. The CTcrm retrotransposons are nearly 5 kb long, and they have all the features of Ty1-copia retrotransposons. The differences in gene organization and nucleotide sequence length between the CTcrm retrotransposons and other reported complete retrotransposons in apple showed that CTcrm1 and CTcrm2 are the first two distinct complete Ty1-copia retrotransposons in the apple genome. To investigate the potential utility of the two retrotransposons as molecular markers, primers complementary to the CTcrm LTRs were designed to develop sequence-specific amplification polymorphism markers for discriminating bud sports of Fuji apple. Multiple polymorphisms corresponding to CTcrm1 and CTcrm2 were detected and could easily be used to discriminate bud sports from their Fuji progenitor, as well as from each other.  相似文献   

5.
6.
7.
Simple sequence repeats (SSRs) can be derived from the complete genome sequence. These markers are important for gene mapping as well as marker-assisted selection (MAS). To develop SSRs for cotton gene mapping, we selected the complete genome sequence of Gossypium raimondii, which consisted of 4447 non-redundant scaffolds. Out of 775.2 Mb sequence examined, a total of 136,345 microsatellites were identified with a density of 5.69 kb per SSR in the G. raimondii genome leading to development of 112,177 primer pairs. The distributions of SSRs in the genome were non-random. Among the different motifs ranging from 1 to 6 bp, penta-nucleotide repeats were most abundant (30.5%), followed by tetra-nucleotide repeats (18.2%) and di-nucleotide repeats (16.9%). Among all identified 457 motif types, the most frequently occurring repeat motifs were poly-AT/TA, which accounted for 79.8% of the total di-nt SSRs, followed by AAAT/TTTA with 51.5% of the total tetra-nucleotede. Further, 18,834 microsatellites were detected from the protein-coding genes, and the frequency of gene containing SSRs was 46.0% in 40,976 genes of G. raimondii. These genome-based SSRs developed in the present study will lay the groundwork for developing large numbers of SSR markers for genetic mapping, gene discovery, genetic diversity analysis, and MAS breeding in cotton.  相似文献   

8.
The complete sequence of Musa acuminata bacterial artificial chromosome (BAC) clones is presented and, consequently, the first analysis of the banana genome organization. One clone (MuH9) is 82,723 bp long with an overall G+C content of 38.2%. Twelve putative protein-coding sequences were identified, representing a gene density of one per 6.9 kb, which is slightly less than that previously reported for Arabidopsis but similar to rice. One coding sequence was identified as a partial M. acuminata malate synthase, while the remaining sequences showed a similarity to predicted or hypothetical proteins identified in genome sequence data. A second BAC clone (MuG9) is 73,268 bp long with an overall G+C content of 38.5%. Only seven putative coding regions were discovered, representing a gene density of only one gene per 10.5 kb, which is strikingly lower than that of the first BAC. One coding sequence showed significant homology to the soybean ribonucleotide reductase (large subunit). A transition point between coding regions and repeated sequences was found at approximately 45 kb, separating the coding upstream BAC end from its downstream end that mainly contained transposon-like sequences and regions similar to known repetitive sequences of M. acuminata. This gene organization resembles Gramineae genome sequences, where genes are clustered in gene-rich regions separated by gene-poor DNA containing abundant transposons.Communicated by J.S. Heslop-Harrison  相似文献   

9.
To better understand genome structure and the expression of α/β-gliadin multigenes in hexaploid wheat, bacterial artificial chromosome (BAC) clones containing α/β-gliadin genes from the three loci, Gli-A2, Gli-B2, and Gli-D2, were screened. Based on their restriction fragment patterns, we selected five BAC clones, namely, two clones for Gli-A2, two clones for Gli-B2, and one clone for Gli-D2, to fully sequence. Approximately 200 kb was sequenced for each locus. In total, twelve α/β-gliadin intact genes and four pseudogenes were found, and retrotransposons or other transposons existed in each BAC clone. Dot-plot analysis revealed the pattern of genome segmental duplication within each BAC. We calculated time since duplication of each set of α/β-gliadin genes and insertion of retrotransposons. Duplication of all adjacent genes within the same BAC clone took place before or after allotetrapolyploidization, but duplication of certain genes occurred before diploid differentiation of wheat species. Retrotransposons were also inserted before and after the segmental duplication events. Furthermore, translocation of α/β-gliadin genes from chromosomes 1 to 6 apparently occurred before the diversification of various wheat genomes. Duplication of genome segments containing α/β-gliadin genes and retrotransposons were brought about through unequal crossing-over or saltatory replication and α/β-gliadin genes per se were duplicated without any recombination events. Out of twelve intact α/β-gliadin genes detected from their sequences, nine were expressed, although their patterns of expression were distinct. Since they have similar cis-elements and promoter structures, the mechanisms underlying their distinct gene expression and possible applications are discussed.  相似文献   

10.
This paper describes the structure of a 9.2-kb repeat unit of DNA, which represents one-secalin gene and spacer sequence located at theSec-1 locus on the short arm of chromosome 1 of rye. The gene units at theSec-1 locus comprise 1.1 kb representing the gene and 8.1 kb of spacer sequence separating the genes. A sequence comparison of nine genes and their promoter regions from theSec-1 locus, reveals that there is greater variation within the coding sequence than there is within the promoter regions. The gene sequence variation is discussed in terms of the size variation seen for the-secalin proteins in rye species. The results include a comparison of promoter sequences from members of the Triticeae to examine the degree of conservation between other seed storage protein genes.  相似文献   

11.
LTR retrotransposons may be important contributors to host gene evolution because they contain regulatory and coding signals. In an effort to assess the possible contribution of LTR retrotransposons to C. elegans gene evolution, we searched upstream and downstream of LTR retrotransposon sequences for the presence of predicted genes. Sixty-three percent of LTR retrotransposon sequences (79/124) are located within 1 kb of a gene or within gene boundaries. Most gene-retrotransposon associations were located along the chromosome arms. Our results are consistent with the hypothesis that LTR retrotransposons have contributed to the structural and/or regulatory evolution of genes in C. elegans.  相似文献   

12.
13.
A bacterial-artificial-chromosome (BAC) clone from the genome of Triticum tauschii, the D-genome ancestor of hexaploid bread wheat, was sequenced and the presence of the two paralogous x- and y-type high-molecular-weight (HMW) glutenin genes of the Glu-D1 locus was confirmed. These two genes occur in the same orientation, are 51,893 bp apart, and the separating DNA includes a 31,000-bp cluster of retrotransposons. A second retrotransposon cluster of 32,000 bp follows the x-type HMW-glutenin gene region. Each HMW-glutenin gene is found within a region of mainly unique DNA sequence which includes multiple additional genes including an active endosperm globulin gene not previously reported in the Triticeae family, a leucine-rich-repeat (LRR) type gene truncated at the 5′ end of the BAC, a kinase gene of unknown activity, remnants of a paralogous second globulin gene, and genes similar to two hypothetical rice genes. The newly identified globulin genes are assigned to a locus designated Glo-2. Comparison to available orthologous regions of the wheat A and B genomes show rapid sequence divergences flanking the HMW-glutenin genes, and the absence of two hypothetical and unknown genes found 5′ to the B-genome x-type ortholog. The region surrounding the Glu-D1 locus is similar to other reported Triticeae BAC sequences; i.e. small gene islands separated by retrotransposon clusters. Electronic Publication  相似文献   

14.
We sequenced five BAC clones of Brassica oleracea doubled haploid ‘Early Big' broccoli containing major genes in the aliphatic glucosinolate pathway, and comparatively analyzed them with similar sequences in A. thaliana and B. rapa. Additionally, we included in the analysis published sequences from three other B. oleracea BAC clones and a contig of this species corresponding to segments in A. thaliana chromosomes IV and V. A total of 2,946 kb of B. oleracea, 1,069 kb of B. rapa sequence and 2,607 kb of A. thaliana sequence were compared and analyzed. We found conserved collinearity for gene order and content restricted to specific chromosomal segments, but breaks in collinearity were frequent resulting in gene absence likely not due to gene loss but rearrangements. B. oleracea has the lowest gene density of the three species, followed by B. rapa. The genome expansion of the Brassica species, B. oleracea in particular, is due to larger introns and gene spacers resulting from frequent insertion of DNA transposons and retrotransposons. These findings are discussed in relation to the possible origin and evolution of the Brassica genomes. Electronic supplementary material  The online version of this article (doi:) contains supplementary material, which is available to authorized users.  相似文献   

15.
The 441-bp DNA segment in a PCR-amplified fragment from Oryza sativa cv. IR36 was found to have a sequence with features characteristic of LTRs of retroelements, which was named RIRE2 (Rice retroelement #2) and further analyzed. Cloning and sequencing analyses of the DNA segments connected to LTR-like sequence showed that RIRE2 has a long internal region almost 10 kb long that is flanked by LTR-like sequences. This internal region carries a primer binding site (PBS) and polypurine tract (PPT) which are necessary for cDNA synthesis of retroelements. The PBS sequence is complementary to the 3' end region of tRNA(Arg). The internal region has an rt gene homologous to that of gypsy-type retrotransposons, evidence that RIRE2 is indeed a retrotransposon related to gypsy from Drosophila. RIRE2 has an extra sequence more than 4 kb long in the region downstream of gag-pol. Phylogenetic analysis of the putative amino-acid sequences of the rt gene as well as the int gene showed that RIRE2 is related to a group of gypsy-type retrotransposons of a large size that include Grande1-4 of teosinte, Tat4-1 and Athila1-1 of Arabidopsis thaliana, and Cyclops-2 of pea, but distantly related to any other group of gypsy-type retrotransposons, including RIRE3 and RIRE8 of rice. RIRE2 and Grande1-4 had the highest homology in the gag-pol region, but the nucleotide sequences of the LTR regions differed. Both elements had significant homology in the middle area of the extra regions downstream of gag-pol, in which they had an open reading frame encoding a protein with no known function on the opposite strand from that coding for gag-pol.  相似文献   

16.
Triticeae species (including wheat, barley and rye) have huge and complex genomes due to polyploidization and a high content of transposable elements (TEs). TEs are known to play a major role in the structure and evolutionary dynamics of Triticeae genomes. During the last 5 years, substantial stretches of contiguous genomic sequence from various species of Triticeae have been generated, making it necessary to update and standardize TE annotations and nomenclature. In this study we propose standard procedures for these tasks, based on structure, nucleic acid and protein sequence homologies. We report statistical analyses of TE composition and distribution in large blocks of genomic sequences from wheat and barley. Altogether, 3.8 Mb of wheat sequence available in the databases was analyzed or re-analyzed, and compared with 1.3 Mb of re-annotated genomic sequences from barley. The wheat sequences were relatively gene-rich (one gene per 23.9 kb), although wheat gene-derived sequences represented only 7.8% (159 elements) of the total, while the remainder mainly comprised coding sequences found in TEs (54.7%, 751 elements). Class I elements [mainly long terminal repeat (LTR) retrotransposons] accounted for the major proportion of TEs, in terms of sequence length as well as element number (83.6% and 498, respectively). In addition, we show that the gene-rich sequences of wheat genome A seem to have a higher TE content than those of genomes B and D, or of barley gene-rich sequences. Moreover, among the various TE groups, MITEs were most often associated with genes: 43.1% of MITEs fell into this category. Finally, the TRIM and copia elements were shown to be the most active TEs in the wheat genome. The implications of these results for the evolution of diploid and polyploid wheat species are discussed. Electronic Supplementary Material Supplementary material is available for this article at  相似文献   

17.
18.
The 1.5 Gbp/2C genome of pedunculate oak (Quercus robur) has been sequenced. A strategy was established for dealing with the challenges imposed by the sequencing of such a large, complex and highly heterozygous genome by a whole‐genome shotgun (WGS) approach, without the use of costly and time‐consuming methods, such as fosmid or BAC clone‐based hierarchical sequencing methods. The sequencing strategy combined short and long reads. Over 49 million reads provided by Roche 454 GS‐FLX technology were assembled into contigs and combined with shorter Illumina sequence reads from paired‐end and mate‐pair libraries of different insert sizes, to build scaffolds. Errors were corrected and gaps filled with Illumina paired‐end reads and contaminants detected, resulting in a total of 17 910 scaffolds (>2 kb) corresponding to 1.34 Gb. Fifty per cent of the assembly was accounted for by 1468 scaffolds (N50 of 260 kb). Initial comparison with the phylogenetically related Prunus persica gene model indicated that genes for 84.6% of the proteins present in peach (mean protein coverage of 90.5%) were present in our assembly. The second and third steps in this project are genome annotation and the assignment of scaffolds to the oak genetic linkage map. In accordance with the Bermuda and Fort Lauderdale agreements and the more recent Toronto Statement, the oak genome data have been released into public sequence repositories in advance of publication. In this presubmission paper, the oak genome consortium describes its principal lines of work and future directions for analyses of the nature, function and evolution of the oak genome.  相似文献   

19.
Continuous genomic sequence has been previously determined for the swine leukocyte antigen (SLA) class I region from the TNF gene cluster at the border between the major histocompatibility complex (MHC) class III and class I regions to the UBD gene at the telomeric end of the classical class I gene cluster (SLA-1 to SLA-5, SLA-9, SLA-11). To complete the genomic sequence of the entire SLA class I genomic region, we have analyzed the genomic sequences of two BAC clones carrying a continuous 237,633-bp-long segment spanning from the TRIM15 gene to the UBD gene located on the telomeric side of the classical SLA class I gene cluster. Fifteen non-class I genes, including the zinc finger and the tripartite motif (TRIM) ring-finger-related family genes and olfactory receptor genes, were identified in the 238-kilobase (kb) segment, and their location in the segment was similar to their apparent human homologs. In contrast, a human segment (alpha block) spanning about 375 kb from the gene ETF1P1 and from the HLA-J to HLA-F genes was absent from the 238-kb swine segment. We conclude that the gene organization of the MHC non-class I genes located in the telomeric side of the classical SLA class I gene cluster is remarkably similar between the swine and the human segments, although the swine lacks a 375-kb segment corresponding to the human alpha block. The nucleotide sequence data reported in this paper have been submitted to DDBJ, EMBL, and GenBank databases under accession numbers AB158486 and AB158487  相似文献   

20.
Sequencing and annotation of a contiguous stretch of genomic DNA (112.3 kb) from the oomycete plant pathogen Phytophthora infestans revealed the order, spacing and genomic context of four members of the elicitin (inf) gene family. Analysis of the GC content at the third codon position (GC3) of six genes encoded in the region, and a set of randomly selected coding regions as well as random genomic regions, showed that a high GC3 value is a general feature of Phytophthora genes that can be exploited to optimize gene prediction programs for Phytophthora species. At least one-third of the annotated 112.3-kb P. infestans sequence consisted of transposons or transposon-like elements. The most prominent were four Tc3/gypsy and Tc1/copia type retrotransposons and three DNA transposons that belong to the Tc1/mariner, Pogo and PiggyBac groups, respectively. Comparative analysis of other available genomic sequences suggests that transposable elements are highly heterogeneous and ubiquitous in the P. infestans genome.Electronic Supplementary Material Supplementary material is available for this article at  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号