首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 46 毫秒
1.
Arabidopsis thaliana has a relatively small genome of approximately 130 Mb containing about 10% repetitive DNA. Genome sequencing studies reveal a gene-rich genome, predicted to contain approximately 25000 genes spaced on average every 4.5 kb. Between 10 to 20% of the predicted genes occur as clusters of related genes, indicating that local sequence duplication and subsequent divergence generates a significant proportion of gene families. In addition to gene families, repetitive sequences comprise individual and small clusters of two to three retroelements and other classes of smaller repeats. The clustering of highly repetitive elements is a striking feature of the A. thaliana genome emerging from sequence and other analyses.  相似文献   

2.
Higher order chromatin structures in maize and Arabidopsis.   总被引:1,自引:0,他引:1       下载免费PDF全文
A L Paul  R J Ferl 《The Plant cell》1998,10(8):1349-1359
We are investigating the nature of plant genome domain organization by using DNase I- and topoisomerase II-mediated cleavage to produce domains reflecting higher order chromatin structures. Limited digestion of nuclei with DNase I results in the conversion of the >800 kb genomic DNA to an accumulation of fragments that represents a collection of individual domains of the genome created by preferential cleavage at super-hypersensitive regions. The median size of these fragments is approximately 45 kb in maize and approximately 25 kb in Arabidopsis. Hybridization analyses with specific gene probes revealed that individual genes occupy discrete domains within the distribution created by DNase I. The maize alcohol dehydrogenase Adh1 gene occupies a domain of 90 kb, and the maize general regulatory factor GRF1 gene occupies a domain of 100 kb in length. Arabidopsis Adh was found within two distinct domains of 8.3 and 6.1 kb, whereas an Arabidopsis GRF gene occupies a single domain of 27 kb. The domains created by topoisomerase II-mediated cleavage are identical in size to those created by DNase I. These results imply that the genome is not packaged by means of a random gathering of the genome into domains of indiscriminate length but rather that the genome is gathered into specific domains and that a gene consistently occupies a discrete physical section of the genome. Our proposed model is that these large organizational domains represent the fundamental structural loop domains created by attachment of chromatin to the nuclear matrix at loop basements. These loop domains may be distinct from the domains created by the matrix attachment regions that typically flank smaller, often functionally distinct sections of the genome.  相似文献   

3.
A "gene-island" sequencing strategy has been developed that expedites the targeted acquisition of orthologous gene sequences from related species for comparative genome analysis. A 152-kb bacterial artificial chromosome (BAC) clone from sorghum (Sorghum bicolor) encoding phytochrome A (PHYA) was fully sequenced, revealing 16 open reading frames with a gene density similar to many regions of the rice (Oryza sativa) genome. The sequences of genes in the orthologous region of the maize (Zea mays) and rice genomes were obtained using the gene-island sequencing method. BAC clones containing the orthologous maize and rice PHYA genes were identified, sheared, subcloned, and probed with the sorghum PHYA-containing BAC DNA. Sequence analysis revealed that approximately 75% of the cross-hybridizing subclones contained sequences orthologous to those within the sorghum PHYA BAC and less than 25% contained repetitive and/or BAC vector DNA sequences. The complete sequence of four genes, including up to 1 kb of their promoter regions, was identified in the maize PHYA BAC. Nine orthologous gene sequences were identified in the rice PHYA BAC. Sequence comparison of the orthologous sorghum and maize genes aided in the identification of exons and conserved regulatory sequences flanking each open reading frame. Within genomic regions where micro-colinearity of genes is absolutely conserved, gene-island sequencing is a particularly useful tool for comparative analysis of genomes between related species.  相似文献   

4.
Lilly JW  Havey MJ 《Genetics》2001,159(1):317-328
Closely related cucurbit species possess eightfold differences in the sizes of their mitochondrial genomes. We cloned mitochondrial DNA (mtDNA) fragments showing strong hybridization signals to cucumber mtDNA and little or no signal to watermelon mtDNA. The cucumber mtDNA clones carried short (30-53 bp), repetitive DNA motifs that were often degenerate, overlapping, and showed no homology to any sequences currently in the databases. On the basis of dot-blot hybridizations, seven repetitive DNA motifs accounted for >13% (194 kb) of the cucumber mitochondrial genome, equaling >50% of the size of the Arabidopsis mitochondrial genome. Sequence analysis of 136 kb of cucumber mtDNA revealed only 11.2% with significant homology to previously characterized mitochondrial sequences, 2.4% to chloroplast DNA, and 15% to the seven repetitive DNA motifs. The remaining 71.4% of the sequence was unique to the cucumber mitochondrial genome. There was <4% sequence colinearity surrounding the watermelon and cucumber atp9 coding regions, and the much smaller watermelon mitochondrial genome possessed no significant amounts of cucumber repetitive DNAs. Our results demonstrate that the expanded cucumber mitochondrial genome is in part due to extensive duplication of short repetitive sequences, possibly by recombination and/or replication slippage.  相似文献   

5.
The maize genome contains a helitron insertion   总被引:8,自引:0,他引:8       下载免费PDF全文
The maize mutation sh2-7527 was isolated in a conventional maize breeding program in the 1970s. Although the mutant contains foreign sequences within the gene, the mutation is not attributable to an interchromosomal exchange or to a chromosomal inversion. Hence, the mutation was caused by an insertion. Sequences at the two Sh2 borders have not been scrambled or mutated, suggesting that the insertion is not caused by a catastrophic reshuffling of the maize genome. The insertion is large, at least 12 kb, and is highly repetitive in maize. As judged by hybridization, sorghum contains only one or a few copies of the element, whereas no hybridization was seen to the Arabidopsis genome. The insertion acts from a distance to alter the splicing of the sh2 pre-mRNA. Three distinct intron-bearing maize genes were found in the insertion. Of most significance, the insertion bears striking similarity to the recently described DNA helicase-bearing transposable elements termed HELITRONS: Like Helitrons, the inserted sequence of sh2-7527 is large, lacks terminal repeats, does not duplicate host sequences, and was inserted between a host dinucleotide AT. Like Helitrons, the maize element contains 5' TC and 3' CTRR termini as well as two short palindromic sequences near the 3' terminus that potentially can form a 20-bp hairpin. Although the maize element lacks sequence information for a DNA helicase, it does contain four exons with similarity to a plant DEAD box RNA helicase. A second Helitron insertion was found in the maize genomic database. These data strongly suggest an active Helitron in the present-day maize genome.  相似文献   

6.
Most higher plants have complex genomes containing large quantities of repetitive DNA interspersed with low-copy-number sequences. Many of these repetitive DNAs are mobile and have homology to RNAs in various cell types. This can make it difficult to identify the genes in a long chromosomal continuum. It was decided to use genic sequence conservation and grass genome co-linearity as tools for gene identification. A bacterial artificial chromosome (BAC) clone containing sorghum genomic DNA was selected using a maize Adh1 probe. The 165 kb sorghum BAC was tested for hybridization to a set of clones representing the contiguous 280 kb of DNA flanking maize Adh1. None of the repetitive maize DNAs hybridized, but most of the low-copy-number sequences did. A low-copy-number sequence that did cross-hybridize was found to be a gene, while one that did not was found to be a low-copy-number retrotransposon that was named Reina. Regions of cross-hybridization were co-linear between the two genomes, but closer together in the smaller sorghum genome. These results indicate that local genomic cross-referencing by hybridization of orthologous clones can be an efficient and rapid technique for gene identification and studies of genome organization.  相似文献   

7.
Evolution of the genome size in eukaryotes is often affected by changes in the noncoding sequences, for which insertions and deletions (indels) of small nucleotide sequences and amplification of repetitive elements are considered responsible. In this study, we compared the genomic DNA sequences of two kinds of fish, medaka (Oryzias latipes) and fugu (Takifugu rubripes), which show two-fold difference in the genome size (800 Mb vs. 400 Mb). We selected a contiguous DNA sequence of 790 kb from the medaka chromosome LG22 (linkage group 22), and made a precise comparison with the sequence (387 kb) of the corresponding region of Takifugu. The sequence of 178 kb in total was aligned common between two fishes, and the remaining sequences (612 kb for medaka and 209 kb for fugu) were found abundant in various repetitive elements including many types of unclassified low copy repeats, all of which accounted for more than a half (54%) of the genome size difference. Furthermore, we identified a significant difference in the length ratio of the unaligned sequences that locate between the aligned sequences (USBAS), particularly after eliminating known repetitive elements. These USBAS with no repetitive elements (USBAS-nr) located within the intron and intergenic region. These results strongly indicated that amplification of repetitive elements and compilation of indels are major driving forces to facilitate changes in the genome size.  相似文献   

8.
The genomes of flowering plants vary in size from about 0.1 to over 100 gigabase pairs (Gbp), mostly because of polyploidy and variation in the abundance of repetitive elements in intergenic regions. High-quality sequences of the relatively small genomes of Arabidopsis (0.14 Gbp) and rice (0.4 Gbp) have now been largely completed. The sequencing of plant genomes that have a more representative size (the mean for flowering plant genomes is 5.6 Gbp) has been seen as a daunting task, partly because of their size and partly because of the numerous highly conserved repeats. Nevertheless, creative strategies and powerful new tools have been generated recently in the plant genetics community, so that sequencing large plant genomes is now a realistic possibility. Maize (2.4-2.7 Gbp) will be the first gigabase-size plant genome to be sequenced using these novel approaches. Pilot studies on maize indicate that the new gene-enrichment, gene-finishing and gene-orientation technologies are efficient, robust and comprehensive. These strategies will succeed in sequencing the gene-space of large genome plants, and in locating all of these genes and adjacent sequences on the genetic and physical maps.  相似文献   

9.
A bacterial artificial chromosome (BAC) library has been established for Arabidopsis thaliana (ecotype Col-0) covering about seven haploid nuclear genome equivalents. This library, called the Institut für Genbiologische Forschung (IGF) BAC library, consists of 10?752 recombinant clones carrying inserts (generated by partial EcoRI digestion) of an average size of about 100?kb in a modified BAC vector, pBeloBAC-Kan. Hybridization with organellar DNA and nuclear repetitive DNA elements revealed the presence of 1.1% clones with mitochondrial DNA, 0.2% clones with plastid DNA, 3.2% clones with the 180?bp paracentromeric repeat, 1.6% clones with 5S rDNA, and 10.8% clones with the 18S-25S rDNA repeat. With its extensive genome coverage, its rather uniformly sized inserts (80?kb?<85% <120?kb) and low contamination with organellar DNA, this library provides an excellent resource for A. thaliana genomic mapping, map-based gene cloning, and genome sequencing.  相似文献   

10.
G Matassi  R Melis  K C Kuo  G Macaya  C W Gehrke  G Bernardi 《Gene》1992,122(2):239-245
Methylation was investigated in compositional fractions of nuclear DNA preparations (50-100 kb in size) from five plants (onion, maize, rye, pea and tobacco), and was found to increase from GC-poor to GC-rich fractions. This methylation gradient showed different patterns in different plants and appears, therefore, to represent a novel, characteristic genome feature which concerns the noncoding, intergenic sequences that make up the bulk of the plant genomes investigated and mainly consist of repetitive sequences. The structural and functional implications of these results are discussed.  相似文献   

11.
The X, Y and ovalbumin genes, which are found within a 40 kb region of the chicken genome, are all expressed in oviduct under steroid hormone control, and share some sequence homologies. We have now cloned the complete X gene and have analyzed its structure. It codes for two RNA species, X and X′; both are coded by eight exons and appear to differ only by the size of their 3′ untranslated region, X′ RNA being 1400 nucleotides longer than X RNA. The striking similarity in the number and length of the exons which constitute the X, Y or ovalbumin genes establishes that they have evolved from a common ancestor gene by duplication events. Comparison of selected regions of the X and ovalbumin genes indicates that the exon sequences coding for protein and the location of the splice junctions have been well-conserved. The introns and the 3′ untranslated exonic sequences have diverged much more rapidly. Four regions of apparently unrelated repetitive sequences are found both outside the X gene and within it (in two introns and in the sequence coding for the 3′ untranslated part of X′RNA). The intragenic repetitive sequences have no counterpart in the ovalbumin and Y genes.  相似文献   

12.
13.
The small genome size (740 Mb), short life cycle (3 months) and high economic importance as a food crop legume make chickpea (Cicer arietinum L.) an important system for genomics research. Although several genetic linkage maps using various markers and genomic tools have become available, sequencing efforts and their use are limited in chickpea genomic research. In this study, we explored the genome organization of chickpea by sequencing approximately 500 kb from 11 BAC clones (three representing ascochyta blight resistance QTL1 (ABR-QTL1) and eight randomly selected BAC clones). Our analysis revealed that these sequenced chickpea genomic regions have a gene density of one per 9.2 kb, an average gene length of 2,500 bp, an average of 4.7 exons per gene, with an average exon and intron size of 401 and 316 bp, respectively, and approximately 8.6% repetitive elements. Other features analyzed included exon and intron length, number of exons per gene, protein length and %GC content. Although there are reports on high synteny among legume genomes, the microsynteny between the 500 kb chickpea and available Medicago truncatula genomic sequences varied depending on the region analyzed. The GBrowse-based annotation of these BACs is available at http://www.genome.ou.edu/plants_totals.html . We believe that our work provides significant information that supports a chickpea genome sequencing effort in the future.  相似文献   

14.
15.
Comparative genomics of Arabidopsis and maize: prospects and limitations   总被引:3,自引:0,他引:3  
Brendel V  Kurtz S  Walbot V 《Genome biology》2002,3(3):reviews1005.1-reviews10056
The completed Arabidopsis genome seems to be of limited value as a model for maize genomics. In addition to the expansion of repetitive sequences in maize and the lack of genomic micro-colinearity, maize-specific or highly-diverged proteins contribute to a predicted maize proteome of about 50,000 proteins, twice the size of that of Arabidopsis.  相似文献   

16.
Organization and sequence of the human alpha-lactalbumin gene.   总被引:10,自引:1,他引:9       下载免费PDF全文
  相似文献   

17.
Sequence composition, organization, and evolution of the core Triticeae genome   总被引:13,自引:0,他引:13  
We investigated the composition and the basis of genome expansion in the core Triticeae genome using Aegilops tauschii, the D-genome donor of bread wheat. We sequenced an unfiltered genomic shotgun (trs) and a methylation-filtration (tmf) library of A. tauschii, and analyzed wheat expressed sequence tags (ESTs) to estimate the expression of genes and transposable elements (TEs). The sampled D-genome sequences consisted of 91.6% repetitive elements, 2.5% known genes, and 5.9% low-copy sequences of unknown function. TEs constituted 68.2% of the D-genome compared with 50% in maize and 14% in rice. The DNA transposons constituted 13% of the D-genome compared with 2% in maize. TEs were methylated unevenly within and among elements and families, and most were transcribed which contributed to genome expansion in the core Triticeae genome. The copy number of a majority of repeat families increased gradually following polyploidization. Certain TE families occupied discrete chromosome territories. Nested insertions and illegitimate recombination occurred extensively between the TE families, and a majority of the TEs contained internal deletions. The GC content varied significantly among the three sequence sets examined ranging from 42% in tmf to 46% in trs and 52% in the EST. Based on enrichment of genic sequences, methylation-filtration offers one option, although not as efficient as in maize, for isolating gene-rich regions from the large genome of wheat.  相似文献   

18.
Maize inbred line 77Ht2 contains agriculturally important genes and has been widely used in corn breeding in China. A bacterial artificial chromosome (BAC) library of 77Ht2 has been constructed in order to identify useful genes and to facilitate the study of the maize genome. The library contains 175104 clones with an average insert size of 57 kb and represents about 4 maize haploid genome equivalents. Characterization of the library showed less than 0.5% of clones to not contain large inserts. Significant contamination of chloroplast and mitochondria DNA was not detected. BAC clones (152 arrays) were stored in 96 microtiter plates, with each well containing 12 clones. This is the first maize BAC library constructed in China. It is well suited for map-based cloning of maize genes and genome physical mapping.  相似文献   

19.
20.
The architecture of grass genomes varies on multiple levels. Large long terminal repeat retrotransposon clusters occupy significant portions of the intergenic regions, and islands of protein-encoding genes are interspersed among the repeat clusters. Hence, advanced assembly techniques are required to obtain completely finished genomes as well as to investigate gene and transposable element distributions. To characterize the organization and distribution of repeat clusters and gene islands across large grass genomes, we present 961- and 594-kb contiguous sequence contigs associated with the rf1 (for restorer of fertility1) locus in the near-centromeric region of maize (Zea mays) chromosome 3. We present two methods for computational finishing of highly repetitive bacterial artificial chromosome clones that have proved successful to close all sequence gaps caused by transposable element insertions. Sixteen repeat clusters were observed, ranging in length from 23 to 155 kb. These repeat clusters are almost exclusively long terminal repeat retrotransposons, of which the paleontology of insertion varies throughout the cluster. Gene islands contain from one to four predicted genes, resulting in a gene density of one gene per 16 kb in gene islands and one gene per 111 kb over the entire sequenced region. The two sequence contigs, when compared with the rice (Oryza sativa) and sorghum (Sorghum bicolor) genomes, retain gene colinearity of 50% and 71%, respectively, and 70% and 100%, respectively, for high-confidence gene models. Collinear genes on single gene islands show that while most expansion of the maize genome has occurred in the repeat clusters, gene islands are not immune and have experienced growth in both intragene and intergene locations.Genome sequencing of the maize (Zea mays) genome is nearing completion (Bennetzen et al., 2001; Chandler and Brendel, 2002; Wessler, 2006); it is the largest and most difficult-to-assemble plant genome sequenced to date. Maize is an important economic, agricultural, industrial, and research crop; however, with a genome close to the size of the human genome (2.8 Gb) and its high percentage of repetitive elements, acquiring the maize genome seemed a daunting task. Approximately 67% of the genome is made up of transposable elements (TEs; Haberer et al., 2005; Kronmiller and Wise, 2008), increasing the difficulty of assembly (Rabinowicz and Bennetzen, 2006). Much exploratory work has gone into isolating and sequencing just the gene areas and ignoring the repetitive regions, both by methylation filtration (Rabinowicz et al., 1999; Palmer et al., 2003; Whitelaw et al., 2003) and high-C0t (Whitelaw et al., 2003; Yuan et al., 2003) systems, which have assisted researchers with selecting only genic regions to sequence. These methods have captured a majority of the maize genic sequence (Fu et al., 2005), but they still have the potential to miss important regions. The current genome-sequencing project aims to capture the entire gene set of maize, including regulatory regions. However, the current strategy will not provide a fully assembled genome but rather assembled bacterial artificial chromosome (BAC) contigs ordered and orientated to provide complete gene regions that are adjacent to potentially incomplete TE clusters.The landscape of the maize genome provides an interesting challenge for both sequencing and subsequent annotation. A high density of long terminal repeat (LTR) retrotransposons has had a direct effect on the genome size of many plant genomes, including maize (SanMiguel et al., 1996; Bennetzen et al., 2005; Hawkins et al., 2006; Piegu et al., 2006). Besides expanding genome size, LTR retrotransposons can have an impact on evolution of the species (Kidwell and Lisch, 2000). LTR retrotransposon insertions tend to form nested clusters (SanMiguel and Bennetzen, 1998), which are separated by small regions of several genes. Large nested repeat clusters consist of TE insertions inside TE sequences, expanding the repeat cluster and breaking up the sequence of the TEs found within, hindering repeat and gene annotation and increasing the difficulty of assembly. However, full sequence completion of the repetitive regions can be of great benefit to understanding the evolutionary history of the maize genome. LTR retrotransposons can provide an estimated time since insertion by calculating the divergence of their LTRs (Kimura, 1980; Ma and Bennetzen, 2004), and carefully sequenced assemblies of nested repeat clusters can help to illustrate their expansion, proliferation, and evolution across the genome (Kronmiller and Wise, 2008).Previous studies of large contiguous regions of maize have provided a general view of the landscape of the genome. Unfinished sequence totaling 7.8 Mb from chromosome 1 and 6.6 Mb from chromosome 9 shows a gene density of one gene per 33 and 27 kb, respectively (Bruggmann et al., 2006). BAC contigs ranging in size from 126 to 405 kb show a gene density of one gene per 19 kb and genes found in small groups between large repeat clusters (Brunner et al., 2005). Genome-wide analysis of maize BACs has painted a different picture: while gene density of 100 random BACs at one gene per 44 kb was similar to the above results, genes were not observed in tight clusters (Haberer et al., 2005). When investigating gene-specific areas of maize, this dichotomy of gene density is also seen. Analysis of gene-rich regions such as the 22-kd α-zein gene family on maize chromosome 4 reveals a high density of genes, with one gene observed per 10 kb over 346 kb (Song et al., 2001). The Adh1 locus on maize chromosome 1 contains two genes across 280 kb, or one gene per 140 kb. Perhaps the only message learned here is that the gene density across the maize genome varies to a great degree, and large contiguous sequenced regions can begin to capture the true diversity of maize chromosome architecture.In order to characterize large contiguous regions of maize sequence, we identified and sequenced two B73 BAC contigs from the centromeric region of chromosome 3. These contigs of 961 and 594 kb correspond to contigs 117 and 119, respectively, on maize WebFPC (Wei et al., 2007) and span regions associated with the rf1 (for restorer of fertility1) locus for Texas (T) cytoplasmic male sterility (cmsT; Duvick et al., 1961; Wise et al., 1996). As a foundation for the isolation of the Rf1 locus, four rf1 male-sterile mutants were recovered from a screen of 123,500 flowering plants (Wise et al., 1996). A 5.5-kb Mu1-hybridizing EcoRI restriction fragment was identified that cosegregated with the rf1-m3207 allele. Sequences from this fragment were hybridized to a Rf1 cDNA library, and probes designed from the identified cDNA, p6140-1 (Wise et al., 1999), were found to cosegregate with the rf1 locus in a recombinant population selected from over 10,000 progeny.Using probes designed off the 5.5-kb cosegregating restriction fragment and the p6140-1 cDNA, we have identified two BAC contigs spanning the rf1 locus. Sixteen BACs were sequenced to completion to provide high-quality finished sequence. Here, we present two methods for computational finishing of highly repetitive grass genomes, which were successfully utilized to close 11 TE-induced gaps. Sixteen nested repeat clusters were found, each spanning as much as 155 kb and containing a variety of LTR retrotransposon types and ages of insertion. Genes are found tightly clustered, showing a density rate of one gene per 16 kb within gene islands. Finally, comparative analysis with rice (Oryza sativa) and sorghum (Sorghum bicolor) shows that while many genes are retained across all three species, genes have both been lost and translocated across the genomes.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号