首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
We investigated the compositional and structural differences in sequences derived from different fractions of wheat genomic DNA obtained using methylation filtration and Cot fractionation. Comparative analysis of these sequences revealed large compositional and structural variations in terms of GC content, different structural elements including repeat sequences (e.g., transposable elements and simple sequence repeats),protein coding genes, and non-coding RNA genes. A correlation between methylation status [determined on the basis of selective inclusion/exclusion in methylation-filtered (MF) library]of different repeat elements and expression level was observed. The expression levels were determined by comparing MF sequences with expressed sequence tags (ESTs) available in the public domain. Only a limited overlap among MF,high Cot (HC), and ESTs was observed, suggesting that these sequences may largely either represent the low-copy non-transcribed sequences or include genes with low expression levels. Thus, these results indicated a need to study MF and HC sequences along with ESTs to fully appreciate complexity of wheat gene space.  相似文献   

2.
Germination of wheat seeds results in small changes of the GC content of total DNA (from 47.5 to 49.0 mole %): at the same time the amount of 5-methylcytosine in seeds 10 hours after wetting and at day 3 of germination significantly decrease (from 6.0 to 5.4 and 5.2 mole %, respectively). The wheat genome is methylated in non-uniform fashion: moderute repeats (less than a hundred copies, interval Cot = 0,12 . 10(2)-420) possess the maximal amount of 5-methylcytosine, while the unique sequences (Cot greater than 420) have the lowest 5-methylcytosine content. Methylation of highly reiterated sequences (Cot less than 0,8 . 10(-2) is similar to that of the total DNA. At day 3 of germination the amount of 5-methycytosine in all DNA fractions is lower as compared with these fractions isolated from DNA of dormant seeds. This is probably due to (1) diminution in the amount of reiterated sequences with high 5-methylcytosine content and (2) to lowering of DNA methylation level in germinating seeds. Changes in DNA methylation may be associated with the regulation of gene activity in the differentiating plant cells at various stages of ontogenesis.  相似文献   

3.
The fractions of unique (Cot less than 405), moderately (Cot=0.13--405) and highly reiterated (Cot less than 0--0.13) sequences were isolated from DNA of wheat seeds and 3 day old seedlings, and GC content, amount of 5-methylcytosine and its distribution among various pyrimidine isostichs in the fractions isolated were studied. Different in Cot value DNA fractions from seeds or from seedlings are similar in GC content and in all other characteristics studied. Seed DNA differs from DNA of seedlings in the content of pyrimidine isostichs from the respective fractions of reiterated sequences. Pronounced differences in the amount of pyridmidine clusters with various base composition in the corresponding fractions of DNA from seeds and seedlings were found. These differences in the frequencies of respective pyrimidine clusters from DNA of seeds and seedlings may be considered as being a result of changes in the molecular population of wheat DNA on germination. The seed and seedling DNA differ significantly in the 5-methylcytosine content in the respective pyrimidine isostichs isolated from unique sequences. In the seedling DNA some other nucleotide sequences are to be methylated as compared to DNA of dormat seeds. Thus, on germination some changes occur in DNA methylation as well as in the genome organization.  相似文献   

4.
曾少华  刘迪  王瑛 《遗传》2009,31(8):799-808
高等植物的基因组大小差异十分巨大, 在大基因组植物的基因组中, 各种重复序列占据了基因组中相当大一部分, 而低拷贝或单拷贝的基因序列仅占了很少一部分。对于大基因组物种而言, 大量的重复序列给基因组的研究工作带来很大困难, 使得大规模获得基因信息成为一项很有挑战性的工作。目前, 在基因组范围内富集基因的方法有cDNA文库、甲基化过滤文库、高Cot值文库、转座子标签富集法等。文章综述了这几种方法的技术原理和特性, 结合近年来国内外运用甲基化过滤技术的研究进展, 探讨了根据不同研究材料和研究目标, 如何高效选择适合的方法或者方法的组合。  相似文献   

5.
Although the nuclear genome of banana (Musa spp.) is relatively small (1C approximately 610 Mbp for M. acuminata), the results obtained from other sequenced genomes suggest that more than half of the banana genome may be composed of repetitive and non-coding DNA sequences. Knowledge of repetitive DNA can facilitate mapping of important traits, phylogenetic studies, BAC-based physical mapping, and genome sequencing/annotation. However, only a few repetitive DNA sequences have been characterized in banana. In this work, we used DNA reassociation kinetics to isolate the highly repeated fraction of the banana genome (M. acuminata 'Calcutta 4'). Two libraries, one prepared from Cot 相似文献   

6.
Sequence composition, organization, and evolution of the core Triticeae genome   总被引:13,自引:0,他引:13  
We investigated the composition and the basis of genome expansion in the core Triticeae genome using Aegilops tauschii, the D-genome donor of bread wheat. We sequenced an unfiltered genomic shotgun (trs) and a methylation-filtration (tmf) library of A. tauschii, and analyzed wheat expressed sequence tags (ESTs) to estimate the expression of genes and transposable elements (TEs). The sampled D-genome sequences consisted of 91.6% repetitive elements, 2.5% known genes, and 5.9% low-copy sequences of unknown function. TEs constituted 68.2% of the D-genome compared with 50% in maize and 14% in rice. The DNA transposons constituted 13% of the D-genome compared with 2% in maize. TEs were methylated unevenly within and among elements and families, and most were transcribed which contributed to genome expansion in the core Triticeae genome. The copy number of a majority of repeat families increased gradually following polyploidization. Certain TE families occupied discrete chromosome territories. Nested insertions and illegitimate recombination occurred extensively between the TE families, and a majority of the TEs contained internal deletions. The GC content varied significantly among the three sequence sets examined ranging from 42% in tmf to 46% in trs and 52% in the EST. Based on enrichment of genic sequences, methylation-filtration offers one option, although not as efficient as in maize, for isolating gene-rich regions from the large genome of wheat.  相似文献   

7.
8.
Low-molecular-weight glutenin subunits (LMW-GSs) are encoded by a multi-gene family and are essential for determining the quality of wheat flour products, such as bread and noodles. However, the exact role or contribution of individual LMW-GS genes to wheat quality remains unclear. This is, at least in part, due to the difficulty in characterizing complete sequences of all LMW-GS gene family members in bread wheat. To identify full-length LMW-GS genes, a polymerase chain reaction (PCR)-based method was established, consisting of newly designed conserved primers and the previously developed LMW-GS gene molecular marker system. Using the PCR-based method, 17 LMW-GS genes were identified and characterized in Xiaoyan 54, of which 12 contained full-length sequences. Sequence alignments showed that 13 LMW-GS genes were identical to those found in Xiaoyan 54 using the genomic DNA library screening, and the other four full-length LMW-GS genes were first isolated from Xiaoyan 54. In Chinese Spring, 16 unique LMW-GS genes were isolated, and 13 of them contained full-length coding sequences. Additionally, 16 and 17 LMW-GS genes in Dongnong 101 and Lvhan 328 (chosen from the micro-core collections of Chinese germplasm), respectively, were also identified. Sequence alignments revealed that at least 15 LMW-GS genes were common in the four wheat varieties, and allelic variants of each gene shared high sequence identities (>95%) but exhibited length polymorphism in repetitive regions. This study provides a PCR-based method for efficiently identifying LMW-GS genes in bread wheat, which will improve the characterization of complex members of the LMW-GS gene family and facilitate the understanding of their contributions to wheat quality.  相似文献   

9.
Brachypodium is well suited as a model system for temperate grasses because of its compact genome and a range of biological features. In an effort to develop resources for genome research in this emerging model species, we constructed 2 bacterial artificial chromosome (BAC) libraries from an inbred diploid Brachypodium distachyon line, Bd21, using restriction enzymes HindIII and BamHI. A total of 73,728 clones (36,864 per BAC library) were picked and arrayed in 192,384-well plates. The average insert size for the BamHI and HindIII libraries is estimated to be 100 and 105 kb, respectively, and inserts of chloroplast origin account for 4.4% and 2.4%, respectively. The libraries individually represent 9.4- and 9.9-fold haploid genome equivalents with combined 19.3-fold genome coverage, based on a genome size of 355 Mb reported for the diploid Brachypodium, implying a 99.99% probability that any given specific sequence will be present in each library. Hybridization of the libraries with 8 starch biosynthesis genes was used to empirically evaluate this theoretical genome coverage; the frequency at which these genes were present in the library clones gave an estimated coverage of 11.6- and 19.6-fold genome equivalents. To obtain a first view of the sequence composition of the Brachypodium genome, 2185 BAC end sequences (BES) representing 1.3 Mb of random genomic sequence were compared with the NCBI GenBank database and the GIRI repeat database. Using a cutoff expectation value of E<10-10, only 3.3% of the BESs showed similarity to repetitive sequences in the existing database, whereas 40.0% had matches to the sequences in the EST database, suggesting that a considerable portion of the Brachypodium genome is likely transcribed. When the BESs were compared with individual EST databases, more matches hit wheat than maize, although their EST collections are of a similar size, further supporting the close relationship between Brachypodium and the Triticeae. Moreover, 122 BESs have significant matches to wheat ESTs mapped to individual chromosome bin positions. These BACs represent colinear regions containing the mapped wheat ESTs and would be useful in identifying additional markers for specific wheat chromosome regions.  相似文献   

10.
The role of DNAs in the nuclear binding of dexamethasone-receptor complexes (DRC) was studied. The cytosolic receptors from rat liver have a sedimentation coefficient of about 7S, the Stock's radius--of about 50 A and possess a high affinity to dexamethasone (Kas = 2,6 X 10(8) M-1). Their capacity is 3 X 10(-13) and 5.5--7.0 X 10(-12) mole of dexamethasone per mg cytosolic protein and mg DNA, respectively. DRC has the ability to bind to the nuclei of rat liver. DRC binding to nuclei is increased approximately 3-fold by temperature activation of cytosol. The nuclear acceptor sites are saturated at the level of 16.2 pmoles of bound DRC per mg nuclear DNA. Free DNA has the ability to compete with nuclei for binding with DRC. Temperature-activated DRC can bind both with homo- and heterologous DNAs. Secondary DRC-DNA complexes were isolated by means of gel filtration on Sepharose 4B. Thermal denaturation of DNA decreases its ability to bind DRC approximately 2-fold. DNAs of a similar nucleotide composition, i.e. DNA from rat liver (GC = 43 mole%) and DNA from Photobacterium belozerskii (GC = 44 mole%), have a close DRC-binding ability. At the same time, these DNAs bind about 1.5-fold less DRC, as compared to DNA from Pseudomonas aeruginosa (GC = 67 mole%) and about 1.5-fold more, than does DNA from T2 phage (GC = 35 mole%). Thus the positive correlation between the GC composition of DNA and its DRC-binding ability was established. Unique sequences (Cot greater than 600) bind several times less DRC than the reiterated sequences (also denaturated) (Cot = O--600) of rate liver DNA. Thus, DNA can be considered as a nuclear acceptor of DRC. It is assumed, that DRC is able to recognise in DNA certain short GC-rich sequences, distributed in the rate genome in a non-random fashion.  相似文献   

11.
Many economically important crops have large and complex genomes that hamper their sequencing by standard methods such as whole genome shotgun (WGS). Large tracts of methylated repeats occur in plant genomes that are interspersed by hypomethylated gene‐rich regions. Gene‐enrichment strategies based on methylation profiles offer an alternative to sequencing repetitive genomes. Here, we have applied methyl filtration with McrBC endonuclease digestion to enrich for euchromatic regions in the sugarcane genome. To verify the efficiency of methylation filtration and the assembly quality of sequences submitted to gene‐enrichment strategy, we have compared assemblies using methyl‐filtered (MF) and unfiltered (UF) libraries. The use of methy filtration allowed a better assembly by filtering out 35% of the sugarcane genome and by producing 1.5× more scaffolds and 1.7× more assembled Mb in length compared with unfiltered dataset. The coverage of sorghum coding sequences (CDS) by MF scaffolds was at least 36% higher than by the use of UF scaffolds. Using MF technology, we increased by 134× the coverage of gene regions of the monoploid sugarcane genome. The MF reads assembled into scaffolds that covered all genes of the sugarcane bacterial artificial chromosomes (BACs), 97.2% of sugarcane expressed sequence tags (ESTs), 92.7% of sugarcane RNA‐seq reads and 98.4% of sorghum protein sequences. Analysis of MF scaffolds from encoded enzymes of the sucrose/starch pathway discovered 291 single‐nucleotide polymorphisms (SNPs) in the wild sugarcane species, S. spontaneum and S. officinarum. A large number of microRNA genes was also identified in the MF scaffolds. The information achieved by the MF dataset provides a valuable tool for genomic research in the genus Saccharum and for improvement of sugarcane as a biofuel crop.  相似文献   

12.
High-Cot sequence analysis of the maize genome   总被引:10,自引:0,他引:10  
Higher eukaryotic genomes, including those from plants, contain large amounts of repetitive DNA that complicate genome analysis. We have developed a technique based on DNA renaturation which normalizes repetitive DNA, and thereby allows a more efficient outcome for full genome shotgun sequencing. The data indicate that sequencing the unrenatured outcome of a Cot experiment, otherwise known as High-Cot DNA, enriches genic sequences by more than fourfold in maize, from 5% for a random library to more than 20% for a High-Cot library. Using this approach, we predict that gene discovery would be greater than 95% and that the number of sequencing runs required to sequence the full gene space in maize would be at least fourfold lower than that required for full-genome shotgun sequencing.  相似文献   

13.
14.
Mapping‐by‐sequencing analyses have largely required a complete reference sequence and employed whole genome re‐sequencing. In species such as wheat, no finished genome reference sequence is available. Additionally, because of its large genome size (17 Gb), re‐sequencing at sufficient depth of coverage is not practical. Here, we extend the utility of mapping by sequencing, developing a bespoke pipeline and algorithm to map an early‐flowering locus in einkorn wheat (Triticum monococcum L.) that is closely related to the bread wheat genome A progenitor. We have developed a genomic enrichment approach using the gene‐rich regions of hexaploid bread wheat to design a 110‐Mbp NimbleGen SeqCap EZ in solution capture probe set, representing the majority of genes in wheat. Here, we use the capture probe set to enrich and sequence an F2 mapping population of the mutant. The mutant locus was identified in T. monococcum, which lacks a complete genome reference sequence, by mapping the enriched data set onto pseudo‐chromosomes derived from the capture probe target sequence, with a long‐range order of genes based on synteny of wheat with Brachypodium distachyon. Using this approach we are able to map the region and identify a set of deleted genes within the interval.  相似文献   

15.
Genomic resources have recently been developed for a number of species of Fagaceae, with the purpose of identifying the genetic factors underlying the adaptation of these long-lived, biologically predominant, commercially and ecologically important species to their environment. The sequencing of genomes of the size of the oak genome (740 Mb/C) is now becoming both possible and affordable due to breakthroughs in sequencing technology. However, an understanding of the composition and structure of the oak genome is required before launching a sequencing initiative. We constructed random (Rd) and hypomethylated (Hp) genomic libraries for pedunculate oak (Quercus robur) and carried out a sample sequencing of 2.33 and 2.36 Mb of shotgun DNA from the Rd and Hp libraries, respectively, to provide a first insight into the repetitive element and gene content of the oak genome. We found striking similarities between Rd sequences and previously analyzed BAC end sequences of pedunculate oak, with a similar percentage of known repeat elements (5.56%), an almost identical simple sequence repeat density (i.e., 29 SSRs per 100 kb), an identical profile of SSR motifs (in descending order of frequency—dinucleotide, pentanucleotide, trinucleotide, tetranucleotide, and hexanucleotide motifs). Conversely, the Hp fraction was, as expected, enriched in nuclear genes (2.44-fold enrichment). This enrichment was associated with a lower frequency of retrotransposons than for Rd sequences. We also identified twice as many SSR motifs in the Rd library as in the Hp library. This work provides useful information before opening a new chapter in oak genome sequencing.  相似文献   

16.
The genome of bread wheat is hexaploid and contains 1.6 x 10 10 bp of DNA, of which more than 80% is repetitive sequences. Its size and complexity represent a challenge for the isolation of agronomically important genes, for which we frequently know only their position on the genetic map. Recently, new genomic resources and databases from genome projects have simplified the molecular analysis of the wheat genome. The first genes to be isolated from wheat by map-based cloning include three resistance genes against the fungal diseases powdery mildew and leaf rust. In this review, we will describe the approaches and resources that have contributed to this progress, and discuss genomic strategies that will simplify positional cloning in wheat in the near future.  相似文献   

17.
ABSTRACT: BACKGROUND: Polyploidization is considered one of the main mechanisms of plant genome evolution. The presence of multiple copies of the same gene reduces selection pressure and permits sub-functionalization and neo-functionalization leading to plant diversification, adaptation and speciation. In bread wheat, polyploidization and the prevalence of transposable elements resulted in massive gene duplication and movement. As a result, the number of genes which are non-collinear to genomes of related species seems markedly increased in wheat. RESULTS: We used new-generation sequencing (NGS) to generate sequence of a Mb-sized region from wheat chromosome arm 3DS. Sequence assembly of 24 BAC clones resulted in two scaffolds of 1,264,820 and 333,768 bases. The sequence was annotated and compared to the homoeologous region on wheat chromosome 3B and orthologous loci of Brachypodium distachyon and rice. Among 39 coding sequences in the 3DS scaffolds, 32 have a homoeolog on chromosome 3B. In contrast, only fifteen and fourteen orthologs were identified in the corresponding regions in rice and Brachypodium, respectively. Interestingly, five pseudogenes were identified among the non-collinear coding sequences at the 3B locus, while none was found at the 3DS locus. CONCLUSION: Direct comparison of two Mb-sized regions of the B and D genomes of bread wheat revealed similar rates of non-collinear gene insertion in both genomes with a majority of gene duplications occurring before their divergence. Relatively low proportion of pseudogenes was identified among non-collinear coding sequences. Our data suggest that the pseudogenes did not originate from insertion of non-functional copies, but were formed later during the evolution of hexaploid wheat. Some evidence was found for gene erosion along the B genome locus.  相似文献   

18.
The high-quality rice genome sequence is serving as a reference for comparative genome analysis in crop plants, especially cereals. However, early comparisons with bread wheat showed complex patterns of conserved synteny (gene content) and colinearity (gene order). Here, we show the presence of ancient duplicated segments in the progenitor of wheat, which were first identified in the rice genome. We also show that single-copy (SC) rice genes, those representing unique matches with wheat expressed sequence tag (EST) unigene contigs in the whole rice genome, show more than twice the proportion of genes mapping to syntenic wheat chromosome as compared to the multicopy (MC) or duplicated rice genes. While 58.7% of the 1,244 mapped SC rice genes were located in single syntenic wheat chromosome groups, the remaining 41.3% were distributed randomly to the other six non-syntenic wheat groups. This could only be explained by a background dispersal of genes in the genome through transposition or other unknown mechanism. The breakdown of rice–wheat synteny due to such transpositions was much greater near the wheat centromeres. Furthermore, the SC rice genes revealed a conserved primordial gene order that gives clues to the origin of rice and wheat chromosomes from a common ancestor through polyploidy, aneuploidy, centromeric fusions, and translocations. Apart from the bin-mapped wheat EST contigs, we also compared 56,298 predicted rice genes with 39,813 wheat EST contigs assembled from 409,765 EST sequences and identified 7,241 SC rice gene homologs of wheat. Based on the conserved colinearity of 1,063 mapped SC rice genes across the bins of individual wheat chromosomes, we predicted the wheat bin location of 6,178 unmapped SC rice gene homologs and validated the location of 213 of these in the telomeric bins of 21 wheat chromosomes with 35.4% initial success. This opens up the possibility of directed mapping of a large number of conserved SC rice gene homologs in wheat. Overall, only 46.4% of these SC genes code for proteins with known functional domains; the remaining 53.6% have unknown function, and hence, represent an important, but yet, under explored category of genes. Electronic supplementary material Supplementary material is available for this article at and is accessible for authorized users.  相似文献   

19.
M Turner  Y Mukai  P Leroy  B Charef  R Appels  S Rahman 《Génome》1999,42(6):1242-1250
The grain softness proteins or friabilins are known to be composed of three main components: puroindoline a, puroindoline b, and GSP-1. cDNAs for GSP-1 have previously been mapped to group-5 chromosomes and their location on chromosome 5D is closely linked to the grain hardness (Ha) locus of hexaploid wheat. A genomic DNA clone containing the GSP-1 gene (wGSP1-A1) from hexaploid wheat has been identified by fluorescent in situ hybridization as having originated from the distal end of the short arm of chromosome 5A. A genomic clone containing the gene (wGSP1-D1) was also isolated from Aegilops tauschii, the donor of the D genome to bread wheat. There are no introns in the GSP-1 genes, and there is high sequence identity between wGSP1-A1 and wGSP1-D1 up to 1 kb 5' and 300 bp 3' to wGSP1-D1. However, regions further upstream and downstream of wGSP1-D1 share no significant sequence identity to corresponding sequences in wGSP1-A1. These regions therefore identified potentially valuable sequences for tracing the Ha locus through assaying polymorphic DNA sequences. The sequence from 300 to 500 bp 3' to wGSP1-D1 (wGSP1-D13) was mapped to the Ha locus in a mapping population. wGSP1-D13 was also tightly linked to genes for puroindoline a and puroindoline b which have been previously mapped to be at the Ha locus. In addition wGSP1-D13 was used to detect RFLPs between near isogenic soft and hard Falcon lines and in a random selection of soft and hard wheats.  相似文献   

20.
Totipotent cDNA libraries representative of all the potentially expressed sequences in a genome would be of great benefit to gene expression studies. Here, we report on an innovative method for creating such a library for durum wheat (Triticum turgidum L. var. durum) and its application for gene discovery. The use of suitable quantities of 5-azacytidine during the germination phase induced the demethylation of total DNA, and the resulting seedlings potentially express all of the genes present in the genome. A new wheat microarray consisting of 4925 unigenes was developed from the totipotent cDNA library and used to screen for genes that may contribute to differences in the disease resistance of two near-isogenic lines, the durum wheat cultivar Latino and the line 5BIL-42, which are respectively susceptible and resistant to powdery mildew. Fluorescently labeled cDNA was prepared from the RNA of seedlings of the two near-isogenic wheat lines after infection with a single powdery mildew isolate under controlled conditions in the greenhouse. Hybridization to the microarray identified six genes that were differently expressed in the two lines. Four of the sequences could be assigned putative functions based on their similarity to known genes in public databases. Physical mapping of the six genes localized them to two regions of the genome: the centromeric region of chromosome 5B, where the Pm36 resistance gene was previously localized, and chromosome 6B.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号