首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
The ultimate goal of genome research on the model flowering plant Arabidopsis thaliana is the identification of all of the genes and understanding their functions. A major step towards this goal, the genome sequencing project, is nearing completion; however, functional studies of newly discovered genes have not yet kept up to this pace. Recent progress in large-scale insertional mutagenesis opens new possibilities for functional genomics in Arabidopsis. The number of T-DNA and transposon insertion lines from different laboratories will soon represent insertions into most Arabidopsis genes. Vast resources of gene knockouts are becoming available that can be subjected to different types of reverse genetics screens to deduce the functions of the sequenced genes.  相似文献   

2.
In many cases, the unprecedented availability of data provided by high-throughput sequencing has shifted the bottleneck from a data availability issue to a data interpretation issue, thus delaying the promised breakthroughs in genetics and precision medicine, for what concerns Human genetics, and phenotype prediction to improve plant adaptation to climate change and resistance to bioagressors, for what concerns plant sciences. In this paper, we propose a novel Genome Interpretation paradigm, which aims at directly modeling the genotype-to-phenotype relationship, and we focus on A. thaliana since it is the best studied model organism in plant genetics. Our model, called Galiana, is the first end-to-end Neural Network (NN) approach following the genomes in/phenotypes out paradigm and it is trained to predict 288 real-valued Arabidopsis thaliana phenotypes from Whole Genome sequencing data. We show that 75 of these phenotypes are predicted with a Pearson correlation ≥0.4, and are mostly related to flowering traits. We show that our end-to-end NN approach achieves better performances and larger phenotype coverage than models predicting single phenotypes from the GWAS-derived known associated genes. Galiana is also fully interpretable, thanks to the Saliency Maps gradient-based approaches. We followed this interpretation approach to identify 36 novel genes that are likely to be associated with flowering traits, finding evidence for 6 of them in the existing literature.  相似文献   

3.
The most frequently used method to identify mutations induced by a commonly used mutagen, EMS (ethyl methane sulfonate), in Arabidopsis thaliana has been map-based cloning. The first step of this method is crossing a mutant with a plant of another accession as it requires polymorphisms between accessions for linkage analysis. Therefore, to perform the method routinely, it is greatly preferred to use accession combinations between which enough polymorphisms are already known. Further, it requires laborious examination of a large number of F? recombinants using many markers to detect each polymorphism. After linkage analysis narrows down the chromosomal region containing the causal mutation, sequencing candidate genes one by one within the region is necessary until the mutation is finally identified. Overall, this method is generally time-consuming and labor intensive, and it becomes harder when multiple loci are involved in phenotypes. A few recent reports showed that causal mutations induced by EMS could be identified by deep-sequencing technologies with less labor compared with the conventional method when mutants were generated in the Arabidopsis reference Columbia background whose genome organization is well known. Here we report that we succeeded in rapid identification of EMS-induced causal mutations in a non-reference accession background, whose whole genome sequence is not publicly available, using one round of whole genome sequencing. Moreover, in our case, we could monitor the causal locus and the transgenic reporter locus simultaneously, implying that this methodology could theoretically be applicable to analyzing even complex traits. We describe the pipeline of this methodology and discuss its characteristics.  相似文献   

4.
Gardening the genome: DNA methylation in Arabidopsis thaliana   总被引:13,自引:0,他引:13  
DNA methylation has two essential roles in plants and animals - defending the genome against transposons and regulating gene expression. Recent experiments in Arabidopsis thaliana have begun to address crucial questions about how DNA methylation is established and maintained. One cardinal insight has been the discovery that DNA methylation can be guided by small RNAs produced through RNA-interference pathways. Plants and mammals use a similar suite of DNA methyltransferases to propagate DNA methylation, but plants have also developed a glycosylase-based mechanism for removing DNA methylation, and there are hints that similar processes function in other organisms.  相似文献   

5.
Ultraconserved elements, sequences with 100% identity with no insertions or deletions between genomes, have been found in both vertebrate and invertebrate genomes; whether plant genomes contain ultraconserved elements, however, is unknown. We consequently compared the genomes of Arabidopsis thaliana and rice, which diverged about 200 million years ago, and identified 25 ultraconserved elements that are longer than 100 bp. Similar to those previously found, ultraconserved elements in plants tend to occur in clusters and locate at noncoding regions; nevertheless, they have many distinct features. For instance, the longest ultraconserved element between the 2 plant genomes is 1491 bp, much longer than the longest one (779 bp) between the human and rodent genomes. Some biological implications are discussed, but the functions of these plant ultraconserved elements and the reasons why they are practically frozen during the evolution of millions of years remain a mystery.  相似文献   

6.
7.
The proportion of non-tandem duplicated loci detected by DNA hybridization and the segregation of RFLPs using 90 independent randomly isolated cDNA probes was estimated by segregation analysis to be 17%. The 14 cDNA probes showing duplicate loci in progeny derived from a cross between Arabidopsis-thaliana ecotypes Columbia x Landsberg erecta detected an average of 3.6 loci per probe (ranging from 2 to 6). The 50 loci detected with these 14 probes were arranged on a genetic map of 587 cM and assigned to the five A. Thaliana chromosomes. An additional duplicated locus was detected in progeny from a cross between Landsberg erecta x Niederzenz. The majority of duplicated loci were on different chromosomes, and when linkage between duplicate locus pairs was detected, these loci were always separated by at least 15 cM. When partial nucleotide sequence data were compared with GENBANK databases, the identities of 2 cDNA clones which recognized duplicate unlinked sequences in the A. Thaliana genome were determined to encode a chlorophyll a/b-binding protein and a beta-tubulin. Of the 8 loci carrying beta-tubulin genes 6 were placed on the genetic map. These results imply that gene duplication has been an important factor in the evolution of the Arabidopsis genome.  相似文献   

8.
9.
Comparative genomics provides insight into the evolutionary dynamics that shape discrete sequences as well as whole genomes. To advance comparative genomics within the Brassicaceae, we have end sequenced 23,136 medium-sized insert clones from Boechera stricta, a wild relative of Arabidopsis (Arabidopsis thaliana). A significant proportion of these sequences, 18,797, are nonredundant and display highly significant similarity (BLASTn e-value < or = 10(-30)) to low copy number Arabidopsis genomic regions, including more than 9,000 annotated coding sequences. We have used this dataset to identify orthologous gene pairs in the two species and to perform a global comparison of DNA regions 5' to annotated coding regions. On average, the 500 nucleotides upstream to coding sequences display 71.4% identity between the two species. In a similar analysis, 61.4% identity was observed between 5' noncoding sequences of Brassica oleracea and Arabidopsis, indicating that regulatory regions are not as diverged among these lineages as previously anticipated. By mapping the B. stricta end sequences onto the Arabidopsis genome, we have identified nearly 2,000 conserved blocks of microsynteny (bracketing 26% of the Arabidopsis genome). A comparison of fully sequenced B. stricta inserts to their homologous Arabidopsis genomic regions indicates that indel polymorphisms >5 kb contribute substantially to the genome size difference observed between the two species. Further, we demonstrate that microsynteny inferred from end-sequence data can be applied to the rapid identification and cloning of genomic regions of interest from nonmodel species. These results suggest that among diploid relatives of Arabidopsis, small- to medium-scale shotgun sequencing approaches can provide rapid and cost-effective benefits to evolutionary and/or functional comparative genomic frameworks.  相似文献   

10.
The Arabidopsis thaliana genome sequence provides a catalogue of reference genes that can be used for comparative analysis of other species thereby facilitating map-based cloning in economically important crops. We made use of a coffee bacterial artificial chromosome (BAC) contig linked to the SH3 leaf rust resistance gene to assess microsynteny between coffee (Coffea arabica L.) and Arabidopsis. Microsynteny was revealed and the matching counterparts to C. arabica contigs were seen to be scattered throughout four different syntenic segments of Arabidopsis on chromosomes (Ath) I, III, IV and V. Coffee BAC filter hybridizations were performed using coffee putative conserved orthologous sequences to Arabidopsis predicted genes located on the different Arabidopsis syntenic regions. The coffee BAC contig related to the SH3 region was successfully consolidated and later on validated by fingerprinting. Furthermore, the anchoring markers appeared in same order on the coffee BAC contigs and in all Arabidopsis segments with the exception of a single inversion on AtIII and AtIV Arabidopsis segments. However, the SH3 coffee region appears to be closer to the ancestral genome segment (before the divergence of Arabidopsis and coffee) than any of the duplicated counterparts in the present-day Arabidopsis genome. The genome duplication events at the origin of its Arabidopsis counterparts occurred most probably after the separation (i.e. 94 million years ago) of Euasterid (Coffee) and Eurosid (Arabidopsis).  相似文献   

11.
Nearly 4 years after launching the International Rice Genome Sequencing Project (IRGSP), the rice genome sequence is almost completed. This is the second plant genome after Arabidopsis thaliana and one expect that it is more representative of other cereal genomes. Indeed, no more than 4 sequences have been independently reported as a result of a tough competition between economy, politics and media. The efficiency and impact of this way of managing a large scale project is questionable. This paper reports the various phases in sequencing rice genome as well as what we start to learn.  相似文献   

12.
Mestres-Ortega D  Meyer Y 《Gene》1999,240(2):307-316
Screening of cDNA libraries at low stringency and complete sequencing of EST clones with homology to thioredoxins allowed us to characterize five new prokaryotic type Arabidopsis thaliana thioredoxins. All present N-terminal extensions with characteristics of transit peptides. Four are clustered in a phylogenetic tree with the chloroplastic thioredoxin m from red and green algae and higher plants, and their transit peptides have typical characteristics of chloroplastic transit peptides. One is clearly divergent and defines a new prokaryotic thioredoxin type that we have named thioredoxin x. Its transit peptide sequence presents characteristics of both chloroplastic and mitochondrial transit peptides. The five corresponding genes are expressed at different levels, but mostly in green tissues and in in-vitro cultivated cells.  相似文献   

13.
14.
Small nucleolar RNAs (snoRNAs) are noncoding RNAs that direct 2′-O-methylation or pseudouridylation on ribosomal RNAs or spliceosomal small nuclear RNAs. These modifications are needed to modulate the activity of ribosomes and spliceosomes. A comprehensive repertoire of snoRNAs is needed to expand the knowledge of these modifications. The sequences corresponding to snoRNAs in 18–26-nt small RNA sequencing data have been rarely explored and remain as a hidden treasure for snoRNA annotation. Here, we showed the enrichment of small RNAs at Arabidopsis snoRNA termini and developed a computational approach to identify snoRNAs on the basis of this characteristic. The approach successfully uncovered the full-length sequences of 144 known Arabidopsis snoRNA genes, including some snoRNAs with improved 5′- or 3′-end annotation. In addition, we identified 27 and 17 candidates for novel box C/D and box H/ACA snoRNAs, respectively. Northern blot analysis and sequencing data from parallel analysis of RNA ends confirmed the expression and the termini of the newly predicted snoRNAs. Our study especially expanded on the current knowledge of box H/ACA snoRNAs and snoRNA species targeting snRNAs. In this study, we demonstrated that the use of small RNA sequencing data can increase the complexity and the accuracy of snoRNA annotation.  相似文献   

15.
16.
Various physiological imbalances lead to reactive oxygen species (ROS) overproduction and/or increases in lipoxygenase (LOX) activities, both events ending in lipid peroxidation of polyunsaturated fatty acids (PUFAs). Besides the quantification of such a process, the development of tools is necessary in order to allow the identification of the primary cause of its development and localization. A biochemical method assessing 9 LOX, 13 LOX and ROS-mediated peroxidation of membrane-bound and free PUFAs has been improved. The assay is based on the analysis of hydroxy fatty acids derived from PUFA hydroperoxides by both the straight and chiral phase high-performance liquid chromatography. Besides the upstream products of peroxidation of the 18:2 and 18:3 PUFAs, products coming from the 16:3 were characterized and their steady-state level quantified. Moreover, the observation that the relative amounts of the ROS-mediated peroxidation isomers of 18:3 were constant in leaves allowed us to circumvent the chiral analyses for the discrimination and quantification of 9 LOX, 13 LOX and ROS-mediated processes in routine experiments. The methodology has been successfully applied to decipher lipid peroxidation in Arabidopsis leaves submitted to biotic and abiotic stresses. We provide evidence of the relative timing of enzymatic and non-enzymatic lipid peroxidation processes. The 13 LOX pathway is activated early whatever the nature of the stress, leading to the peroxidation of chloroplast lipids. Under cadmium stress, the 9 LOX pathway added to the 13 LOX one. ROS-mediated peroxidation was mainly driven by light and always appeared as a late process.  相似文献   

17.
FtsH(Filamentation temperature-sensitive H)是一种广泛存在于原核生物和真核生物中的ATP依赖型金属蛋白酶。同源性分析表明,在拟南芥和水稻基因组中分别有12个和9个ftsH基因。ftsH基因在染色体上的分布有明显的偏爱性,如拟南芥的1、2、5号染色体和水稻的1、5号染色体。亚细胞定位分析表明,所有FtsH蛋白均定位于叶绿体或线粒体中。系统进化分析表明,21个FtsH蛋白成员可分为8个类群,其中AtFtsH12在水稻中没有发现种间同源物。每个类群成员的蛋白序列高度保守,种内同源物显示出大于80%的相似性,而种间同源物的相似性也大于70%。类群内的同源基因并非平行进化产生的,拟南芥基因组中进化出AtftsH1/5、AtftsH2/8、AtftsH3/10和AtftsH7/9共4个同源基因对,而水稻基因组中只有OsftsH3/8和OsftsH4/5两个同源基因对。每一类群中的成员在基因外显子-内含子边界分布上表现出高度保守性,在蛋白功能结构域的可变残基上具有偏爱性,而内含子在碱基组成和序列长度上表现出广泛的变异。拟南芥和水稻ftsH基因家族的比较分析为其他物种ftsH基因的特...  相似文献   

18.
19.
In Arabidopsis the NRT2.1 gene encodes a main component of the root high-affinity nitrate uptake system (HATS). Its regulation has been thoroughly studied showing a strong correlation between NRT2.1 expression and HATS activity. Despite its central role in plant nutrition, nothing is known concerning localization and regulation of NRT2.1 at the protein level. By combining a green fluorescent protein fusion strategy and an immunological approach, we show that NRT2.1 is mainly localized in the plasma membrane of root cortical and epidermal cells, and that several forms of the protein seems to co-exist in cell membranes (the monomer and at least one higher molecular weight complex). The monomer is the most abundant form of NRT2.1, and seems to be the one involved in NO(3)(-) transport. It strictly requires the NAR2.1 protein to be expressed and addressed at the plasma membrane. No rapid changes in NRT2.1 abundance were observed in response to light, sucrose, or nitrogen treatments that strongly affect both NRT2.1 mRNA level and HATS activity. This suggests the occurrence of post-translational regulatory mechanisms. One such mechanism could correspond to the cleavage of NRT2.1 C terminus, which results in the presence of both intact and truncated proteins in the plasma membrane.  相似文献   

20.
I A P Parkin  D J Lydiate  M Trick 《Génome》2002,45(2):356-366
This study describes a comprehensive comparison of chromosome 5 of the model crucifer Arabidopsis with the genome of its amphidiploid crop relative Brassica napus and introduces the use of in silico sequence homology to identify conserved loci between the two species. A region of chromosome 5, spanning 8 Mb, was found in six highly conserved copies in the B. napus genome. A single inversion appeared to be the predominant rearrangement that had separated the two lineages leading to the formation of Arabidopsis chromosome 5 and its homologues in B. napus. The observed results could be explained by the fusion of three ancestral genomes with strong similarities to modern-day Arabidopsis to generate the constituent diploid genomes of B. napus. This supports the hypothesis that the diploid Brassica genomes evolved from a common hexaploid ancestor. Alignment of the genetic linkage map of B. napus with the genomic sequence of Arabidopsis indicated that for specific regions a genetic distance of 1 cM in B. napus was equivalent to 285 Kb of Arabidopsis DNA sequence. This analysis strongly supports the application of Arabidopsis as a tool in marker development, map-based gene cloning, and candidate gene identification for the larger genomes of Brassica crop species.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号