首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 234 毫秒
1.
2.
3.
4.
5.
6.
7.
8.

Background

Annotating mammalian genomes for noncoding RNAs (ncRNAs) is nontrivial since far from all ncRNAs are known and the computational models are resource demanding. Currently, the human genome holds the best mammalian ncRNA annotation, a result of numerous efforts by several groups. However, a more direct strategy is desired for the increasing number of sequenced mammalian genomes of which some, such as the pig, are relevant as disease models and production animals.

Results

We present a comprehensive annotation of structured RNAs in the pig genome. Combining sequence and structure similarity search as well as class specific methods, we obtained a conservative set with a total of 3,391 structured RNA loci of which 1,011 and 2,314, respectively, hold strong sequence and structure similarity to structured RNAs in existing databases. The RNA loci cover 139 cis-regulatory element loci, 58 lncRNA loci, 11 conflicts of annotation, and 3,183 ncRNA genes. The ncRNA genes comprise 359 miRNAs, 8 ribozymes, 185 rRNAs, 638 snoRNAs, 1,030 snRNAs, 810 tRNAs and 153 ncRNA genes not belonging to the here fore mentioned classes. When running the pipeline on a local shuffled version of the genome, we obtained no matches at the highest confidence level. Additional analysis of RNA-seq data from a pooled library from 10 different pig tissues added another 165 miRNA loci, yielding an overall annotation of 3,556 structured RNA loci. This annotation represents our best effort at making an automated annotation. To further enhance the reliability, 571 of the 3,556 structured RNAs were manually curated by methods depending on the RNA class while 1,581 were declared as pseudogenes. We further created a multiple alignment of pig against 20 representative vertebrates, from which RNAz predicted 83,859 de novo RNA loci with conserved RNA structures. 528 of the RNAz predictions overlapped with the homology based annotation or novel miRNAs. We further present a substantial synteny analysis which includes 1,004 lineage specific de novo RNA loci and 4 ncRNA loci in the known annotation specific for Laurasiatheria (pig, cow, dolphin, horse, cat, dog, hedgehog).

Conclusions

We have obtained one of the most comprehensive annotations for structured ncRNAs of a mammalian genome, which is likely to play central roles in both health modelling and production. The core annotation is available in Ensembl 70 and the complete annotation is available at http://rth.dk/resources/rnannotator/susscr102/version1.02.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-459) contains supplementary material, which is available to authorized users.  相似文献   

9.

Background

Previous studies in Drosophila and mammals have revealed levels of long non-coding RNAs (lncRNAs) sequence conservation that are intermediate between neutrally evolving and protein-coding sequence. These analyses compared conservation between species that diverged up to 75 million years ago. However, analysis of sequence polymorphisms within a species'' population can provide an understanding of essentially contemporaneous selective constraints that are acting on lncRNAs and can quantify the deleterious effect of mutations occurring within these loci.

Results

We took advantage of polymorphisms derived from the genome sequences of 163 Drosophila melanogaster strains and 174 human individuals to calculate the distribution of fitness effects of single nucleotide polymorphisms occurring within intergenic lncRNAs and compared this to distributions for SNPs present within putatively neutral or protein-coding sequences. Our observations show that in D.melanogaster there is a significant excess of rare frequency variants within intergenic lncRNAs relative to neutrally evolving sequences, whereas selection on human intergenic lncRNAs appears to be effectively neutral. Approximately 30% of mutations within these fruitfly lncRNAs are estimated as being weakly deleterious.

Conclusions

These contrasting results can be attributed to the large difference in effective population sizes between the two species. Our results suggest that while the sequences of lncRNAs will be well conserved across insect species, such loci in mammals will accumulate greater proportions of deleterious changes through genetic drift.  相似文献   

10.

Background

Many noncoding genomic loci have remained constant over long evolutionary periods, suggesting that they are exposed to strong selective pressures. The molecular functions of these elements have been partially elucidated, but the fundamental reason for their extreme conservation is still unknown.

Results

To gain new insights into the extreme selection of highly conserved noncoding elements (HCNEs), we used a systematic analysis of multi-omic data to study the epigenetic regulation of such elements during the development of Drosophila melanogaster. At the sequence level, HCNEs are GC-rich and have a characteristic oligomeric composition. They have higher levels of stable nucleosome occupancy than their flanking regions, and lower levels of mononucleosomes and H3.3, suggesting that these regions reside in compact chromatin. Furthermore, these regions showed remarkable modulations in histone modification and the expression levels of adjacent genes during development. Although HCNEs are primarily initiated late in replication, about 10% were related to early replication origins. Finally, HCNEs showed strong enrichment within lamina-associated domains.

Conclusion

HCNEs have distinct and protective sequence properties, undergo dynamic epigenetic regulation, and appear to be associated with the structural components of the chromatin, replication origins, and nuclear matrix. These observations indicate that such elements are likely to have essential cellular functions, and offer insights into their epigenetic properties.  相似文献   

11.

Background

Epigenetic modifications, such as cytosine methylation, are inherited in plant species and may occur in response to biotic or abiotic stress, affecting gene expression without changing genome sequence. Laguncularia racemosa, a mangrove species, occurs in naturally contrasting habitats where it is subjected daily to salinity and nutrient variations leading to morphological differences. This work aims at unraveling how CpG-methylation variation is distributed among individuals from two nearby habitats, at a riverside (RS) or near a salt marsh (SM), with different environmental pressures and how this variation is correlated with the observed morphological variation.

Principal Findings

Significant differences were observed in morphological traits such as tree height, tree diameter, leaf width and leaf area between plants from RS and SM locations, resulting in smaller plants and smaller leaf size in SM plants. Methyl-Sensitive Amplified Polymorphism (MSAP) was used to assess genetic and epigenetic (CpG-methylation) variation in L. racemosa genomes from these populations. SM plants were hypomethylated (14.6% of loci had methylated samples) in comparison to RS (32.1% of loci had methylated samples). Within-population diversity was significantly greater for epigenetic than genetic data in both locations, but SM also had less epigenetic diversity than RS. Frequency-based (GST) and multivariate (βST) methods that estimate population structure showed significantly greater differentiation among locations for epigenetic than genetic data. Co-Inertia analysis, exploring jointly the genetic and epigenetic data, showed that individuals with similar genetic profiles presented divergent epigenetic profiles that were characteristic of the population in a particular environment, suggesting that CpG-methylation changes may be associated with environmental heterogeneity.

Conclusions

In spite of significant morphological dissimilarities, individuals of L. racemosa from salt marsh and riverside presented little genetic but abundant DNA methylation differentiation, suggesting that epigenetic variation in natural plant populations has an important role in helping individuals to cope with different environments.  相似文献   

12.
13.

Background

Non-coding RNAs (ncRNAs) have important functional roles in the cell: for example, they regulate gene expression by means of establishing stable joint structures with target mRNAs via complementary sequence motifs. Sequence motifs are also important determinants of the structure of ncRNAs. Although ncRNAs are abundant, discovering novel ncRNAs on genome sequences has proven to be a hard task; in particular past attempts for ab initio ncRNA search mostly failed with the exception of tools that can identify micro RNAs.

Methodology/Principal Findings

We present a very general ab initio ncRNA gene finder that exploits differential distributions of sequence motifs between ncRNAs and background genome sequences.

Conclusions/Significance

Our method, once trained on a set of ncRNAs from a given species, can be applied to a genome sequences of other organisms to find not only ncRNAs homologous to those in the training set but also others that potentially belong to novel (and perhaps unknown) ncRNA families. Availability: http://compbio.cs.sfu.ca/taverna/smyrna  相似文献   

14.
15.
16.
17.
小麦长链非编码RNA的预测及功能分析   总被引:1,自引:0,他引:1       下载免费PDF全文
生物体有部分基因被转录成RNA,但是不编码相应蛋白质,称为长链非编码RNA(lncRNA)。它们参与基因的表观调控,这一过程对动物、植物的生长发育都有重要作用,但是,目前植物中发现和研究的lncRNA较少。为了研究lncRNA在植物中的功能,本研究建立了基于小麦全长cDNA的lncRNA识别程序。从6162条小麦全长cDNA中发现了231条lncRNAs,并从中鉴定出两个新miRNAs,这表明lncRNAs可以通过形成miRNAs前体基因形成其功能。此外,通过序列富集分析,我们从小麦lncRNAs中鉴定出三个保守的调控元件,结果显示小麦lncRNAs可能通过和其它蛋白质或DNA等分子作用,进而参与小麦生长、发育等过程的调控,这些结果对进一步研究植物体内的lncRNA的功能和作用机制具有重要意义。  相似文献   

18.

Background

The Gadd45 proteins play important roles in growth control, maintenance of genomic stability, DNA repair, and apoptosis. Recently, Gadd45 proteins have also been implicated in epigenetic gene regulation by promoting active DNA demethylation. Gadd45 proteins have sequence homology with the L7Ae/L30e/S12e RNA binding superfamily of ribosomal proteins, which raises the question if they may interact directly with nucleic acids.

Principal Findings

Here we show that Gadd45a binds RNA but not single- or double stranded DNA or methylated DNA in vitro. Sucrose density gradient centrifugation experiments demonstrate that Gadd45a is present in high molecular weight particles, which are RNase sensitive. Gadd45a displays RNase-sensitive colocalization in nuclear speckles with the RNA helicase p68 and the RNA binding protein SC35. A K45A point mutation defective in RNA binding was still active in DNA demethylation. This suggests that RNA binding is not absolutely essential for demethylation of an artificial substrate. A point mutation at G39 impared RNA binding, nuclear speckle localization and DNA demethylation, emphasizing its relevance for Gadd45a function.

Significance

The results implicate RNA in Gadd45a function and suggest that Gadd45a is associated with a ribonucleoprotein particle.  相似文献   

19.
20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号