首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
2.
3.
4.

Background

Annotating mammalian genomes for noncoding RNAs (ncRNAs) is nontrivial since far from all ncRNAs are known and the computational models are resource demanding. Currently, the human genome holds the best mammalian ncRNA annotation, a result of numerous efforts by several groups. However, a more direct strategy is desired for the increasing number of sequenced mammalian genomes of which some, such as the pig, are relevant as disease models and production animals.

Results

We present a comprehensive annotation of structured RNAs in the pig genome. Combining sequence and structure similarity search as well as class specific methods, we obtained a conservative set with a total of 3,391 structured RNA loci of which 1,011 and 2,314, respectively, hold strong sequence and structure similarity to structured RNAs in existing databases. The RNA loci cover 139 cis-regulatory element loci, 58 lncRNA loci, 11 conflicts of annotation, and 3,183 ncRNA genes. The ncRNA genes comprise 359 miRNAs, 8 ribozymes, 185 rRNAs, 638 snoRNAs, 1,030 snRNAs, 810 tRNAs and 153 ncRNA genes not belonging to the here fore mentioned classes. When running the pipeline on a local shuffled version of the genome, we obtained no matches at the highest confidence level. Additional analysis of RNA-seq data from a pooled library from 10 different pig tissues added another 165 miRNA loci, yielding an overall annotation of 3,556 structured RNA loci. This annotation represents our best effort at making an automated annotation. To further enhance the reliability, 571 of the 3,556 structured RNAs were manually curated by methods depending on the RNA class while 1,581 were declared as pseudogenes. We further created a multiple alignment of pig against 20 representative vertebrates, from which RNAz predicted 83,859 de novo RNA loci with conserved RNA structures. 528 of the RNAz predictions overlapped with the homology based annotation or novel miRNAs. We further present a substantial synteny analysis which includes 1,004 lineage specific de novo RNA loci and 4 ncRNA loci in the known annotation specific for Laurasiatheria (pig, cow, dolphin, horse, cat, dog, hedgehog).

Conclusions

We have obtained one of the most comprehensive annotations for structured ncRNAs of a mammalian genome, which is likely to play central roles in both health modelling and production. The core annotation is available in Ensembl 70 and the complete annotation is available at http://rth.dk/resources/rnannotator/susscr102/version1.02.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-459) contains supplementary material, which is available to authorized users.  相似文献   

5.

Background

Previous studies in Drosophila and mammals have revealed levels of long non-coding RNAs (lncRNAs) sequence conservation that are intermediate between neutrally evolving and protein-coding sequence. These analyses compared conservation between species that diverged up to 75 million years ago. However, analysis of sequence polymorphisms within a species'' population can provide an understanding of essentially contemporaneous selective constraints that are acting on lncRNAs and can quantify the deleterious effect of mutations occurring within these loci.

Results

We took advantage of polymorphisms derived from the genome sequences of 163 Drosophila melanogaster strains and 174 human individuals to calculate the distribution of fitness effects of single nucleotide polymorphisms occurring within intergenic lncRNAs and compared this to distributions for SNPs present within putatively neutral or protein-coding sequences. Our observations show that in D.melanogaster there is a significant excess of rare frequency variants within intergenic lncRNAs relative to neutrally evolving sequences, whereas selection on human intergenic lncRNAs appears to be effectively neutral. Approximately 30% of mutations within these fruitfly lncRNAs are estimated as being weakly deleterious.

Conclusions

These contrasting results can be attributed to the large difference in effective population sizes between the two species. Our results suggest that while the sequences of lncRNAs will be well conserved across insect species, such loci in mammals will accumulate greater proportions of deleterious changes through genetic drift.  相似文献   

6.
7.

Background

Many noncoding genomic loci have remained constant over long evolutionary periods, suggesting that they are exposed to strong selective pressures. The molecular functions of these elements have been partially elucidated, but the fundamental reason for their extreme conservation is still unknown.

Results

To gain new insights into the extreme selection of highly conserved noncoding elements (HCNEs), we used a systematic analysis of multi-omic data to study the epigenetic regulation of such elements during the development of Drosophila melanogaster. At the sequence level, HCNEs are GC-rich and have a characteristic oligomeric composition. They have higher levels of stable nucleosome occupancy than their flanking regions, and lower levels of mononucleosomes and H3.3, suggesting that these regions reside in compact chromatin. Furthermore, these regions showed remarkable modulations in histone modification and the expression levels of adjacent genes during development. Although HCNEs are primarily initiated late in replication, about 10% were related to early replication origins. Finally, HCNEs showed strong enrichment within lamina-associated domains.

Conclusion

HCNEs have distinct and protective sequence properties, undergo dynamic epigenetic regulation, and appear to be associated with the structural components of the chromatin, replication origins, and nuclear matrix. These observations indicate that such elements are likely to have essential cellular functions, and offer insights into their epigenetic properties.  相似文献   

8.
9.
10.
11.
小麦长链非编码RNA的预测及功能分析   总被引:1,自引:0,他引:1       下载免费PDF全文
生物体有部分基因被转录成RNA,但是不编码相应蛋白质,称为长链非编码RNA(lncRNA)。它们参与基因的表观调控,这一过程对动物、植物的生长发育都有重要作用,但是,目前植物中发现和研究的lncRNA较少。为了研究lncRNA在植物中的功能,本研究建立了基于小麦全长cDNA的lncRNA识别程序。从6162条小麦全长cDNA中发现了231条lncRNAs,并从中鉴定出两个新miRNAs,这表明lncRNAs可以通过形成miRNAs前体基因形成其功能。此外,通过序列富集分析,我们从小麦lncRNAs中鉴定出三个保守的调控元件,结果显示小麦lncRNAs可能通过和其它蛋白质或DNA等分子作用,进而参与小麦生长、发育等过程的调控,这些结果对进一步研究植物体内的lncRNA的功能和作用机制具有重要意义。  相似文献   

12.

Background

Non-coding RNAs (ncRNAs) have important functional roles in the cell: for example, they regulate gene expression by means of establishing stable joint structures with target mRNAs via complementary sequence motifs. Sequence motifs are also important determinants of the structure of ncRNAs. Although ncRNAs are abundant, discovering novel ncRNAs on genome sequences has proven to be a hard task; in particular past attempts for ab initio ncRNA search mostly failed with the exception of tools that can identify micro RNAs.

Methodology/Principal Findings

We present a very general ab initio ncRNA gene finder that exploits differential distributions of sequence motifs between ncRNAs and background genome sequences.

Conclusions/Significance

Our method, once trained on a set of ncRNAs from a given species, can be applied to a genome sequences of other organisms to find not only ncRNAs homologous to those in the training set but also others that potentially belong to novel (and perhaps unknown) ncRNA families. Availability: http://compbio.cs.sfu.ca/taverna/smyrna  相似文献   

13.
14.
15.
16.
17.
18.

Introduction

In addition to the well-known short noncoding RNAs such as microRNAs (miRNAs), increasing evidence suggests that long noncoding RNAs (lncRNAs) act as key regulators in a wide aspect of biologic processes. Dysregulated expression of lncRNAs has been demonstrated being implicated in a variety of human diseases. However, little is known regarding the role of lncRNAs with regards to intervertebral disc degeneration (IDD). In the present study we aimed to determine whether lncRNAs are differentially expressed in IDD.

Methods

An lncRNA-mRNA microarray analysis of human nucleus pulposus (NP) was employed. Bioinformatics prediction was also applied to delineate the functional roles of the differentially expressed lncRNAs. Several lncRNAs and mRNAs were chosen for quantitative real-time PCR (qRT-PCR) validation.

Results

Microarray data profiling indicated that 116 lncRNAs (67 up and 49 down) and 260 mRNAs were highly differentially expressed with an absolute fold change greater than ten. Moreover, 1,052 lncRNAs and 1,314 mRNAs were differentially expressed in the same direction in at least four of the five degenerative samples with fold change greater than two. Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis for the differentially expressed mRNAs indicated a number of pathways, such as extracellular matrix (ECM)-receptor interaction. A coding-noncoding gene co-expression (CNC) network was constructed for the ten most significantly changed lncRNAs. Annotation terms of the coexpressed mRNAs were related to several known degenerative alterations, such as chondrocyte differentiation. Moreover, lncRNAs belonging to a particular subgroup were identified. Functional annotation for the corresponding nearby coding genes showed that these lncRNAs were mainly associated with cell migration and phosphorylation. Interestingly, we found that Fas-associated protein factor-1 (FAF1), which potentiates the Fas-mediated apoptosis and its nearby enhancer-like lncRNA RP11-296A18.3, were highly expressed in the degenerative discs. Subsequent qRT-PCR results confirmed the changes.

Conclusions

This is the first study to demonstrate that aberrantly expressed lncRNAs play a role in the development of IDD. Our study noted that up-regulated RP11-296A18.3 highly likely induced the over-expression of FAF1, which eventually promoted the aberrant apoptosis of disc cells. Such findings further broaden the understanding of the etiology of IDD.

Electronic supplementary material

The online version of this article (doi:10.1186/s13075-014-0465-5) contains supplementary material, which is available to authorized users.  相似文献   

19.
长非编码RNA(lnc RNA)是长度大于200 bp的一类非编码蛋白的RNA,因其在基因组中含量巨大以及重要的生物学功能引起了学术界的广泛关注.基因组印记是一种表观遗传现象,lnc RNAs通过建立靶基因的印记而发挥重要的生物功能.基因组印记可以用来研究lnc RNAs在转录和转录后水平调控基因表达的分子机制.本文选取6个印记机制研究比较透彻的印记区域,包括Kcnq1/Cdkn1c、Igf2r/Airn、Prader-Willi(PWS)/Angelman(AS)、Snurf/Snrpn、Dlk1-Dio3和H19/Igf2.通过介绍包括基因间lnc RNAs(H19、Ipw和Meg3)、反义lnc RNAs(Kcnq1ot1、Airn、Ube3a-ATS)和增强子lnc RNAs(IG-DMR e RNAs)在内的3种类型lnc RNAs在印记调控中的作用,从而了解lnc RNAs通过顺式或(/和)反式作用多种机制调控亲本特异性靶基因的表达.了解印记基因簇中lnc RNAs的作用方式将有助于我们揭示lnc RNAs在整个基因组中的作用机制.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号