首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
2.
3.
4.
5.

Background

Transposable elements form a significant proportion of eukaryotic genomes. Recently, Lexa et al. (Nucleic Acids Res 42:968-978, 2014) reported that plant long terminal repeat (LTR) retrotransposons often contain potential quadruplex sequences (PQSs) in their LTRs and experimentally confirmed their ability to adopt four-stranded DNA conformations.

Results

Here, we searched for PQSs in human retrotransposons and found that PQSs are specifically localized in the 3’-UTR of LINE-1 elements, in LTRs of HERV elements and are strongly accumulated in specific regions of SVA elements. Circular dichroism spectroscopy confirmed that most PQSs had adopted monomolecular or bimolecular guanine quadruplex structures. Evolutionarily young SVA elements contained more PQSs than older elements and their propensity to form quadruplex DNA was higher. Full-length L1 elements contained more PQSs than truncated elements; the highest proportion of PQSs was found inside transpositionally active L1 elements (PA2 and HS families).

Conclusions

Conservation of quadruplexes at specific positions of transposable elements implies their importance in their life cycle. The increasing quadruplex presence in evolutionarily young LINE-1 and SVA families makes these elements important contributors toward present genome-wide quadruplex distribution.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-1032) contains supplementary material, which is available to authorized users.  相似文献   

6.

Background

Genome-scale “-omics” measurements are challenging to benchmark due to the enormous variety of unique biological molecules involved. Mixtures of previously-characterized samples can be used to benchmark repeatability and reproducibility using component proportions as truth for the measurement. We describe and evaluate experiments characterizing the performance of RNA-sequencing (RNA-Seq) measurements, and discuss cases where mixtures can serve as effective process controls.

Results

We apply a linear model to total RNA mixture samples in RNA-seq experiments. This model provides a context for performance benchmarking. The parameters of the model fit to experimental results can be evaluated to assess bias and variability of the measurement of a mixture. A linear model describes the behavior of mixture expression measures and provides a context for performance benchmarking. Residuals from fitting the model to experimental data can be used as a metric for evaluating the effect that an individual step in an experimental process has on the linear response function and precision of the underlying measurement while identifying signals affected by interference from other sources. Effective benchmarking requires well-defined mixtures, which for RNA-Seq requires knowledge of the post-enrichment ‘target RNA’ content of the individual total RNA components. We demonstrate and evaluate an experimental method suitable for use in genome-scale process control and lay out a method utilizing spike-in controls to determine enriched RNA content of total RNA in samples.

Conclusions

Genome-scale process controls can be derived from mixtures. These controls relate prior knowledge of individual components to a complex mixture, allowing assessment of measurement performance. The target RNA fraction accounts for differential selection of RNA out of variable total RNA samples. Spike-in controls can be utilized to measure this relationship between target RNA content and input total RNA. Our mixture analysis method also enables estimation of the proportions of an unknown mixture, even when component-specific markers are not previously known, whenever pure components are measured alongside the mixture.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1912-7) contains supplementary material, which is available to authorized users.  相似文献   

7.

Background

Piwi-interacting RNAs (piRNAs) are a recently discovered class of small non-coding RNAs whose best-understood function is to repress mobile element (ME) activity in animal germline. To date, nearly all piRNA studies have been conducted in model organisms and little is known about piRNA diversity, target specificity and biological function in human.

Results

Here we performed high-throughput sequencing of piRNAs from three human adult testis samples. We found that more than 81% of the ~17 million putative piRNAs mapped to ~6,000 piRNA-producing genomic clusters using a relaxed definition of clusters. A set of human protein-coding genes produces a relatively large amount of putative piRNAs from their 3’UTRs, and are significantly enriched for certain biological processes, suggestive of non-random sampling by the piRNA biogenesis machinery. Up to 16% of putative piRNAs mapped to a few hundred annotated long non-coding RNA (lncRNA) genes, suggesting that some lncRNA genes can act as piRNA precursors. Among major ME families, young families of LTR and endogenous retroviruses have a greater association with putative piRNAs than other MEs. In addition, piRNAs preferentially mapped to specific regions in the consensus sequences of several ME (sub)families and some piRNA mapping peaks showed patterns consistent with the “ping-pong” cycle of piRNA targeting and amplification.

Conclusions

Overall our data provide a comprehensive analysis and improved annotation of human piRNAs in adult human testes and shed new light into the relationship of piRNAs with protein-coding genes, lncRNAs, and mobile genetic elements in human.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-545) contains supplementary material, which is available to authorized users.  相似文献   

8.
9.
10.

Background

Identifying sequence-structure motifs common to two RNAs can speed up the comparison of structural RNAs substantially. The core algorithm of the existent approach ExpaRNA solves this problem for a priori known input structures. However, such structures are rarely known; moreover, predicting them computationally is no rescue, since single sequence structure prediction is highly unreliable.

Results

The novel algorithm ExpaRNA-P computes exactly matching sequence-structure motifs in entire Boltzmann-distributed structure ensembles of two RNAs; thereby we match and fold RNAs simultaneously, analogous to the well-known “simultaneous alignment and folding” of RNAs. While this implies much higher flexibility compared to ExpaRNA, ExpaRNA-P has the same very low complexity (quadratic in time and space), which is enabled by its novel structure ensemble-based sparsification. Furthermore, we devise a generalized chaining algorithm to compute compatible subsets of ExpaRNA-P’s sequence-structure motifs. Resulting in the very fast RNA alignment approach ExpLoc-P, we utilize the best chain as anchor constraints for the sequence-structure alignment tool LocARNA. ExpLoc-P is benchmarked in several variants and versus state-of-the-art approaches. In particular, we formally introduce and evaluate strict and relaxed variants of the problem; the latter makes the approach sensitive to compensatory mutations. Across a benchmark set of typical non-coding RNAs, ExpLoc-P has similar accuracy to LocARNA but is four times faster (in both variants), while it achieves a speed-up over 30-fold for the longest benchmark sequences (≈400nt). Finally, different ExpLoc-P variants enable tailoring of the method to specific application scenarios. ExpaRNA-P and ExpLoc-P are distributed as part of the LocARNA package. The source code is freely available at http://www.bioinf.uni-freiburg.de/Software/ExpaRNA-P.

Conclusions

ExpaRNA-P’s novel ensemble-based sparsification reduces its complexity to quadratic time and space. Thereby, ExpaRNA-P significantly speeds up sequence-structure alignment while maintaining the alignment quality. Different ExpaRNA-P variants support a wide range of applications.

Electronic supplementary material

The online version of this article (doi:10.1186/s12859-014-0404-0) contains supplementary material, which is available to authorized users.  相似文献   

11.

Background

Next-Generation Sequencing (NGS) is revolutionizing molecular epidemiology by providing new approaches to undertake whole genome sequencing (WGS) in diagnostic settings for a variety of human and veterinary pathogens. Previous sequencing protocols have been subject to biases such as those encountered during PCR amplification and cell culture, or are restricted by the need for large quantities of starting material. We describe here a simple and robust methodology for the generation of whole genome sequences on the Illumina MiSeq. This protocol is specific for foot-and-mouth disease virus (FMDV) or other polyadenylated RNA viruses and circumvents both the use of PCR and the requirement for large amounts of initial template.

Results

The protocol was successfully validated using five FMDV positive clinical samples from the 2001 epidemic in the United Kingdom, as well as a panel of representative viruses from all seven serotypes. In addition, this protocol was successfully used to recover 94% of an FMDV genome that had previously been identified as cell culture negative. Genome sequences from three other non-FMDV polyadenylated RNA viruses (EMCV, ERAV, VESV) were also obtained with minor protocol amendments. We calculated that a minimum coverage depth of 22 reads was required to produce an accurate consensus sequence for FMDV O. This was achieved in 5 FMDV/O/UKG isolates and the type O FMDV from the serotype panel with the exception of the 5′ genomic termini and area immediately flanking the poly(C) region.

Conclusions

We have developed a universal WGS method for FMDV and other polyadenylated RNA viruses. This method works successfully from a limited quantity of starting material and eliminates the requirement for genome-specific PCR amplification. This protocol has the potential to generate consensus-level sequences within a routine high-throughput diagnostic environment.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-828) contains supplementary material, which is available to authorized users.  相似文献   

12.
13.

Background

We profiled the expression of circulating microRNAs (miRNAs) in mice using Illumina small RNA deep sequencing in order to identify the miRNAs that may potentially be used as biomarkers to distinguish between gram-negative and gram-positive bacterial infections.

Results

Recombinant-specific gram-negative pathogen Escherichia coli (Xen14) and gram-positive pathogen Staphylococcus aureus (Xen29) were used to induce bacterial infection in mice at a concentration of 1 × 108 bacteria/100 μL of phosphate buffered saline (PBS). Small RNA libraries generated from the serum of mice after exposure to PBS, Xen14, Xen29, and Xen14 + Xen29 via the routes of subcutaneous injection (I), cut wound (C), or under grafted skin (S) were analyzed using an Illumina HiSeq2000 Sequencer. Following exposure to gram-negative bacteria alone, no differentially expressed miRNA was found in the injection, cut, or skin graft models. Exposure to mixed bacteria induced a similar expression pattern of the circulating miRNAs to that induced by gram-positive bacterial infection. Upon gram-positive bacterial infection, 9 miRNAs (mir-193b-3p, mir-133a-1-3p, mir-133a-2-3p, mir-133a-1-5p, mir-133b-3p, mir-434-3p, mir-127-3p, mir-676-3p, mir-215-5p) showed upregulation greater than 4-fold with a p-value < 0.01. Among them, mir-193b-3p, mir-133a-1-3p, and mir-133a-2-3p presented the most common miRNA targets expressed in the mice exposed to gram-positive bacterial infection.

Conclusions

This study identified mir-193b-3p, mir-133a-1-3p, and mir-133a-2-3p as potential circulating miRNAs for gram-positive bacterial infections.

Electronic supplementary material

The online version of this article (doi:10.1186/s12929-014-0106-y) contains supplementary material, which is available to authorized users.  相似文献   

14.

Background

A recessive mutation “c” in the Mexican axolotl, Ambystoma mexicanum, results in the failure of normal heart development. In homozygous recessive embryos, the hearts do not have organized myofibrils and fail to beat. In our previous studies, we identified a noncoding Myofibril-Inducing RNA (MIR) from axolotls which promotes myofibril formation and rescues heart development.

Results

We randomly cloned RNAs from fetal human heart. RNA from clone #291 promoted myofibril formation and induced heart development of mutant axolotls in organ culture. This RNA induced expression of cardiac markers in mutant hearts: tropomyosin, troponin and α-syntrophin. This cloned RNA matches in partial sequence alignment to human microRNA-499a and b, although it differs in length. We have concluded that this cloned RNA is unique in its length, but is still related to the microRNA-499 family. We have named this unique RNA, microRNA-499c. Thus, we will refer to this RNA derived from clone #291 as microRNA-499c throughout the rest of the paper.

Conclusions

This new form, microRNA-499c, plays an important role in cardiac development.  相似文献   

15.

Background

While next-generation sequencing technologies have made sequencing genomes faster and more affordable, deciphering the complete genome sequence of an organism remains a significant bioinformatics challenge, especially for large genomes. Low sequence coverage, repetitive elements and short read length make de novo genome assembly difficult, often resulting in sequence and/or fragment “gaps” – uncharacterized nucleotide (N) stretches of unknown or estimated lengths. Some of these gaps can be closed by re-processing latent information in the raw reads. Even though there are several tools for closing gaps, they do not easily scale up to processing billion base pair genomes.

Results

Here we describe Sealer, a tool designed to close gaps within assembly scaffolds by navigating de Bruijn graphs represented by space-efficient Bloom filter data structures. We demonstrate how it scales to successfully close 50.8 % and 13.8 % of gaps in human (3 Gbp) and white spruce (20 Gbp) draft assemblies in under 30 and 27 h, respectively – a feat that is not possible with other leading tools with the breadth of data used in our study.

Conclusion

Sealer is an automated finishing application that uses the succinct Bloom filter representation of a de Bruijn graph to close gaps in draft assemblies, including that of very large genomes. We expect Sealer to have broad utility for finishing genomes across the tree of life, from bacterial genomes to large plant genomes and beyond. Sealer is available for download at https://github.com/bcgsc/abyss/tree/sealer-release.

Electronic supplementary material

The online version of this article (doi:10.1186/s12859-015-0663-4) contains supplementary material, which is available to authorized users.  相似文献   

16.

Background

A long juvenile period between germination and flowering is a common characteristic among fruit trees, including Malus hupehensis (Pamp.) Rehd., which is an apple rootstock widely used in China. microRNAs (miRNAs) play an important role in the regulation of phase transition and reproductive growth processes.

Results

M. hupehensis RNA libraries, one adult and one juvenile phase, were constructed using tree leaves and underwent high-throughput sequencing. We identified 42 known miRNA families and 172 novel miRNAs. We also identified 127 targets for 25 known miRNA families and 168 targets for 35 unique novel miRNAs using degradome sequencing. The identified miRNA targets were categorized into 58 biological processes, and the 123 targets of known miRNAs were associated with phase transition processes. The KEGG analysis revealed that these targets were involved in starch and sucrose metabolism, and plant hormone signal transduction. Expression profiling of miRNAs and their targets indicated multiple regulatory functions in the phase transition. The higher expression level of mdm-miR156 and lower expression level of mdm-miR172 in the juvenile phase leaves implied that these two small miRNAs regulated the phase transition. mdm-miR160 and miRNA393, which regulate genes involved in auxin signal transduction, could also be involved in controlling this process. The identification of known and novel miRNAs and their targets provides new information on this regulatory process in M. hupehensis, which will contribute to the understanding of miRNA functions during growth, phase transition and reproduction in woody fruit trees.

Conclusions

The combination of sRNA and degradome sequencing can be used to better illustrate the profiling of hormone-regulated miRNAs and miRNA targets involving complex regulatory networks, which will contribute to the understanding of miRNA functions during growth, phase transition and reproductive growth in perennial woody fruit trees.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-1125) contains supplementary material, which is available to authorized users.  相似文献   

17.

Background

Whales have captivated the human imagination for millennia. These incredible cetaceans are the only mammals that have adapted to life in the open oceans and have been a source of human food, fuel and tools around the globe. The transition from land to water has led to various aquatic specializations related to hairless skin and ability to regulate their body temperature in cold water.

Results

We present four common minke whale (Balaenoptera acutorostrata) genomes with depth of ×13 ~ ×17 coverage and perform resequencing technology without a reference sequence. Our results indicated the time to the most recent common ancestors of common minke whales to be about 2.3574 (95% HPD, 1.1521 – 3.9212) million years ago. Further, we found that genes associated with epilation and tooth-development showed signatures of positive selection, supporting the morphological uniqueness of whales.

Conclusions

This whole-genome sequencing offers a chance to better understand the evolutionary journey of one of the largest mammals on earth.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1213-1) contains supplementary material, which is available to authorized users.  相似文献   

18.
19.

Background

Though rare in occurrence, patients with rare bleeding disorders (RBDs) are highly heterogeneous and may manifest with severe bleeding diathesis. Due to the high rate of consanguinity in many caste groups, these autosomal recessive bleeding disorders which are of rare occurrence in populations across the world, may not be as rare in India.

Objectives

To comprehensively analyze the frequency and nature of mutations in Indian patients with RBDs.

Methods

Pubmed search was used (www.pubmed.com) to explore the published literature from India on RBDs using the key words “rare bleeding disorders”, “mutations”, “India”, “fibrinogen”, “afibrinogenemia”, “factor II deficiency”, “prothrombin” “factor VII deficiency”, “factor V deficiency”, “factor X deficiency”, “factor XI deficiency”, “combined factor V and VIII deficiency”, “factor XIII deficiency”, “Bernard Soulier syndrome” and “Glanzmanns thrombasthenia” in different combinations. A total of 60 relevant articles could be retrieved. The distribution of mutations from India was compared with that of the world literature by referring to the Human Gene Mutation Database (HGMD) (www.hgmd.org).

Results

Taken together, 181 mutations in 270 patients with different RBDs have been reported from India. Though the types of mutations reported from India and their percentage distribution with respect to the world data are largely similar, yet much higher percentage of small deletions, duplication mutations, insertions, indels were observed in this analysis. Besides the identification of novel mutations and polymorphisms, several common mutations have also been reported, which will allow to develop a strategy for mutation screening in Indian patients with RBDs.

Conclusion

There is a need for a consortium of Institutions working on the molecular pathology of RBDs in India. This will facilitate a quicker and cheaper diagnosis of RBDs besides its utility in first trimester prenatal diagnosis of the affected families.  相似文献   

20.

Background

Trypanosomatid parasites possess a single mitochondrion which is classically involved in the energetic metabolism of the cell, but also, in a much more original way, through its single and complex DNA (termed kinetoplast), in the correct progress of cell division. In order to identify proteins potentially involved in the cell cycle, we performed RNAi knockdowns of 101 genes encoding mitochondrial proteins using procyclic cells of Trypanosoma brucei.

Results

A major cell growth reduction was observed in 10 cases and a moderate reduction in 29 other cases. These data are overall in agreement with those previously obtained by a case-by-case approach performed on chromosome 1 genes, and quantitatively with those obtained by “high-throughput phenotyping using parallel sequencing of RNA interference targets” (RIT-seq). Nevertheless, a detailed analysis revealed many qualitative discrepancies with the RIT-seq-based approach. Moreover, for 37 out of 39 mutants for which a moderate or severe growth defect was observed here, we noted abnormalities in the cell cycle progress, leading to increased proportions of abnormal cell cycle stages, such as cells containing more than 2 kinetoplasts (K) and/or more than 2 nuclei (N), and modified proportions of the normal phenotypes (1N1K, 1N2K and 2N2K).

Conclusions

These data, together with the observation of other abnormal phenotypes, show that all the corresponding mitochondrial proteins are involved, directly or indirectly, in the correct progress or, less likely, in the regulation, of the cell cycle in T. brucei. They also show how post-genomics analyses performed on a case-by-case basis may yield discrepancies with global approaches.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1505-5) contains supplementary material, which is available to authorized users.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号