首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The power of genome sequencing depends on the ability to understand what those genes and their proteins products actually do. The automated methods used to assign functions to putative proteins in newly sequenced organisms are limited by the size of our library of proteins with both known function and sequence. Unfortunately this library grows slowly, lagging well behind the rapid increase in novel protein sequences produced by modern genome sequencing methods. One potential source for rapidly expanding this functional library is the “back catalog” of enzymology – “orphan enzymes,” those enzymes that have been characterized and yet lack any associated sequence. There are hundreds of orphan enzymes in the Enzyme Commission (EC) database alone. In this study, we demonstrate how this orphan enzyme “back catalog” is a fertile source for rapidly advancing the state of protein annotation. Starting from three orphan enzyme samples, we applied mass-spectrometry based analysis and computational methods (including sequence similarity networks, sequence and structural alignments, and operon context analysis) to rapidly identify the specific sequence for each orphan while avoiding the most time- and labor-intensive aspects of typical sequence identifications. We then used these three new sequences to more accurately predict the catalytic function of 385 previously uncharacterized or misannotated proteins. We expect that this kind of rapid sequence identification could be efficiently applied on a larger scale to make enzymology’s “back catalog” another powerful tool to drive accurate genome annotation.  相似文献   

2.
Despite the current wealth of sequencing data, one‐third of all biochemically characterized metabolic enzymes lack a corresponding gene or protein sequence, and as such can be considered orphan enzymes. They represent a major gap between our molecular and biochemical knowledge, and consequently are not amenable to modern systemic analyses. As 555 of these orphan enzymes have metabolic pathway neighbours, we developed a global framework that utilizes the pathway and (meta)genomic neighbour information to assign candidate sequences to orphan enzymes. For 131 orphan enzymes (37% of those for which (meta)genomic neighbours are available), we associate sequences to them using scoring parameters with an estimated accuracy of 70%, implying functional annotation of 16 345 gene sequences in numerous (meta)genomes. As a case in point, two of these candidate sequences were experimentally validated to encode the predicted activity. In addition, we augmented the currently available genome‐scale metabolic models with these new sequence–function associations and were able to expand the models by on average 8%, with a considerable change in the flux connectivity patterns and improved essentiality prediction.  相似文献   

3.
In this work, we isolated and characterized 14 bacteriophages that infect Rhizobium etli. They were obtained from rhizosphere soil of bean plants from agricultural lands in Mexico using an enrichment method. The host range of these phages was narrow but variable within a collection of 48 R. etli strains. We obtained the complete genome sequence of nine phages. Four phages were resistant to several restriction enzymes and in vivo cloning, probably due to nucleotide modifications. The genome size of the sequenced phages varied from 43 kb to 115 kb, with a median size of ∼45 to 50 kb. A large proportion of open reading frames of these phage genomes (65 to 70%) consisted of hypothetical and orphan genes. The remainder encoded proteins needed for phage morphogenesis and DNA synthesis and processing, among other functions, and a minor percentage represented genes of bacterial origin. We classified these phages into four genomic types on the basis of their genomic similarity, gene content, and host range. Since there are no reports of similar sequences, we propose that these bacteriophages correspond to novel species.  相似文献   

4.
5.
ObjectiveTo review existing regulations and policies utilised by countries to enable patient access to orphan drugs.MethodsA review of the literature (1998 to 2014) was performed to identify relevant, peer-reviewed articles. Using content analysis, we synthesised regulations and policies for access to orphan drugs by type and by country.ResultsFifty seven articles and 35 countries were included in this review. Six broad categories of regulation and policy instruments were identified: national orphan drug policies, orphan drug designation, marketing authorization, incentives, marketing exclusivity, and pricing and reimbursement. The availability of orphan drugs depends on individual country’s legislation and regulations including national orphan drug policies, orphan drug designation, marketing authorization, marketing exclusivity and incentives such as tax credits to ensure research, development and marketing. The majority of countries (27/35) had in place orphan drug legislation. Access to orphan drugs depends on individual country’s pricing and reimbursement policies, which varied widely between countries. High prices and insufficient evidence often limit orphan drugs from meeting the traditional health technology assessment criteria, especially cost-effectiveness, which may influence access.ConclusionsOverall many countries have implemented a combination of legislations, regulations and policies for orphan drugs in the last two decades. While these may enable the availability and access to orphan drugs, there are critical differences between countries in terms of range and types of legislations, regulations and policies implemented. Importantly, China and India, two of the largest countries by population size, both lack national legislation for orphan medicines and rare diseases, which could have substantial negative impacts on their patient populations with rare diseases.  相似文献   

6.
7.
8.
The rates and patterns of deletions in the human factor IX gene.   总被引:4,自引:2,他引:2       下载免费PDF全文
Deletions are commonly observed in genes with either segments of highly homologous sequences or excessive gene length. However, in the factor IX gene and in most genes, deletions (of > or = 21 bp) are uncommon. We have analyzed DNA from 290 families with hemophilia B (203 independent mutations) and have found 12 deletions > 20 bp. Eleven of these are > 2 kb (range > 3-163 kb), and one is 1.1 kb. The junctions of the four deletions that are completely contained within the factor IX gene have been determined. A novel mutation occurred in patient HB128: the data suggest that a 26.8-kb deletion occurred between two segments of alternating purines and pyrimidines and that a 2.3-kb sense strand segment derived from the deleted region was inserted. For our sample of 203 independent mutations, we estimate the "baseline" rates of deletional mutation per base pair per generation as a function of size. The rate for large (> 2 kb) deletions is exceedingly low. For every mutational event in which a given base is at the junction of a large deletion, there are an estimated 58 microdeletions (< 20 bp) and 985 single-base substitutions at that base. Analysis of the nine reported deletion junctions in the factor IX gene literature reveals that (i) five are associated with inversions, orphan sequences, or sense strand insertions; (ii) four are simple deletions that display an excess of short direct repeats at their junctions; (iii) there is no dramatic clustering of junctions within the gene; and (iv) with the exception of alternating purines and pyrimidines, deletion junctions are not preferentially associated with repetitive DNA.  相似文献   

9.
Small nucleolar RNAs (snoRNAs) represent an abundant group of non-coding RNAs in eukaryotes. They can be divided into guide and orphan snoRNAs according to the presence or absence of antisense sequence to rRNAs or snRNAs. Current snoRNA-searching programs, which are essentially based on sequence complementarity to rRNAs or snRNAs, exist only for the screening of guide snoRNAs. In this study, we have developed an advanced computational package, snoSeeker, which includes CDseeker and ACAseeker programs, for the highly efficient and specific screening of both guide and orphan snoRNA genes in mammalian genomes. By using these programs, we have systematically scanned four human–mammal whole-genome alignment (WGA) sequences and identified 54 novel candidates including 26 orphan candidates as well as 266 known snoRNA genes. Eighteen novel snoRNAs were further experimentally confirmed with four snoRNAs exhibiting a tissue-specific or restricted expression pattern. The results of this study provide the most comprehensive listing of two families of snoRNA genes in the human genome till date.  相似文献   

10.
Phytoplasmas are unculturable bacterial plant pathogens transmitted by phloem-feeding hemipteran insects. DNA of phytoplasmas is difficult to purify because of their exclusive phloem location and low abundance in plants. To overcome this constraint, suppression subtractive hybridization (SSH) was modified and used to selectively amplify DNA of the stolbur phytoplasma infecting a periwinkle plant. Plasmid libraries were constructed, and the origins of the DNA inserts were verified by hybridization and PCR screenings. After a single round of SSH, there was still a significant level of contamination with plant DNA (around 50%). However, the modified SSH, which included a second round of subtraction (double SSH), resulted in an increased phytoplasma DNA purity (97%). Results validated double SSH as an efficient way to produce a genome survey for microbial agents unavailable in culture. Assembly of 266 insert sequences revealed 181 phytoplasma genetic loci which were annotated. Comparative analysis of 113 kbp indicated that among 217 protein coding sequences, 83% were homologous to “Candidatus Phytoplasma asteris” (OY-M strain) genes, with hits widely distributed along the chromosome. Most of the stolbur-specific SSH sequences were orphan genes, with the exception of two partial coding sequences encoding proteins homologous to a mycoplasma surface protein and riboflavin kinase.  相似文献   

11.
The genome sequences completed so far contain more than 20 000 genes with unknown function and no similarity to genes in other genomes. The origin and evolution of the orphan genes is an enigma. Here, we discuss the suggestion that some orphan genes may represent pseudogenes or short fragments of genes that were functional in the genome of a common ancestor. These may be the remains of unsuccessful duplication or horizontal gene transfer events, in which the acquired sequences have entered the fragmentation process and thereby lost their similarity to genes in other species. This scenario is supported by a recent case study of orphan genes in several closely related species of Rickettsia, where full-length ancestral genes were reconstructed from sets of short, overlapping orphan genes. One of these was found to display similarity to genes encoding proteins with ankyrin-repeat domains.  相似文献   

12.
Satoshi Fukuchi  Ken Nishikawa 《DNA research》2004,11(4):219-31, 311-313
Genome annotation produces a considerable number of putative proteins lacking sequence similarity to known proteins. These are referred to as "orphans." The proportion of orphan genes varies among genomes, and is independent of genome size. In the present study, we show that the proportion of orphan genes roughly correlates with the isolation index of organisms (IIO), an indicator introduced in the present study, which represents the degree of isolation of a given genome as measured by sequence similarity. However, there are outlier genomes with respect to the linear correlation, consisting of those genomes that may contain excess amounts of orphan genes. Comparisons of genome sequences among closely related strains revealed that some of the annotated genes are not conserved, suggesting that they are ORFs occurring by chance. Exclusion of these non-conserved ORFs within closely related genomes improved the correlation between the proportion of orphan genes and the IIO values. Assuming that the correlation holds in general, this relationship was used to estimate the number of "authentic" orphan genes in a genome. Using this definition of authentic orphan genes, the anomalies arising from over-assignments, e.g., the percentages of structural annotations, were corrected for 16 genomes, including those of five archaea.  相似文献   

13.
The BioBricks standard has made the construction of DNA modules easier, quicker and cheaper. So far, over 100 BioBricks assembly schemes have been developed and many of them, including the original standard of BBF RFC 10, are now widely used. However, because the restriction endonucleases employed by these standards usually recognize short DNA sequences that are widely spread among natural DNA sequences, and these recognition sites must be removed before the parts construction, there is much inconvenience in dealing with large-size DNA parts (e.g., more than couple kilobases in length) with the present standards. Here, we introduce a new standard, namely iBrick, which uses two homing endonucleases of I-SceI and PI-PspI. Because both enzymes recognize long DNA sequences (>18 bps), their sites are extremely rare in natural DNA sources, thus providing additional convenience, especially in handling large pieces of DNA fragments. Using the iBrick standard, the carotenoid biosynthetic cluster (>4 kb) was successfully assembled and the actinorhodin biosynthetic cluster (>20 kb) was easily cloned and heterologously expressed. In addition, a corresponding nomenclature system has been established for the iBrick standard.  相似文献   

14.
In the sequences released by the Arabidopsis Genome Initiative (AGI), we discovered a new and unexpectedly large family of orphan genes (127 genes by 01.08.99), named AtPCMP. The distribution of the AtPCMP genes on the five chromosomes suggests that the genome of Arabidopsis thaliana contains more than 200 genes of this family (1% of the whole genome). The deduced AtPCMP proteins are characterized by a surprising combinatorial organization of sequence motifs. The amino-terminal domain is made of a succession of three conserved motifs which generate an important diversity. These proteins are classified into three subfamilies based on the length and nature of their carboxy-terminal domain constituted by 1–6 motifs. All the motifs characterized have an important level of conservation in both sequence and spacing. A specific signature of this large family is defined. The presence of ESTs in databases and the detection of clones in A. thaliana cDNA libraries indicate that most of the genes of this family are expressed. The absence of similar sequences outside the plant kingdom strongly suggests that this unusually large orphan family is unique to plants. Features, the genesis, the potential function and the evolution of this plant combinatorial and modular protein family are discussed.  相似文献   

15.
The C-terminal peptides of ubiquitin (UB) and UB-like proteins (UBLs) play a key role in their recognition by the specific activating enzymes (E1s) to launch their transfer through the respective enzymatic cascades thus modifying cellular proteins. UB and Nedd8, a UBL regulating the activity of cullin-RING UB ligases, only differ by one residue at their C-termini; yet each has its specific E1 for the activation reaction. It has been reported recently that UAE can cross react with Nedd8 to enable its passage through the UB transfer cascade for protein neddylation. To elucidate differences in UB recognition by UAE and NAE, we carried out phage selection of a UB library with randomized C-terminal sequences based on the catalytic formation of UB∼NAE thioester conjugates. Our results confirmed the previous finding that residue 72 of UB plays a “gate-keeping” role in E1 selectivity. We also found that diverse sequences flanking residue 72 at the UB C-terminus can be accommodated by NAE for activation. Furthermore heptameric peptides derived from the C-terminal sequences of UB variants selected for NAE activation can function as mimics of Nedd8 to form thioester conjugates with NAE and the downstream E2 enzyme Ubc12 in the Nedd8 transfer cascade. Once the peptides are charged onto the cascade enzymes, the full-length Nedd8 protein is effectively blocked from passing through the cascade for the critical modification of cullin. We have thus identified a new class of inhibitors of protein neddylation based on the profiles of the UB C-terminal sequences recognized by NAE.  相似文献   

16.
Somatostatin and its related peptides (SSRPs) form an important family of hormones with diverse physiological roles. The ubiquitous presence of SSRPs in vertebrates and several invertebrate deuterostomes suggests an ancient origin of the SSRP signaling system. However, the existence of SSRP genes outside of deuterostomes has not been established, and the evolutionary history of this signaling system remains poorly understood. Our recent discovery of SSRP-like toxins (consomatins) in venomous marine cone snails (Conus) suggested the presence of a related signaling system in mollusks and potentially other protostomes. Here, we identify the molluscan SSRP-like signaling gene that gave rise to the consomatin family. Following recruitment into venom, consomatin genes experienced strong positive selection and repeated gene duplications resulting in the formation of a hyperdiverse family of venom peptides. Intriguingly, the largest number of consomatins was found in worm-hunting species (>400 sequences), indicating a homologous system in annelids, another large protostome phylum. Consistent with this, comprehensive sequence mining enabled the identification of SSRP-like sequences (and their corresponding orphan receptor) in annelids and several other protostome phyla. These results established the existence of SSRP-like peptides in many major branches of bilaterians and challenge the prevailing hypothesis that deuterostome SSRPs and protostome allatostatin-C are orthologous peptide families. Finally, having a large set of predator–prey SSRP sequences available, we show that although the cone snail’s signaling SSRP-like genes are under purifying selection, the venom consomatin genes experience rapid directional selection to target receptors in a changing mix of prey.  相似文献   

17.
Homing endonucleases are enzymes that catalyze DNA sequence specific double-strand breaks and can significantly stimulate homologous recombination at these breaks. These enzymes have great potential for applications such as gene correction in gene therapy or gene alteration in systems biology and metabolic engineering. However, homing endonucleases have a limited natural repertoire of target sequences, which severely hamper their applications. Here we report the development of a highly sensitive selection method for the directed evolution of homing endonucleases that couples enzymatic DNA cleavage with the survival of host cells. Using I-SceI as a model homing endonuclease, we have demonstrated that cells with wild-type I-SceI showed a high cell survival rate of 80–100% in the presence of the original I-SceI recognition site, whereas cells without I-SceI showed a survival rate <0.003%. This system should also be readily applicable for directed evolution of other DNA cleavage enzymes.  相似文献   

18.
Site-specific recombinases (SSRs) can perform DNA rearrangements, including deletions, inversions and translocations when their naive target sequences are placed strategically into the genome of an organism. Hence, in order to employ SSRs in heterologous hosts, their target sites have to be introduced into the genome of an organism before the enzyme can be practically employed. Engineered SSRs hold great promise for biotechnology and advanced biomedical applications, as they promise to extend the usefulness of SSRs to allow efficient and specific recombination of pre-existing, natural genomic sequences. However, the generation of enzymes with desired properties remains challenging. Here, we use substrate-linked directed evolution in combination with molecular modeling to rationally engineer an efficient and specific recombinase (sTre) that readily and specifically recombines a sequence present in the HIV-1 genome. We elucidate the role of key residues implicated in the molecular recognition mechanism and we present a rationale for sTre’s enhanced specificity. Combining evolutionary and rational approaches should help in accelerating the generation of enzymes with desired properties for use in biotechnology and biomedicine.  相似文献   

19.
Mitochondrial tRNA (mt-tRNA) 5′-editing was first described more than 20 years ago; however, the first candidates for 5′-editing enzymes were only recently identified in a eukaryotic microbe (protist), the slime mold Dictyostelium discoideum. In this organism, eight of 18 mt-tRNAs are predicted to be edited based on the presence of genomically encoded mismatched nucleotides in their aminoacyl-acceptor stem sequences. Here, we demonstrate that mt-tRNA 5′-editing occurs at all predicted sites in D. discoideum as evidenced by changes in the sequences of isolated mt-tRNAs compared with the expected sequences encoded by the mitochondrial genome. We also identify two previously unpredicted editing events in which G-U base pairs are edited in the absence of any other genomically encoded mismatches. A comparison of 5′-editing in D. discoideum with 5′-editing in another slime mold, Polysphondylium pallidum, suggests organism-specific idiosyncrasies in the treatment of U-G/G-U pairs. In vitro activities of putative D. discoideum editing enzymes are consistent with the observed editing reactions and suggest an overall lack of tRNA substrate specificity exhibited by the repair component of the editing enzyme. Although the presence of terminal mismatches in mt-tRNA sequences is highly predictive of the occurrence of mt-tRNA 5′-editing, the variability in treatment of U-G/G-U base pairs observed here indicates that direct experimental evidence of 5′-editing must be obtained to understand the complete spectrum of mt-tRNA editing events in any species.  相似文献   

20.
G protein-coupled receptors (GPCRs) are seven-transmembrane-spanning proteins that mediate cellular and physiological responses. They are critical for cardiovascular function and are targeted for the treatment of hypertension and heart failure. Nevertheless, current therapies only target a small fraction of the cardiac GPCR repertoire, indicating that there are many opportunities to investigate unappreciated aspects of heart biology. Here, we offer an update on the contemporary view of GPCRs and the complexities of their signalling, and review the roles of the ‘classical’ GPCRs in cardiovascular physiology and disease. We then provide insights into other GPCRs that have been less extensively studied in the heart, including orphan, odorant and taste receptors. We contend that these novel cardiac GPCRs contribute to heart function in health and disease and thereby offer exciting opportunities to therapeutically modulate heart function.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号