首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
We here present jTraML, a Java API for the Proteomics Standards Initiative TraML data standard. The library provides fully functional classes for all elements specified in the TraML XSD document, as well as convenient methods to construct controlled vocabulary-based instances required to define SRM transitions. The use of jTraML is demonstrated via a two-way conversion tool between TraML documents and vendor specific files, facilitating the adoption process of this new community standard. The library is released as open source under the permissive Apache2 license and can be downloaded from http://jtraml.googlecode.com . TraML files can also be converted online at http://iomics.ugent.be/jtraml .  相似文献   

2.
The theme of the third annual Spring workshop of the HUPO-PSI was "proteomics and beyond" and its underlying goal was to reach beyond the boundaries of the proteomics community to interact with groups working on the similar issues of developing interchange standards and minimal reporting requirements. Significant developments in many of the HUPO-PSI XML interchange formats, minimal reporting requirements and accompanying controlled vocabularies were reported, with many of these now feeding into the broader efforts of the Functional Genomics Experiment (FuGE) data model and Functional Genomics Ontology (FuGO) ontologies.  相似文献   

3.
The EMBL-EBI Complex Portal is a knowledgebase of macromolecular complexes providing persistent stable identifiers. Entries are linked to literature evidence and provide details of complex membership, function, structure and complex-specific Gene Ontology annotations. Data are freely available and downloadable in HUPO-PSI community standards and missing entries can be requested for curation. In collaboration with Saccharomyces Genome Database and UniProt, the yeast complexome, a compendium of all known heteromeric assemblies from the model organism Saccharomyces cerevisiae, was curated. This expansion of knowledge and scope has led to a 50% increase in curated complexes compared to the previously published dataset, CYC2008. The yeast complexome is used as a reference resource for the analysis of complexes from large-scale experiments. Our analysis showed that genes coding for proteins in complexes tend to have more genetic interactions, are co-expressed with more genes, are more multifunctional, localize more often in the nucleus, and are more often involved in nucleic acid-related metabolic processes and processes where large machineries are the predominant functional drivers. A comparison to genetic interactions showed that about 40% of expanded co-complex pairs also have genetic interactions, suggesting strong functional links between complex members.  相似文献   

4.
Topic models and neural networks can discover meaningful low-dimensional latent representations of text corpora; as such, they have become a key technology of document representation. However, such models presume all documents are non-discriminatory, resulting in latent representation dependent upon all other documents and an inability to provide discriminative document representation. To address this problem, we propose a semi-supervised manifold-inspired autoencoder to extract meaningful latent representations of documents, taking the local perspective that the latent representation of nearby documents should be correlative. We first determine the discriminative neighbors set with Euclidean distance in observation spaces. Then, the autoencoder is trained by joint minimization of the Bernoulli cross-entropy error between input and output and the sum of the square error between neighbors of input and output. The results of two widely used corpora show that our method yields at least a 15% improvement in document clustering and a nearly 7% improvement in classification tasks compared to comparative methods. The evidence demonstrates that our method can readily capture more discriminative latent representation of new documents. Moreover, some meaningful combinations of words can be efficiently discovered by activating features that promote the comprehensibility of latent representation.  相似文献   

5.
Orchard S  Jones AR  Stephan C  Binz PA 《Proteomics》2007,7(7):1006-1008
The plenary session of the Proteomics Standards Initiative of the Human Proteome Organisation provided an opportunity to update delegates on the progress of the work of the Human Proteome Organisation's Proteomics Standards Initiative (HUPO-PSI) to develop and implement standards in the field of proteomics. Significant advances have been made since the previous congress, with several of the interchange standards and minimal requirements documents being submitted for publication in the literature and being more widely adopted by both manufacturers and data repositories. An exciting development over the interim twelve months is the ongoing merger of the two existing mass spectrometry standards, the PSI mzData and Institute for Systems Biology mzXML, into a single product. This should be achieved by early in 2007.  相似文献   

6.
The role of the Human Proteome Organisation Proteomics Standards Initiative (HUPO-PSI) is to produce and release community-accepted reporting requirements, interchange formats and controlled vocabularies for mass spectrometry proteomics and related technologies such as gel electrophoresis, column chromatography and molecular interactions. A number of significant advances were made at this workshop, with the new MS standard, mzML, being finalised prior to release on 1(st) June 2008 and analysisXML, which will allow protein and peptide identifications and post-translational modifications to be captured, being prepared to enter the review process this summer. The accompanying controlled vocabularies are continuing to evolve and a number of standards papers are now being finalised prior to publication.  相似文献   

7.
Evolutionary processes such as adaptation, ecological filtering, and niche conservatism involve the interaction of organisms with their environment and are thus commonly studied along environmental gradients. Elevational gradients have become among the most studied environmental gradients to understand large-scale patterns of species richness and composition because they are highly replicated with different combinations of geographical, environmental and historical factors. We here review the literature on using elevational gradients to understand evolutionary processes in ferns. Some phylogenetic studies of individual fern clades have considered elevation in the analysis or interpretation and postulated that fern diversification is linked to the colonization of mountain habitats. Other studies that have linked elevational community composition and hence ecological filtering with phylogenetic community composition and morphological traits, usually only found limited phylogenetic signal. However, these studies are ultimately only correlational, and there are few actual tests of the evolutionary mechanisms leading to these patterns. We identify a number of challenges for improving our understanding of how evolutionary and ecological processes are linked to elevational richness patterns in ferns: i) limited information on traits and their ecological relevance, ii) uncertainties on the dispersal kernels of ferns and hence the delimitation of regional species pools from which local assemblages are recruited, iii) limited genomic data to identify candidate genes under selection and hence actually document adaptation and selection, and iv) conceptual challenges in developing clear and testable hypotheses to how specific evolutionary processes can be linked to patterns in community composition and species richness.  相似文献   

8.
Submittals under CERCLA, RCRA, and related environmental regulatory programs often require voluminous data and narrative, supported by multiple appendices of supplemental data. Often lost in the encyclopedic maze of documentation is the key message of the respondent. Also lost is the fact that every document, however data-driven or technical, has a message that should be readable and understandable to both the technical expert and the general reader alike. This is especially true for documents that will undergo public scrutiny at some phase. The regulatory review process can be facilitated if the respondents submitting the documentation present a clear and understandable message that is first understood by the respondents and subsequently by the intended audience. Technical communicators can play an important role in assuring that even a highly complex technical document filled with assumptions, qualifiers, analysis, and quantification retains a clear and understandable message. Simple tools that we may take for granted can be employed throughout the writing process. These tools can range from titles, outlines, and team message slogans to editorial style sheets and review checklists. These tools, used as message reminders, can assist members of a documentation team to stay en pointe, to keep the message in focus while the document is being prepared. Properly applied, these tools translate into a clear message for readers, despite the legitimate and real concerns of sophisticated analysis and scientific inquiry. A clear message, in turn, can expedite the regulatory review process by helping to minimize extraneous comments and excessive requests for clarification from regulatory agencies.  相似文献   

9.
Falkner JA  Hill JA  Andrews PC 《Proteomics》2008,8(9):1756-1757
A FASTA file archive and reference resource has been added to ProteomeCommons.org. Motivation for this new functionality derives from two primary sources. The first is the recent FASTA standardization work done by the Human Proteome Organization's Proteomics Standards Initiative (HUPO-PSI). Second is the general lack of a uniform mechanism to properly cite FASTA files used in a study, and to publicly access such FASTA files post-publication. An extension to the Tranche data sharing network has been developed that includes web-pages, documentation, and tools for facilitating the use of FASTA files. These include conversion to the new HUPO-PSI format, and provisions for both citing and publicly archiving FASTA files. This new resource is available immediately, free of charge, and can be accessed at http://www.proteomecommons.org/data/fasta/. Source-code for related tools is also freely available under the BSD license.  相似文献   

10.
We introduce and make publicly available a large corpus of digitized primary source human rights documents which are published annually by monitoring agencies that include Amnesty International, Human Rights Watch, the Lawyers Committee for Human Rights, and the United States Department of State. In addition to the digitized text, we also make available and describe document-term matrices, which are datasets that systematically organize the word counts from each unique document by each unique term within the corpus of human rights documents. To contextualize the importance of this corpus, we describe the development of coding procedures in the human rights community and several existing categorical indicators that have been created by human coding of the human rights documents contained in the corpus. We then discuss how the new human rights corpus and the existing human rights datasets can be used with a variety of statistical analyses and machine learning algorithms to help scholars understand how human rights practices and reporting have evolved over time. We close with a discussion of our plans for dataset maintenance, updating, and availability.  相似文献   

11.
The Human Proteome Organisation Proteomics Standards Initiative (HUPO-PSI) was established in 2002 with the aim of defining community standards for data representation in proteomics and facilitating data comparison, exchange and verification. Over the last 10 years significant advances have been made, with common data standards now published and implemented in the field of both mass spectrometry and molecular interactions. The 2012 meeting further advanced this work, with the mass spectrometry groups finalising approaches to capturing the output from recent developments in the field, such as quantitative proteomics and SRM. The molecular interaction group focused on improving the integration of data from multiple resources. Both groups united with a guest work track, organized by the HUPO Technology/Standards Committee, to formulate proposals for data submissions from the HUPO Human Proteome Project and to start an initiative to collect standard experimental protocols.  相似文献   

12.
The Human Proteome Organization's Proteomics Standards Initiative (PSI) promotes the development of exchange standards to improve data integration and interoperability. PSI specifies the suitable level of detail required when reporting a proteomics experiment (via the Minimum Information About a Proteomics Experiment), and provides extensible markup language (XML) exchange formats and dedicated controlled vocabularies (CVs) that must be combined to generate a standard compliant document. The framework presented here tackles the issue of checking that experimental data reported using a specific format, CVs and public bio‐ontologies (e.g. Gene Ontology, NCBI taxonomy) are compliant with the Minimum Information About a Proteomics Experiment recommendations. The semantic validator not only checks the XML syntax but it also enforces rules regarding the use of an ontology class or CV terms by checking that the terms exist in the resource and that they are used in the correct location of a document. Moreover, this framework is extremely fast, even on sizable data files, and flexible, as it can be adapted to any standard by customizing the parameters it requires: an XML Schema Definition, one or more CVs or ontologies, and a mapping file describing in a formal way how the semantic resources and the format are interrelated. As such, the validator provides a general solution to the common problem in data exchange: how to validate the correct usage of a data standard beyond simple XML Schema Definition validation. The framework source code and its various applications can be found at http://psidev.info/validator .  相似文献   

13.
Orchard S  Ping P 《Proteomics》2006,6(16):4436-4438
This meeting was convened with the aim of bringing together representatives from scientific journals, granting authorities, software and instrumentation manufacturers, data producers and database providers to discuss the implementation and adoption of the HUPO-PSI data standards and how these can be best used to support the publication and dissemination of proteomics data. The current status of data formats and reporting requirements was reviewed and the attendees agreed that the use of data standards was essential as the field of proteomics grows and matures.  相似文献   

14.
The future for dynamic signature verification systems seems inextricably linked to the future of digital signatures, e-commerce, workflow management and non-repudiation of documents. Over the past year, the lines between document management and e-commerce markets have been blurring as document management has moved on-line.  相似文献   

15.
生物信息学涉及生物、化学、物理、数学、计算机和互联网应用等多方面知识,要从事这个领域的研究工作,必须掌握更多的各方面知识。目前该领域的很多资料和文献在网上以电子文档的形式存在。电子文档能够包含比常规的文档包含更广泛,更丰富的内容,对电子文档的收编整理发能大大节省文档的搜索时间,并能提供更专业,更有效的检索结果,为科研工作服务。本文论述了建立包括电子文档的收集,标注,优化检索的一个方案,检索结果按照文档和检索条件的相似程度分级列出。实践证明,该系统能够有效的提供生物信息学专业信息的管理和检索。  相似文献   

16.
Patterns and Processes of Microbial Community Assembly   总被引:1,自引:0,他引:1  

SUMMARY

Recent research has expanded our understanding of microbial community assembly. However, the field of community ecology is inaccessible to many microbial ecologists because of inconsistent and often confusing terminology as well as unnecessarily polarizing debates. Thus, we review recent literature on microbial community assembly, using the framework of Vellend (Q. Rev. Biol. 85:183–206, 2010) in an effort to synthesize and unify these contributions. We begin by discussing patterns in microbial biogeography and then describe four basic processes (diversification, dispersal, selection, and drift) that contribute to community assembly. We also discuss different combinations of these processes and where and when they may be most important for shaping microbial communities. The spatial and temporal scales of microbial community assembly are also discussed in relation to assembly processes. Throughout this review paper, we highlight differences between microbes and macroorganisms and generate hypotheses describing how these differences may be important for community assembly. We end by discussing the implications of microbial assembly processes for ecosystem function and biodiversity.  相似文献   

17.
Photosynthesis Research - In vivo analyses of electron and proton transport-related processes as well as photoprotective responses were carried out at different stages of growth in chlorophyll b...  相似文献   

18.
19.
Categorization of biomedical articles is a central task for supporting various curation efforts. It can also form the basis for effective biomedical text mining. Automatic text classification in the biomedical domain is thus an active research area. Contests organized by the KDD Cup (2002) and the TREC Genomics track (since 2003) defined several annotation tasks that involved document classification, and provided training and test data sets. So far, these efforts focused on analyzing only the text content of documents. However, as was noted in the KDD'02 text mining contest-where figure-captions proved to be an invaluable feature for identifying documents of interest-images often provide curators with critical information. We examine the possibility of using information derived directly from image data, and of integrating it with text-based classification, for biomedical document categorization. We present a method for obtaining features from images and for using them-both alone and in combination with text-to perform the triage task introduced in the TREC Genomics track 2004. The task was to determine which documents are relevant to a given annotation task performed by the Mouse Genome Database curators. We show preliminary results, demonstrating that the method has a strong potential to enhance and complement traditional text-based categorization methods.  相似文献   

20.
Waterborne cryptosporidiosis   总被引:5,自引:0,他引:5  
Awareness of the importance of Cryptosporidium as a gastrointestinal parasite of developed countries not only stems from its prevalence in AIDS patients but also from its recent recognition as a possible contaminant of drinking water supplies. The importance of Cryptosporidium to public health has recently been revealed by a series of major epidemics of diarrhoeal disease in the USA and UK. In this review, Huw Smith and Joan Rose document what is known of the causes of some of these outbreaks and explain why this parasite can escape the battery of treatment processes normally used for drinking water supplies in these countries.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号