首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 781 毫秒
1.
2.
MOTIVATION: The human genome project and the development of new high-throughput technologies have created unparalleled opportunities to study the mechanism of diseases, monitor the disease progression and evaluate effective therapies. Gene expression profiling is a critical tool to accomplish these goals. The use of nucleic acid microarrays to assess the gene expression of thousands of genes simultaneously has seen phenomenal growth over the past five years. Although commercial sources of microarrays exist, investigators wanting more flexibility in the genes represented on the array will turn to in-house production. The creation and use of cDNA microarrays is a complicated process that generates an enormous amount of information. Effective data management of this information is essential to efficiently access, analyze, troubleshoot and evaluate the microarray experiments. RESULTS: We have developed a distributable software package designed to track and store the various pieces of data generated by a cDNA microarray facility. This includes the clone collection storage data, annotation data, workflow queues, microarray data, data repositories, sample submission information, and project/investigator information. This application was designed using a 3-tier client server model. The data access layer (1st tier) contains the relational database system tuned to support a large number of transactions. The data services layer (2nd tier) is a distributed COM server with full database transaction support. The application layer (3rd tier) is an internet based user interface that contains both client and server side code for dynamic interactions with the user. AVAILABILITY: This software is freely available to academic institutions and non-profit organizations at http://www.genomics.mcg.edu/niddkbtc.  相似文献   

3.
The analysis of proteomes of biological organisms represents a major challenge of the post-genome era. Classical proteomics combines two-dimensional electrophoresis (2-DE) and mass spectrometry (MS) for the identification of proteins. Novel technologies such as isotope coded affinity tag (ICAT)-liquid chromatography/mass spectrometry (LC/MS) open new insights into protein alterations. The vast amount and diverse types of proteomic data require adequate web-accessible computational and database technologies for storage, integration, dissemination, analysis and visualization. A proteome database system (http://www.mpiib-berlin.mpg.de/2D-PAGE) for microbial research has been constructed which integrates 2-DE/MS, ICAT-LC/MS and functional classification data of proteins with genomic, metabolic and other biological knowledge sources. The two-dimensional polyacrylamide gel electrophoresis database delivers experimental data on microbial proteins including mass spectra for the validation of protein identification. The ICAT-LC/MS database comprises experimental data for protein alterations of mycobacterial strains BCG vs. H37Rv. By formulating complex queries within a functional protein classification database "FUNC_CLASS" for Mycobacterium tuberculosis and Helicobacter pylori the researcher can gather precise information on genes, proteins, protein classes and metabolic pathways. The use of the R language in the database architecture allows high-level data analysis and visualization to be performed "on-the-fly". The database system is centrally administrated, and investigators without specific bioinformatic competence in database construction can submit their data. The database system also serves as a template for a prototype of a European Proteome Database of Pathogenic Bacteria. Currently, the database system includes proteome information for six strains of microorganisms.  相似文献   

4.
Some participants will get HIV‐infected in HIV prevention trials, despite risk reduction measures. The subsequent treatment responsibilities of sponsor‐investigators have been widely debated, especially where access to antiretroviral therapy (ART) is not available. In this paper, we explore two accounts of beneficence to establish whether they can shed light on sponsor‐investigator responsibilities. We find the notion of general beneficence helpful insofar as it clarifies that some beneficent actions will be obligatory where they can be dispensed without scuppering the trial. We find the notion of specific beneficence helpful insofar as it directs investigators to attend to the needs of trial participants; however the range of interventions that could be provided remains unhelpfully broad. We then examine accounts of the investigator‐participant relationship to narrow the range of interventions that investigators should provide, concluding that health‐care, and HIV infection, are appropriate foci. We conclude that when investigators are able to meet the ART needs of their participants (e.g. referral, assisted referral or direct provision) without sacrificing trial quality, they must do so. However, there is little of this explicit direction to be found in the account of specific beneficence itself, but rather it is found in accounts of the relationship that are compatible with beneficence.  相似文献   

5.
6.

Background

Male circumcision (MC) is a well-established component of HIV prevention in countries with high HIV prevalence and heterosexually driven epidemics. Delivery and monitoring of MC programs are reliant on good quality MC data. Such data are often generated through self-reported MC status surveys. This study examined self-reported MC status in comparison with genital photographs from men in Papua New Guinea (PNG).

Methods

This retrospective non-interventional study collated self-reported MC status data from the ‘acceptability and feasibility of MC’ study at 4 sites in PNG during 2010–2011. Participants reported their MC status based on an 8-category photographic classification covering the range of foreskin cutting practices in PNG. Genital photographs of 222 participants from this study were independently classified by 2 investigators. The 8-category photographic classification was simplified into a 3 category classification of ‘no cut’, ‘straight cut’ and ‘round cut’ before comparing for agreement between self-reporting and investigator assessment using Cohen’s Kappa measure.

Results

Using the 3-category classification, there was 90.6% (201/222) agreement between self-assessment and investigator classification (κ value 0.805). Of the discordant 9.4% (21/222), 3.6% (8/222) self-classified as having a cut foreskin (5 straight cut; 3 round cut) while investigators classified as having no cut; 4.1% (9/222) self-classified as having no cut while investigators classified them as having had a cut (6 straight cut; 3 round cut) and 1.8% (4/222) self-classified as having a round cut while investigators classified as having a straight cut. Given the great variety of foreskin cutting practices and appearances, feasible explanations are suggested for two-thirds (13/21) of these discordant results.

Conclusions

This study demonstrates a high level of agreement between self-reporting and investigator assessment of MC status in PNG and suggests self-reporting of MC status to be highly reliable among men in PNG.  相似文献   

7.
8.
As an archive of sequence data for over 165,000 species, GenBank is an indispensable resource for phylogenetic inference. Here we describe an informatics processing pipeline and online database, the PhyLoTA Browser (http://loco.biosci.arizona.edu/pb), which offers a view of GenBank tailored for molecular phylogenetics. The first release of the Browser is computed from 2.6 million sequences representing the taxonomically enriched subset of GenBank sequences for eukaryotes (excluding most genome survey sequences, ESTs, and other high-throughput data). In addition to summarizing sequence diversity and species diversity across nodes in the NCBI taxonomy, it reports 87,000 potentially phylogenetically informative clusters of homologous sequences, which can be viewed or downloaded, along with provisional alignments and coarse phylogenetic trees. At each node in the NCBI hierarchy, the user can display a "data availability matrix" of all available sequences for entries in a subtaxa-by-clusters matrix. This matrix provides a guidepost for subsequent assembly of multigene data sets or supertrees. The database allows for comparison of results from previous GenBank releases, highlighting recent additions of either sequences or taxa to GenBank and letting investigators track progress on data availability worldwide. Although the reported alignments and trees are extremely approximate, the database reports several statistics correlated with alignment quality to help users choose from alternative data sources.  相似文献   

9.
10.
11.
Automated genome sequence analysis and annotation.   总被引:5,自引:0,他引:5  
MOTIVATION: Large-scale genome projects generate a rapidly increasing number of sequences, most of them biochemically uncharacterized. Research in bioinformatics contributes to the development of methods for the computational characterization of these sequences. However, the installation and application of these methods require experience and are time consuming. RESULTS: We present here an automatic system for preliminary functional annotation of protein sequences that has been applied to the analysis of sets of sequences from complete genomes, both to refine overall performance and to make new discoveries comparable to those made by human experts. The GeneQuiz system includes a Web-based browser that allows examination of the evidence leading to an automatic annotation and offers additional information, views of the results, and links to biological databases that complement the automatic analysis. System structure and operating principles concerning the use of multiple sequence databases, underlying sequence analysis tools, lexical analyses of database annotations and decision criteria for functional assignments are detailed. The system makes automatic quality assessments of results based on prior experience with the underlying sequence analysis tools; overall error rates in functional assignment are estimated at 2.5-5% for cases annotated with highest reliability ('clear' cases). Sources of over-interpretation of results are discussed with proposals for improvement. A conservative definition for reporting 'new findings' that takes account of database maturity is presented along with examples of possible kinds of discoveries (new function, family and superfamily) made by the system. System performance in relation to sequence database coverage, database dynamics and database search methods is analysed, demonstrating the inherent advantages of an integrated automatic approach using multiple databases and search methods applied in an objective and repeatable manner. AVAILABILITY: The GeneQuiz system is publicly available for analysis of protein sequences through a Web server at http://www.sander.ebi.ac. uk/gqsrv/submit  相似文献   

12.
HOWDY: an integrated database system for human genome research   总被引:1,自引:0,他引:1  
HOWDY is an integrated database system for accessing and analyzing human genomic information (http://www-alis.tokyo.jst.go.jp/HOWDY/). HOWDY stores information about relationships between genetic objects and the data extracted from a number of databases. HOWDY consists of an Internet accessible user interface that allows thorough searching of the human genomic databases using the gene symbols and their aliases. It also permits flexible editing of the sequence data. The database can be searched using simple words and the search can be restricted to a specific cytogenetic location. Linear maps displaying markers and genes on contig sequences are available, from which an object can be chosen. Any search starting point identifies all the information matching the query. HOWDY provides a convenient search environment of human genomic data for scientists unsure which database is most appropriate for their search.  相似文献   

13.
14.
15.
As high‐throughput techniques including proteomics become more accessible to individual laboratories, there is an urgent need for a user‐friendly bioinformatics analysis system. Here, we describe FunRich, an open access, standalone functional enrichment and network analysis tool. FunRich is designed to be used by biologists with minimal or no support from computational and database experts. Using FunRich, users can perform functional enrichment analysis on background databases that are integrated from heterogeneous genomic and proteomic resources (>1.5 million annotations). Besides default human specific FunRich database, users can download data from the UniProt database, which currently supports 20 different taxonomies against which enrichment analysis can be performed. Moreover, the users can build their own custom databases and perform the enrichment analysis irrespective of organism. In addition to proteomics datasets, the custom database allows for the tool to be used for genomics, lipidomics and metabolomics datasets. Thus, FunRich allows for complete database customization and thereby permits for the tool to be exploited as a skeleton for enrichment analysis irrespective of the data type or organism used. FunRich ( http://www.funrich.org ) is user‐friendly and provides graphical representation (Venn, pie charts, bar graphs, column, heatmap and doughnuts) of the data with customizable font, scale and color (publication quality).  相似文献   

16.
Computer-based sequence analysis, notation, and manipulation are a necessity for all molecular biologists working with any but the most simple DNA sequences. As sequence data become increasingly available, tools that can be used to manipulate and annotate individual sequences and sequence elements will become an even more vital implement in the molecular biologist's arsenal. The Omiga DNA and Protein Sequence Analysis Software tool, version 2.0 provides an effective and comprehensive tool for the analysis of both nucleic acid and protein sequences that runs on a standard PC available in every molecular biology laboratory. Omiga allows the import of sequences in several common formats. Upon importing sequences and assigning them to various projects, Omiga allows the user to produce, analyze, and edit sequence alignments. Sequences may also be queried for the presence of restriction sites, sequence motifs, and other sequence features, all of which can be added into the notations accompanying each sequence. This newest version of Omiga also allows for sequencing and polymerase chain reaction (PCR) primer prediction, a functionality missing in earlier versions. Finally, Omiga allows rapid searches for putative coding regions, and Basic Local Alignment Search Tool (BLAST) queries against public databases at the National Center for Biotechnology Information (NCBI).  相似文献   

17.
18.
The Los Alamos hepatitis C sequence database   总被引:6,自引:0,他引:6  
MOTIVATION: The hepatitis C virus (HCV) is a significant threat to public health worldwide. The virus is highly variable and evolves rapidly, making it an elusive target for the immune system and for vaccine and drug design. At present, some 30 000 HCV sequences have been published. A central website that provides annotated sequences and analysis tools will be helpful to HCV scientists worldwide. RESULTS: The HCV sequence database collects and annotates sequence data and provides them to the public via a website that contains a user-friendly search interface and a large number of sequence analysis tools, based on the model of the highly regarded Los Alamos HIV database. The HCV sequence database was officially launched in September 2003. Since then, its usage has steadily increased and is now at an average of approximately 280 visits per day from distinct IP addresses. AVAILABILITY: The HCV website can be accessed via http://hcv.lanl.gov and http://hcv-db.org.  相似文献   

19.
The Phytophthora Genome Initiative (PGI) is a distributed collaboration to study the genome and evolution of a particularly destructive group of plant pathogenic oomycete, with the goal of understanding the mechanisms of infection and resistance. NCGR provides informatics support for the collaboration as well as a centralized data repository. In the pilot phase of the project, several investigators prepared Phytophthora infestans and Phytophthora sojae EST and Phytophthora sojae BAC libraries and sent them to another laboratory for sequencing. Data from sequencing reactions were transferred to NCGR for analysis and curation. An analysis pipeline transforms raw data by performing simple analyses (i.e., vector removal and similarity searching) that are stored and can be retrieved by investigators using a web browser. Here we describe the database and access tools, provide an overview of the data therein and outline future plans. This resource has provided a unique opportunity for the distributed, collaborative study of a genus from which relatively little sequence data are available. Results may lead to insight into how better to control these pathogens. The homepage of PGI can be accessed at http:www.ncgr.org/pgi, with database access through the database access hyperlink.  相似文献   

20.
Phylogenetic analysis of the formin homology 2 domain   总被引:6,自引:0,他引:6       下载免费PDF全文
Formin proteins are key regulators of eukaryotic actin filament assembly and elongation, and many species possess multiple formin isoforms. A nomenclature system based on fundamental features would be desirable, to aid the rapid identification and characterization of novel formins. In this article, we attempt to systematize the formin family by performing phylogenetic analyses of the formin homology 2 (FH2) domain, an independently folding region common to all formins, which alone can influence actin dynamics. Through database searches, we identify 101 FH2 domains from 26 eukaryotic species, including 15 in mice. Sequence alignments reveal a highly conserved yeast-specific insert in the "knob loop" region of the FH2 domain, with unknown functional consequences. Phylogenetic analysis using minimum evolution (ME), maximum parsimony (MP), and maximum likelihood (ML) algorithms strongly supports the existence of seven metazoan groups. Yeast FH2 domains segregate from all other eukaryotes, including metazoans, other fungi, plants, and protists. Sequence comparisons of non-FH2 regions support relationships between three metazoan groups (Dia, DAAM, and FRL) and examine previously identified coiled-coil and Diaphanous auto-regulatory domain sequences. This analysis allows for a formin nomenclature system based on sequence relationships, as well as suggesting strategies for the determination of biochemical and cellular activities of these proteins.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号