首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
新的人类蛋白质图集4.0版本上已经含有了对应5000个人类基因的6000多种抗体。这个版本里已经拥有500多万张高分辨率的免疫组化和激光共聚焦图片。每张图片都是经过优秀的病理学家的注释,从而为功能研究提供知识储备,也可以进行正常和病理组织中蛋白质表达谱的查询和文献检索。一个新的结构实现了,它包括了所有预测的基因(大约20400个),并且带有可视化的所有编码蛋白质基因的特征。一个新的搜索工具也已经启动了,它可以执行高级检索功能,包括染色体定位、蛋白质分级和(或)组织特异性的检索。蛋白质图集作为一种搜索工具可以发现癌症诊断学的潜在生物标志物。  相似文献   

2.
3.
4.
5.
Multipurpose genes in the human genome which are over-expressed in a large variety of different cancers have been identified. Forty-two of the 19,016 human genes annotated to date (0.2%) are ubiquitously over-expressed in half or more of the 36 investigated human cancers. Of these genes, 15 are involved in protein biosynthesis and folding, six of them in glycolysis. A group of 13 solid tumours over-express almost all (39-42 of 42) ubiquitous cancer genes, suggesting a common mechanism underlying these cancers. Others, such as endocrine cancers, have only a few over-expressed ubiquitous cancer genes. The proteins for which these genes code or the corresponding antibodies are candidates for small protein microarrays aiming at maximum information with only a limited number of proteins. Since the over-expression pattern varies from cancer to cancer, distinction between different cancer classes is possible using one single set of protein or antibody molecules.  相似文献   

6.
Antibody microarrays offer new opportunities for exploring the proteome and to identify biomarker candidates in human serum and plasma. Here, we have investigated the effect of heat and detergents on an antibody-based suspension bead array (SBA) assay using polyclonal antibodies and biotinylated plasma samples. With protein profiles from more than 2300 antibodies generated in 384-plex antibody SBAs, three major classes of heat and detergent susceptibility could be described. The results show that washing of the beads with SDS (rather than Tween) after target binding lowered intensity levels of basically all profiles and that about 50% of the profiles appeared to be lowered to a similar extent by heating of the sample. About 33% of the profiles appeared to be insensitive to heat treatment while another 17% showed a positive influence of heat to yield elevated profiles. The results suggest that the classification of antibodies is driven by the molecular properties of the antibody-antigen interaction and can generally not be predicted based on protein class or Western blot data. The experimental scheme presented here can be used to systematically categorize antibodies and thereby combine antibodies with similar properties into targeted arrays for analysis of plasma and serum.  相似文献   

7.
Understanding the categorization of human diseases is critical for reliably identifying disease causal genes. Recently, genome-wide studies of abnormal chromosomal locations related to diseases have mapped >2000 phenotype–gene relations, which provide valuable information for classifying diseases and identifying candidate genes as drug targets. In this article, a regularized non-negative matrix tri-factorization (R-NMTF) algorithm is introduced to co-cluster phenotypes and genes, and simultaneously detect associations between the detected phenotype clusters and gene clusters. The R-NMTF algorithm factorizes the phenotype–gene association matrix under the prior knowledge from phenotype similarity network and protein–protein interaction network, supervised by the label information from known disease classes and biological pathways. In the experiments on disease phenotype–gene associations in OMIM and KEGG disease pathways, R-NMTF significantly improved the classification of disease phenotypes and disease pathway genes compared with support vector machines and Label Propagation in cross-validation on the annotated phenotypes and genes. The newly predicted phenotypes in each disease class are highly consistent with human phenotype ontology annotations. The roles of the new member genes in the disease pathways are examined and validated in the protein–protein interaction subnetworks. Extensive literature review also confirmed many new members of the disease classes and pathways as well as the predicted associations between disease phenotype classes and pathways.  相似文献   

8.
9.
Towards proteome-wide production of monoclonal antibody by phage display   总被引:5,自引:0,他引:5  
Sequencing of the human genome reveals that there are approximately 30,000 genes that encode an even greater number of proteins which comprise the human proteome. Characterization of gene products at the genome-wide scale requires the development of high throughput methods to generate temporo-spatial information on each and every protein in the cell under normal and pathological conditions. Monoclonal antibodies are important reagents for these studies. We have developed a method to generate human monoclonal antibodies by selecting phage antibody libraries directly on antigen blotted onto poly(vinylidene fluoride) membranes. Cellular proteins are first separated by two dimensional (2D) gel electrophoresis, Western blotted onto poly(vinylidene fluoride) membranes, and used to select phage antibody libraries. Monoclonal antibodies can be generated against individual protein spots on a 2D gel. The antibodies are functional in Western blotting, ELISA, and immunohistochemistry. Automation of this process should allow high throughput production of monoclonal phage antibodies against cellular proteins as well as proteins that are uniquely expressed under pathological conditions.  相似文献   

10.
A new approach has been used to examine DNA sequence organization in the chicken genome. The interspersion pattern was determined by studying the fraction of labelled DNA fragments of different lengths that hybridized to an excess of short chicken repeated DNA sequences. The results indicate that chicken DNA has a pattern of sequence organization quite different than the standard ‘Xenopus’ or ‘Drosophila’ patterns. Two classes of unique sequences are found. One, 34% of the genome, consists of unique sequences approx. 4 kb long interspersed with repeated sequences. The second, non-interspersed fraction, 38% of the genome, consists of unique sequences found in long tracts, a minimum of approx. 22 kb in length. In an attempt to determine whether a relationship exists between DNA sequence organization and the distribution of structural genes we have isolated chicken DNA sequences belonging to different interspersion classes and tested each for the presence of structural genes by hybridization to excess poly(A)+ mRNA. Sequences complementary to poly(A)+ mRNA can be found with approximately the same frequency in both the non-interspersed fraction of the genome and a repeat-contiguous fraction enriched for interspersed sequences.  相似文献   

11.
Improving gene annotation of complete viral genomes   总被引:4,自引:0,他引:4       下载免费PDF全文
Gene annotation in viruses often relies upon similarity search methods. These methods possess high specificity but some genes may be missed, either those unique to a particular genome or those highly divergent from known homologs. To identify potentially missing viral genes we have analyzed all complete viral genomes currently available in GenBank with a specialized and augmented version of the gene finding program GeneMarkS. In particular, by implementing genome-specific self-training protocols we have better adjusted the GeneMarkS statistical models to sequences of viral genomes. Hundreds of new genes were identified, some in well studied viral genomes. For example, a new gene predicted in the genome of the Epstein–Barr virus was shown to encode a protein similar to α-herpesvirus minor tegument protein UL14 with heat shock functions. Convincing evidence of this similarity was obtained after only 12 PSI-BLAST iterations. In another example, several iterations of PSI-BLAST were required to demonstrate that a gene predicted in the genome of Alcelaphine herpesvirus 1 encodes a BALF1-like protein which is thought to be involved in apoptosis regulation and, potentially, carcinogenesis. New predictions were used to refine annotations of viral genomes in the RefSeq collection curated by the National Center for Biotechnology Information. Importantly, even in those cases where no sequence similarities were detected, GeneMarkS significantly reduced the number of primary targets for experimental characterization by identifying the most probable candidate genes. The new genome annotations were stored in VIOLIN, an interactive database which provides access to similarity search tools for up-to-date analysis of predicted viral proteins.  相似文献   

12.
Probing the S100 protein family through genomic and functional analysis   总被引:8,自引:0,他引:8  
The EF-hand superfamily of calcium binding proteins includes the S100, calcium binding protein, and troponin subfamilies. This study represents a genome, structure, and expression analysis of the S100 protein family, in mouse, human, and rat. We confirm the high level of conservation between mammalian sequences but show that four members, including S100A12, are present only in the human genome. We describe three new members of the S100 family in the three species and their locations within the S100 genomic clusters and propose a revised nomenclature and phylogenetic relationship between members of the EF-hand superfamily. Two of the three new genes were induced in bone-marrow-derived macrophages activated with bacterial lipopolysaccharide, suggesting a role in inflammation. Normal human and murine tissue distribution profiles indicate that some members of the family are expressed in a specific manner, whereas others are more ubiquitous. Structure-function analysis of the chemotactic properties of murine S100A8 and human S100A12, particularly within the active hinge domain, suggests that the human protein is the functional homolog of the murine protein. Strong similarities between the promoter regions of human S100A12 and murine S100A8 support this possibility. This study provides insights into the possible processes of evolution of the EF-hand protein superfamily. Evolution of the S100 proteins appears to have occurred in a modular fashion, also seen in other protein families such as the C2H2-type zinc-finger family.  相似文献   

13.
Arabidopsis thaliana has a relatively small genome of approximately 130 Mb containing about 10% repetitive DNA. Genome sequencing studies reveal a gene-rich genome, predicted to contain approximately 25000 genes spaced on average every 4.5 kb. Between 10 to 20% of the predicted genes occur as clusters of related genes, indicating that local sequence duplication and subsequent divergence generates a significant proportion of gene families. In addition to gene families, repetitive sequences comprise individual and small clusters of two to three retroelements and other classes of smaller repeats. The clustering of highly repetitive elements is a striking feature of the A. thaliana genome emerging from sequence and other analyses.  相似文献   

14.
15.
Genome-wide functional linkages among proteins in cellular complexes and metabolic pathways can be inferred from high throughput experimentation, such as DNA microarrays, or from bioinformatic analyses. Here we describe a method for the visualization and interpretation of genome-wide functional linkages inferred by the Rosetta Stone, Phylogenetic Profile, Operon and Conserved Gene Neighbor computational methods. This method involves the construction of a genome-wide functional linkage map, where each significant functional linkage between a pair of proteins is displayed on a two-dimensional scatter-plot, organized according to the order of genes along the chromosome. Subsequent hierarchical clustering of the map reveals clusters of genes with similar functional linkage profiles and facilitates the inference of protein function and the discovery of functionally linked gene clusters throughout the genome. We illustrate this method by applying it to the genome of the pathogenic bacterium Mycobacterium tuberculosis, assigning cellular functions to previously uncharacterized proteins involved in cell wall biosynthesis, signal transduction, chaperone activity, energy metabolism and polysaccharide biosynthesis.  相似文献   

16.
For over 3 decades, the rate of replacement mutations has been assumed to be equal to, and estimated from, the rate of "strictly" neutral sequence divergence in noncoding regions and in silent-codon positions where mutations do not alter the amino acid encoded. This assumption is fundamental to estimating the fraction of harmful protein mutations and to identifying adaptive evolution at individual codons and proteins. We show that the assumption is not justifiable because a much larger fraction of codon positions is involved in hypermutable CpG dinucleotides as compared with the introns, leading to a higher expected replacement mutation rate per site in a vast majority of the genes. Consideration of this difference reveals a higher intensity of purifying natural selection than previously inferred in human genes. We also show that a much smaller number of genes are expected to be evolving with positive selection than that predicted using sequence divergence at intron and silent positions in the human genome. These patterns indicate the need for using new approaches for estimating rates of amino acid-altering mutations in order to find positively selected genes and codons in genomes that contain hypermutable CpG's.  相似文献   

17.
Asplund A  Edqvist PH  Schwenk JM  Pontén F 《Proteomics》2012,12(13):2067-2077
In this review, we present an update on the progress of the Human Protein Atlas, with an emphasis on strategies for validating immunohistochemistry-based protein expression patterns and on the possibilities to extend the map of protein expression patterns for cancer research projects. The objectives underlying the Human Protein Atlas include (i) the generation of validated antibodies toward a major isoform of all proteins encoded by the human genome, (ii) creating an information database of protein expression patterns in normal human tissues, in cells, and in cancer, and (iii) utilizing generated antibodies and protein expression data as tools to identify clinically useful biomarkers. The success of such an effort is dependent on the validity of antibodies as specific binders of intended targets in applications used to map protein expression patterns. The development of strategies to support specific target binding is crucial and remains a challenge as a large fraction of proteins encoded by the human genome is poorly characterized, including the approximately one-third of all proteins lacking evidence of existence. Conceivable methods for validation include the use of paired antibodies, i.e. two independent antibodies targeting different and nonoverlapping epitopes on the same protein as well as comparative analysis of mRNA expression patterns with corresponding proteins.  相似文献   

18.
19.
20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号