共查询到20条相似文献,搜索用时 0 毫秒
1.
Extensive Sequence Conservation Among Insect,Nematode, and Vertebrate Vitellogenins Reveals Ancient Common Ancestry 总被引:10,自引:0,他引:10
Jeng-Shong Chen Thomas W. Sappington Alexander S. Raikhel 《Journal of molecular evolution》1997,44(4):440-451
The eggs of most oviparous animals are provisioned with a class of protein called vitellogenin (Vg) which is stored as the
major component of yolk. Until recently, deduced amino acid sequences were available only from vertebrate and nematode Vgs,
which proved to be homologous. The sequences of several insect Vgs are now known, but early attempts at pairwise alignments
with vertebrate and nematode Vgs have been problematic, leading to conflicting conclusions about how closely insect Vgs are
related to the others. In this paper we demonstrate that insect Vg sequences can be confidently aligned with one another along
their entire lengths and with multiple vertebrate and nematode Vg sequences along most of their spans. Although divergence
is high, conservation among insect, vertebrate, and nematode Vg sequences is widespread with a preponderance of glycine, proline,
and cysteine residues among strictly conserved amino acids, establishing conclusively that Vgs from the three phyla are homologous.
Areas of least-certain alignment are primarily in and around insect and vertebrate polyserine domains which are not homologous.
Phylogenetic reconstructions of Vgs based on sequence identities indicate that the insect lineage is the most diverged and
that the mammalian serum protein, apolipoprotein B-100, arose from a Vg ancestor after the nematode/vertebrate divergence.
Received: 6 May 1996 / Accepted: 27 September 1996 相似文献
2.
Background
Predicting protein function from primary sequence is an important open problem in modern biology. Not only are there many thousands of proteins of unknown function, current approaches for predicting function must be improved upon. One problem in particular is overly-specific function predictions which we address here with a new statistical model of the relationship between protein sequence similarity and protein function similarity.Methodology
Our statistical model is based on sets of proteins with experimentally validated functions and numeric measures of function specificity and function similarity derived from the Gene Ontology. The model predicts the similarity of function between two proteins given their amino acid sequence similarity measured by statistics from the BLAST sequence alignment algorithm. A novel aspect of our model is that it predicts the degree of function similarity shared between two proteins over a continuous range of sequence similarity, facilitating prediction of function with an appropriate level of specificity.Significance
Our model shows nearly exact function similarity for proteins with high sequence similarity (bit score >244.7, e-value >1e−62, non-redundant NCBI protein database (NRDB)) and only small likelihood of specific function match for proteins with low sequence similarity (bit score <54.6, e-value <1e−05, NRDB). For sequence similarity ranges in between our annotation model shows an increasing relationship between function similarity and sequence similarity, but with considerable variability. We applied the model to a large set of proteins of unknown function, and predicted functions for thousands of these proteins ranging from general to very specific. We also applied the model to a data set of proteins with previously assigned, specific functions that were electronically based. We show that, on average, these prior function predictions are more specific (quite possibly overly-specific) compared to predictions from our model that is based on proteins with experimentally determined function. 相似文献3.
4.
Samuel Hertig Thomas?D. Goddard Graham?T. Johnson Thomas?E. Ferrin 《Biophysical journal》2015,108(9):2097-2102
Homology modeling predicts protein structures using known structures of related proteins as templates. We developed MULTIDOMAIN ASSEMBLER (MDA) to address the special problems that arise when modeling proteins with large numbers of domains, such as fibronectin with 30 domains, as well as cases with hundreds of templates. These problems include how to spatially arrange nonoverlapping template structures, and how to get the best template coverage when some sequence regions have hundreds of available structures while other regions have a few distant homologs. MDA automates the tasks of template searching, visualization, and selection followed by multidomain model generation, and is part of the widely used molecular graphics package UCSF CHIMERA (University of California, San Francisco). We demonstrate applications and discuss MDA’s benefits and limitations. 相似文献
5.
Over the course of HIV infection, virus replication is facilitated by the phosphorylation of HIV proteins by human ERK1 and ERK2 mitogen-activated protein kinases (MAPKs). MAPKs are known to phosphorylate their substrates by first binding with them at a docking site. Docking site interactions could be viable drug targets because the sequences guiding them are more specific than phosphorylation consensus sites. In this study we use multiple bioinformatics tools to discover candidate MAPK docking site motifs on HIV proteins known to be phosphorylated by MAPKs, and we discuss the possibility of targeting docking sites with drugs. Using sequence alignments of HIV proteins of different subtypes, we show that MAPK docking patterns previously described for human proteins appear on the HIV matrix, Tat, and Vif proteins in a strain dependent manner, but are absent from HIV Rev and appear on all HIV Nef strains. We revise the regular expressions of previously annotated MAPK docking patterns in order to provide a subtype independent motif that annotates all HIV proteins. One revision is based on a documented human variant of one of the substrate docking motifs, and the other reduces the number of required basic amino acids in the standard docking motifs from two to one. The proposed patterns are shown to be consistent with in silico docking between ERK1 and the HIV matrix protein. The motif usage on HIV proteins is sufficiently different from human proteins in amino acid sequence similarity to allow for HIV specific targeting using small-molecule drugs. 相似文献
6.
7.
C. George Priya Doss Chiranjib Chakrabarty C. Debajyoti S. Debottam 《Cell biochemistry and biophysics》2014,70(2):897-906
Certain mysteries pointing toward their recruitment pathways, cell cycle regulation mechanisms, spindle checkpoint assembly, and chromosome segregation process are considered the centre of attraction in cancer research. In modern times, with the established databases, ranges of computational platforms have provided a platform to examine almost all the physiological and biochemical evidences in disease-associated phenotypes. Using existing computational methods, we have utilized the amino acid residues to understand the similarity within the evolutionary variance of different associated centromere proteins. This study related to sequence similarity, protein–protein networking, co-expression analysis, and evolutionary trajectory of centromere proteins will speed up the understanding about centromere biology and will create a road map for upcoming researchers who are initiating their work of clinical sequencing using centromere proteins. 相似文献
8.
Bobyleva L. G. Yakupova E. I. Ulanova A. D. Udaltsov S. N. Shumeyko S. A. Salmov N. N. Bobylev A. G. Vikhlyantsev I. M. 《Biophysics》2019,64(5):667-670
Biophysics - Abstract—The hypothesis that formed the basis of this work has been made on our studies that have shown that giant multi-domain muscle proteins of the titin family (titin... 相似文献
9.
Chantel D. Sloan Angeline D. Andrew Eric J. Duell Scott M. Williams Margaret R. Karagas Jason H. Moore 《PloS one》2009,4(9)
Genetic structure due to ancestry has been well documented among many divergent human populations. However, the ability to associate ancestry with genetic substructure without using supervised clustering has not been explored in more presumably homogeneous and admixed US populations. The goal of this study was to determine if genetic structure could be detected in a United States population from a single state where the individuals have mixed European ancestry. Using Bayesian clustering with a set of 960 single nucleotide polymorphisms (SNPs) we found evidence of population stratification in 864 individuals from New Hampshire that can be used to differentiate the population into six distinct genetic subgroups. We then correlated self-reported ancestry of the individuals with the Bayesian clustering results. Finnish and Russian/Polish/Lithuanian ancestries were most notably found to be associated with genetic substructure. The ancestral results were further explained and substantiated using New Hampshire census data from 1870 to 1930 when the largest waves of European immigrants came to the area. We also discerned distinct patterns of linkage disequilibrium (LD) between the genetic groups in the growth hormone receptor gene (GHR). To our knowledge, this is the first time such an investigation has uncovered a strong link between genetic structure and ancestry in what would otherwise be considered a homogenous US population. 相似文献
10.
RxLR effectors represent one of the largest and most diverse effector families in oomycete plant pathogens. These effectors have attracted enormous attention since they can be delivered inside the plant cell and manipulates host immunity. With the exceptions of a signal peptide and the following RxLR-dEER and C-terminal W/Y/L motifs identified from the sequences themselves, nearly no functional domains have been found. Recently, protein structures of several RxLRs were revealed to comprise alpha-helical bundle repeats. However, approximately half of all RxLRs lack obvious W/Y/L motifs, which are associated with helical structures. In this study, secondary structure prediction of the putative RxLR proteins was performed. We found that the C-terminus of the majority of these RxLR proteins, irrespective of the presence of W/Y/L motifs, contains abundant short alpha-helices. Since a large-scale experimental determination of protein structures has been difficult to date, results of the current study extend our understanding on the oomycete RxLR effectors in protein secondary structures from individual members to the entire family. Moreover, we identified less alpha-helix-rich proteins from secretomes of several oomycete and fungal organisms in which RxLRs have not been identified, providing additional evidence that these organisms are unlikely to harbor RxLR-like proteins. Therefore, these results provide additional information that will aid further studies on the evolution and functional mechanisms of RxLR effectors. 相似文献
11.
A 2550-bp portion of the mitochondrial genome of a Demosponge, genus Tetilla, was amplified from whole genomic DNA extract and sequenced. The sequence was found to code for the 3′ end of the 16S rRNA
gene, cytochrome c oxidase subunit II, a lysine tRNA, ATPase subunit 8, and a 5′ portion of ATPase subunit 6. The Porifera cluster distinctly
within the eumetazoan radiation, as a sister group to the Cnidaria. Also, the mitochondrial genetic code of this sponge is
likely identical to that found in the Cnidaria. Both the full COII DNA and protein sequences and a portion of the 16S rRNA
gene were found to possess a striking similarity to published Cnidarian mtDNA sequences, allying the Porifera more closely
to the Cnidaria than to any other metazoan phylum. The gene arrangement, COII—tRNALys—ATP8—ATP6, is observed in many Eumetazoan phyla and is apparently ancestral in the metazoa.
Received: 24 November 1997 / Accepted: 14 September 1998 相似文献
12.
Multidomain Carbohydrate-binding Proteins Involved in Bacteroides thetaiotaomicron Starch Metabolism
Elizabeth A. Cameron Mallory A. Maynard Christopher J. Smith Thomas J. Smith Nicole M. Koropatkin Eric C. Martens 《The Journal of biological chemistry》2012,287(41):34614-34625
Human colonic bacteria are necessary for the digestion of many dietary polysaccharides. The intestinal symbiont Bacteroides thetaiotaomicron uses five outer membrane proteins to bind and degrade starch. Here, we report the x-ray crystallographic structures of SusE and SusF, two outer membrane proteins composed of tandem starch specific carbohydrate-binding modules (CBMs) with no enzymatic activity. Examination of the two CBMs in SusE and three CBMs in SusF reveals subtle differences in the way each binds starch and is reflected in their Kd values for both high molecular weight starch and small maltooligosaccharides. Thus, each site seems to have a unique starch preference that may enable these proteins to interact with different regions of starch or its breakdown products. Proteins similar to SusE and SusF are encoded in many other polysaccharide utilization loci that are possessed by human gut bacteria in the phylum Bacteroidetes. Thus, these proteins are likely to play an important role in carbohydrate metabolism in these abundant symbiotic species. Understanding structural changes that diversify and adapt related proteins in the human gut microbial community will be critical to understanding the detailed mechanistic roles that they perform in the complex digestive ecosystem. 相似文献
13.
Larry L. Mai 《American anthropologist》2002,104(3):1003-1004
The Seven Daughters of Eve: The Science That Reveals Our Genetic Ancestry. Bryan Sykes. New York: W. W. Norton and Company, Inc., 2001. 306 pp. 相似文献
14.
Lipid rafts are specialized cholesterol-enriched microdomains in the cell membrane. They have been known as a platform for protein-protein interactions and to take part in multiple biological processes. Nevertheless, how lipid rafts influence protein properties at the proteomic level is still an open question for researchers using traditional biochemical approaches. Here, by annotating the lipid raft localization of proteins in human protein-protein interaction networks, we performed a systematic analysis of the function of proteins related to lipid rafts. Our results demonstrated that lipid raft proteins and their interactions were critical for the structure and stability of the whole network, and that the interactions between them were significantly enriched. Furthermore, for each protein in the network, we calculated its “lipid raft dependency (LRD),” which indicates how close it is topologically associated with lipid rafts, and we then uncovered the connection between LRD and protein functions. Proteins with high LRD tended to be essential for mammalian development, and malfunction of these proteins was inclined to cause human diseases. Coordinated with their neighbors, high-LRD proteins participated in multiple biological processes and targeted many pathways in diseases pathogenesis. High-LRD proteins were also found to have tissue specificity of expression. In summary, our network-based analysis denotes that lipid raft proteins have higher centrality in the network, and that lipid-raft-related proteins have multiple functions and are probably concerned with many biological processes in disease development. 相似文献
15.
Yedael Y. Waldman Arjun Biddanda Natalie R. Davidson Paul Billing-Ross Maya Dubrovsky Christopher L. Campbell Carole Oddoux Eitan Friedman Gil Atzmon Eran Halperin Harry Ostrer Alon Keinan 《PloS one》2016,11(3)
The Bene Israel Jewish community from West India is a unique population whose history before the 18th century remains largely unknown. Bene Israel members consider themselves as descendants of Jews, yet the identity of Jewish ancestors and their arrival time to India are unknown, with speculations on arrival time varying between the 8th century BCE and the 6th century CE. Here, we characterize the genetic history of Bene Israel by collecting and genotyping 18 Bene Israel individuals. Combining with 486 individuals from 41 other Jewish, Indian and Pakistani populations, and additional individuals from worldwide populations, we conducted comprehensive genome-wide analyses based on FST, principal component analysis, ADMIXTURE, identity-by-descent sharing, admixture linkage disequilibrium decay, haplotype sharing and allele sharing autocorrelation decay, as well as contrasted patterns between the X chromosome and the autosomes. The genetics of Bene Israel individuals resemble local Indian populations, while at the same time constituting a clearly separated and unique population in India. They are unique among Indian and Pakistani populations we analyzed in sharing considerable genetic ancestry with other Jewish populations. Putting together the results from all analyses point to Bene Israel being an admixed population with both Jewish and Indian ancestry, with the genetic contribution of each of these ancestral populations being substantial. The admixture took place in the last millennium, about 19–33 generations ago. It involved Middle-Eastern Jews and was sex-biased, with more male Jewish and local female contribution. It was followed by a population bottleneck and high endogamy, which can lead to increased prevalence of recessive diseases in this population. This study provides an example of how genetic analysis advances our knowledge of human history in cases where other disciplines lack the relevant data to do so. 相似文献
16.
Tiago R. Magalh?es Jillian P. Casey Judith Conroy Regina Regan Darren J. Fitzpatrick Naisha Shah Jo?o Sobral Sean Ennis 《PloS one》2012,7(11)
Knowledge of human origins, migrations, and expansions is greatly enhanced by the availability of large datasets of genetic information from different populations and by the development of bioinformatic tools used to analyze the data. We present Ancestry Mapper, which we believe improves on existing methods, for the assignment of genetic ancestry to an individual and to study the relationships between local and global populations. The principle function of the method, named Ancestry Mapper, is to give each individual analyzed a genetic identifier, made up of just 51 genetic coordinates, that corresponds to its relationship to the HGDP reference population. As a consequence, the Ancestry Mapper Id (AMid) has intrinsic biological meaning and provides a tool to measure similarity between world populations. We applied Ancestry Mapper to a dataset comprised of the HGDP and HapMap data. The results show distinctions at the continental level, while simultaneously giving details at the population level. We clustered AMids of HGDP/HapMap and observe a recapitulation of human migrations: for a small number of clusters, individuals are grouped according to continental origins; for a larger number of clusters, regional and population distinctions are evident. Calculating distances between AMids allows us to infer ancestry. The number of coordinates is expandable, increasing the power of Ancestry Mapper. An R package called Ancestry Mapper is available to apply this method to any high density genomic data set. 相似文献
17.
Silvia Galiani Dominic Waithe Katharina Reglinski Luis Daniel Cruz-Zaragoza Esther Garcia Mathias P. Clausen Wolfgang Schliebs Ralf Erdmann Christian Eggeling 《The Journal of biological chemistry》2016,291(33):16948-16962
Membrane-associated events during peroxisomal protein import processes play an essential role in peroxisome functionality. Many details of these processes are not known due to missing spatial resolution of technologies capable of investigating peroxisomes directly in the cell. Here, we present the use of super-resolution optical stimulated emission depletion microscopy to investigate with sub-60-nm resolution the heterogeneous spatial organization of the peroxisomal proteins PEX5, PEX14, and PEX11 around actively importing peroxisomes, showing distinct differences between these peroxins. Moreover, imported protein sterol carrier protein 2 (SCP2) occupies only a subregion of larger peroxisomes, highlighting the heterogeneous distribution of proteins even within the peroxisome. Finally, our data reveal subpopulations of peroxisomes showing only weak colocalization between PEX14 and PEX5 or PEX11 but at the same time a clear compartmentalized organization. This compartmentalization, which was less evident in cases of strong colocalization, indicates dynamic protein reorganization linked to changes occurring in the peroxisomes. Through the use of multicolor stimulated emission depletion microscopy, we have been able to characterize peroxisomes and their constituents to a yet unseen level of detail while maintaining a highly statistical approach, paving the way for equally complex biological studies in the future. 相似文献
18.
共同祖先原则和系统发育树的解读 总被引:1,自引:0,他引:1
系统发育树是基于共同祖先原则,用于描述生物传代谱系的常用表达方式.可以直观地表达分类群(物种或基因)之间的亲缘关系和进化方向,但这两方面在生物学研究和教学中却常常被误读.通过具体实例,阐释了如何应用共同祖先原则正确解读系统发育树的方法. 相似文献
19.
Gaone Retshabile Busisiwe C. Mlotshwa Lesedi Williams Savannah Mwesigwa Gerald Mboowa Zhuoyi Huang Navin Rustagi Shanker Swaminathan Eric Katagirya Samuel Kyobe Misaki Wayengera Grace P. Kisitu David P. Kateete Eddie M. Wampande Koketso Maplanka Ishmael Kasvosve Edward D. Pettitt Mogomotsi Matshaba Neil A. Hanchard 《American journal of human genetics》2018,102(5):731-743
20.
Kabir Hassan Biswas Avinash R. Shenoy Anindya Dutta Sandhya S. Visweswariah 《Journal of molecular evolution》2009,68(6):587-602
Guanylyl cyclases (GCs) are enzymes that generate cyclic GMP and regulate different physiologic and developmental processes in a number of organisms. GCs possess sequence similarity to class III adenylyl cyclases (ACs) and are present as either membrane-bound receptor GCs or cytosolic soluble GCs. We sought to determine the evolution of GCs using a large-scale bioinformatic analysis and found multiple lineage-specific expansions of GC genes in the genomes of many eukaryotes. Moreover, a few GC-like proteins were identified in prokaryotes, which come fused to a number of different domains, suggesting allosteric regulation of nucleotide cyclase activity. Eukaryotic receptor GCs are associated with a kinase homology domain (KHD), and phylogenetic analysis of these proteins suggest coevolution of the KHD and the associated cyclase domain as well as a conservation of the sequence and the size of the linker region between the KHD and the associated cyclase domain. Finally, we also report the existence of mimiviral proteins that contain putative active kinase domains associated with a cyclase domain, which could suggest early evolution of the fusion of these two important domains involved in signal transduction. Electronic supplementary material The online version of this article (doi:) contains supplementary material, which is available to authorized users. 相似文献