共查询到20条相似文献,搜索用时 15 毫秒
1.
Giovanni Minervini Giuseppe Evangelista Fabio Polticelli Monika Piwowar Marek Kochanczyk Lukasz Flis Maciej Malawski Tomasz Szepieniec Zdzisaw Winiowski Ewa Matczyska Katarzyna Prymula Irena Roterman 《Bioinformation》2008,3(4):177-179
The number of natural proteins although large is significantly smaller than the theoretical number of proteins that can be obtained combining the 20 natural amino acids, the so-called “never born proteins” (NBPs). The study of the structure and properties of these proteins allows to investigate the sources of the natural proteins being of unique characteristics or special properties. However the structural study of NPBs can also been intended as an ideal test for evaluating the efficiency of software packages for the ab initio protein structure prediction. In this research, 10.000 three-dimensional structures of proteins of completely random sequence generated according to ROSETTA and FOD model were compared. The results show the limits of these software packages, but at the same time indicate that in many cases there is a significant agreement between the prediction obtained. 相似文献
2.
3.
We suggest a new approach to the generation of candidate structures (decoys) for ab initio prediction of protein structures. Our method is based on random sampling of conformation space and subsequent local energy minimization. At the core of this approach lies the design of a novel type of energy function. This energy function has local minima with native structure characteristics and wide basins of attraction. The current work presents our motivation for deriving such an energy function and also tests the derived energy function.Our approach is novel in that it takes advantage of the inherently rough energy landscape of proteins, which is generally considered a major obstacle for protein structure prediction. When local minima have wide basins of attraction, the protein's conformation space can be greatly reduced by the convergence of large regions of the space into single points, namely the local minima corresponding to these funnels. We have implemented this concept by an iterative process. The potential is first used to generate decoy sets and then we study these sets of decoys to guide further development of the potential. A key feature of our potential is the use of cooperative multi-body interactions that mimic the role of the entropic and solvent contributions to the free energy.The validity and value of our approach is demonstrated by applying it to 14 diverse, small proteins. We show that, for these proteins, the size of conformation space is considerably reduced by the new energy function. In fact, the reduction is so substantial as to allow efficient conformational sampling. As a result we are able to find a significant number of near-native conformations in random searches performed with limited computational resources. 相似文献
4.
Eicosapenta peptide repeats (EPRs) occur exclusively in flowering plant genomes and exhibit very high amino acid residue
conservation across occurrence. DNA and amino acid sequence searches yielded no indications about the function due to absence
of similarity to known sequences. Tertiary structure of an EPR protein coded by rice (Oryza sativa japonica) cDNA (GI: 32984786)
was determined based on ab initio methodology in order to draw clues on functional significance of EPRs. The resultant structure
comprised of seven α-helices and thirteen anti-parallel β-sheets. Surface-mapping of conserved residues onto the structure deduced
that (i) regions equivalent to β α4-
the primary function of EPR protein could be Ca2+ binding, and (iii) the putative EPR Ca2+ binding domain is structurally similar to
calcium-binding domains of plant lectins. Additionally, the phylogenetic analysis showed an evolving taxa-specific distribution of
EPR proteins observed in some GNA-like lectins. 相似文献
5.
6.
7.
Xiaosen Guo Max Brenner Xuemei Zhang Teresina Laragione Shuaishuai Tai Yanhong Li Junjie Bu Ye Yin Anish A. Shah Kevin Kwan Yingrui Li Wang Jun Pércio S. Gulko 《Genetics》2013,194(4):1017-1028
DA (D-blood group of Palm and Agouti, also known as Dark Agouti) and F344 (Fischer) are two inbred rat strains with differences in several phenotypes, including susceptibility to autoimmune disease models and inflammatory responses. While these strains have been extensively studied, little information is available about the DA and F344 genomes, as only the Brown Norway (BN) and spontaneously hypertensive rat strains have been sequenced to date. Here we report the sequencing of the DA and F344 genomes using next-generation Illumina paired-end read technology and the first de novo assembly of a rat genome. DA and F344 were sequenced with an average depth of 32-fold, covered 98.9% of the BN reference genome, and included 97.97% of known rat ESTs. New sequences could be assigned to 59 million positions with previously unknown data in the BN reference genome. Differences between DA, F344, and BN included 19 million positions in novel scaffolds, 4.09 million single nucleotide polymorphisms (SNPs) (including 1.37 million new SNPs), 458,224 short insertions and deletions, and 58,174 structural variants. Genetic differences between DA, F344, and BN, including high-impact SNPs and short insertions and deletions affecting >2500 genes, are likely to account for most of the phenotypic variation between these strains. The new DA and F344 genome sequencing data should facilitate gene discovery efforts in rat models of human disease. 相似文献
8.
Ilina EL Logachov AA Laplaze L Demchenko NP Pawlowski K Demchenko KN 《Annals of botany》2012,110(2):479-489
Background and Aims
In most plant species, initiation of lateral root primordia occurs above the elongation zone. However, in cucurbits and some other species, lateral root primordia initiation and development takes place in the apical meristem of the parental root. Composite transgenic plants obtained by Agrobacterium rhizogenes-mediated transformation are known as a suitable model to study root development. The aim of the present study was to establish this transformation technique for squash.Methods
The auxin-responsive promoter DR5 was cloned into the binary vectors pKGW-RR-MGW and pMDC162-GFP. Incorporation of 5-ethynyl-2′-deoxyuridine (EdU) was used to evaluate the presence of DNA-synthesizing cells in the hypocotyl of squash seedlings to find out whether they were suitable for infection. Two A. rhizogenes strains, R1000 and MSU440, were used. Roots containing the respective constructs were selected based on DsRED1 or green fluorescent protein (GFP) fluorescence, and DR5::Egfp-gusA or DR5::gusA insertion, respectively, was verified by PCR. Distribution of the response to auxin was visualized by GFP fluorescence or β-glucuronidase (GUS) activity staining and confirmed by immunolocalization of GFP and GUS proteins, respectively.Key Results
Based on the distribution of EdU-labelled cells, it was determined that 6-day-old squash seedlings were suited for inoculation by A. rhizogenes since their root pericycle and the adjacent layers contain enough proliferating cells. Agrobacterium rhizogenes R1000 proved to be the most virulent strain on squash seedlings. Squash roots containing the respective constructs did not exhibit the hairy root phenotype and were morphologically and structurally similar to wild-type roots.Conclusions
The auxin response pattern in the root apex of squash resembled that in arabidopsis roots. Composite squash plants obtained by A. rhizogenes-mediated transformation are a good tool for the investigation of root apical meristem development and root branching. 相似文献9.
Dystrophin (DMD) gene is the largest gene containing 79 exons involving various mutation types and regions, and targeted next-generation sequencing (NGS) was employed in detecting DMD gene mutation in the present study. A literature-annotated disease nonsense mutation (c.10141C>T, NM_004006.1) in exon 70 that has been reported as Duchenne Muscular Dystrophy (DMD)-causing mutation was found in our two patients, the proband and his cousin. In the present study two main methods were used, the next-generation sequencing and the classic Sanger sequencing. The exon capture followed by HiSeq2000 sequencing was specifically used in this study. Combined applications of the next-generation sequencing platform and bioinformatics are proved to be effective methods for DMD diagnosis. 相似文献
10.
11.
Barbara Franke Alexander Gasch Dayté Rodriguez Mohamed Chami Muzamil M. Khan Rüdiger Rudolf Jaclyn Bibby Akira Hanashima Julijus Bogomolovas Eleonore von Castelmur Daniel J. Rigden Isabel Uson Siegfried Labeit Olga Mayans 《Open biology》2014,4(3)
MuRF1 is an E3 ubiquitin ligase central to muscle catabolism. It belongs to the TRIM protein family characterized by a tripartite fold of RING, B-box and coiled-coil (CC) motifs, followed by variable C-terminal domains. The CC motif is hypothesized to be responsible for domain organization in the fold as well as for high-order assembly into functional entities. But data on CC from this family that can clarify the structural significance of this motif are scarce. We have characterized the helical region from MuRF1 and show that, contrary to expectations, its CC domain assembles unproductively, being the B2- and COS-boxes in the fold (respectively flanking the CC) that promote a native quaternary structure. In particular, the C-terminal COS-box seemingly forms an α-hairpin that packs against the CC, influencing its dimerization. This shows that a C-terminal variable domain can be tightly integrated within the conserved TRIM fold to modulate its structure and function. Furthermore, data from transfected muscle show that in MuRF1 the COS-box mediates the in vivo targeting of sarcoskeletal structures and points to the pharmacological relevance of the COS domain for treating MuRF1-mediated muscle atrophy. 相似文献
12.
13.
Kevin M. Dorn Johnathon D. Fankhauser Donald L. Wyse M. David Marks 《DNA research》2015,22(2):121-131
Field pennycress (Thlaspi arvense L.) is being domesticated as a new winter cover crop and biofuel species for the Midwestern United States that can be double-cropped between corn and soybeans. A genome sequence will enable the use of new technologies to make improvements in pennycress. To generate a draft genome, a hybrid sequencing approach was used to generate 47 Gb of DNA sequencing reads from both the Illumina and PacBio platforms. These reads were used to assemble 6,768 genomic scaffolds. The draft genome was annotated using the MAKER pipeline, which identified 27,390 predicted protein-coding genes, with almost all of these predicted peptides having significant sequence similarity to Arabidopsis proteins. A comprehensive analysis of pennycress gene homologues involved in glucosinolate biosynthesis, metabolism, and transport pathways revealed high sequence conservation compared with other Brassicaceae species, and helps validate the assembly of the pennycress gene space in this draft genome. Additional comparative genomic analyses indicate that the knowledge gained from years of basic Brassicaceae research will serve as a powerful tool for identifying gene targets whose manipulation can be predicted to result in improvements for pennycress. 相似文献
14.
PacBio Sequencing and Its Applications 总被引:2,自引:0,他引:2
15.
Barbara Turner Ovidiu Paun Jér?me Munzinger Mark W. Chase Rosabelle Samuel 《Annals of botany》2016,117(7):1175-1185
Background and Aims Some plant groups, especially on islands, have been shaped by strong ancestral bottlenecks and rapid, recent radiation of phenotypic characters. Single molecular markers are often not informative enough for phylogenetic reconstruction in such plant groups. Whole plastid genomes and nuclear ribosomal DNA (nrDNA) are viewed by many researchers as sources of information for phylogenetic reconstruction of groups in which expected levels of divergence in standard markers are low. Here we evaluate the usefulness of these data types to resolve phylogenetic relationships among closely related Diospyros species.Methods Twenty-two closely related Diospyros species from New Caledonia were investigated using whole plastid genomes and nrDNA data from low-coverage next-generation sequencing (NGS). Phylogenetic trees were inferred using maximum parsimony, maximum likelihood and Bayesian inference on separate plastid and nrDNA and combined matrices.Key Results The plastid and nrDNA sequences were, singly and together, unable to provide well supported phylogenetic relationships among the closely related New Caledonian Diospyros species. In the nrDNA, a 6-fold greater percentage of parsimony-informative characters compared with plastid DNA was found, but the total number of informative sites was greater for the much larger plastid DNA genomes. Combining the plastid and nuclear data improved resolution. Plastid results showed a trend towards geographical clustering of accessions rather than following taxonomic species.Conclusions In plant groups in which multiple plastid markers are not sufficiently informative, an investigation at the level of the entire plastid genome may also not be sufficient for detailed phylogenetic reconstruction. Sequencing of complete plastid genomes and nrDNA repeats seems to clarify some relationships among the New Caledonian Diospyros species, but the higher percentage of parsimony-informative characters in nrDNA compared with plastid DNA did not help to resolve the phylogenetic tree because the total number of variable sites was much lower than in the entire plastid genome. The geographical clustering of the individuals against a background of overall low sequence divergence could indicate transfer of plastid genomes due to hybridization and introgression following secondary contact. 相似文献
16.
Versées W Loverix S Vandemeulebroucke A Geerlings P Steyaert J 《Journal of molecular biology》2004,338(1):1-6
General acid catalysis is a powerful and widely used strategy in enzymatic nucleophilic displacement reactions. For example, hydrolysis/phosphorolysis of the N-glycosidic bond in nucleosides and nucleotides commonly involves the protonation of the leaving nucleobase concomitant with nucleophilic attack. However, in the nucleoside hydrolase of the parasite Trypanosoma vivax, crystallographic and mutagenesis studies failed to identify a general acid. This enzyme binds the purine base of the substrate between the aromatic side-chains of Trp83 and Trp260. Here, we show via quantum chemical calculations that face-to-face stacking can raise the pKa of a heterocyclic aromatic compound by several units. Site-directed mutagenesis combined with substrate engineering demonstrates that Trp260 catalyzes the cleavage of the glycosidic bond by promoting the protonation of the purine base at N-7, hence functioning as an alternative to general acid catalysis. 相似文献
17.
18.
19.
Shruthi Sridhar Vembar Matthew Seetin Christine Lambert Maria Nattestad Michael C. Schatz Primo Baybayan Artur Scherf Melissa Laird Smith 《DNA research》2016,23(4):339-351
The application of next-generation sequencing to estimate genetic diversity of Plasmodium falciparum, the most lethal malaria parasite, has proved challenging due to the skewed AT-richness [∼80.6% (A + T)] of its genome and the lack of technology to assemble highly polymorphic subtelomeric regions that contain clonally variant, multigene virulence families (Ex: var and rifin). To address this, we performed amplification-free, single molecule, real-time sequencing of P. falciparum genomic DNA and generated reads of average length 12 kb, with 50% of the reads between 15.5 and 50 kb in length. Next, using the Hierarchical Genome Assembly Process, we assembled the P. falciparum genome de novo and successfully compiled all 14 nuclear chromosomes telomere-to-telomere. We also accurately resolved centromeres [∼90–99% (A + T)] and subtelomeric regions and identified large insertions and duplications that add extra var and rifin genes to the genome, along with smaller structural variants such as homopolymer tract expansions. Overall, we show that amplification-free, long-read sequencing combined with de novo assembly overcomes major challenges inherent to studying the P. falciparum genome. Indeed, this technology may not only identify the polymorphic and repetitive subtelomeric sequences of parasite populations from endemic areas but may also evaluate structural variation linked to virulence, drug resistance and disease transmission. 相似文献
20.
Clio Der Sarkissian Julia T. Vilstrup Mikkel Schubert Andaine Seguin-Orlando David Eme Jacobo Weinstock Maria Teresa Alberdi Fabiana Martin Patricio M. Lopez Jose L. Prado Alfredo Prieto Christophe J. Douady Tom W. Stafford Eske Willerslev Ludovic Orlando 《Biology letters》2015,11(3)
Hippidions were equids with very distinctive anatomical features. They lived in South America 2.5 million years ago (Ma) until their extinction approximately 10 000 years ago. The evolutionary origin of the three known Hippidion morphospecies is still disputed. Based on palaeontological data, Hippidion could have diverged from the lineage leading to modern equids before 10 Ma. In contrast, a much later divergence date, with Hippidion nesting within modern equids, was indicated by partial ancient mitochondrial DNA sequences. Here, we characterized eight Hippidion complete mitochondrial genomes at 3.4–386.3-fold coverage using target-enrichment capture and next-generation sequencing. Our dataset reveals that the two morphospecies sequenced (H. saldiasi and H. principale) formed a monophyletic clade, basal to extant and extinct Equus lineages. This contrasts with previous genetic analyses and supports Hippidion as a distinct genus, in agreement with palaeontological models. We date the Hippidion split from Equus at 5.6–6.5 Ma, suggesting an early divergence in North America prior to the colonization of South America, after the formation of the Panamanian Isthmus 3.5 Ma and the Great American Biotic Interchange. 相似文献