期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Prevalence and species distribution of the low-complexity,amyloid-like,reversible, kinked segment structural motif in amyloid-like fibrils

Michael P. Hughes Lukasz Goldschmidt David S. Eisenberg 《The Journal of biological chemistry》2021,297(4)

Membraneless organelles (MLOs) are vital and dynamic reaction centers in cells that compartmentalize the cytoplasm in the absence of a membrane. Multivalent interactions between protein low-complexity domains contribute to MLO organization. Previously, we used computational methods to identify structural motifs termed low-complexity amyloid-like reversible kinked segments (LARKS) that promote phase transition to form hydrogels and that are common in human proteins that participate in MLOs. Here, we searched for LARKS in the proteomes of six model organisms: Homo sapiens, Drosophila melanogaster, Plasmodium falciparum, Saccharomyces cerevisiae, Mycobacterium tuberculosis, and Escherichia coli to gain an understanding of the distribution of LARKS in the proteomes of various species. We found that LARKS are abundant in M. tuberculosis, D. melanogaster, and H. sapiens but not in S. cerevisiae or P. falciparum. LARKS have high glycine content, which enables kinks to form as exemplified by the known LARKS-rich amyloidogenic structures of TDP43, FUS, and hnRNPA2, three proteins that are known to participate in MLOs. These results support the idea of LARKS as an evolved structural motif. Based on these results, we also established the LARKSdb Web server, which permits users to search for LARKS in their protein sequences of interest. 相似文献

2.

Methodological Issues in Modern Track Analysis

Tania Escalante Elkin A. Noguera-Urbano Bruno Pimentel Oscar Aguado-Bautista 《Evolutionary biology》2017,44(2):284-293

Track analysis is the core of panbiogeographic analysis. In this work, we reflect on the formalization of track analysis, its methodological issues, and interpretations by using new software developments and from a contemporary evolutionary biogeographical viewpoint. From a geometric perspective, we analyze the meaning of a minimal spanning tree, considering that Prim’s algorithm is the most commonly used to draw individual tracks. We then show the existing methodologies (graphs, PAE, combined method, AE) and software packages (Trazos2004, Croizat, Martitracks, fossil) used to perform track analysis. Finally, we illustrate a track analysis using Nearctic mammals as an example. Based on our review, connectivity matrix analysis may be the best way to associate individual tracks into generalized tracks because it compares the minimal spanning tree topologies. However, it is the most demanding of all methods, since it requires a high spatial congruence among species, and therefore more algorithmic development. 相似文献

3.

Enumeration of Drosophila form II DNA-dependent RNA polymerase initiation sites on Drosophila DNA

John P. Phillips 《Biochemical genetics》1979,17(1-2):97-104

相似文献

4.

SmProt: A Reliable Repository with Comprehensive Annotation of Small Proteins Identified from Ribosome Profiling

《基因组蛋白质组与生物信息学报(英文版)》2021,19(4):602-610

Small proteins specifically refer to proteins consisting of less than 100 amino acids translated from small open reading frames (sORFs), which were usually missed in previous genome annotation. The significance of small proteins has been revealed in current years, along with the discovery of their diverse functions. However, systematic annotation of small proteins is still insufficient. SmProt was specially developed to provide valuable information on small proteins for scientific community. Here we present the update of SmProt, which emphasizes reliability of translated sORFs, genetic variants in translated sORFs, disease-specific sORF translation events or sequences, and remarkably increased data volume. More components such as non-ATG translation initiation, function, and new sources are also included. SmProt incorporated 638,958 unique small proteins curated from 3,165,229 primary records, which were computationally predicted from 419 ribosome profiling (Ribo-seq) datasets or collected from literature and other sources from 370 cell lines or tissues in 8 species (Homo sapiens, Mus musculus, Rattus norvegicus, Drosophila melanogaster, Danio rerio, Saccharomyces cerevisiae, Caenorhabditis elegans, and Escherichia coli). In addition, small protein families identified from human microbiomes were also collected. All datasets in SmProt are free to access, and available for browse, search, and bulk downloads at http://bigdata.ibp.ac.cn/SmProt/. 相似文献

5.

The most frequent short sequences in non-coding DNA

Juan A. Subirana Xavier Messeguer 《Nucleic acids research》2010,38(4):1172-1181

The purpose of this work is to determine the most frequent short sequences in non-coding DNA. They may play a role in maintaining the structure and function of eukaryotic chromosomes. We present a simple method for the detection and analysis of such sequences in several genomes, including Arabidopsis thaliana, Caenorhabditis elegans, Drosophila melanogaster and Homo sapiens. We also study two chromosomes of man and mouse with a length similar to the whole genomes of the other species. We provide a list of the most common sequences of 9–14 bases in each genome. As expected, they are present in human Alu sequences. Our programs may also give a graph and a list of their position in the genome. Detection of clusters is also possible. In most cases, these sequences contain few alternating regions. Their intrinsic structure and their influence on nucleosome formation are not known. In particular, we have found new features of short sequences in C. elegans, which are distributed in heterogeneous clusters. They appear as punctuation marks in the chromosomes. Such clusters are not found in either A. thaliana or D. melanogaster. We discuss the possibility that they play a role in centromere function and homolog recognition in meiosis. 相似文献

6.

Intrapopulation Genome Size Variation in D. melanogaster Reflects Life History Variation and Plasticity

Lisa L. Ellis Wen Huang Andrew M. Quinn Astha Ahuja Ben Alfrejd Francisco E. Gomez Carl E. Hjelmen Kristi L. Moore Trudy F. C. Mackay J. Spencer Johnston Aaron M. Tarone 《PLoS genetics》2014,10(7)

We determined female genome sizes using flow cytometry for 211 Drosophila melanogaster sequenced inbred strains from the Drosophila Genetic Reference Panel, and found significant conspecific and intrapopulation variation in genome size. We also compared several life history traits for 25 lines with large and 25 lines with small genomes in three thermal environments, and found that genome size as well as genome size by temperature interactions significantly correlated with survival to pupation and adulthood, time to pupation, female pupal mass, and female eclosion rates. Genome size accounted for up to 23% of the variation in developmental phenotypes, but the contribution of genome size to variation in life history traits was plastic and varied according to the thermal environment. Expression data implicate differences in metabolism that correspond to genome size variation. These results indicate that significant genome size variation exists within D. melanogaster and this variation may impact the evolutionary ecology of the species. Genome size variation accounts for a significant portion of life history variation in an environmentally dependent manner, suggesting that potential fitness effects associated with genome size variation also depend on environmental conditions. 相似文献

7.

Phylogenomics of the reproductive parasite Wolbachia pipientis wMel: a streamlined genome overrun by mobile genetic elements

下载免费PDF全文

Wu M Sun LV Vamathevan J Riegler M Deboy R Brownlie JC McGraw EA Martin W Esser C Ahmadinejad N Wiegand C Madupu R Beanan MJ Brinkac LM Daugherty SC Durkin AS Kolonay JF Nelson WC Mohamoud Y Lee P Berry K Young MB Utterback T Weidman J Nierman WC Paulsen IT Nelson KE Tettelin H O'Neill SL Eisen JA 《PLoS biology》2004,2(3):E69

The complete sequence of the 1,267,782 bp genome of Wolbachia pipientis wMel, an obligate intracellular bacteria of Drosophila melanogaster, has been determined. Wolbachia, which are found in a variety of invertebrate species, are of great interest due to their diverse interactions with different hosts, which range from many forms of reproductive parasitism to mutualistic symbioses. Analysis of the wMel genome, in particular phylogenomic comparisons with other intracellular bacteria, has revealed many insights into the biology and evolution of wMel and Wolbachia in general. For example, the wMel genome is unique among sequenced obligate intracellular species in both being highly streamlined and containing very high levels of repetitive DNA and mobile DNA elements. This observation, coupled with multiple evolutionary reconstructions, suggests that natural selection is somewhat inefficient in wMel, most likely owing to the occurrence of repeated population bottlenecks. Genome analysis predicts many metabolic differences with the closely related Rickettsia species, including the presence of intact glycolysis and purine synthesis, which may compensate for an inability to obtain ATP directly from its host, as Rickettsia can. Other discoveries include the apparent inability of wMel to synthesize lipopolysaccharide and the presence of the most genes encoding proteins with ankyrin repeat domains of any prokaryotic genome yet sequenced. Despite the ability of wMel to infect the germline of its host, we find no evidence for either recent lateral gene transfer between wMel and D. melanogaster or older transfers between Wolbachia and any host. Evolutionary analysis further supports the hypothesis that mitochondria share a common ancestor with the α-Proteobacteria, but shows little support for the grouping of mitochondria with species in the order Rickettsiales. With the availability of the complete genomes of both species and excellent genetic tools for the host, the wMel–D. melanogaster symbiosis is now an ideal system for studying the biology and evolution of Wolbachia infections. 相似文献

8.

Eukaryotic and prokaryotic promoter prediction using hybrid approach

Lin H Li QZ 《Theorie in den Biowissenschaften》2011,130(2):91-100

相似文献

9.

An integrated gene annotation and transcriptional profiling approach towards the full gene content of the Drosophila genome

下载免费PDF全文

Hild M Beckmann B Haas SA Koch B Solovyev V Busold C Fellenberg K Boutros M Vingron M Sauer F Hoheisel JD Paro R 《Genome biology》2003,5(1):R3-17

相似文献

10.

ChIPpeakAnno: a Bioconductor package to annotate ChIP-seq and ChIP-chip data

Lihua J Zhu Claude Gazin Nathan D Lawson Hervé Pagès Simon M Lin David S Lapointe Michael R Green 《BMC bioinformatics》2010,11(1):237

Background

Chromatin immunoprecipitation (ChIP) followed by high-throughput sequencing (ChIP-seq) or ChIP followed by genome tiling array analysis (ChIP-chip) have become standard technologies for genome-wide identification of DNA-binding protein target sites. A number of algorithms have been developed in parallel that allow identification of binding sites from ChIP-seq or ChIP-chip datasets and subsequent visualization in the University of California Santa Cruz (UCSC) Genome Browser as custom annotation tracks. However, summarizing these tracks can be a daunting task, particularly if there are a large number of binding sites or the binding sites are distributed widely across the genome. 相似文献

11.

A Complete Mitochondrial Genome Sequence from a Mesolithic Wild Aurochs (Bos primigenius)

Ceiridwen J. Edwards David A. Magee Stephen D. E. Park Paul A. McGettigan Amanda J. Lohan Alison Murphy Emma K. Finlay Beth Shapiro Andrew T. Chamberlain Martin B. Richards Daniel G. Bradley Brendan J. Loftus David E. MacHugh 《PloS one》2010,5(2)

Background

The derivation of domestic cattle from the extinct wild aurochs (Bos primigenius) has been well-documented by archaeological and genetic studies. Genetic studies point towards the Neolithic Near East as the centre of origin for Bos taurus, with some lines of evidence suggesting possible, albeit rare, genetic contributions from locally domesticated wild aurochsen across Eurasia. Inferences from these investigations have been based largely on the analysis of partial mitochondrial DNA sequences generated from modern animals, with limited sequence data from ancient aurochsen samples. Recent developments in DNA sequencing technologies, however, are affording new opportunities for the examination of genetic material retrieved from extinct species, providing new insight into their evolutionary history. Here we present DNA sequence analysis of the first complete mitochondrial genome (16,338 base pairs) from an archaeologically-verified and exceptionally-well preserved aurochs bone sample.

Methodology

DNA extracts were generated from an aurochs humerus bone sample recovered from a cave site located in Derbyshire, England and radiocarbon-dated to 6,738±68 calibrated years before present. These extracts were prepared for both Sanger and next generation DNA sequencing technologies (Illumina Genome Analyzer). In total, 289.9 megabases (22.48%) of the post-filtered DNA sequences generated using the Illumina Genome Analyzer from this sample mapped with confidence to the bovine genome. A consensus B. primigenius mitochondrial genome sequence was constructed and was analysed alongside all available complete bovine mitochondrial genome sequences.

Conclusions

For all nucleotide positions where both Sanger and Illumina Genome Analyzer sequencing methods gave high-confidence calls, no discrepancies were observed. Sequence analysis reveals evidence of heteroplasmy in this sample and places this mitochondrial genome sequence securely within a previously identified aurochsen haplogroup (haplogroup P), thus providing novel insights into pre-domestic patterns of variation. The high proportion of authentic, endogenous aurochs DNA preserved in this sample bodes well for future efforts to determine the complete genome sequence of a wild ancestor of domestic cattle. 相似文献

12.

Mutational process in protein-coding genes of human mitochondrial genome in context of evolution of Homo genus

B. A. Malyarchuk 《Molecular Biology》2013,47(6):807-813

The human mitochondrial genome, although small in size, shows a high level of variation that differs across nucleotide groups. In this work, mutation rates in mtDNA were compared in species of the Homo genus, including humans, Neanderthals, Denisova hominins, and other primate species. It was found that more than half (56.5%) of the polymorphisms in protein-coding genes of human mtDNA are actually reverse mutations to the pre-H. sapiens state of the mitochondrial genome. Among hypervariable nucleotide positions, only a small portion of mutations are specific to H. sapiens, while the majority of mutations (both nucleotide and amino acid substitutions) result in a loss of Homo-specific variants of polymorphisms. Most commonly, polymorphism variants specific to H. sapiens arise as a result of unique forward mutations and disappear mainly due to multiple reverse mutations, including those in mutational hot spots. 相似文献

13.

Effects of female behavior on wing display and courtship of male Drosophila simulans and Drosophila melanogaster

David Wood John M. Ringo 《Behavioural processes》1981,6(1):85-89

Corrolations between female rejection behaviors and male wing display were calculated for both Drosophila simulans and Drosophila melanogaster intraspicific pair-matings. No significant correlations were found for D. melanogaster, but in D. simulans flicking by the female appeared to be associated with a shift in male wing display pattern resulting in higher levels of vibration. Flicking did not appear to discourage courtship by males in either species. 相似文献

14.

The Phylogenetic Distribution of Non-CTCF Insulator Proteins Is Limited to Insects and Reveals that BEAF-32 Is Drosophila Lineage Specific

Todd A. Schoborg Mariano Labrador 《Journal of molecular evolution》2010,70(1):74-84

相似文献

15.

Fine-structured multi-scaling long-range correlations in completely sequenced genomes—features,origin, and classification

Tobias A. Knoch Markus Göker Rudolf Lohner Anis Abuseiris Frank G. Grosveld 《European biophysics journal : EBJ》2009,38(6):757-779

The sequential organization of genomes, i.e. the relations between distant base pairs and regions within sequences, and its connection to the three-dimensional organization of genomes is still a largely unresolved problem. Long-range power-law correlations were found using correlation analysis on almost the entire observable scale of 132 completely sequenced chromosomes of 0.5 × 10⁶ to 3.0 × 10⁷ bp from Archaea, Bacteria, Arabidopsis thaliana, Saccharomyces cerevisiae, Schizosaccharomyces pombe, Drosophila melanogaster, and Homo sapiens. The local correlation coefficients show a species-specific multi-scaling behaviour: close to random correlations on the scale of a few base pairs, a first maximum from 40 to 3,400 bp (for Arabidopsis thaliana and Drosophila melanogaster divided in two submaxima), and often a region of one or more second maxima from 10⁵ to 3 × 10⁵ bp. Within this multi-scaling behaviour, an additional fine-structure is present and attributable to codon usage in all except the human sequences, where it is related to nucleosomal binding. Computer-generated random sequences assuming a block organization of genomes, the codon usage, and nucleosomal binding explain these results. Mutation by sequence reshuffling destroyed all correlations. Thus, the stability of correlations seems to be evolutionarily tightly controlled and connected to the spatial genome organization, especially on large scales. In summary, genomes show a complex sequential organization related closely to their three-dimensional organization. This article has been submitted as a contribution to the festschrift entitled “Uncovering cellular sub-structures by light microscopy” in honor of Professor Cremer’s 65th birthday. 相似文献

16.

Regularities of context-dependent codon bias in eukaryotic genes 总被引：10，自引：1，他引：9

下载免费PDF全文

Fedorov A Saxonov S Gilbert W 《Nucleic acids research》2002,30(5):1192-1197

Nucleotides surrounding a codon influence the choice of this particular codon from among the group of possible synonymous codons. The strongest influence on codon usage arises from the nucleotide immediately following the codon and is known as the N₁ context. We studied the relative abundance of codons with N₁ contexts in genes from four eukaryotes for which the entire genomes have been sequenced: Homo sapiens, Drosophila melanogaster, Caenorhabditis elegans and Arabidopsis thaliana. For all the studied organisms it was found that 90% of the codons have a statistically significant N₁ context-dependent codon bias. The relative abundance of each codon with an N₁ context was compared with the relative abundance of the same 4mer oligonucleotide in the whole genome. This comparison showed that in about half of all cases the context-dependent codon bias could not be explained by the sequence composition of the genome. Ranking statistics were applied to compare context-dependent codon biases for codons from different synonymous groups. We found regularities in N₁ context-dependent codon bias with respect to the codon nucleotide composition. Codons with the same nucleotides in the second and third positions and the same N₁ context have a statistically significant correlation of their relative abundances. 相似文献

17.

Structure-based Comparative Analysis and Prediction of N-linked Glycosylation Sites in Evolutionarily Distant Eukaryotes 总被引：1，自引：0，他引：1

Phuc Vinh Nguyen Lam Radoslav Goldman Konstantinos Karagiannis Tejas Narsule Vahan Simonyan Valerii Soika Raja Mazumder 《基因组蛋白质组与生物信息学报(英文版)》2013,11(2):96-104

The asparagine-X-serine/threonine (NXS/T) motif, where X is any amino acid except proline, is the consensus motif for N-linked glycosylation. Significant numbers of high-resolution crystal structures of glycosylated proteins allow us to carry out structural analysis of the N-linked glycosylation sites (NGS). Our analysis shows that there is enough structural information from diverse glycoproteins to allow the development of rules which can be used to predict NGS. A Python-based tool was developed to investigate asparagines implicated in N-glycosylation in five species: Homo sapiens, Mus musculus, Drosophila melanogaster, Arabidopsis thaliana and Saccharomyces cerevisiae. Our analysis shows that 78% of all asparagines of NXS/T motif involved in N-glycosylation are localized in the loop/turn conformation in the human proteome. Similar distribution was revealed for all the other species examined. Comparative analysis of the occurrence of NXS/T motifs not known to be glycosylated and their reverse sequence (S/TXN) shows a similar distribution across the secondary structural elements, indicating that the NXS/T motif in itself is not biologically relevant. Based on our analysis, we have defined rules to determine NGS. Using machine learning methods based on these rules we can predict with 93% accuracy if a particular site will be glycosylated. If structural information is not available the tool uses structural prediction results resulting in 74% accuracy. The tool was used to identify glycosylation sites in 108 human proteins with structures and 2247 proteins without structures that have acquired NXS/T site/s due to non-synonymous variation. The tool, Structure Feature Analysis Tool (SFAT), is freely available to the public at http://hive.biochemistry.gwu.edu/tools/sfat. 相似文献

18.

CAGO: a software tool for dynamic visual comparison and correlation measurement of genome organization

Chang YF Chang CH 《PloS one》2011,6(11):e27080

CAGO (Comparative Analysis of Genome Organization) is developed to address two critical shortcomings of conventional genome atlas plotters: lack of dynamic exploratory functions and absence of signal analysis for genomic properties. With dynamic exploratory functions, users can directly manipulate chromosome tracks of a genome atlas and intuitively identify distinct genomic signals by visual comparison. Signal analysis of genomic properties can further detect inconspicuous patterns from noisy genomic properties and calculate correlations between genomic properties across various genomes. To implement dynamic exploratory functions, CAGO presents each genome atlas in Scalable Vector Graphics (SVG) format and allows users to interact with it using a SVG viewer through JavaScript. Signal analysis functions are implemented using R statistical software and a discrete wavelet transformation package waveslim. CAGO is not only a plotter for generating complex genome atlases, but also a platform for exploring genome atlases with dynamic exploratory functions for visual comparison and with signal analysis for comparing genomic properties across multiple organisms. The web-based application of CAGO, its source code, user guides, video demos, and live examples are publicly available and can be accessed at http://cbs.ym.edu.tw/cago. 相似文献

19.

Evolutionarily conserved motifs and modules in mitochondrial protein–protein interaction networks

Mohieddin Jafari Mehdi Sadeghi Mehdi Mirzaie Sayed-Amir Marashi Mostafa Rezaei-Tavirani 《Mitochondrion》2013,13(6):668-675

Advances in organelle interactomics have led to new insights into organelle functions. In this study, we considered the common mitochondrial PIN of four evolutionarily distant eukaryotic species, namely Homo sapiens, Mus musculus, Drosophila melanogaster and Caenorhabditis elegans. By comparative interactomics analysis of mitochondrial PINs in these organisms, five conserved modules were identified. Modules comprise the main mitochondrial tasks, including proteins involved in translation process, mitochondrial import inner membrane proteins, TCA cycle enzymes, mitochondrial electron transport chain, and metabolic enzymes. Furthermore, we reemphasize that subgraphs of network, i.e., motifs and themes, may represent evolutionarily conserved topological units which are biologically significant. 相似文献

20.

The role played by exons in genomic DNA sequence correlations

João Carlos de Oliveira Guerra Pedro Licinio 《Journal of theoretical biology》2010,264(3):830-837

The codon structure inside exons imposes a strong modulation with period-3 for genomic composition correlations. A new formalism for calculating nucleotide correlations along DNA sequences in terms of an irreducible set of six correlation functions is presented. New procedures to extract the corresponding period-3 modulations are also developed. These modulations are seen to be stronger for the irreducible self-correlation C_zz(k), which accounts only for the binding strength of dinucleotides (z stands for adenine or thymine minus cytosine or guanine concentrations). We investigate and model the relationship between exon distribution and genomic period-3 correlations for the D. melanogaster genome. 相似文献