共查询到20条相似文献,搜索用时 0 毫秒
1.
2.
Despite the current good level of annotation, the Drosophila genome still holds surprises. A recent study has added perhaps 2,000 genes to the predicted total, and raises a number of questions about how genome annotation data should be stored and presented. 相似文献
3.
4.
PETER E. SMOUSE 《Molecular ecology》2010,19(7):1265-1266
Since the days of allozyme analysis, we have been enamored with the idea that if we just had enough polymorphic mendelian loci, we could gauge the inbreeding level of individuals by measuring heterozygosity and simultaneously measure the degree of genetic relatedness between pairs of individuals. Given Mendel’s Laws, we have always known that we would need numerous independently segregating loci to achieve any reasonable degree of accuracy. Santure et al. (2010, this issue) use a 771 marker SNP panel to assess heterozygosity levels and to assess pairwise relatedness, and compare both with theoretical expectations obtained from a carefully recorded pedigree of a zebra finch breeding colony, as a function of increasing numbers of SNP markers. They also compare the SNP results with those from a 20‐locus microsatellite panel, showing that adding SNPs to a fairly large microsatellite panel improves accuracy, but given an existing panel of 125 SNPs, little is to be gained by adding microsatellites. They show that the accuracy available for estimating individual levels of inbreeding is somewhat limited. They also show that the average pairwise relatedness measures bracket pedigree relationship very nicely, but the variances for individual pairs remain substantial, even with a very large panel. 相似文献
5.
Nigel E. Stork 《Biodiversity and Conservation》1993,2(3):215-232
How many species are there is a question receiving more attention from biologists and reasons for this are suggested. Different methods of answering this question are examined and include: counting all species; extrapolations from known faunas and regions; extrapolations from samples; methods using ecological models; censusing taxonomists' views. Most of these methods indicate that global totals of 5 to 15 million species are reasonable. The implications of much higher estimates of 30 million species or more are examined, particularly the question of where these millions of species might be found. 相似文献
6.
Dmitry N. Ivankov Samuel H. Payne Michael Y. Galperin Stefano Bonissone Pavel A. Pevzner Dmitrij Frishman 《Environmental microbiology》2013,15(4):983-990
Over the last 5 years proteogenomics (using mass spectroscopy to identify proteins predicted from genomic sequences) has emerged as a promising approach to the high‐throughput identification of protein N‐termini, which remains a problem in genome annotation. Comparison of the experimentally determined N‐termini with those predicted by sequence analysis tools allows identification of the signal peptides and therefore conclusions on the cytoplasmic or extracytoplasmic (periplasmic or extracellular) localization of the respective proteins. We present here the results of a proteogenomic study of the signal peptides in Escherichia coli K‐12 and compare its results with the available experimental data and predictions by such software tools as SignalP and Phobius. A single proteogenomics experiment recovered more than a third of all signal peptides that had been experimentally determined during the past three decades and confirmed at least 31 additional signal peptides, mostly in the known exported proteins, which had been previously predicted but not validated. The filtering of putative signal peptides for the peptide length and the presence of an eight‐residue hydrophobic patch and a typical signal peptidase cleavage site proved sufficient to eliminate the false‐positive hits. Surprisingly, the results of this proteogenomics study, as well as a re‐analysis of the E. coli genome with the latest version of SignalP program, show that the fraction of proteins containing signal peptides is only about 10%, or half of previous estimates. 相似文献
7.
Attempts to assess the magnitude of global biodiversity have focused on estimating species richness. However, this is but one component of biodiversity, and others, such as numbers of individuals or biomass, are at least as poorly known and just as important to quantify. Here, we use a variety of methods to estimate the global number of individuals for a single taxon, birds. The different methods yield surprisingly consistent estimates of a global bird population of between 200 billion and 400 billion individuals (1 billion=109). We discuss some of the implications of this figure. 相似文献
8.
The predisposition to develop a majority of autoimmune diseases is associated with specific genes within the human leukocyte antigen (HLA) complex. However, it is frequently difficult to determine which of the many genes of the HLA complex are directly involved in the disease process. The main reasons for these difficulties are the complexity of associations where several HLA complex genes might be involved, and the strong linkage disequilibrium that exists between the genes in this complex. The latter phenomenon leads to secondary disease associations, or what has been called 'hitchhiking polymorphisms'. Here, we give an overview of the complexity of HLA associations in autoimmune disease, focusing on type 1 diabetes and trying to answer the question: how many and which HLA genes are directly involved? 相似文献
9.
Sterck L Rombauts S Vandepoele K Rouzé P Van de Peer Y 《Current opinion in plant biology》2007,10(2):199-203
Annotation of the first few complete plant genomes has revealed that plants have many genes. For Arabidopsis, over 26,500 gene loci have been predicted, whereas for rice, the number adds up to 41,000. Recent analysis of the poplar genome suggests more than 45,000 genes, and partial sequence data from Medicago and Lotus also suggest that these plants contain more than 40,000 genes. Nevertheless, estimations suggest that ancestral angiosperms had no more than 12,000-14,000 genes. One explanation for the large increase in gene number during angiosperm evolution is gene duplication. It has been shown previously that the retention of duplicates following small- and large-scale duplication events in plants is substantial. Taking into account the function of genes that have been duplicated, we are now beginning to understand why many plant genes might have been retained, and how their retention might be linked to the typical lifestyle of plants. 相似文献
10.
How essential are nonessential genes? 总被引:8,自引:0,他引:8
Gene essentiality in bacteria has been identified in silico, focusing on gene persistence, or experimentally, focusing on the growth of knockouts in rich media. Comparing 55 genomes of Firmicutes and Gamma-proteobacteria to identify the genes which, while persistent among genomes, do not lead to a lethal phenotype when inactivated, we show that the characteristics of persistence, conservation, expression, and location are shared between persistent nonessential (PNE) genes and experimentally essential genes. PNE genes show an overrepresentation of genes related to maintenance and stress response. This outlines the limits of current experimental techniques to define gene essentiality and highlights the essential role of genes implicated in maintenance which, although dispensable for growth, are not dispensable from an evolutionary point of view. Firmicutes and Gamma-proteobacteria are mostly differing in the construction of the cell envelope, DNA replication and proofreading, and RNA degradation. In addition to suggesting functions for persistent genes that had until now resisted identification, we show that these genes have many characters in common with experimentally identified essential genes. They should then be regarded as truly essential genes. 相似文献
11.
Plants contain far more carbohydrate-active enzyme-encoding genes than any other organism sequenced to date. The extremely large number of glycosidase and glycosyltransferase-related genes in plant genomes can be explained by the complex structure of the plant cell wall, by ancient genome duplication and by recent local duplications, but also by the recent emergence of novel and unrelated protein functions based on widely available pre-existing scaffolds. 相似文献
12.
13.
How many membrane proteins are there? 总被引:9,自引:1,他引:8
D. Boyd C. Schierle J. Beckwith 《Protein science : a publication of the Protein Society》1998,7(1):201-205
One of the basic issues that arises in functional genomics is the ability to predict the subcellular location of proteins that are deduced from gene and genome sequencing. In particular, one would like to be able to readily specify those proteins that are soluble and those that are inserted in a membrane. Traditional methods of distinguishing between these two locations have relied on extensive, time-consuming biochemical studies. The alternative approach has been to make inferences based on a visual search of the amino acid sequences of presumed gene products for stretches of hydrophobic amino acids. This numerical, sequence-based approach is usually seen as a first approximation pending more reliable biochemical data. The recent availability of large and complete sequence data sets for several organisms allows us to determine just how accurate such a numerical approach could be, and to attempt to minimize and quantify the error involved. We have optimized a statistical approach to protein location determination. Using our approach, we have determined that surprisingly few proteins are misallocated using the numerical method. We also examine the biological implications of the success of this technique. 相似文献
14.
Zhi-Xin Wang 《Proteins》1996,26(2):186-191
Many protein structures have now been determined and reveal that protein molecules can adopt the same fold despite having very different sequences. It has been suggested that, owing to different stereochemical constraints, the number of ways that a sequence can fold may be limited. Therefore, it is reasonable to ask how many fold types exist in nature. Several groups have tackled this problem with very different results. In the present study, a novel statistical sampling approach is used to reestimate this number. The results suggest that the number of protein folds in nature is probably several hundreds. © 1996 Wiley-Liss, Inc. 相似文献
15.
N. M. Korovchinsky 《Hydrobiologia》1996,321(3):191-204
An estimation of the number of taxa within families, genera and local faunas of Cladocera reveals that only c. 129 species (17% of all known species) may be considered as sufficiently well described (valid species), and c. 146 as rather well described (fair species) but needing further study using modern methods of investigation. The status of all other species is vague. The families Chydoridae, Daphniidae and Sididae and genera Diaphanosoma, Daphnia, (including Daphniopsis), Megafenestra, Scapholeberis, Eurycercus, Chydorus, Ephemeroporus and Pleuroxus have been comparatively studied best. The largest number of valid species is known from Europe, North America, Australia and South America, and the smallest number from Africa. Presence of large number of vague species of Cladocera negatively affects faunistic, zoogeographic, and ecological studies of continental waters.Dedicated to the memory of Professor D. J. Frey 相似文献
16.
17.
Anette Schreiber Peter Schramm Hans-Jörg Hofmann 《Journal of molecular modeling》2011,17(6):1393-1400
The formation of α-turns is a possibility to reverse the direction of peptide sequences via five amino acids. In this paper, a systematic conformational analysis was performed to find the possible isolated α-turns with a hydrogen bond between the first and fifth amino acid employing the methods of ab initio MO theory in vacuum (HF/6-31G*, B3LYP/6-311?+?G*) and in solution (CPCM/HF/6-31G*). Only few α-turn structures with glycine and alanine backbones fulfill the geometry criteria for the i←(i?+?4) hydrogen bond satisfactorily. The most stable representatives agree with structures found in the Protein Data Bank. There is a general tendency to form additional hydrogen bonds for smaller pseudocycles corresponding to β- and γ-turns with better hydrogen bond geometries. Sometimes, this competition weakens or even destroys the i←(i?+?4) hydrogen bond leading to very stable double β-turn structures. This is also the reason why an “ideal” α-turn with three central amino acids having the perfect backbone angle values of an α-helix could not be localized. There are numerous hints for stable α-turns with a distance between the \( {{\hbox{C}}_\alpha } \)-atoms of the first and fifth amino acid smaller than 6-7 Å, but without an i←(i?+?4) hydrogen bond. 相似文献
18.
19.
Due to the development in DNA-PCR-technique more and more systems with a high number of alleles have been established in twin diagnosis. Because of their high effectiveness in resolving of genetic questions it is not amazing that some authors have postulated the thesis that typing of 5 to 10 DNA-PCR systems can prove monozygosity. For this paper the use of different systems (conventional and PCR systems) has been tested for twin diagnosis and the observed effects are discussed. 相似文献
20.
How many species of cichlid fishes are there in African lakes? 总被引:14,自引:0,他引:14
The endemic cichlid fishes of Lakes Malawi, Tanganyika and Victoria are textbook examples of explosive speciation and adaptive radiation, and their study promises to yield important insights into these processes. Accurate estimates of species richness of lineages in these lakes, and elsewhere, will be a necessary prerequisite for a thorough comparative analysis of the intrinsic and extrinsic factors influencing rates of diversification. This review presents recent findings on the discoveries of new species and species flocks and critically appraises the relevant evidence on species richness from recent studies of polymorphism and assortative mating, generally using behavioural and molecular methods. Within the haplochromines, the most species-rich lineage, there are few reported cases of postzygotic isolation, and these are generally among allopatric taxa that are likely to have diverged a relatively long time in the past. However, many taxa, including many which occur sympatrically and do not interbreed in nature, produce viable, fertile hybrids. Prezygotic barriers are more important, and persist in laboratory conditions in which environmental factors have been controlled, indicating the primary importance of direct mate preferences. Studies to date indicate that estimates of alpha (within-site) diversity appear to be robust. Although within-species colour polymorphisms are common, these have been taken into account in previous estimates of species richness. However, overall estimates of species richness in Lakes Malawi and Victoria are heavily dependent on the assignation of species status to allopatric populations differing in male colour. Appropriate methods for testing the specific status of allopatric cichlid taxa are reviewed and preliminary results presented. 相似文献