首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
2.
The Linnaean system of nomenclature has been used and adapted by biologists over a period of almost 250 years. Under the current system of codes, it is now applied to more than 2 million species of organisms. Inherent in the Linnaean system is the indication of hierarchical relationships. The Linnaean system has been justified primarily on the basis of stability. Stability can be assessed on at least two grounds: the absolute stability of names, irrespective of taxonomic concept; and the stability of names under changing concepts. Recent arguments have invoked conformity to phylogenetic methods as the primary basis for choice of nomenclatural systems, but even here stability of names as they relate to monophyletic groups is stated as the ultimate objective. The idea of absolute stability as the primary justification for nomenclatural methods was wrong from the start. The reasons are several. First, taxa are concepts, no matter the frequency of assertions to the contrary; as such, they are subject to change at all levels and always will be, with the consequence that to some degree the names we use to refer to them will also be subject to change. Second, even if the true nature of all taxa could be agreed upon, the goal would require that we discover them all and correctly recognize them for what they are. Much of biology is far from that goal at the species level and even further for supraspecific taxa. Nomenclature serves as a tool for biology. Absolute stability of taxonomic concepts—and nomenclature—would hinder scientific progress rather than promote it. It can been demonstrated that the scientific goals of systematists are far from achieved. Thus, the goal of absolute nomenclatural stability is illusory and misguided. The primary strength of the Linnaean system is its ability to portray hierarchical relationships; stability is secondary. No single system of nomenclature can ever possess all desirable attributes: i.e., convey information on hierarchical relationships, provide absolute stability in the names portraying those relationships, and provide simplicity and continuity in communicating the identities of the taxa and their relationships. Aside from myriad practical problems involved in its implementation, it must be concluded that “phylogenetic nomenclature” would not provide a more stable and effective system for communicating information on biological classifications than does the Linnaean system.  相似文献   

3.
Collaborative effort among four lead indexes of taxon names and nomenclatural acts ( International Plant Name Index (IPNI), Index Fungorum, MycoBank and ZooBank) and the journals PhytoKeys, MycoKeys and ZooKeys to create an automated, pre-publication, registration workflow, based on a server-to-server, XML request/response model. The registration model for ZooBank uses the TaxPub schema, which is an extension to the Journal Tag Publishing Suite (JATS) of the National Library of Medicine (NLM). The indexing or registration model of IPNI and Index Fungorum will use the Taxonomic Concept Transfer Schema (TCS) as a basic standard for the workflow. Other journals and publishers who intend to implement automated, pre-publication, registration of taxon names and nomenclatural acts can also use the open sample XML formats and links to schemas and relevant information published in the paper.  相似文献   

4.
MOTIVATION: As research into disease pathology and cellular function continues to generate vast amounts of data pertaining to protein, gene and small molecule (PGSM) interactions, there exists a critical need to capture these results in structured formats allowing for computational analysis. Although many efforts have been made to create databases that store this information in computer readable form, populating these sources largely requires a manual process of interpreting and extracting interaction relationships from the biological research literature. Being able to efficiently and accurately automate the extraction of interactions from unstructured text, would greatly improve the content of these databases and provide a method for managing the continued growth of new literature being published. RESULTS: In this paper, we describe a system for extracting PGSM interactions from unstructured text. By utilizing a lexical analyzer and context free grammar (CFG), we demonstrate that efficient parsers can be constructed for extracting these relationships from natural language with high rates of recall and precision. Our results show that this technique achieved a recall rate of 83.5% and a precision rate of 93.1% for recognizing PGSM names and a recall rate of 63.9% and a precision rate of 70.2% for extracting interactions between these entities. In contrast to other published techniques, the use of a CFG significantly reduces the complexities of natural language processing by focusing on domain specific structure as opposed to analyzing the semantics of a given language. Additionally, our approach provides a level of abstraction for adding new rules for extracting other types of biological relationships beyond PGSM relationships. AVAILABILITY: The program and corpus are available by request from the authors.  相似文献   

5.
Amphibians, reptiles, birds and mammals serve as hosts for 19 species of Cryptosporidium. All 19 species have been confirmed by morphological, biological, and molecular data. Fish serve as hosts for three additional species, all of which lack supporting molecular data. In addition to the named species, gene sequence data from more than 40 isolates from various vertebrate hosts are reported in the scientific literature or are listed in GenBank. These isolates lack taxonomic status and are referred to as genotypes based on the host of origin. Undoubtedly, some will eventually be recognized as species. For them to receive taxonomic status sufficient morphological, biological, and molecular data are required and names must comply with the rules of the International Code for Zoological Nomenclature (ICZN). Because the ICZN rules may be interpreted differently by persons proposing names, original names might be improperly assigned, original literature might be overlooked, or new scientific methods might be applicable to determining taxonomic status, the names of species and higher taxa are not immutable. The rapidly evolving taxonomic status of Cryptosporidium sp. reflects these considerations.  相似文献   

6.
Feminist news media researchers have long contended that masculine news values shape journalists’ quotidian decisions about what is newsworthy. As a result, it is argued, topics and issues traditionally regarded as primarily of interest and relevance to women are routinely marginalised in the news, while men’s views and voices are given privileged space. When women do show up in the news, it is often as “eye candy,” thus reinforcing women’s value as sources of visual pleasure rather than residing in the content of their views. To date, evidence to support such claims has tended to be based on small-scale, manual analyses of news content. In this article, we report on findings from our large-scale, data-driven study of gender representation in online English language news media. We analysed both words and images so as to give a broader picture of how gender is represented in online news. The corpus of news content examined consists of 2,353,652 articles collected over a period of six months from more than 950 different news outlets. From this initial dataset, we extracted 2,171,239 references to named persons and 1,376,824 images resolving the gender of names and faces using automated computational methods. We found that males were represented more often than females in both images and text, but in proportions that changed across topics, news outlets and mode. Moreover, the proportion of females was consistently higher in images than in text, for virtually all topics and news outlets; women were more likely to be represented visually than they were mentioned as a news actor or source. Our large-scale, data-driven analysis offers important empirical evidence of macroscopic patterns in news content concerning the way men and women are represented.  相似文献   

7.
MOTIVATION: Despite substantial efforts to develop and populate the back-ends of biological databases, front-ends to these systems often rely on taxonomic expertise. This research applies techniques from human-computer interaction research to the biodiversity domain. RESULTS: We developed an interactive node-link tool, TaxonTree, illustrating the value of a carefully designed interaction model, animation, and integrated searching and browsing towards retrieval of biological names and other information. Users tested the tool using a new, large integrated dataset of animal names with phylogenetic-based and classification-based tree structures. These techniques also translated well for a tool, DoubleTree, to allow comparison of trees using coupled interaction. Our approaches will be useful not only for biological data but as general portal interfaces.  相似文献   

8.
9.
Overviews are provided for traditional and phylogenetic nomenclature. In traditional nomenclature, a name is provided with a type and a rank. In the rankless phylogenetic nomenclature, a taxon name is provided with an explicit phylogenetic definition, which attaches the name to a clade. Linnaeus’s approach to nomenclature is also reviewed, and it is shown that, although the current system of nomenclature does use some Linnaean conventions (e.g., certain rank-denoting terms, binary nomenclature), it is actually quite different from Linnaean nomenclature. The primary differences between traditional and phylogenetic nomenclature are reviewed. In phylogenetic nomenclature, names are provided with explicit phylogenetic definitions, whereas in traditional nomenclature names are not explicitly defined. In phylogenetic nomenclature, a name remains attached to a clade regardless of how future changes in phylogeny alter the clade’s content; in traditional nomenclature a name is not “married” to any particular clade. In traditional nomenclature, names must be assigned ranks (an admittedly arbitrary process), whereas in phylogenetic nomenclature there are no formal ranks. Therefore, in phylogenetic nomenclature, the name itself conveys no hierarchical information, and the name conveys nothing regarding set exclusivity. It is concluded that the current system is better able to handle new and unexpected changes in ideas about taxonomic relationships. This greater flexibility, coupled with the greater information content that the names themselves (i.e., when used outside the context of a given taxonomy or phytogeny) provide, makes the current system better designed for use by all users of taxon names.  相似文献   

10.
11.

Background

The digitization of biodiversity data is leading to the widespread application of taxon names that are superfluous, ambiguous or incorrect, resulting in mismatched records and inflated species numbers. The ultimate consequences of misspelled names and bad taxonomy are erroneous scientific conclusions and faulty policy decisions. The lack of tools for correcting this ‘names problem’ has become a fundamental obstacle to integrating disparate data sources and advancing the progress of biodiversity science.

Results

The TNRS, or Taxonomic Name Resolution Service, is an online application for automated and user-supervised standardization of plant scientific names. The TNRS builds upon and extends existing open-source applications for name parsing and fuzzy matching. Names are standardized against multiple reference taxonomies, including the Missouri Botanical Garden's Tropicos database. Capable of processing thousands of names in a single operation, the TNRS parses and corrects misspelled names and authorities, standardizes variant spellings, and converts nomenclatural synonyms to accepted names. Family names can be included to increase match accuracy and resolve many types of homonyms. Partial matching of higher taxa combined with extraction of annotations, accession numbers and morphospecies allows the TNRS to standardize taxonomy across a broad range of active and legacy datasets.

Conclusions

We show how the TNRS can resolve many forms of taxonomic semantic heterogeneity, correct spelling errors and eliminate spurious names. As a result, the TNRS can aid the integration of disparate biological datasets. Although the TNRS was developed to aid in standardizing plant names, its underlying algorithms and design can be extended to all organisms and nomenclatural codes. The TNRS is accessible via a web interface at http://tnrs.iplantcollaborative.org/ and as a RESTful web service and application programming interface. Source code is available at https://github.com/iPlantCollaborativeOpenSource/TNRS/.  相似文献   

12.
Given the current trends, it seems inevitable that all biological documents will eventually exist in a digital format and be distributed across the internet. New network services and tools need to be developed to increase retrieval rates for documents and to refine data recovery. Biological data have traditionally been well managed using taxonomic principles. As part of a larger initiative to build an array of names-based network services that emulate taxonomic principles for managing biological information, we undertook the digitization of a major taxonomic reference text, Nomenclator Zoologicus. The process involved replicating the text to a high level of fidelity, parsing the content for inclusion within a database, developing tools to enable expert input into the product, and integrating the metadata and factual content within taxonomic network services. The result is a high-quality and freely available web application (http://uio.mbl.edu/NomenclatorZoologicus/) capable of being exploited in an array of biological informatics services.  相似文献   

13.
This study summarizes results of a DNA barcoding campaign on German Diptera, involving analysis of 45,040 specimens. The resultant DNA barcode library includes records for 2,453 named species comprising a total of 5,200 barcode index numbers (BINs), including 2,700 COI haplotype clusters without species‐level assignment, so called “dark taxa.” Overall, 88 out of 117 families (75%) recorded from Germany were covered, representing more than 50% of the 9,544 known species of German Diptera. Until now, most of these families, especially the most diverse, have been taxonomically inaccessible. By contrast, within a few years this study provided an intermediate taxonomic system for half of the German Dipteran fauna, which will provide a useful foundation for subsequent detailed, integrative taxonomic studies. Using DNA extracts derived from bulk collections made by Malaise traps, we further demonstrate that species delineation using BINs and operational taxonomic units (OTUs) constitutes an effective method for biodiversity studies using DNA metabarcoding. As the reference libraries continue to grow, and gaps in the species catalogue are filled, BIN lists assembled by metabarcoding will provide greater taxonomic resolution. The present study has three main goals: (a) to provide a DNA barcode library for 5,200 BINs of Diptera; (b) to demonstrate, based on the example of bulk extractions from a Malaise trap experiment, that DNA barcode clusters, labelled with globally unique identifiers (such as OTUs and/or BINs), provide a pragmatic, accurate solution to the “taxonomic impediment”; and (c) to demonstrate that interim names based on BINs and OTUs obtained through metabarcoding provide an effective method for studies on species‐rich groups that are usually neglected in biodiversity research projects because of their unresolved taxonomy.  相似文献   

14.
As metagenomic studies continue to increase in their number, sequence volume and complexity, the scalability of biological analysis frameworks has become a rate-limiting factor to meaningful data interpretation. To address this issue, we have developed JCVI Metagenomics Reports (METAREP) as an open source tool to query, browse, and compare extremely large volumes of metagenomic annotations. Here we present improvements to this software including the implementation of a dynamic weighting of taxonomic and functional annotation, support for distributed searches, advanced clustering routines, and integration of additional annotation input formats. The utility of these improvements to data interpretation are demonstrated through the application of multiple comparative analysis strategies to shotgun metagenomic data produced by the National Institutes of Health Roadmap for Biomedical Research Human Microbiome Project (HMP) (http://nihroadmap.nih.gov). Specifically, the scalability of the dynamic weighting feature is evaluated and established by its application to the analysis of over 400 million weighted gene annotations derived from 14 billion short reads as predicted by the HMP Unified Metabolic Analysis Network (HUMAnN) pipeline. Further, the capacity of METAREP to facilitate the identification and simultaneous comparison of taxonomic and functional annotations including biological pathway and individual enzyme abundances from hundreds of community samples is demonstrated by providing scenarios that describe how these data can be mined to answer biological questions related to the human microbiome. These strategies provide users with a reference of how to conduct similar large-scale metagenomic analyses using METAREP with their own sequence data, while in this study they reveal insights into the nature and extent of variation in taxonomic and functional profiles across body habitats and individuals. Over one thousand HMP WGS datasets and the latest open source code are available at http://www.jcvi.org/hmp-metarep.  相似文献   

15.
Ceci n'est pas une pipe: names, clades and phylogenetic nomenclature   总被引:2,自引:0,他引:2  
An introduction is provided to the literature and to issues relating to phylogenetic nomenclature and the PhyloCode, together with a critique of the current Linnaean system of nomenclature. The Linnaean nomenclature fixes taxon names with types, and associates the names with ranks (genus, family, etc.). In phylogenetic nomenclature, names are instead defined with reference to cladistic relationships, and the names are not associated with ranks. We argue that taxon names under the Linnaean system are unclear in meaning and provide unstable group–name associations, notwithstanding whether or not there are agreements on relationships. Furthermore, the Linnaean rank assignments lack justification and invite unwarranted comparisons across taxa. On the contrary, the intention of taxon names in phylogenetic nomenclature is clear and stable, and the application of the names will be unambiguous under any given cladistic hypothesis. The extension of the names reflects current knowledge of relationships, and will shift as new hypotheses are forwarded. The extension of phylogenetic names is, therefore, clear but is associated to (and thus dependent upon) cladistic hypotheses. Stability in content can be maximized with carefully formulated name definitions. A phylogenetic nomenclature will shift the focus from discussions of taxon names towards the understanding of relationships. Also, we contend that species should not be recognized as taxonomic units. The term ‘species’ is ambiguous, it mixes several distinct classes of entities, and there is a large gap between most of the actual concepts and the evidence available to identify the entities. Instead, we argue that only clades should be recognized. Among these, it is useful to tag the smallest named clades, which all represent non-overlapping groups. Such taxa  – LITUs (Least Inclusive Taxonomic Units) – are distinguished from more inclusive clades by being spelled with lower-case initial letter. In contrast to species, LITUs are conceptually straightforward and are, like other clades, identified by apomorphies.  相似文献   

16.
根据物种学名、分类号、任意一段核酸或蛋白质的序列,判定其属于什么物种及其详细分类的信息如何,是生物信息分析的最为基础且重要的环节,但该过程的分析及结果的获取均为手动,费时费力且容易出错。本研究旨在解决如何在NCBI网站上自动或批量获取物种信息。通过解析NCBI在线BLAST结果及其网页源程序特点,利用Perl语言编写自动化脚本,以达到批量获取查询或比对结果的物种分类信息。本研究编写的Perl语言脚本可解决序列在NCBI在线比对后自动或批量获取物种的分类信息问题,适用于细菌、真菌、动物、植物等物种学名、分类号、核酸或蛋白质的任意序列,可以为同行生物数据分析提供参考。  相似文献   

17.
Taxonomic indexing refers to a new array of taxonomically intelligent network services that use nomenclatural principles and elements of expert taxonomic knowledge to manage information about organisms. Taxonomic indexing was introduced to help manage the increasing amounts of digital information about biology. It has been designed to form a near basal layer in a layered cyberinfrastructure that deals with biological information. Taxonomic Indexing accommodates the special problems of using names of organisms to index biological material. It links alternative names for the same entity (reconciliation), and distinguishes between uses of the same name for different entities (disambiguation), and names are placed within an indefinite number of hierarchical schemes. In order to access all information on all organisms, Taxonomic indexing must be able to call on a registry of all names in all forms for all organisms. NameBank has been developed to meet that need. Taxonomic indexing is an area of informatics that overlaps with taxonomy, is dependent on the expert input of taxonomists, and reveals the relevance of the discipline to a wide audience.  相似文献   

18.
The correct use of taxonomic names must become widespread if a clear understanding of primate paleontology is to exist among anthropologists. Physical anthropologists are urged to acquire genuine competence in the paleontology, systematics, and taxonomy of mammals. Examples are given of improper taxonomic procedure and of the perpetuation of invalid names. The need for a stable and correct nomenclature of the primates is emphasized. The importance of examining actual fossil specimens is stressed. The taxonomy of the Hominoidea is discussed and a summary of invalid names in current use is given. Recently discovered fossils from Oligocene strata in the Egyptian Fayum are figured and the pertinence of these to the origins of higher primates is suggested.  相似文献   

19.
The RESID Database is a comprehensive collection of annotations and structures for protein post-translational modifications including N-terminal, C-terminal and peptide chain cross-link modifications. The RESID Database includes systematic and frequently observed alternate names, Chemical Abstracts Service registry numbers, atomic formulas and weights, enzyme activities, taxonomic range, keywords, literature citations with database cross-references, structural diagrams and molecular models. The NRL-3D Sequence-Structure Database is derived from the three-dimensional structure of proteins deposited with the Research Collaboratory for Structural Bioinformatics Protein Data Bank. The NRL-3D Database includes standardized and frequently observed alternate names, sources, keywords, literature citations, experimental conditions and searchable sequences from model coordinates. These databases are freely accessible through the National Cancer Institute-Frederick Advanced Biomedical Computing Center at these web sites: http://www. ncifcrf.gov/RESID, http://www.ncifcrf.gov/NRL-3D; or at these National Biomedical Research Foundation Protein Information Resource web sites: http://pir.georgetown.edu/pirwww/dbinfo/resid .html, http://pir.georgetown.edu/pirwww/dbinfo/nrl3d .html  相似文献   

20.
段维军  严进  刘芳  蔡磊  朱水芳 《菌物学报》2015,34(5):942-960
中华人民共和国现行进境检疫性菌物名录中共有130种。近年来由于菌物分类研究的快速发展,许多检疫性菌物的分类地位已经发生变化。本文对我国进境检疫性菌物名录中的名称与国际公认的分类体系和现用名进行了初步的比较和分析,进一步以茎点霉属和轮枝菌属为例说明分类系统的变化对菌物名称的影响。另外,名录中很多汉语学名的使用也不符合规范。我国进境检疫性菌物名录亟需修订。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号