首页 | 本学科首页   官方微博 | 高级检索  
     


Retrieving taxa names from large biodiversity data collections using a flexible matching workflow
Affiliation:1. Vrije Universiteit Brussel (VUB), Brussels, Belgium;2. Istituto di Scienza e Tecnologie dell''Informazione A. Faedo, CNR, Pisa, Italy;3. WorldFish, Penang, Malaysia;4. Fisheries and Aquaculture Department, Statistics and Information (FIPS), FAO, Rome, Italy;5. FishBase Information and Research Group, Inc. (FIN), Los Baños, Laguna, Philippines;1. U.S. Geological Survey, Southwest Biological Science Center, Rm. 123, University of Arizona, 1110 E. So. Campus Drive, Tucson, AZ 85719, United States;2. U.S. Geological Survey, Core Science Analytics and Synthesis, Denver Federal Center Bldg. 810, Lakewood, CO 80225, United States;3. U.S. Geological Survey, USA National Phenology Network, 1955 E 6th St., Tucson, AZ 85721, United States;4. NOAA Fisheries Service, Office of Science and Technology, 1315 East West Highway, Silver Spring, MD 20910, United States
Abstract:In the domain of biological classification there are several taxon name matching services that can search for a species scientific name in a large collection of taxonomic names. Many of these services are available online, and many others run on computers of individual scientists. While these systems may work very well, most suffer from the fact that the list of names used as a reference, and the criteria to decide on a match, are hard-coded in the engine that performs the name matching. In this paper we present BiOnym, a taxon name matching system that separates reference namelists, search criteria and matching engine. The user is offered a choice of several taxonomic reference lists, including the option to upload his/her own list onto the system. Furthermore, BiOnym is a flexible workflow, which embeds and combines techniques using lexical matching algorithms as well as expert knowledge. It is also an open platform allowing developers to contribute with new techniques. In this paper we demonstrate the benefits brought by this approach in terms of the efficiency and effectiveness of the information retrieval process with respect to other solutions.
Keywords:
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号