MBA: a literature mining system for extracting biomedical abbreviations期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

MBA: a literature mining system for extracting biomedical abbreviations

Authors:	Yun Xu ZhiHao Wang YiMing Lei YuZhong Zhao Yu Xue

Affiliation:	(1) Department of Computer Science and Technology, University of Science and Technology of China Hefei, Anhui, 230027, PR, China;(2) Anhui Province-MOST Co-Key Laboratory of High Performance Computing and Its Application Hefei, , Anhui, 230027, PR, China;(3) School of Life Science, University of Science and Technology of China Hefei, Anhui, 230027, PR, China

Abstract:	Background The exploding growth of the biomedical literature presents many challenges for biological researchers. One such challenge is from the use of a great deal of abbreviations. Extracting abbreviations and their definitions accurately is very helpful to biologists and also facilitates biomedical text analysis. Existing approaches fall into four broad categories: rule based, machine learning based, text alignment based and statistically based. State of the art methods either focus exclusively on acronym-type abbreviations, or could not recognize rare abbreviations. We propose a systematic method to extract abbreviations effectively. At first a scoring method is used to classify the abbreviations into acronym-type and non-acronym-type abbreviations, and then their corresponding definitions are identified by two different methods: text alignment algorithm for the former, statistical method for the latter.

Keywords:
本文献已被 SpringerLink 等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Background