BIOSMILE: A semantic role labeling system for biomedical verbs using a maximum-entropy model with automatically generated template features |
| |
Authors: | Richard Tzong-Han Tsai Wen-Chi Chou Ying-Shan Su Yu-Chun Lin Cheng-Lung Sung Hong-Jie Dai Irene Tzu-Hsuan Yeh Wei Ku Ting-Yi Sung Wen-Lian Hsu |
| |
Affiliation: | (1) Institute of Information Science, Academia Sinica, Nankang, Taipei 115 PRoC, Taiwan;(2) Institute of Human Nutrition, Columbia University, New York, NY 10032, USA;(3) Biological Sciences & Psychology, Mellon College of Sciences, Carnegie Mellon University, Pittsburgh, PA, USA |
| |
Abstract: | Background Bioinformatics tools for automatic processing of biomedical literature are invaluable for both the design and interpretation of large-scale experiments. Many information extraction (IE) systems that incorporate natural language processing (NLP) techniques have thus been developed for use in the biomedical field. A key IE task in this field is the extraction of biomedical relations, such as protein-protein and gene-disease interactions. However, most biomedical relation extraction systems usually ignore adverbial and prepositional phrases and words identifying location, manner, timing, and condition, which are essential for describing biomedical relations. Semantic role labeling (SRL) is a natural language processing technique that identifies the semantic roles of these words or phrases in sentences and expresses them as predicate-argument structures. We construct a biomedical SRL system called BIOSMILE that uses a maximum entropy (ME) machine-learning model to extract biomedical relations. BIOSMILE is trained on BioProp, our semi-automatic, annotated biomedical proposition bank. Currently, we are focusing on 30 biomedical verbs that are frequently used or considered important for describing molecular events. |
| |
Keywords: | |
本文献已被 SpringerLink 等数据库收录! |
|