fulltext.study @t Gmail

Exploiting the performance of dictionary-based bio-entity name recognition in biomedical literature

Paper ID Volume ID Publish Year Pages File Format Full-Text
15355 1406 2008 5 PDF Available
Title
Exploiting the performance of dictionary-based bio-entity name recognition in biomedical literature
Abstract

Bio-entity name recognition is the key step for information extraction from biomedical literature. This paper presents a dictionary-based bio-entity name recognition approach. The approach expands the bio-entity name dictionary via the Abbreviation Definitions identifying algorithm, improves the recall rate through the improved edit distance algorithm and adopts some post-processing methods including Pre-keyword and Post-keyword expansion, Part of Speech expansion, merge of adjacent bio-entity names and the exploitation of the contextual cues to further improve the performance. Experiment results show that with this approach even an internal dictionary-based system could achieve a fairly good performance.

Keywords
Text mining; Entity recognition; Edit distance; Conditional random fields
First Page Preview
Exploiting the performance of dictionary-based bio-entity name recognition in biomedical literature
Publisher
Database: Elsevier - ScienceDirect
Journal: Computational Biology and Chemistry - Volume 32, Issue 4, August 2008, Pages 287–291
Authors
, , ,
Subjects
Physical Sciences and Engineering Chemical Engineering Bioengineering