Precursor-induced conditional random fields: connecting separate entities by induction for improved clinical named entity recognition

Lee, Wangjin; Choi, Jinwook

doi:10.1186/s12911-019-0865-1

서울대학교 중앙도서관

S-Space 소개

My S-Space

로그인이 필요합니다.

S-Space

Publications

Detailed Information

Precursor-induced conditional random fields: connecting separate entities by induction for improved clinical named entity recognition

DC Field	Value	Language
dc.contributor.author	Lee, Wangjin	-
dc.contributor.author	Choi, Jinwook	-
dc.date.accessioned	2019-07-23T07:54:09Z	-
dc.date.available	2019-07-23T16:55:19Z	-
dc.date.issued	2019-07-15	-
dc.identifier.citation	BMC Medical Informatics and Decision Making. 19(1):132	ko_KR
dc.identifier.issn	1472-6947	-
dc.identifier.uri	https://hdl.handle.net/10371/160718	-
dc.description.abstract	Background This paper presents a conditional random fields (CRF) method that enables the capture of specific high-order label transition factors to improve clinical named entity recognition performance. Consecutive clinical entities in a sentence are usually separated from each other, and the textual descriptions in clinical narrative documents frequently indicate causal or posterior relationships that can be used to facilitate clinical named entity recognition. However, the CRF that is generally used for named entity recognition is a first-order model that constrains label transition dependency of adjoining labels under the Markov assumption. Methods Based on the first-order structure, our proposed model utilizes non-entity tokens between separated entities as an information transmission medium by applying a label induction method. The model is referred to as precursor-induced CRF because its non-entity state memorizes precursor entity information, and the models structure allows the precursor entity information to propagate forward through the label sequence. Results We compared the proposed model with both first- and second-order CRFs in terms of their F1-scores, using two clinical named entity recognition corpora (the i2b2 2012 challenge and the Seoul National University Hospital electronic health record). The proposed model demonstrated better entity recognition performance than both the first- and second-order CRFs and was also more efficient than the higher-order model. Conclusion The proposed precursor-induced CRF which uses non-entity labels as label transition information improves entity recognition F1 score by exploiting long-distance transition factors without exponentially increasing the computational time. In contrast, a conventional second-order CRF model that uses longer distance transition factors showed even worse results than the first-order model and required the longest computation time. Thus, the proposed model could offer a considerable performance improvement over current clinical named entity recognition methods based on the CRF models.	ko_KR
dc.description.sponsorship	This work was supported by the Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education [No. NRF-2015R1D1A1A01058075]; and also supported by a grant of the Korea Health Technology R&D Project through the Korea Health Industry Development Institute (KHIDI), funded by the Ministry of Health &Welfare, Republic of Korea [grant number HI14C1277].	ko_KR
dc.language.iso	en	ko_KR
dc.publisher	BioMed Central	ko_KR
dc.subject	Clinical named entity recognition	ko_KR
dc.subject	Conditional random fields	ko_KR
dc.subject	High-order dependency	ko_KR
dc.subject	Clinical natural language processing	ko_KR
dc.subject	Induction method	ko_KR
dc.title	Precursor-induced conditional random fields: connecting separate entities by induction for improved clinical named entity recognition	ko_KR
dc.type	Article	ko_KR
dc.contributor.AlternativeAuthor	이완진	-
dc.contributor.AlternativeAuthor	최진욱	-
dc.identifier.doi	10.1186/s12911-019-0865-1	-
dc.language.rfc3066	en	-
dc.rights.holder	The Author(s).	-
dc.date.updated	2019-07-21T03:31:44Z	-

Appears in Collections:

College of Engineering/Engineering Practice School (공과대학/대학원)
- Program in Bioengineering (협동과정-바이오엔지니어링전공)
  - Journal Papers (저널논문_협동과정-바이오엔지니어링전공)

College of Medicine/School of Medicine (의과대학/대학원)
- Biomedical Engineering (의공학전공)
  - Journal Papers (저널논문_의공학전공)

Files in This Item:

12911_2019_Article_865.pdf 1.81 MB

Altmetrics

Item View & Download Count

Show Simple Item Record

Find it @ SNU

트윗하기

SNS Share