Publications

Detailed Information

Precursor-induced conditional random fields: connecting separate entities by induction for improved clinical named entity recognition

DC Field Value Language
dc.contributor.authorLee, Wangjin-
dc.contributor.authorChoi, Jinwook-
dc.creator최진욱-
dc.date.accessioned2020-01-23T07:39:32Z-
dc.date.available2020-04-05T07:39:32Z-
dc.date.created2019-11-25-
dc.date.issued2019-07-15-
dc.identifier.citationBMC Medical Informatics and Decision Making, Vol.19 No.1, p. 132-
dc.identifier.issn1472-6947-
dc.identifier.urihttps://hdl.handle.net/10371/163870-
dc.description.abstractBackgroundThis paper presents a conditional random fields (CRF) method that enables the capture of specific high-order label transition factors to improve clinical named entity recognition performance. Consecutive clinical entities in a sentence are usually separated from each other, and the textual descriptions in clinical narrative documents frequently indicate causal or posterior relationships that can be used to facilitate clinical named entity recognition. However, the CRF that is generally used for named entity recognition is a first-order model that constrains label transition dependency of adjoining labels under the Markov assumption.MethodsBased on the first-order structure, our proposed model utilizes non-entity tokens between separated entities as an information transmission medium by applying a label induction method. The model is referred to as precursor-induced CRF because its non-entity state memorizes precursor entity information, and the model's structure allows the precursor entity information to propagate forward through the label sequence.ResultsWe compared the proposed model with both first- and second-order CRFs in terms of their F-1-scores, using two clinical named entity recognition corpora (the i2b2 2012 challenge and the Seoul National University Hospital electronic health record). The proposed model demonstrated better entity recognition performance than both the first- and second-order CRFs and was also more efficient than the higher-order model.ConclusionThe proposed precursor-induced CRF which uses non-entity labels as label transition information improves entity recognition F-1 score by exploiting long-distance transition factors without exponentially increasing the computational time. In contrast, a conventional second-order CRF model that uses longer distance transition factors showed even worse results than the first-order model and required the longest computation time. Thus, the proposed model could offer a considerable performance improvement over current clinical named entity recognition methods based on the CRF models.-
dc.language영어-
dc.language.isoENGen
dc.publisherBioMed Central-
dc.titlePrecursor-induced conditional random fields: connecting separate entities by induction for improved clinical named entity recognition-
dc.typeArticle-
dc.identifier.doi10.1186/s12911-019-0865-1-
dc.citation.journaltitleBMC Medical Informatics and Decision Making-
dc.identifier.wosid000475733400001-
dc.identifier.scopusid2-s2.0-85069160486-
dc.description.srndOAIID:RECH_ACHV_DSTSH_NO:T201915223-
dc.description.srndRECH_ACHV_FG:RR00200001-
dc.description.srndADJUST_YN:-
dc.description.srndEMP_ID:A079476-
dc.description.srndCITE_RATE:2.134-
dc.description.srndDEPT_NM:의학과-
dc.description.srndEMAIL:jinchoi@snu.ac.kr-
dc.description.srndSCOPUS_YN:Y-
dc.citation.number1-
dc.citation.startpage132-
dc.citation.volume19-
dc.description.isOpenAccessY-
dc.contributor.affiliatedAuthorChoi, Jinwook-
dc.identifier.srndT201915223-
dc.type.docTypeArticle-
dc.description.journalClass1-
dc.subject.keywordPlusIDENTIFICATION-
dc.subject.keywordPlusASSERTIONS-
dc.subject.keywordPlusEXTRACTION-
dc.subject.keywordPlusSYSTEM-
dc.subject.keywordAuthorClinical named entity recognition-
dc.subject.keywordAuthorConditional random fields-
dc.subject.keywordAuthorHigh-order dependency-
dc.subject.keywordAuthorClinical natural language processing-
dc.subject.keywordAuthorInduction method-
Appears in Collections:
Files in This Item:
There are no files associated with this item.

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share