srBERT: automatic article classification model for systematic review using BERT

Aum, Sungmin; Choe, Seon

doi:10.1186/s13643-021-01763-w

서울대학교 중앙도서관

S-Space 소개

My S-Space

로그인이 필요합니다.

S-Space

Publications

Detailed Information

srBERT: automatic article classification model for systematic review using BERT

DC Field	Value	Language
dc.contributor.author	Aum, Sungmin	-
dc.contributor.author	Choe, Seon	-
dc.date.accessioned	2022-02-18T01:21:14Z	-
dc.date.available	2022-02-18T01:21:14Z	-
dc.date.issued	2021-10-30	-
dc.identifier.citation	Systematic Reviews. 2021 Oct 30;10(1):285	ko_KR
dc.identifier.issn	2046-4053	-
dc.identifier.uri	https://hdl.handle.net/10371/176961	-
dc.description.abstract	Background Systematic reviews (SRs) are recognized as reliable evidence, which enables evidence-based medicine to be applied to clinical practice. However, owing to the significant efforts required for an SR, its creation is time-consuming, which often leads to out-of-date results. To support SR tasks, tools for automating these SR tasks have been considered; however, applying a general natural language processing model to domain-specific articles and insufficient text data for training poses challenges. Methods The research objective is to automate the classification of included articles using the Bidirectional Encoder Representations from Transformers (BERT) algorithm. In particular, srBERT models based on the BERT algorithm are pre-trained using abstracts of articles from two types of datasets, and the resulting model is then fine-tuned using the article titles. The performances of our proposed models are compared with those of existing general machine-learning models. Results Our results indicate that the proposed srBERTmy model, pre-trained with abstracts of articles and a generated vocabulary, achieved state-of-the-art performance in both classification and relation-extraction tasks; for the first task, it achieved an accuracy of 94.35% (89.38%), F1 score of 66.12 (78.64), and area under the receiver operating characteristic curve of 0.77 (0.9) on the original and (generated) datasets, respectively. In the second task, the model achieved an accuracy of 93.5% with a loss of 27%, thereby outperforming the other evaluated models, including the original BERT model. Conclusions Our research shows the possibility of automatic article classification using machine-learning approaches to support SR tasks and its broad applicability. However, because the performance of our model depends on the size and class ratio of the training dataset, it is important to secure a dataset of sufficient quality, which may pose challenges.	ko_KR
dc.description.sponsorship	The authors received no fnancial support for the research, authorship, and publication of this article.	ko_KR
dc.language.iso	en	ko_KR
dc.publisher	BMC	ko_KR
dc.subject	Systematic review	-
dc.subject	Process automation	-
dc.subject	Deep learning	-
dc.subject	Text mining	-
dc.title	srBERT: automatic article classification model for systematic review using BERT	ko_KR
dc.type	Article	ko_KR
dc.contributor.AlternativeAuthor	엄성민	-
dc.contributor.AlternativeAuthor	최선	-
dc.identifier.doi	https://doi.org/10.1186/s13643-021-01763-w	-
dc.citation.journaltitle	Systematic Reviews	ko_KR
dc.language.rfc3066	en	-
dc.rights.holder	The Author(s)	-
dc.date.updated	2021-10-31T04:22:15Z	-
dc.citation.number	1	ko_KR
dc.citation.startpage	285	ko_KR
dc.citation.volume	10	ko_KR

Appears in Collections:

College of Medicine/School of Medicine (의과대학/대학원)
- Program in Medical Informatics (협동과정-의료정보학전공)
  - Journal Papers (저널논문_협동과정-의료정보학)

Files in This Item:

13643_2021_Article_1763.pdf 1.23 MB

Altmetrics

Item View & Download Count

Show Simple Item Record

Find it @ SNU

트윗하기

SNS Share