An annotated corpus from biomedical articles to construct a drug-food interaction database

서울대학교 중앙도서관

S-Space 소개

My S-Space

로그인이 필요합니다.

S-Space

Publications

Detailed Information

An annotated corpus from biomedical articles to construct a drug-food interaction database

Cited 5 time in Web of Science Cited 5 time in Scopus

Abstract: Motivation: While drug-food interaction (DFI) may undermine the efficacy and safety of drugs, DFI detection has been difficult because a well-organized database for DFI did not exist. To construct a DFI database and build a natural language processing system extracting DFI from biomedical articles, we formulated the DFI extraction tasks and manually annotated texts that could have contained DFI information. In this article, we introduced a new annotated corpus for extracting DFI, the DFI corpus. Results: The DFI corpus contains 2270 abstracts of biomedical articles accessible through PubMed and 2498 sentences that contain DFI and/or drug-drug information (DDI), a substantial amount of information about drug/ food entities, evidence-levels of abstracts and relations between named entities. BERT models pre-trained on the biomedical domain achieved a F1 score 55.0% in extracting DFI key-sentences. To the best of our knowledge, theDFI corpus is the largest public corpus for drug-food interaction. Availability and implementation: Our corpus is available at https://github. com/ccadd-snu/corpus-for-DFI-extraction.

Appears in Collections:

Show Full Item Record

Find it @ SNU

SNS Share