Publications

Detailed Information

An annotated corpus from biomedical articles to construct a drug-food interaction database

Cited 5 time in Web of Science Cited 5 time in Scopus
Authors

Kim, Siun; Choi, Yoona; Won, Jung-Hyun; Oh, Jung Mi; Lee, Howard

Issue Date
2022-02
Publisher
Academic Press
Citation
Journal of Biomedical Informatics, Vol.126, p. 103985
Abstract
Motivation: While drug-food interaction (DFI) may undermine the efficacy and safety of drugs, DFI detection has been difficult because a well-organized database for DFI did not exist. To construct a DFI database and build a natural language processing system extracting DFI from biomedical articles, we formulated the DFI extraction tasks and manually annotated texts that could have contained DFI information. In this article, we introduced a new annotated corpus for extracting DFI, the DFI corpus. Results: The DFI corpus contains 2270 abstracts of biomedical articles accessible through PubMed and 2498 sentences that contain DFI and/or drug-drug information (DDI), a substantial amount of information about drug/ food entities, evidence-levels of abstracts and relations between named entities. BERT models pre-trained on the biomedical domain achieved a F1 score 55.0% in extracting DFI key-sentences. To the best of our knowledge, theDFI corpus is the largest public corpus for drug-food interaction. Availability and implementation: Our corpus is available at https://github. com/ccadd-snu/corpus-for-DFI-extraction.
ISSN
1532-0464
URI
https://hdl.handle.net/10371/179516
DOI
https://doi.org/10.1016/j.jbi.2022.103985
Files in This Item:
There are no files associated with this item.
Appears in Collections:

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share