Browse

An annotated corpus from biomedical articles to construct a drug-food interaction database

Cited 0 time in Web of Science Cited 0 time in Scopus
Issue Date
2022-02
Publisher
Academic Press
Citation
Journal of Biomedical Informatics, Vol.126, p. 103985
Abstract
Motivation: While drug-food interaction (DFI) may undermine the efficacy and safety of drugs, DFI detection has been difficult because a well-organized database for DFI did not exist. To construct a DFI database and build a natural language processing system extracting DFI from biomedical articles, we formulated the DFI extraction tasks and manually annotated texts that could have contained DFI information. In this article, we introduced a new annotated corpus for extracting DFI, the DFI corpus. Results: The DFI corpus contains 2270 abstracts of biomedical articles accessible through PubMed and 2498 sentences that contain DFI and/or drug-drug information (DDI), a substantial amount of information about drug/ food entities, evidence-levels of abstracts and relations between named entities. BERT models pre-trained on the biomedical domain achieved a F1 score 55.0% in extracting DFI key-sentences. To the best of our knowledge, theDFI corpus is the largest public corpus for drug-food interaction. Availability and implementation: Our corpus is available at https://github. com/ccadd-snu/corpus-for-DFI-extraction.
ISSN
1532-0464
URI
https://hdl.handle.net/10371/179516
Files in This Item:
There are no files associated with this item.
Appears in Collections:
College of Pharmacy (약학대학)Dept. of Pharmacy (약학과)Journal Papers (저널논문_약학과)
Graduate School of Convergence Science and Technology (융합과학기술대학원)Dept. of Molecular and Biopharmaceutical Sciences (분자의학 및 바이오제약학과)Journal Papers (저널논문_분자의학 및 바이오제약학과)
College of Pharmacy (약학대학)Dept. of Pharmacy (약학과)Journal Papers (저널논문_약학과)
  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Browse