Publications
Detailed Information
An annotated corpus from biomedical articles to construct a drug-food interaction database
Cited 5 time in
Web of Science
Cited 5 time in Scopus
- Authors
- Issue Date
- 2022-02
- Publisher
- Academic Press
- Citation
- Journal of Biomedical Informatics, Vol.126, p. 103985
- Abstract
- Motivation: While drug-food interaction (DFI) may undermine the efficacy and safety of drugs, DFI detection has been difficult because a well-organized database for DFI did not exist. To construct a DFI database and build a natural language processing system extracting DFI from biomedical articles, we formulated the DFI extraction tasks and manually annotated texts that could have contained DFI information. In this article, we introduced a new annotated corpus for extracting DFI, the DFI corpus. Results: The DFI corpus contains 2270 abstracts of biomedical articles accessible through PubMed and 2498 sentences that contain DFI and/or drug-drug information (DDI), a substantial amount of information about drug/ food entities, evidence-levels of abstracts and relations between named entities. BERT models pre-trained on the biomedical domain achieved a F1 score 55.0% in extracting DFI key-sentences. To the best of our knowledge, theDFI corpus is the largest public corpus for drug-food interaction. Availability and implementation: Our corpus is available at https://github. com/ccadd-snu/corpus-for-DFI-extraction.
- ISSN
- 1532-0464
- Files in This Item:
- There are no files associated with this item.
Item View & Download Count
Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.