Browse
An annotated corpus from biomedical articles to construct a drug-food interaction database
- Authors
- Kim, Siun ; Choi, Yoona ; Won, Jung-Hyun ; Oh, Jung Mi ; Lee, Howard
- Issue Date
- 2022-02
- Publisher
- Academic Press
- Citation
- Journal of Biomedical Informatics, Vol.126, p. 103985
- Abstract
- Motivation: While drug-food interaction (DFI) may undermine the efficacy and safety of drugs, DFI detection has been difficult because a well-organized database for DFI did not exist. To construct a DFI database and build a natural language processing system extracting DFI from biomedical articles, we formulated the DFI extraction tasks and manually annotated texts that could have contained DFI information. In this article, we introduced a new annotated corpus for extracting DFI, the DFI corpus. Results: The DFI corpus contains 2270 abstracts of biomedical articles accessible through PubMed and 2498 sentences that contain DFI and/or drug-drug information (DDI), a substantial amount of information about drug/ food entities, evidence-levels of abstracts and relations between named entities. BERT models pre-trained on the biomedical domain achieved a F1 score 55.0% in extracting DFI key-sentences. To the best of our knowledge, theDFI corpus is the largest public corpus for drug-food interaction. Availability and implementation: Our corpus is available at https://github. com/ccadd-snu/corpus-for-DFI-extraction.
- ISSN
- 1532-0464
- Files in This Item: There are no files associated with this item.
- Appears in Collections:
- College of Pharmacy (약학대학)Dept. of Pharmacy (약학과)Journal Papers (저널논문_약학과)
Graduate School of Convergence Science and Technology (융합과학기술대학원)Dept. of Molecular and Biopharmaceutical Sciences (분자의학 및 바이오제약학과)Journal Papers (저널논문_분자의학 및 바이오제약학과)
College of Pharmacy (약학대학)Dept. of Pharmacy (약학과)Journal Papers (저널논문_약학과)
Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.