Publications
Detailed Information
Comprehensive ensemble in QSAR prediction for drug discovery
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Kwon, Sunyoung | - |
dc.contributor.author | Bae, Ho | - |
dc.contributor.author | Jo, Jeonghee | - |
dc.contributor.author | Yoon, Sungroh | - |
dc.date.accessioned | 2020-03-02T07:44:52Z | - |
dc.date.available | 2020-03-02T16:46:28Z | - |
dc.date.issued | 2019-10-26 | - |
dc.identifier.citation | BMC Bioinformatics, 20(1):521 | ko_KR |
dc.identifier.issn | 1471-2105 | - |
dc.identifier.uri | 10.1186/s12859-019-3135-4 | - |
dc.identifier.uri | https://hdl.handle.net/10371/164398 | - |
dc.description.abstract | Background
Quantitative structure-activity relationship (QSAR) is a computational modeling method for revealing relationships between structural properties of chemical compounds and biological activities. QSAR modeling is essential for drug discovery, but it has many constraints. Ensemble-based machine learning approaches have been used to overcome constraints and obtain reliable predictions. Ensemble learning builds a set of diversified models and combines them. However, the most prevalent approach random forest and other ensemble approaches in QSAR prediction limit their model diversity to a single subject. Results The proposed ensemble method consistently outperformed thirteen individual models on 19 bioassay datasets and demonstrated superiority over other ensemble approaches that are limited to a single subject. The comprehensive ensemble method is publicly available at http://data.snu.ac.kr/QSAR/ Conclusions We propose a comprehensive ensemble method that builds multi-subject diversified models and combines them through second-level meta-learning. In addition, we propose an end-to-end neural network-based individual classifier that can automatically extract sequential features from a simplified molecular-input line-entry system (SMILES). The proposed individual models did not show impressive results as a single model, but it was considered the most important predictor when combined, according to the interpretation of the meta-learning. | ko_KR |
dc.description.sponsorship | Publication costs were funded by Seoul National University. This research was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (Ministry of Science and ICT) [2014M3C9A3063541, 2018R1A2B3001628], the Brain Korea 21 Plus Project in 2018, and the Korea Health Technology R&D Project through the Korea Health Industry Development Institute (KHIDI), funded by the Ministry of Health and Welfare, Republic of Korea [HI15C3224]. The funding bodies did not play any roles in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript. | ko_KR |
dc.language.iso | en | ko_KR |
dc.publisher | BMC | ko_KR |
dc.subject | Ensemble-learning | - |
dc.subject | Meta-learning | - |
dc.subject | Drug-prediction | - |
dc.title | Comprehensive ensemble in QSAR prediction for drug discovery | ko_KR |
dc.type | Article | ko_KR |
dc.contributor.AlternativeAuthor | 권순영 | - |
dc.contributor.AlternativeAuthor | 배호 | - |
dc.contributor.AlternativeAuthor | 조정희 | - |
dc.contributor.AlternativeAuthor | 윤성로 | - |
dc.citation.journaltitle | BMC Bioinformatics | ko_KR |
dc.language.rfc3066 | en | - |
dc.rights.holder | The Author(s) | - |
dc.date.updated | 2019-10-27T06:28:49Z | - |
dc.citation.number | 1 | ko_KR |
dc.citation.startpage | 521 | ko_KR |
dc.citation.volume | 20 | ko_KR |
- Appears in Collections:
- Files in This Item:
Item View & Download Count
Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.