Comprehensive ensemble in QSAR prediction for drug discovery

Kwon, Sunyoung; Bae, Ho; Jo, Jeonghee; Yoon, Sungroh

서울대학교 중앙도서관

S-Space 소개

My S-Space

로그인이 필요합니다.

S-Space

Publications

Detailed Information

Comprehensive ensemble in QSAR prediction for drug discovery

DC Field	Value	Language
dc.contributor.author	Kwon, Sunyoung	-
dc.contributor.author	Bae, Ho	-
dc.contributor.author	Jo, Jeonghee	-
dc.contributor.author	Yoon, Sungroh	-
dc.date.accessioned	2020-03-02T07:44:52Z	-
dc.date.available	2020-03-02T16:46:28Z	-
dc.date.issued	2019-10-26	-
dc.identifier.citation	BMC Bioinformatics, 20(1):521	ko_KR
dc.identifier.issn	1471-2105	-
dc.identifier.uri	10.1186/s12859-019-3135-4	-
dc.identifier.uri	https://hdl.handle.net/10371/164398	-
dc.description.abstract	Background Quantitative structure-activity relationship (QSAR) is a computational modeling method for revealing relationships between structural properties of chemical compounds and biological activities. QSAR modeling is essential for drug discovery, but it has many constraints. Ensemble-based machine learning approaches have been used to overcome constraints and obtain reliable predictions. Ensemble learning builds a set of diversified models and combines them. However, the most prevalent approach random forest and other ensemble approaches in QSAR prediction limit their model diversity to a single subject. Results The proposed ensemble method consistently outperformed thirteen individual models on 19 bioassay datasets and demonstrated superiority over other ensemble approaches that are limited to a single subject. The comprehensive ensemble method is publicly available at http://data.snu.ac.kr/QSAR/ Conclusions We propose a comprehensive ensemble method that builds multi-subject diversified models and combines them through second-level meta-learning. In addition, we propose an end-to-end neural network-based individual classifier that can automatically extract sequential features from a simplified molecular-input line-entry system (SMILES). The proposed individual models did not show impressive results as a single model, but it was considered the most important predictor when combined, according to the interpretation of the meta-learning.	ko_KR
dc.description.sponsorship	Publication costs were funded by Seoul National University. This research was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (Ministry of Science and ICT) [2014M3C9A3063541, 2018R1A2B3001628], the Brain Korea 21 Plus Project in 2018, and the Korea Health Technology R&D Project through the Korea Health Industry Development Institute (KHIDI), funded by the Ministry of Health and Welfare, Republic of Korea [HI15C3224]. The funding bodies did not play any roles in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript.	ko_KR
dc.language.iso	en	ko_KR
dc.publisher	BMC	ko_KR
dc.subject	Ensemble-learning	-
dc.subject	Meta-learning	-
dc.subject	Drug-prediction	-
dc.title	Comprehensive ensemble in QSAR prediction for drug discovery	ko_KR
dc.type	Article	ko_KR
dc.contributor.AlternativeAuthor	권순영	-
dc.contributor.AlternativeAuthor	배호	-
dc.contributor.AlternativeAuthor	조정희	-
dc.contributor.AlternativeAuthor	윤성로	-
dc.citation.journaltitle	BMC Bioinformatics	ko_KR
dc.language.rfc3066	en	-
dc.rights.holder	The Author(s)	-
dc.date.updated	2019-10-27T06:28:49Z	-
dc.citation.number	1	ko_KR
dc.citation.startpage	521	ko_KR
dc.citation.volume	20	ko_KR

Appears in Collections:

College of Engineering/Engineering Practice School (공과대학/대학원)
- Dept. of Electrical and Computer Engineering (전기·정보공학부)
  - Journal Papers (저널논문_전기·정보공학부)

College of Natural Sciences (자연과학대학)
- Program in Bioinformatics (협동과정-생물정보학전공)
  - Journal Papers (저널논문_협동과정-생물정보학전공)

Files in This Item:

12859_2019_Article_3135.pdf 1.15 MB

Altmetrics

Item View & Download Count

Show Simple Item Record

Find it @ SNU

트윗하기

SNS Share