Publications
Detailed Information
Protein pKa Prediction by Tree-Based Machine Learning : Protein p<i>K</i><sub>a</sub> Prediction by Tree-Based Machine Learning
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Chen, Ada Y. | - |
dc.contributor.author | Lee, Ju Yong | - |
dc.contributor.author | Damjanovic, Ana | - |
dc.contributor.author | Brooks, Bernard R. | - |
dc.date.accessioned | 2024-05-13T04:59:50Z | - |
dc.date.available | 2024-05-13T04:59:50Z | - |
dc.date.created | 2024-05-13 | - |
dc.date.issued | 2022-04 | - |
dc.identifier.citation | Journal of Chemical Theory and Computation, Vol.18 No.4, pp.2673-2686 | - |
dc.identifier.issn | 1549-9618 | - |
dc.identifier.uri | https://hdl.handle.net/10371/201507 | - |
dc.description.abstract | Protonation states of ionizable protein residuesmodulate many essential biological processes. For correct modelingand understanding of these processes, it is crucial to accuratelydetermine their pKavalues. Here, we present four tree-basedmachine learning models for protein pKaprediction. The fourmodels, Random Forest, Extra Trees, eXtreme Gradient Boosting(XGBoost), and Light Gradient Boosting Machine (LightGBM),were trained on three experimental PDB and pKadatasets, two ofwhich included a notable portion of internal residues. We observedsimilar performance among the four machine learning algorithms.The best model trained on the largest dataset performs 37% betterthan the widely used empirical pKaprediction tool PROPKA and15% better than the published result from the pKapredictionmethod DelPhiPKa. The overall root-mean-square error (RMSE) for this model is 0.69, with surface and buried RMSE values being0.56 and 0.78, respectively, considering six residue types (Asp, Glu, His, Lys, Cys, and Tyr), and 0.63 when considering Asp, Glu,His, and Lys only. We provide pKapredictions for proteins in human proteome from the AlphaFold Protein Structure Database andobserved that 1% of Asp/Glu/Lys residues have highly shifted pKavalues close to the physiological pH. | - |
dc.language | 영어 | - |
dc.publisher | American Chemical Society | - |
dc.title | Protein pKa Prediction by Tree-Based Machine Learning | - |
dc.title.alternative | Protein pKa Prediction by Tree-Based Machine Learning | - |
dc.type | Article | - |
dc.identifier.doi | 10.1021/acs.jctc.1c01257 | - |
dc.citation.journaltitle | Journal of Chemical Theory and Computation | - |
dc.identifier.wosid | 000789656500050 | - |
dc.identifier.scopusid | 2-s2.0-85127436337 | - |
dc.citation.endpage | 2686 | - |
dc.citation.number | 4 | - |
dc.citation.startpage | 2673 | - |
dc.citation.volume | 18 | - |
dc.description.isOpenAccess | Y | - |
dc.contributor.affiliatedAuthor | Lee, Ju Yong | - |
dc.type.docType | Article | - |
dc.description.journalClass | 1 | - |
dc.subject.keywordPlus | PH MOLECULAR-DYNAMICS | - |
dc.subject.keywordPlus | POISSON-BOLTZMANN EQUATION | - |
dc.subject.keywordPlus | SMOOTH DIELECTRIC FUNCTION | - |
dc.subject.keywordPlus | CONSTANT-PH | - |
dc.subject.keywordPlus | EXPLICIT SOLVENT | - |
dc.subject.keywordPlus | HYDROPHOBIC INTERIOR | - |
dc.subject.keywordPlus | IONIZABLE RESIDUES | - |
dc.subject.keywordPlus | STRUCTURAL-CHANGES | - |
dc.subject.keywordPlus | PROTEIN PK(A) | - |
dc.subject.keywordPlus | CONFORMATIONAL FLEXIBILITY | - |
- Appears in Collections:
- Files in This Item:
- There are no files associated with this item.
Related Researcher
- Graduate School of Convergence Science & Technology
- Dept. of Molecular and Biopharmaceutical Sciences
Item View & Download Count
Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.