Publications

Detailed Information

Development of Prediction Models using Clustering

DC Field Value Language
dc.contributor.advisor박태성-
dc.contributor.author민병주-
dc.date.accessioned2017-07-19T08:44:19Z-
dc.date.available2017-07-19T08:44:19Z-
dc.date.issued2014-02-
dc.identifier.other000000017456-
dc.identifier.urihttps://hdl.handle.net/10371/131277-
dc.description학위논문 (석사)-- 서울대학교 대학원 : 통계학과, 2014. 2. 박태성.-
dc.description.abstractRepeated measures data have been commonly generated in many clinical studies.
One of main objectives of the repeated measures data analysis is to predict a future outcome from the previously observed values. Repeated measures data often show certain patterns over time which can be easily checked by simple scatter plots. In this paper, we demonstrate that the use pattern information increases the accuracy of prediction in repeated measures data analysis. We propose to make a prediction model first by clustering data patterns via clustering methods and later by adding this clustering information into our model as a variable. We illustrate our approach using a real clinical data for bipolar patients. One of the clinical outcomes is Clinical Global Impression (CGI) value to predict patients the extent of depression. We chose the better measure from calculated distances, Euclidean and 1-corr, between individual CGI values and clustered them by hierarchical clustering methods. Then we developed best prediction model of the extent of depression via above results. Here, we used linear mixed effects model in order to consider the effect of individual by adjustment of random effect. In terms of relative quality and prediction, our proposed method outperformed the models without clustering information according to AIC standards and prediction error.
-
dc.description.tableofcontentsContents
1 Introduction ·························································································1
2 Subjects and methods ······································································3
2. 1. Study Subjects ··············································································································3
2. 1. 1. Education Status ······································································································3
2. 1. 2. Depression counts ···································································································4
2. 2. Statistical Method ·······································································6
2. 2. 1 Distance ······················································································································6
2. 2. 2. Clustering method ···································································································6
2. 2. 3. Silhouette ···················································································································7
2. 2 .4. Selection of the number of clusters via the bootstrap ································ 7
2. 2. 5. Linear Mixed Effects Model ················································································8
3. Results ·······························································································10
4. Discussion ························································································17
5. Reference ··························································································18
Abstract in Korean ·············································································21
-
dc.formatapplication/pdf-
dc.format.extent742223 bytes-
dc.format.mediumapplication/pdf-
dc.language.isoen-
dc.publisher서울대학교 대학원-
dc.subjectclustering method-
dc.subjectlinear mixed effects model-
dc.subjectprediction model-
dc.subject.ddc519-
dc.titleDevelopment of Prediction Models using Clustering-
dc.typeThesis-
dc.description.degreeMaster-
dc.citation.pagesⅲ,21-
dc.contributor.affiliation자연과학대학 통계학과-
dc.date.awarded2014-02-
Appears in Collections:
Files in This Item:

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share