Publications

Detailed Information

Utilizing Genetic Algorithm to LambdaMART Forests to Predict Ranking Labels Accurately : 랭킹 라벨을 정확히 예측하기 위한 유전 알고리즘의 LambdaMART 포레스트에 대한 적용

DC Field Value Language
dc.contributor.advisorSrinivasa Rao Satti-
dc.contributor.author정원일-
dc.date.accessioned2017-10-31T07:40:43Z-
dc.date.available2017-10-31T07:40:43Z-
dc.date.issued2017-08-
dc.identifier.other000000145305-
dc.identifier.urihttps://hdl.handle.net/10371/137440-
dc.description학위논문 (석사)-- 서울대학교 대학원 공과대학 컴퓨터공학부, 2017. 8. Srinivasa Rao Satti.-
dc.description.abstractIn this thesis, principles of genetic algorithm (GA) will be applied to forests of LambdaMART to get more accurate ranking results. Ranking problem is considered one kind of prediction function problems, and various solutions were proposed for the ranking problem. Applying machine learning techniques has improved ranking quality of algorithm. One of the techniques is ensemble of decision tree learning where each tree is trained one by one and these trees are used to predict the result with the given input values.
LambdaMART is a fusion of LambdaRank and MART (Multiple Additive Regression Trees), where gradients of scores are calculated by LambdaRank and multiple trees are generated and trained with predefined steps in MART. LambdaMART is also main contributor for the winner of ``Yahoo! Learning to Rank Challenge (2010)" though the challenge reports that ranking solution performance has reached saturation point. However, LambdaMART might have problems about overfitting to training data, which means it could not predict outcome precisely on other unobserved data after being trained with data. In addition, genetic algorithm can provide greater searching ability for solution space though the ability depends on designing core operations such as crossover, mutation, and so on.
Combining this search ability with LambdaMART could enhance solution's quality and reduce some chance of overfitting to training data. Each LambdaMART forest will become a chromosome in this scheme, and multiple forests will be operands of genetic operations. This scheme shows higher accuracy measure value than original LambdaMART and total training time per forest has also been saved.
-
dc.description.tableofcontentsChapter 1 Introduction 1
Chapter 2 Background 3
2.1 Information Retrieval: Ranking 3
2.1.1 Ranking Problem 3
2.1.2 Ranking Measures 4
2.2 Classication and Regression Trees 7
2.3 Genetic Algorithm (GA) 7
2.3.1 Selection 8
2.3.2 Crossover 9
2.3.3 Mutation 10
2.3.4 Replacement 10
Chapter 3 Related Work 11
3.1 RankNet 11
3.2 LambdaRank 13
3.3 MART (Multiple Additive Regression Tree) 14
3.4 LambdaMART 15
Chapter 4 LambdaMART with GA 17
4.1 Overview 17
4.2 Genetic Operations 18
4.2.1 Selection 19
4.2.2 Crossover 19
4.2.3 Mutation 20
4.2.4 Replacement 21
Chapter 5 Experimental Results 22
5.1 System Settings and Datasets 22
5.2 Implementation 23
5.3 Results 23
Chapter 6 Conclusion 31
Bibliography 32
요약 35
Acknowledgements 36
-
dc.formatapplication/pdf-
dc.format.extent2418835 bytes-
dc.format.mediumapplication/pdf-
dc.language.isoen-
dc.publisher서울대학교 대학원-
dc.subjectLambdaMART-
dc.subjectGenetic Algorithm-
dc.subjectLearning-
dc.subjectRanking-
dc.subjectRegression tree-
dc.subject.ddc621.39-
dc.titleUtilizing Genetic Algorithm to LambdaMART Forests to Predict Ranking Labels Accurately-
dc.title.alternative랭킹 라벨을 정확히 예측하기 위한 유전 알고리즘의 LambdaMART 포레스트에 대한 적용-
dc.typeThesis-
dc.contributor.AlternativeAuthorJeong Wonil-
dc.description.degreeMaster-
dc.contributor.affiliation공과대학 컴퓨터공학부-
dc.date.awarded2017-08-
Appears in Collections:
Files in This Item:

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share