Adaptive Matching Time Intervals based on Reinforcement Learning for Ride-Hailing Services

신용근

서울대학교 중앙도서관

S-Space 소개

My S-Space

로그인이 필요합니다.

S-Space

Publications

Detailed Information

Adaptive Matching Time Intervals based on Reinforcement Learning for Ride-Hailing Services : 승차 공유 서비스를 위한 강화학습 기반의 적응형 매칭 시간 간격 결정

DC Field	Value	Language
dc.contributor.advisor	김동규	-
dc.contributor.author	신용근	-
dc.date.accessioned	2023-06-29T01:47:43Z	-
dc.date.available	2023-06-29T01:47:43Z	-
dc.date.issued	2023	-
dc.identifier.other	000000175951	-
dc.identifier.uri	https://hdl.handle.net/10371/193008	-
dc.identifier.uri	https://dcollection.snu.ac.kr/common/orgView/000000175951	ko_KR
dc.description	학위논문(석사) -- 서울대학교대학원 : 공과대학 건설환경공학부, 2023. 2. 김동규.	-
dc.description.abstract	Ride-hailing services helped daily travel by efficiently matching passengers and drivers. These services face inefficiency in system operations due to supply and demand imbalances. A widely adopted strategy is fixed batch-based matching, which accumulates requests and idle drivers and matches them in batches. Recent studies have proposed adaptive matching time intervals to consider dynamic supply and demand patterns. However, matching failure factors such as passenger request cancellation and driver acceptance are not considered. This study aims to control adaptive matching time intervals based on reinforcement learning considering matching failure factors. To this end, we propose a two-step framework to maximize the matching success rate. First, an agent based on Deep Q-Network (DQN) determines the matching time interval, and then combinatorial optimization is performed based on the driver's acceptance probability. We conduct experiments on various supply-demand patterns based on synthetic and real datasets and compare performance with previous strategies. We confirmed that the proposed strategy reduces the proportion of expired requests and achieves the highest matching success rate. We also discussed the trade-off between fixed matching time intervals and matching success rates and interpreted agent policies. Our approach provides insight by discussing matching failure factors, which cannot be captured with performance alone.	-
dc.description.abstract	승차 공유 서비스들은 승객과 운전자들을 효율적으로 연결함으로써 일상 생활의 이동에 많은 도움을 주고 있다. 이러한 서비스들은 수요와 공급의 불균형 문제로 인해 시스템 운영 측면에서 비효율적인 상황에 직면한다. 이를 위해 일정한 매칭 시간 간격 동안 승객의 요청과 공차 통행 중인 운전자들을 모아 일괄적으로 매칭하는 전략을 주로 사용한다. 최근에는 수요와 공급의 동적 패턴을 효과적으로 반영하기 위한 적응형 매칭 시간 간격에 대한 연구들이 있었으나, 승객의 요청 취소와 운전자 거부와 같은 매칭 실패 요인들은 간과되었다. 본 연구의 목표는 매칭 실패 요인이 존재하는 상황에서 강화학습 기반의 적응형 매칭 시간 간격을 통해 매칭 성공률을 최대화하는 것이다. 연구 방법은 2단계 프레임워크로 구성된다. 먼저 DQN (Deep Q-Network) 기반의 강화학습 에이전트는 각 매칭 시간 간격마다 배차 행동(Dispatch action)을 결정하며, 이후에는 운전자의 수락확률을 기반으로 한 조합최적화가 수행된다. 실제 데이터셋을 기반으로 한 실험을 통해 이전 전략들과 성능을 비교하고 매칭 실패 요인들에 대한 분석을 수행한다. 실험 결과, 제안된 방법은 대부분의 실험에서 가장 높은 매칭 성공률을 보였다. 구체적으로는 운전자의 미 수락에 의한 만료 요청의 비율을 감소시키며, 승객의 요청 취소 비율을 효율적으로 제어하는 것을 확인했다. 또한 학습된 에이전트의 정책 해석과 집계된 결과의 세분화를 기반으로 추가 분석이 수행되었다. 이러한 접근 방식은 매칭 성공률과 세부적인 매칭 실패 요인들에 대한 논의를 통해 기존 연구에서 간과되었던 통찰력을 제공한다.	-
dc.description.tableofcontents	Chapter 1. Introduction 1 Chapter 2. Literature Review 7 Chapter 3. Methodology 11 3.1. Problem Statement 11 3.2. MDP formulation 13 3.3. Simulation Framework 15 3.4. Deep Q-Networks (DQN) 21 Chapter 4. Results 23 4.1. Data Description 23 4.2. Experimental Setup 26 4.3. Experiments on synthetic datasets 29 4.4. Experiments on real datasets 33 Chapter 5. Conclusion 41 Bibliography 44 Abstract in Korean 50	-
dc.format.extent	iv, 50	-
dc.language.iso	eng	-
dc.publisher	서울대학교 대학원	-
dc.subject	Ride Hailing Service	-
dc.subject	Reinforcement Learning	-
dc.subject	Deep Q-Network (DQN)	-
dc.subject	Combinatorial Optimization	-
dc.subject	Matching Failure	-
dc.subject.ddc	624	-
dc.title	Adaptive Matching Time Intervals based on Reinforcement Learning for Ride-Hailing Services	-
dc.title.alternative	승차 공유 서비스를 위한 강화학습 기반의 적응형 매칭 시간 간격 결정	-
dc.type	Thesis	-
dc.type	Dissertation	-
dc.contributor.AlternativeAuthor	Yonggeun Shin	-
dc.contributor.department	공과대학 건설환경공학부	-
dc.description.degree	석사	-
dc.date.awarded	2023-02	-
dc.contributor.major	교통공학	-
dc.identifier.uci	I804:11032-000000175951	-
dc.identifier.holdings	000000000049▲000000000056▲000000175951▲	-

Appears in Collections:

College of Engineering/Engineering Practice School (공과대학/대학원)
- Dept. of Civil & Environmental Engineering (건설환경공학부)
  - Theses (Master's Degree_건설환경공학부)

Files in This Item:

000000175951.pdf 1.53 MB

Altmetrics

Item View & Download Count

Show Simple Item Record

Find it @ SNU

트윗하기

SNS Share