Publications

Detailed Information

Joint Rectification and Stitching of Images Formulated as Camera Pose Estimation Problems

DC Field Value Language
dc.contributor.advisor조남익-
dc.contributor.author안재현-
dc.date.accessioned2017-07-13T07:10:22Z-
dc.date.available2017-07-13T07:10:22Z-
dc.date.issued2015-08-
dc.identifier.other000000056836-
dc.identifier.urihttps://hdl.handle.net/10371/119104-
dc.description학위논문 (박사)-- 서울대학교 대학원 : 전기·컴퓨터공학부, 2015. 8. 조남익.-
dc.description.abstractThis dissertation presents a study of image rectification and stitching problems formulated as camera pose estimation problems. There have been many approaches to the rectification and/or stitching of images for their importance in image processing and computer vision areas. This dissertation adds a new approach to these problems, which finds appropriate optimization problems whose solutions give camera pose parameters for the given problems. Specifically, the contribution of this dissertation is to develop (i) a new optimization problem that can handle image rectification and stitching in a unified framework through the pose estimation formulation, and (ii) a new approach to planar object rectification problem which is also formulated as an optimal homography estimation problem.
First, a unified framework for the image rectification and stitching problem is studied, which can handle both assumptions or conditions that (i) the optical center of camera is fixed or (ii) the camera captures a plane target. For this, the camera pose is modeled with six parameters (three for the rotation and three for the translation) and a cost function is developed that reflects the registration errors on a reference plane (image stitching results). The designed cost function is effectively minimized via the Levenberg-Marquardt algorithm. From the estimated camera poses, the relative camera motion is computed: when the optical center is moved (i.e., the camera motion is large), metric rectification is possible and thus provides rectified composites as well as camera poses are obtained.
Second, this dissertation presents a rectification method for planar objects using line segments which can be augmented to the previous problem for further rectification or performed independently to single images when there are planar objects in the image such as building facades or name cards. Based on the 2D Manhattan world assumption (i.e., the majority of line segments are aligned with principal axes), a cost function is formulated as an optimal homography estimation problem that makes the line segments horizontally or vertically straight. Since there are outliers in the line segment detection, an iterative optimization scheme for the robust estimation is also developed.
The application of the proposed methods is the stitching of many images of the same scene into a high resolution image along with its rectification. Also it can be applied to the rectification of building facades, documents, name cards, etc, which helps the optical character recognition (OCR) rates of texts in the scene and also to improve the recognition of buildings and visual qualities of scenery images. In addition, this dissertation finally presents an application of the proposed method for finding boundaries of document in videos for mobile device based application. This is a challenging problem due to perspective distortion, focus and motion blur, partial occlusion, and so on. For this, a cost function is formulated which comprises a data term (color distributions of the document and background), boundary term (alignment and contrast errors after the contour of the documents is rectified), and temporal term (temporal coherence in consecutive frames).
-
dc.description.tableofcontents1 Introduction 1
1.1 Background 1
1.2 Contributions 2
1.3 Homography between the i-th image and pi_E 4
1.4 Structure of the dissertation 5
2 A unified framework for automatic image stitching and rectification 7
2.1 Related works 7
2.2 Proposed cost function and its optimization 8
2.2.1 Proposed cost function 12
2.2.2 Optimization 13
2.2.3 Relation to the model in [1] 14
2.3 Post-processing 15
2.3.1 Classification of the conditions 15
2.3.2 Skew removal 16
2.4 Experimental results 18
2.4.1 Quantitative evaluation on metric reconstruction performance 19
2.4.2 Determining the capturing environment 21
2.4.3 Experiments on real images 25
2.4.4 Applications to document image stitching and more results 28
2.5 Summary 28
3 Rectification of planar targets based on line segments 31
3.1 Related works 31
3.1.1 Rectification of planar objects 32
3.1.2 Rectification based on self calibration 33
3.2 Proposed rectification model 33
3.2.1 Optimization-based framework 36
3.2.2 Cost function based on line segment alignments 37
3.2.3 Optimization 38
3.3 Experimental results 40
3.3.1 Evaluation metrics 40
3.3.2 Quantitative evaluation 41
3.3.3 Computation complexity 45
3.3.4 Qualitative comparisons and limitations 45
3.4 Summary 52
4 Application: Document capture system for mobile devices 53
4.1 Related works 53
4.2 The proposed method 54
4.2.1 Notation 54
4.2.2 Optimization-based framework 55
4.3 Experimental results 62
4.3.1 Initialization 65
4.3.2 Quantitative evaluation 65
4.3.3 Qualitative evaluation and limitations 66
4.4 Summary 67
5 Conclusions and future works 75
Bibliography 77
Abstract (Korean) 83
-
dc.formatapplication/pdf-
dc.format.extent37581294 bytes-
dc.format.mediumapplication/pdf-
dc.language.isoen-
dc.publisher서울대학교 대학원-
dc.subjectimage stitching-
dc.subjectimage rectification-
dc.subjectcamera pose estimation-
dc.subjectdocument detection and segmentation-
dc.subject.ddc621-
dc.titleJoint Rectification and Stitching of Images Formulated as Camera Pose Estimation Problems-
dc.typeThesis-
dc.description.degreeDoctor-
dc.citation.pagesxv, 86-
dc.contributor.affiliation공과대학 전기·컴퓨터공학부-
dc.date.awarded2015-08-
Appears in Collections:
Files in This Item:

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share