S-Space College of Business Administration/Business School (경영대학/대학원) Dept. of Business Administration (경영학과) Theses (Master's Degree_경영학과)
An algorithm for Finding a Relationship Between Entities : Semi-Automated Schema Integration Approach
- 경영대학 경영학과
- Issue Date
- 서울대학교 대학원
- Schema Integration ; Naming Conflicts ; Natural Language Processing ; XML ; Entity Relationship Diagram (ERD)
- 학위논문 (석사)-- 서울대학교 대학원 경영대학 경영학과, 2017. 8. 박진수.
- Database schema integration is a very important issue in information systems. Since schema integration is a time-consuming and labor-intensive task, many studies have attempted to automate this task. In the meantime, the researchers used xml as the source schema and still left much of the work to be done through DBA intervention. For example, there are various naming conflicts related to relationship names in schema integration. In the past, the DBA had to intervene to resolve the naming conflict name. In this paper, we introduce an algorithm that automatically generates relationship names to resolve relationship names conflicts that occur during schema integration. This algorithm is based on Internet collocation dictionary and english sentence example dictionary. The relationship between the two entities is generated by analyzing examples extracted based on dictionary data through natural language processing. By building a semi-automated schema integration system and testing this algorithm, we found that it showed about 90% accuracy. Using this algorithm, we can resolve the problems related to naming conflicts that occur at schema integration automatically without DBA intervention.