KoGPT-2 생성 문장의 주거 관련 편견 분석 및 감소 방법 연구

서나래

서울대학교 중앙도서관

S-Space 소개

My S-Space

로그인이 필요합니다.

S-Space

Publications

Detailed Information

KoGPT-2 생성 문장의 주거 관련 편견 분석 및 감소 방법 연구 : Analyzing and mitigating housing-related bias in KoGPT-2

DC Field	Value	Language
dc.contributor.advisor	신효필	-
dc.contributor.author	서나래	-
dc.date.accessioned	2023-04-07T08:31:37Z	-
dc.date.available	2023-04-07T08:31:37Z	-
dc.date.issued	2022	-
dc.identifier.other	000000170271	-
dc.identifier.uri	https://hdl.handle.net/10371/189963	-
dc.identifier.uri	https://dcollection.snu.ac.kr/common/orgView/000000170271	ko_KR
dc.description	학위논문(석사)-- 서울대학교 대학원 : 데이터사이언스학과, 2022. 2. 신효필	-
dc.description.abstract	사전학습된 언어모델은 대용량의 데이터를 그대로 학습시키기 때문에 편견이 있는 문장을 생성하기도 한다. 이 연구에서는 KoGPT-2가 생성하는 문장의 주거 관련 편견을 분석하였다. 네이버 뉴스 및 부동산 카페 댓글로 미세조정한 KoGPT-2가 생성한 문장으로 주거 형태 및 지역에 관한 편견을 분석하였다. 그리고 주거 관련 편견이 있는 문장 생성 억제를 위해 강화학습을 적용하고, 효과를 측정하였다. 감성 스코어 기반의 평균 거리 지표를 기준으로 주거 형태에 관한 편견은 약 73%, 주거 지역에 관한 편견은 약 22% 개선되었다. 또한 긍/부정적 편견 상관없이 효과를 보였다. 강화학습을 이용한 KoGPT-2 생성 문장의 편견을 감소시키는 방안은 기존의 사전학습모델의 아키텍처를 변경하지 않고 미세조정을 하기 때문에 전이학습의 이점을 그대로 유지할 수 있다. 또한 인간이 주석처리하여 생성한 모델을 보상(Reward)으로 사용하기 때문에, 각기 다른 다양한 의도로 KoGPT-2의 문장 생성을 통제할 수 있다는 의의도 있다.	-
dc.description.abstract	Since pretrained language models learn a large amount of data as is, they sometimes generate biased sentences. In this study, housing-related biases in KoGPT-2 were analyzed. The sentences generated by KoGPT-2, which was fine-tuned based on the comments of Naver news and Naver cafe Real estate study, were analyzed for biases towards housing types and regions. In addition, reinforcement learning was applied to mitigate biases in KoGPT-2, and debiasing effects were measured. Based on the average distance of sentiment scores, biases on housing types were mitigated by about 73%, and biases on housing regions were mitigated by about 22%. It worked regardless of positive/negative biases. Debiasing method using reinforcement learning has two advantages; First, architectures of PLM, which is KoGPT-2 in this study, do not need to be changed and second, sentences generated by KoGPT-2 can easily be controlled based on human intentions since model created with human-annotated data is used as a reward function.	-
dc.description.tableofcontents	제 1 장 서 론 1 제 1 절 연구의 배경 1 제 2 절 연구의 내용 3 제 2 장 선행연구 검토 6 제 1 절 편견의 정의 6 제 2 절 언어모델의 편견 연구 7 제 3 절 언어모델의 편견 감소 연구 8 제 4 절 강화학습과 PPO 알고리즘 9 제 3 장 연구방법 17 제 1 절 주거 관련 데이터 수집 17 제 2 절 KoGPT-2 미세조정 22 제 3 절 KoGPT-2 생성 문장 감성분석 24 제 4 절 주거 관련 편견 보유 감성분석 모델 생성 26 제 5 절 강화학습을 이용한 KoGPT-2 미세조정 27 제 6 절 미세조정 전후 편견 보유 문장 생성 비교 31 제 4 장 연구결과 34 제 1 절 KoGPT-2 생성 문장의 주거에 대한 편견 34 제 2 절 강화학습을 이용한 편견 문장 생성 감소 방법 적용 37 제 5 장 결론 44 제 1 절 연구의 의의 및 한계 44 제 2 절 연구 확장 가능성 46 참고문헌 47 Abstract 51	-
dc.format.extent	52	-
dc.language.iso	kor	-
dc.publisher	서울대학교 대학원	-
dc.subject	사전학습모델	-
dc.subject	GPT2	-
dc.subject	편견	-
dc.subject	주거	-
dc.subject	강화학습	-
dc.subject	보상	-
dc.subject.ddc	005	-
dc.title	KoGPT-2 생성 문장의 주거 관련 편견 분석 및 감소 방법 연구	-
dc.title.alternative	Analyzing and mitigating housing-related bias in KoGPT-2	-
dc.type	Thesis	-
dc.type	Dissertation	-
dc.contributor.AlternativeAuthor	SEO, Narae	-
dc.contributor.department	데이터사이언스대학원 데이터사이언스학과	-
dc.description.degree	석사	-
dc.date.awarded	2022-02	-
dc.identifier.uci	I804:11032-000000170271	-
dc.identifier.holdings	000000000047▲000000000054▲000000170271▲	-

Appears in Collections:

Graduate School of Data Science (데이터사이언스 대학원)
- Theses (Master's Degree_데이터사이언스학과)

Files in This Item:

000000170271.pdf 1.48 MB

Altmetrics

Item View & Download Count

Show Simple Item Record

Find it @ SNU

트윗하기

SNS Share