Development on sensing, prediction and control system of greenhouse environment using machine learning

Abstract: Crop production through facility cultivation has developed continuously over the past several years by ensuring a year round stable and high-quality productivity. Currently, the area and total production of domestic crop cultivation facilities are 52,571 ha and 2,441,000 tons in 2020, respectively (MAFRA, 2022). Smart farm greenhouses with the provision of precise monitoring and regulation of the internal environment are rapidly expanding. However, one major problem encountered by many farmers is the difficulty to secure installation costs of smart farms. Also, most domestic greenhouses are not compatible for the installation of numerous sensors and control systems, so an appropriate sensors and control system for monitoring and controlling the internal environment of greenhouses are needed.
The goal of this study was to develop a system that could monitor and predict the internal environment of naturally ventilated greenhouses. With this three machine learning models have been developed to serve different purpose. The prediction current air temperature using optimal sensor machine learning model (PCTO-ML) of Chapter 3 selected the optimal sensor location of air temperature. The prediction future air temperature using optimal sensor machine learning model (PFTO-ML) of Chapter 4 predicted the air temperature inside the greenhouse in the near future for preemptive control of the air conditioning system. Additionally, the optimal sensor location of PCTO-ML was introduced into the second model to minimize the number of sensors required for future air temperature prediction. The prediction local ventilation rate by CFD driven machine learning model (PLV-CFD driven ML) predicted natural ventilation.
In Chapter 2, the necessity and direction of this dissertation were presented through comprehensive review and analysis of the methodology and limitations of related previous studies. To establish the research methodology, research works related to importance of the greenhouse internal growth environment; greenhouse ventilation evaluation method; optimal sensor location for monitoring the greenhouse internal environment; and application of machine learning techniques in agriculture were reviewed.
In Chapter 3, the optimal sensor location was selected among the nine grid-shaped sensor locations for monitoring the internal air temperature of the naturally ventilated greenhouse using a machine learning model. To build a dataset of this model, environmental factors were collected and were preprocessed through linear interpolation to get rid of missing values. The machine learning model for the prediction of the internal air temperature of a natural ventilation greenhouse was developed from various models such as artificial neural networks (ANN), support vector regression (SVR), and long short-term memory (LSTM). Comparing the predictive performance of these machine learning models, the LSTM model had the highest accuracy (R2 = 0.974, RMSE = 0.024, and P-RMSE = 0.458). The environmental data measured at sensor 5 showed the highest accuracy in predicting air temperature for each sensor installation location in the greenhouse (R2 = 0.984, RMSE = 0.019, and P-RMSE = 0.365). Finally, in order to minimize the kind of sensor required for the machine learning model, the simplified LSTM models with reduced learning features were developed and it also proposed the optimal sensor as sensor 5.
In Chapter 4, the future air temperature inside the natural ventilation greenhouse was predicted using a machine learning model. The process of collecting and preprocessing learning data for model development followed the same procedure as in Chapter 3. Therefore, the accuracy of the LSTM model according to the sequence length was evaluated, and 30 minutes was evaluated as the optimal sequence length. The results of predicting the future air temperature at each location in the greenhouse using the LSTM model with a sequence length of 30 minutes showed high prediction accuracy of R2 > 0.95, and RMSE < 0.65. In order to minimize the installation of sensors in the greenhouse, the predictive accuracy of the LSTM model was evaluated by applying the optimal sensor location suggested in Chapter 3. The results showed a relatively large decrease in prediction accuracy when one optimal sensor was applied (R2=0.918), but the prediction accuracy was similar to that of using all nine sensors (R2=0.950) when three optimal sensors were applied (R2=0.939). Therefore, it was recommended to apply at least three optimal sensor locations for future air temperature prediction.
In Chapter 5, the ventilation rate in naturally ventilated greenhouses by region was predicted using a machine learning model build from the results of Computational Fluid Dynamics (CFD) simulation which were used as learning features. In order to generate learning data, 210 cases of CFD simulation for 10 wind speeds, 7 wind directions, and 3 greenhouse ventilation window types were performed. Multi linear regression (MLR), SVR, Random Forest, and deep neural network (DNN) were developed for machine learning models, and the optimization of hyper-parameters for each machine learning model was performed. For each optimized machine learning model, the bootstrapping technique was applied to supplement a relatively small number of learning data. The results show that models such as RF, DNN and SVR using the rbf kernel function, which have already shown high accuracy (R2 > 0.9), were less accurate after applying bootstrapping. while MLR and SVR models which used poly-kernel functions have improved R2. Finally, in order to minimize the simulation cases, which significant amount of time, the accuracy of the machine learning model was evaluated according to the reduction of the CFD simulation. Result showed that most machine learning models have high accuracy for the learning data considered from 120 cases with only 4 cases of wind direction. The RMSE value was also evaluated low for the model indicating the lowest error for the predicted value.
The greenhouse environment prediction and control system proposed in this dissertation predicted the current (PCTO-ML) and future (PETO-ML) air temperature inside the greenhouse and local ventilation rate (PLV-CFD driven ML). When the three models constituting the system are linked, it is of great significance that it is possible to predict the internal air temperature of the natural ventilation greenhouse and to control the proper ventilation of the predicted air temperature. In particular, the optimal sensor location proposed through the PCTO-ML model is expected to contribute to improving farmers' income by reducing the initial and maintenance costs of the greenhouse. In addition, the PVL-CFD driven ML model was able to predict the natural ventilation rate for all wind environment conditions because it was possible to learn about environmental conditions that were difficult to measure due to the small frequency of occurrence. Since only minimum number of sensors are required for the development and operation of the system, it is economical to introduce these models and is expected to be applicable to most domestic greenhouses where air temperature sensors are installed.
시설 재배를 통한 작물 생산은 연중 안정된 생산성과 고품질의 작물을 생산함으로써 지난 수년간 지속적으로 발전해오고 있다. 현재 국내 채소 재배 시설 면적과 총 생산량은 각각 2020년 52,571 ha와 2,441,000 ton으로 1970년대부터 현재까지 그 규모가 꾸준히 증가해 오고 있는 추세이다 (MAFRA, 2022). 또한 ICT 기술의 발전과 각종 센서 및 제어 시스템의 개발로 인해 온실 내부 환경에 대해 정밀한 모니터링과 제어가 가능한 스마트팜 온실이 빠르게 보급되고 있다. 하지만 스마트팜을 도입한 많은 농가에서 설치비용의 확보에 어려움을 느끼고 있으며, 센서 및 장비의 잦은 고장과 스마트팜 활용의 어려움을 가지고 있는 것으로 나타났다.
본 논문의 최종 목표는 자연환기 온실의 내부 환경을 모니터링하고 예측할 수 있는 모델을 개발하는 것이다. 따라서 온실 내부의 가장 중요한 환경 요소인 공기 온도의 최적 센서 위치를 선정하기 위한 기계학습 모델을 개발하였다. 현재의 온실 내부 공기 온도를 모니터링하는 것 뿐만 아니라 가까운 미래의 온실 내부 공기 온도를 예측하여 공조시스템의 선제적 제어에 활용될 수 있는 기계학습 모델을 개발하였다. 또한 예측된 온실 내부 공기 온도를 작물의 적정 생육 환경으로 조절할 수 있는 가장 기본적인 방법인 자연환기를 예측한 기계학습 모델을 개발하였다.
2장에서는, 선행 연구의 방법론, 한계점의 분석을 통해 연구의 필요성과 연구 방향성의 기틀을 다지고, 연구 방법론의 적립을 위해 온실 내부 생육 환경의 중요성, 온실 환기 평가 방법, 온실의 내부 환경 모니터링을 위한 최적 센서 위치, 농업 분야에서의 기계학습 기법 적용에 대한 연구사를 검토하였다.
3장에서는 기계학습 모델을 이용하여 자연환기 온실의 내부 공기 온도 모니터링을 위한 최적 센서 위치를 선정하였다. 이를 위하여 현장 실험을 통해 자연환기식 온실 내부의 환경 요소들을 수집하였으며, 기계학습 모델의 예측 정확도를 높이기 위하여 일련의 전처리 과정을 수행하였다. 자연환기 온실의 내부 공기 온도 예측을 위한 기계학습 모델은 ANN, SVR, LSTM을 개발하였다. 기계학습 모델들의 예측 성능을 비교한 결과 LSTM이 가장 높은 정확도(R2 = 0.974, RMSE = 0.024, P-RMSE = 0.458)를 보이는 것으로 평가되었다. 따라서 LSTM 모델을 이용하여 온실 내 공기 온도 모니터링을 위한 최적 센서 위치를 평가하였으며, 온실 중앙 위치인 5번 센서 위치에서 측정한 환경데이터를 학습할 경우, 온실 내부의 각 센서 설치 위치에 대한 공기 온도 예측에 가장 높은 정확도를 보였다 (R2 = 0.984, RMSE = 0.019, P-RMSE = 0.365). 마지막으로 기계학습 모델에 필요한 센서의 종류를 최소화하기 위하여 학습 인자를 줄인 Simplified LSTM 모델을 개발하였으며, 이때의 최적 센서 위치는 5번으로 평가되었다.
4장에서는 기계학습 모델을 이용하여 자연환기 온실 내부의 미래 공기 온도를 예측하였다. LSTM 모델이 개발되었으며, 모델 개발을 위한 학습 데이터의 수집과 전처리 과정은 3장과 동일하다. 단, LSTM 모델의 미래 상황 예측 시 입력값의 단위에 해당하는 시퀀스 길이에 따른 LSTM 모델의 정확도를 평가하였으며, 30 분이 최적 시퀀스 길이로 산정되었다. LSTM 모델을 이용하여 온실 내 각 위치에서의 미래 공기 온도를 예측한 결과 대부분 R2 > 0.85, RMSE < 0.65의 높은 예측 정확도를 나타내었다. Chapter 3에서 제시한 최적 센서 위치를 적용하여 LSTM 모델의 예측 정확도를 평가하였다. 그 결과 1개의 최적 센서를 적용할 경우 예측 정확도 (R2=0.918)의 상대적으로 큰 감소가 나타났지만, 3개의 최적 센서를 적용할 경우 (R2=0.939) 센서 9개를 모두 이용한 경우 (R2=0.950)와 유사한 예측 정확도를 나타냈다. 따라서 미래 공기 온기 예측을 위해 최소 3개의 최적 센서 위치를 적용할 것이 추천된다.
5장에서는 CFD 시뮬레이션의 연산 결과를 학습한 기계학습 모델을 이용하여 자연환기 온실의 지역별 환기량을 예측하였다. 학습 데이터의 생성을 위하여 10 가지 풍속, 7 가지 풍향, 3 가지 온실 환기창 타입에 대한 CFD 시뮬레이션을 수행하였다. 기계학습 모델은 MLR, SVR, Random forest, DNN이 개발되었으며, 각각의 기계학습 모델에 대한 hyper-parameter의 최적화를 수행하였다. 그 후 상대적으로 적은 수의 학습 데이터를 보완하기 위하여 Bootstrapping 기법을 적용하여 학습하였다. 그 결과 Bootstrapping 기법 적용 전 높은 예측 성능(R2>0.9)을 나타내었던 RF, DNN, rbf kernel 함수를 이용하는 SVR 모델은 R2가 감소한 반면, 상대적으로 예측 성능이 낮은(R2<0.9) 모델인 MLR과 poly kernel 함수를 이용하는 SVR 모델의 경우 R2가 개선되는 것으로 나타났다. 최종적으로 상대적으로 연산에 많은 시간을 필요로하는 CFD 시뮬레이션의 연산 case의 최소화를 위하여 CFD 시뮬레이션 case의 감소에 따른 기계학습 모델의 정확도를 평가하였다. 그 결과 대부분의 기계학습모델에서 풍향을 4 case만을 고려하여 총 120 case가 고려된 학습데이터에 대하여 가장 높은 정확도를 나타냈다. 뿐만 아니라 RMSE 값 또한 낮게 평가되어 예측값에 대한 오차가 가장 낮은 것으로 나타났다.
본 논문은 3가지 기계학습 모델(PCTO-ML, PETO-ML, PLV-CFD driven ML)로 구성된 온실 내부환경 예측 및 제어 시스템을 통하여 온실 내부의 현재 (PCTO-ML)와 미래 공기온도(PETO-ML)를 예측하며, 온실 내부의 지역적 환기율을 예측(PVL-CFD driven ML)함으로써 예측한 공기 온도의 적절한 제어가 가능하다는 것에 큰 의의가 있다고 판단된다. 특히 PCTO-ML 모델을 통하여 제안한 온실 내부 공기 온도 예측에 필요한 최적 센서 위치는 온실의 초기비용 및 유지보수 비용을 줄이므로써 농가의 소득 향상에 기여할 수 있을 것으로 기대된다. 또한 PVL-CFD driven ML 모델은 발생 빈도가 작아 실측이 어려운 환경조건에 대해서도 학습이 가능했기 때문에 모든 풍환경 조건에 대한 자연환기율을 예측할 수 있었다. 본 논문에서 제안한 온실 내부환경 예측 및 제어 시스템은 시스템의 개발과 운영을 위해 최소한의 온도 센서만을 필요로 하기 때문에 시스템 도입에 경제성을 가지며, 온도 센서가 설치되어 있는 국내 대부분의 온실에도 적용이 가능할 것으로 판단된다.

Language: eng

URI: https://hdl.handle.net/10371/204011

https://dcollection.snu.ac.kr/common/orgView/000000174815

Files in This Item:

000000174815.pdf 3.94 MB

Appears in Collections:

College of Agriculture and Life Sciences (농업생명과학대학)
- Dept. of Landscape Architecture and Rural System Engineering (생태조경·지역시스템공학부)
  - Theses (Ph.D. / Sc.D._생태조경·지역시스템공학부)

Altmetrics

Item View & Download Count

Show Full Item Record

Find it @ SNU

트윗하기

SNS Share