Status of ICT equipment in regional Hanwoo industry
Introduction
According to a recent trend report of the Hanwoo industry in Gyeongsangbuk-do, the total number of Hanwoo farms has decreased from 20,268 in 2017 to 19,158 in 2020. However, the total number of Hanwoo has increased from 615,929 in 2017 to 694,855 in 2020 (KOSTAT, 2020a). Based on the change in the number of farms, there are fewer than 50 Hanwoo farms, which decreased from 16,665 in 2017 to 15,115 in 2020, and farms raising more than 50 Hanwoo increased from 3,610 in 2017 to 4,043 in 2020 (KOSTAT, 2020b). Such finding indicates the performance of intensive animal farming on limited land. Intensive animal farming, also known as industrial livestock, is a type of agriculture with a higher level of input and output per animal raised on limited land. Owing to the aging of farming population, there has recently been a demand for the management of intensive farming using ICT (Information and Communications Technology) equipment to achieve precise management. Therefore, the Korean government has supported the allocation of ICT equipment and services for precise management through a local support project, which aims to supply ICT equipment to 5,750 farms by 2022. Owing to the increasing use of ICT equipment and services, the amount of data produced from farms is rapidly increasing (Cheon DW et al., 2016). Thus, a big data center is urgently needed to enable effective utilization of such data to achieve precise management.
Feeding, water, carcass traits, disease, physiological temperature, and activity data were collected from ICT equipment on farms. These data are then automatically converted into digital data and stored on the farm’s PC and ICT equipment company server. When these data are accumulated in a storage equipment, big data should be stored and managed separately.
A trend has been identified in the big data center industry abroad (Kim JH et al., 2014, Jang YJ and Kim TW, 2019). In the United States, the ‘Big Data R&D Strategic Plan’ was established (Jeong YC and Han EY, 2014). Accordingly, territories in the US were divided into the West (medical), Midwest (agricultural and food), South (health care and manufacturing), and Northeast (energy, education, climate change), and a ‘Big Data Regional Innovation Hub’ was established. This hub is found in more than 250 organizations in each region and is used to solve social problems (Son CM et al., 2019). In the United Kingdom, the ‘Data Strategy Committee’ was established under the government to strengthen access to data. Accordingly, new jobs and start-up industries related to big data technology such as manufacturing, distribution, and finance were created, which yielded an economic profit of 216 billion pounds (an estimated 350 trillion won) (National IT Industry Promotion Agency , 2014). The use of big data was also implemented in Japan at the IT Strategy Headquarters in 2012 to help the country become a society that uses IT (Information technology), such as machine to machine (M2M), big data infrastructure, and create new industries through big data (Korea Internet & Security Agency, 2014). In China, the ‘Action to promote big data development’ plan was announced to utilize the big data industry as a new growth foundation for national development. As a result, China government is investing the big data industry that raise market scale from 280 billion yuan (an estimated 47.7 trillion won) in 2015 to 1.01 trillion yuan (an estimated 172.08 trillion won) in 2020 (Korea-China Science & Technology Cooperation Center, 2018).
Currently, a gap exists in the technical level of big data between advanced countries and South Korea. In particular, the indicator of the convergence technique level is lower than that of the advanced country level (Jang YJ and Kim TW, 2019). According to data from the ITSA (Korea Information Technology Service Industry Association) in 2018, the centralization of the metropolitan area is a serious problem because 61% of domestic big data centers are located in Seoul and Gyeonggi-Incheon, 11% in Chungcheong and Gyeongsangnam-do, and 5.6% in Gangwon-do, Jeolla-do, and Gyeongsangbuk-do (Song JH et al., 2018). As the big data infrastructure is concentrated in the metropolitan area, regions outside of the metropolitan area face disadvantageous competition in the big data industry. In particular, the most of livestock industry is located in the provinces, and the competition in the big data industry is even more subsided. The livestock industry’s data share in the big data sector is 1.39% (Open Government Data Portal, 2020), which is an insignificant in the big data industry. To address these problems, the Ministry of Agriculture, Food and Rural Affairs (MAFRA) and Education, Promotion & Information Service in Food, Agriculture, Forestry & Fisheries (EPIS) collects data while operating a big data portal, however, most of the data are used to analyze consumer patterns to provide consumer-oriented customized purchase information services. As a result, the actual utilization in farms is very low.
Most companies that supply livestock management systems to Hanwoo farms are small, thus, the services that can be provided to farms using collected data with the ICT equipment are limited. Accordingly, integrated management system that enables Hanwoo farms to integrate and use data on breeding, improvement, and disease is inadequate. Therefore, it is necessary to establish a smart livestock model and an integrated management system for each species of livestock specialized in the region to achieve more precise Hanwoo management while improving farm management convenience. In addition, to strengthen the competitiveness of Hanwoo farms, a smart farm-based high-tech environment should be developed, and the integrated collection system of ICT data and management data for Hanwoo farms should be prepared.
Therefore, in this study, we suggest a method to collect and connect data related to breeding, improvement, disease, and environmental management in real-time for Hanwoo farms using ICT equipment in Gyeongsangbuk-do and propose a model for constructing an integrated management system to establish the ‘Regional Hanwoo Big Data center’.
Status of Hanwoo ICT equipment in Gyeongsangbuk-do province
In this study, we investigated information of Hanwoo ICT equipment in Gyeongsangbuk-do province through a literature search and field survey. the information of ICT equipment related to breeding, disease, and reproduction management in Hanwoo farm were shown in Table 1. The domestic equipment were found to mainly collect temperature and activity data for disease and breeding management. International ICT equipment are similar to the domestic ICT equipment, temperature and activity data for breeding and disease management were collected. Especially, due to the geographical characteristics of grazing, GPS location data were also collected by international ICT equipment.
Through a data survey, three of the thirty-six ICT equipment companies that are distributing ICT equipment in Gyeongsangbuk-do were selected, and the data collected by ICT equipment companies and the number of equipment deliveries in Gyeongsangbuk-do. The types of Hanwoo ICT equipment supplied including automated individual feeding machines, rumen activity analyzers, estrus detectors, and calving sensors. The ICT equipment in Gyeongsangbuk-do were investigated to collect data, such as activity level, weight, rumen activity level, tail movement pattern, and disease status of Hanwoo.
The information needs of farmers in Hanwoo industry will only increase as they have to make more and more complex decisions on how to increase productivity performance. so, we collected farm's opinions to know whether ICT equipment has effectively contributed to breeding, disease, and reproduction management. The farm’s opinion is shown in Figure 1. As shown in Figure 1, Farmers have positive thought about ICT equipment by words of ‘ICT’, ‘Positive’, ‘Assistance business’, ‘Helpful’. And the opinions of farmers on the establishment of a big data center for Hanwoo were positive. In fact, words such as ‘Positive’, ‘Disease management’, ‘Breeding management’, ‘Feeding management’, and ‘Integrated management’, were frequently used by the farmers. Therefore, most Hanwoo farmers have a positive outlook regarding the establishment of a big data center for Hanwoo. Moreover, these farmers need a system that can manage disease, breeding, and production in an integrated manner (Fig 1).
Integrated collection system for Hanwoo digital data
The Hanwoo big data model has different categories that must be managed according to the breeding environment and growth level. The data obtained from Hanwoo farm are separately collected and processed and must be stored in a form that is easy to analyze. For farms that do not have an ICT system, the data can be collected through visits and telephone surveys, and then be stored on the regional Hanwoo big data center server using the Web UI provided by the Hanwoo big data integration system (Seong KI et al., 2015). For farms that have ICT systems, the data can be transferred to the Hanwoo big data integration system through the ICT equipment companies’ system (Fig 2). To establish an integrated Hanwoo big data system, Personal Information Collection and Usage Agreement is necessary to standardize the unique data collection systems of many ICT companies and related organizations (Personal information protection commission, 2020). When consent is obtained from farmers to use personal information pursuant to Article 15 (1) 1, Article 3 (1), and Article 17 (2) of the Personal Information Protection Act, livestock data including personal information may be provided by the ICT equipment companies, related organizations. Consent for the use of personal information can be obtained by making it mandatory for farmers to provide personal information and submit consent forms when applying to the internet of things (IoT) equipment assistance program (Ministry of the Interior and Safety. 2019).
Because ICT equipment companies collect and manage data from sensors installed in Hanwoo farms through an integration controller, it is most suitable to use agent, a software that transmits data from the PC server system operated by the ICT equipment companies to the Hanwoo big data integrated system. However, depending on the situation of the ICT equipment companies, auxiliary means, such as file transfer protocol (FTP) and open application programming interface (Open API), might be needed (Fig 2). Related organizations can also share data using Agent, FTP, and Open API. However, related organizations that provide data through the homepage can collect data only by creating Web crawling software, and if crawling is not allowed, it can be downloaded directly into .csv or .xlsx format and uploaded to the Hanwoo big data integrated system (Fig 3).
A method for Hanwoo farms, ICT equipment companies, and related organizations to gather livestock data on Hanwoo safely and efficiently is to connect servers. To apply this method, various communication protocols should be prepared, such as stable data transfer and communication security, hyper-text transfer protocol (HTTP), transmission control protocol (TCP), user datagram protocol (UDP), and FTP, between physically distant servers. The Hanwoo big data center should manualize the protocol support plan by considering the environment of various ICT equipment companies.
Although the IT systems owned by ICT equipment companies and related organizations differ slightly in scale and complexity, standard systems for data connection between systems commonly use the agent and agentless. Agent is a method of receiving data by installing agent software for obtaining data on the PC of a Hanwoo farm or the ICT equipment companies, agentless is a method of bringing data directly from the data provider’s server in an environment where agent cannot be installed. The regional Hanwoo big data center establishes a system by installing agent at ICT equipment companies and Hanwoo data to the Hanwoo big data integrated system. As the data form of related organizations are different for each organization, regional Hanwoo big data center provides agent software to related organizations and connects the server of regional Hanwoo big data center and related organizations. Original data are stored in the Data Lake (DL) and metadata are stored in the data catalog. Thereafter, a system is established wherein the necessary data is retrieved via search of the data catalog.
The collected Hanwoo data is delivered to the user through the steps of ‘collection’-‘storage’-‘pre-processing’-‘analysis’-‘visualization’. ‘Pre-processing’ involves processing the collected data into a form suitable for analysis; this is the most important step in data analysis because different results can be derived based on who processes the collected data. The collected original data is stored in the DL through minimal processing in the ‘collection’ step. To access the data stored in the DL, a data catalog must be provided, and the data retrieved through the data catalog are processed again and stored in the Data Mart (DM). The data stored in the DM are used for analysis and management according to the purpose of the user (Fig 4).
Database integration and management system for Hanwoo digital data
Establishment of infrastructure for Hanwoo big data platform
The establishment category of the Hanwoo big data platform can be divided into data collection, processing, data analysis using artificial intelligence (AI), tools, services needed for analysis, and storage infrastructure that can store data. Agent and agentless were used for data collection and processing, and R or Python was used for data analysis. Management tools and services provide tools for managing users, data, resources, and analytical environments for data analysis and system operations. Finally, the hardware for each area is configured as an independent cluster, and the storage is established to ensure that the service is not interrupted even when scale-up and scale-out , which causes system expansion by adding hardware, disk, and memory, are performed.
Development of Hanwoo big data analysis and AI learning system
The software for data analysis using AI should be composed of open source-based software, except for security solutions and operating systems. In addition, when inquiring the data collected for analysis, various visualization tools for the result and extraction function for the inquiry result must be provided. Analysis tools such as R, Python, and Zeppelin were used for data analysis, and various algorithms such as statistics, machine learning, and deep learning were used according to the data analysis environment. Through AI technology-based data analysis, Hanwoo big data platform provide data which can be practical help with Hanwoo livestock farms by developing system model. All models are predictive models created based on data provided by farms, companies, and related organization. The types of models include a breeding prediction model such as estrus, fertilization and delivery, a carcass grade prediction model related to beef quality, and a Hanwoo disease prediction model.
Establishment of an integrated log management system
A log management system that enables automated real-time monitoring is needed to safely collect, store, and analyze all log data generated by Hanwoo farms. To establish an integrated log management system, big data processing technology, real-time data processing environment, and storage suitable for the characteristics of the collected data must be provided. Accordingly, individual characteristic data such as body temperature, activity, and livestock data, such as humidity and CO2 concentration, can be collected through a wireless network.
Establishment of an internet network connection system for data management
Unless the ICT equipment is directly developed and installed in farms, the most realistic method to receive livestock data from Hanwoo farmers is through servers from ICT equipment companies. To achieve this, consultation with an ICT equipment company must be performed and an agent that minimizes the overload on the server must be installed. Further, applied transport layer security (TLS)-based encryption must be applied between the agent and big data platform communication channel or a communication channel that applies its own certificate must be used. Internet networks connected with related organizations have additional costs, but the safest method, virtual private network (VPN), should first be considered. If VPN application is difficult, a communication channel using TLS, or a certificate method should be established.
Establishment and management plan for the regional Hanwoo digital big data center
When data can be transmitted from Hanwoo farms and ICT equipment companies using ICT equipment to select agent or agentless methods, the servers of Hanwoo farms and ICT equipment companies can be connected to the servers of the regional Hanwoo big data center. Conversely, if it is impossible to transmit data from Hanwoo farms and ICT equipment companies using ICT equipment, through ICT nodes and ICT controllers, which are equipment and programs that help Hanwoo farms and ICT equipment companies connect servers Hanwoo data can be provided via connection with the regional Hanwoo big data center server. Accordingly, the regional Hanwoo big data center receives management, reproduction, disease, and breeding data from data producers, predicts the estrous and optimal time of fertilization in real-time to effectively manage livestock on behalf of small Hanwoo farms, and actively responds to environmental, market, and disease situations. The regional Hanwoo big data center can identify the correlation between data and help solve various livestock problems by overseeing the management data, breeding data, production data, individual data, etc. In addition, the regional Hanwoo big data center can provide auction price, shipment count, disease data, breeding data, and reproduction data to farms by subject, and data based on laws on the provision and use of public data to ICT equipment companies and related organizations (Fig 5) (Ham YH et al., 2018).
Conclusion
The establishment of regional Hanwoo big data center and system using ICT equipment is expected to have various positive effects. In particular, Hanwoo farms are expected to save on labor force needs owing to the use of ICT equipment facilities. Additional labor cost savings are also expected through the utilization of various notification services offered by regional Hanwoo big data center. The regional Hanwoo big data center is also expected to markedly contribute to the increase in the income of farms by reducing calf losses through the provision of support for cattle calving management based on estrus and dystocia prediction. According to a report by the Ministry of Science and ICT, the establishment of a big data center is expected to lead to 1.6 employees being hired per year, and the total number of employees will reach approximately 160, depending on the R&D contribution over the next five years (Song JH et al., 2018). In addition, through pedigree management using IoT technology, positive effects can be expected in other industries, such as the production of high-quality food and the development of livestock management technology using AI (Samjung KPMG Economic Research Institute. Korea, 2016). Although the modern livestock industry is larger than that of the past, many Hanwoo farms cannot focus only on farm management. Hanwoo farms are engaged in farming and pomiculture in addition to livestock industry. As a result, it is easy to miss factors that can be directly related to the livelihood of farms, such as weak estrus and respiratory diseases, which cannot be determined in real-time. The regional Hanwoo big data center can preserve farm profits by integrating disease, production, and breeding management systems to minimize the loss of small Hanwoo farms. In addition, by datafication of pedigree data, including high-quality cattle and elite cattle lists at the regional Hanwoo big data center, the loss of the Hanwoo industry can be prevented by saving high-quality cattle and elite cattle that are unnecessarily slaughtered because they are deemed low grade.