نوع مقاله : مقاله پژوهشی
نویسندگان
گروه مهندسی کامپیوتر، مرکز تحقیقات هوشمندسازی شبکه توزیع برق، واحد مشهد، دانشگاه آزاد اسلامی، مشهد، ایران.
چکیده
کلیدواژهها
موضوعات
عنوان مقاله [English]
نویسندگان [English]
Introduction: Development of a predictive model using machine learning can help livestock farmers to increase their understanding of the performance potential of their livestock. It can assist in decision-making processes related to livestock management, elimination and replacement selection, nutrition, reproduction and other matters of livestock management. Predicting insemination outcomes provides valuable insights to improve reproductive performance, breeding processes, milk production and overall livestock efficiency. The integration of predicting models in the existing systems in animal husbandry increases its practical application as a decision support tool for animal farmers. By developing a tool that can determine the reproductive success of livestock, ranchers can optimize their production and breeding strategies and improve overall livestock management practices to increase reproductive efficiency and profitability. In this study
Material and Methods: This study utilized data from the Helal Agro-Industry Co., a commercial dairy farm associated with the Iranian Red Crescent Investment Company. The commercial dairy herds in this region primarily consist of Holstein-Friesian cattle. In terms of record-keeping and efficient data management, the agricultural enterprise utilizes the Modiran Farmer software. This software leverages the Microsoft SQL Server database infrastructure to facilitate the storage of pertinent information. The dataset encompasses a diverse array of tables containing entries spanning various aspects such as reproduction, milking, health profiles, genetic insights, and broader characteristic attributes. The temporal scope of the database spans from January 1994 through May 2023, encapsulating a substantial historical period. We executed a SQL query against the database to generate a dataset of insemination records and their corresponding features. For each insemination record, we retrieved 25 features encompassing covariates related to milking, reproduction, management factors, health, and insemination result. The data underwent further pre-processing after the extraction process to make it suitable for the proposed models. We proposed three different models of Long Short-Term Memory, Multi-Layer perceptron, and XGBoost. A distinct set of cow IDs was acquired, and then, it was partitioned into three subsets: 70% for training, 10% for validation, and 20% for testing. In order to work with LSTM model, by identifying the temporal dependencies relations between a cow’s insemination cycles, we stacked these cycles to form sequences that can then be processed by LSTM model. So, the sets of unique cow IDs were then used to generate the sequences for each cow. A data augmentation method was used to generate all possible sub-sequences of cows’ insemination. Then, the sequences were aligned and stacked to achieve a constant length of 20. In total, about 168,000 training sequences, 23,000 validation sequences, and about 46,000 test sequences were generated. We tuned the parameters and hyperparameters of each model and upon finalizing the optimal architectures for our models, we initiated training experiments by fitting the models to the prepared datasets.
Results and Discussion: Our experimental findings reveal that the proposed LSTM model significantly improved prediction accuracy compared to the MLP and XGBoost models. The LSTM model, with its architecture of three consecutive LSTM layers, was able to demonstrate the best performance across all evaluation metrics on average over the 10 training runs. LSTM networks are designed to handle long time dependencies well. These networks use memory cells to hold important information over time, which makes them suitable for ordinal data such as time series. In contrast, XGBoost and MLP are not specifically designed to handle temporal dependencies and their performance is more limited on this type of data. Also, LSTM network can learn complex dependencies between ordinal data well. This ability is attributed to the unique structure of LSTM and its gate mechanisms, which enable the network to filter out irrelevant information while retaining essential information. In contrast, models based on XGBoost and MLP are less effective in this area, as they primarily focus on direct interactions between features and struggle to capture temporal dependencies. LSTM-based models excel in extracting higher-level features from data due to their deep learning capabilities. These features provide richer information for classification tasks, ultimately improving classification accuracy. Although XGBoost-based models are known for their precision, they are less adept at extracting high-level features. Additionally, the memory structure of LSTM allows it to handle fluctuations and unexpected variations in sequential data, effectively distinguishing critical information from noise. This feature helps LSTM perform better in situations where the data contains a lot of noise and fluctuations.
Conclusion: Overall, we presented and tested the performance of different models for predicting the results of artificial insemination of livestock. This prediction can help livestock farmers improve performance, increase fertility, and reduce livestock costs. In the problem of predicting the results of artificial insemination of livestock, the presented LSTM neural network model shows the best performance based on the stated evaluation criteria, and then the XGBoost-based classifier has better performance than MLP.
کلیدواژهها [English]
©2023 The author(s). This is an open access article distributed under Creative Commons Attribution 4.0 International License (CC BY 4.0), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source.
ارسال نظر در مورد این مقاله