HAL Id: hal-02358044
https://hal.archives-ouvertes.fr/hal-02358044v3
Submitted on 5 Feb 2020
HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or
L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires
Comparison of forecast models of production of dairy cows combining animal and diet parameters
Thong Nguyen, Remy Fouchereau, Emmanuel Frenod, Christine Gerard, Vincent Sincholle
To cite this version:
Thong Nguyen, Remy Fouchereau, Emmanuel Frenod, Christine Gerard, Vincent Sincholle. Compari-
son of forecast models of production of dairy cows combining animal and diet parameters. Computers
and Electronics in Agriculture, Elsevier, 2020, 170, pp.105258. �10.1016/j.compag.2020.105258�. �hal-
02358044v3�
Comparison of forecast models of production of dairy cows combining animal and diet parameters
Quoc Thong Nguyen a , R´ emy Fouchereau b , Emmanuel Fr´ enod a,b , Christine Gerard c , and Vincent Sincholle. c
a Universit´ e de Bretagne Sud, Laboratoire de Math´ ematiques de Bretagne Atlantique, UMR CNRS 6205, Campus de Tohannic, Vannes, France
b See-d, Parc Innovation Bretagne Sud, Vannes, France
c NEOVIA, France
Abstract
We study the effect of nutritional diet characteristics on the lactating Holstein- Friesian dairy cows in Brittany, France from 36 individuals. An analysis of the relations between fat/protein content and milk yield was implemented for our dataset. The fat and protein production increase at a slower rate as milk yield increases. The importance of chemical composition on milk production is studied using the linear model. The data analysis confirms the importance of Starch, crude fiber, and protein which have a positive effect on milk production. This analysis also confirms the previous study on the effect of parity on the production. After that, the milk production forecast- ing is investigated using both linear models and machine learning approaches (support vector machine, random forest, neural network). We study the per- formance of multiple linear regression and machine learning-based models in both non-autoregressive and autoregressive cases at the individual level.
The autoregressive models, which take into account the previously observed milk yield, have proven to significantly outperform the non-autoregressive approaches. Moreover, the computational cost of each approach is presented in the paper. While the random forest algorithm gives the best performance in both non-autoregressive and autoregressive approaches. The support vec- tor machine algorithm gives a very close performance with a substantial less computing time. The support vector machine is shown to be the best com-
∗ Corresponding author
Email address: quoc-thong.nguyen@univ-ubs.fr (Quoc Thong Nguyen)
promise between accuracy and computational cost.
Keywords: Milk production forecasting, Dairy modeling, Autoregression, Smart farming
1. Introduction
1
Milk production forecasting of the dairy cow is an essential factor that
2
is useful for the dairy farmers in management as well as health monitoring.
3
In literature, many parametric models have been developed to model the
4
lactation curve at the herd and individual level [1, 2, 3, 4, 5, 6]. Or the
5
studies on extended lactation in dairy production [7, 8]. Recently, there
6
are a number of modeling techniques on milk production forecasting that
7
showed to obtain a highly accurate prediction with adaptability at the herd
8
level [9, 10, 8]. The nonlinear autoregressive model with exogenous input
9
using artificial neural networks introduced by Murphy et al. [9] shown to be
10
most effective milk-production model.
11
On the other hand, understanding the effect of the nutritional diet on milk
12
production and the quality of milk is not only helpful in financial planning but
13
also in the production of other dairy products, such as yogurt, cheese, butter
14
[11]. The importance of feed intake, diet on dairy cows was investigated in
15
recent years. For example, the feed intake increases slowly at the beginning
16
of lactation [12]; or the effects of dietary starch concentration on yield of milk
17
and milk components were investigated by Boerman et al. [13].
18
In spite of that, not many studies are on individual cow level, and on the
19
milk forecasting based on the nutrition for the small scale farms. Milk yield
20
forecasting of each individual cow can be beneficial to many applications such
21
as monitoring health conditions and disease detection, i.e. mastitis [14, 15].
22
Recently, Zhang et al. [16] conducted a study on the effect of parity weighting
23
with the dataset in the south of Ireland; or Van Bebber et al. [17] applied
24
Kalman Filter on monitoring dairy milk yields.
25
The subject of this study is to improve livestock farming, particularly
26
milk production, by monitoring the performance in nutrition supplies. The
27
first objective is to analyze the importance of the chemical composition of
28
nutrition on the production and milk production monitoring of dairy cattle
29
in Brittany, France. Secondly, we compare the performance of different types
30
of multiple linear regression and machine learning-based models for predic-
31
tion of production of the individual cow. The practicability and ability for
32
industrial applications are also discussed.
33
The paper is organized as follows. Section 2 is devoted to describe in detail
34
the content of our dataset and to present the composition analysis. Section 3
35
briefly recalls and analyzes the linear regression models and machine learning
36
algorithms. Section 4 focuses on the performance of the regression algorithms
37
on forecasting. The concluding remarks are given in Section 5.
38
2. Data description and composition analysis
39
2.1. Data description
40
The empirical data were collected from 36 lactating Holstein-Friesian
41
dairy cows in a research farm in Brittany, France, equipped with a robotic
42
milking system. For a ten months period (from December 2015 to September
43
2016), there are 7691 valid milking records collected. Each milking record
44
contains Daily Milk Yield (DMY), Day In Milk (DIM), parity information
45
(first, second, third onward lactation, see Tab. 1), number of milking per
46
day and the collective (corn silage, grass silage, wheat straw, soybean meal)
47
or individual (pelleted feed distributed through an automatic feeder) con-
48
sumption of diet components. Each cow is milked one to four times per day
49
by the robotic milking system, the cow can possibly be milked each time
50
it comes to the freestall for food. In this experiment, the amount of given
51
diets are changed every week. In this study, we are interested in the effect of
52
the diet on milk production forecasting. Particularly, the chemical compo-
53
sition studied in this paper are starch, crude fiber, Net Energy (NE) Unit´ e
54
Fourrag` ere Lait (UFL 1 ) and protein (PDIE 2 ). Therefore, the consumption
55
of different diets was converted to these four chemical compositions. Table
56
2 presents the composition of each diet. It should be noted that, in Table
57
2, the consumption of the first eight diets (Corn silage, Grass silage, ..., Ni-
58
trogen supplement) is the same for 36 dairy cows at a specific week. On
59
the other hand, since the last four components (Production feed, ..., Liquid
60
1 which are respectively the units used in dairy production to estimate available energy and protein supply to dairy cows, estimated based on 1 UFL = 1.7 Mcal, see [18].
2 Prot´eines Digestibles dans l’Intestin limitantes par l’apport d’ Energie: true protein ´
absorbable in the small intestine when rumen fermentable energy (organic matter) is lim-
iting microbial protein synthesis in the rumen [19].
feed) in Table 2 are distributed by robot, which means the consumption of
61
these four components varies according to the milk production level of each
62
individual cow. Therefore, the consumption of each individual may differ at
63
a specific week. In order to have a regular effect of each nutrient on milk
64
production, we used the weekly data instead of the daily data. That means
65
each data point is the average of seven days’ observations. The statistical
66
characteristics of the interesting variables are presented in Table 3.
Parity number of cows
First lactation 20
Second lactation 13
Third onward lactation 3
Table 1: Number of individuals on each parity lactation.
DM * content, Protein, Starch, Crude fiber, NE, PDIE,
% g/kg of DM g/kg of DM g/kg of DM UFL/kg of DM g/kg of DM
Corn silage 34.1 75 360 174 0.95 69
Grass silage 23.4 141 0 231 0.92 63
Fescue 88 93 0 222 0.76 82
Alfalfa hay 91.8 160 0 169 0.72 93
Fresh grass 18.3 167 0 217 0.94 90
Wheat straw 88 35 0 420 0.42 44
Ears corn 64 51 580 72 1.06 95
Nitrogen supplement 88 455 0 170 1.09 278
Production feed 88 273 114 14 1.17 205
Soluble nitrogen supplement 88 489 0 13 1.08 256
Ruminoprotected nitrogen supplement 88 443 0 13 1.08 273
Liquid feed 100 0 0 0 2.20 0
* Dry Matter
Table 2: Chemical composition of different diet.
67
2.2. Milk fat and protein composition analysis
68
In this section, we analyze the correlation between fat and protein content
69
and milk yield with the collected data. The yield of cheese and butter mainly
70
depend on milk fat and protein yield. A factor that impacts milk fat and
71
protein concentration is milk yield [20]. It is well-known that, in daily rumi-
72
nants, correlations among fat and protein content (g over 1 kilogram of milk
73
yield) and milk yield are negative [21]. In our experiment, the reported cor-
74
relation coefficients between milk yield and fat and protein content are −0.04
75
and −0.21, respectively. In our observed data, the fat and protein content
76
Mean SD + Min Max Starch (kg) 0.185 0.124 0.000 0.451 Crude fiber (kg) 0.426 0.190 0.080 0.966 PDIE (kg) 0.730 0.304 0.159 1.683 Net energy (UFL) 3.692 1.630 0.672 8.046
Parity 1.631 0.972 1 5
Milking per day 2.731 0.541 1 5
+ Standard deviation
Table 3: The statistical characteristics of the interested variables.
decrease as the milk yield increase, but not significant. As shown in Figures
77
1a and 1c, the fat and protein content visually decrease as milk yield increase
78
to 20 (kg/day). This phenomenon can be explained as at the beginning of
79
the lactation, the milk production increases more rapidly than the ability
80
of consumption of the cow. Moreover, when dairy cows produce more milk,
81
they consume more, especially water [22], but nutrition absorption cannot
82
change so intensively.
83
Some studies discovered that as milk yield increases, fat and protein syn- thesis generally increases at a slower rate [23, 20]. This phenomenon can be described by the allometric model:
y = ax b
where y is fat or protein yield (g/day), x the milk yield (kg/day), and a and
84
b are equation coefficients. Parameter b represents a scaling factor describing
85
the effect of milk yield variation on its two main constituents. With b = 1,
86
milk yield shows a linear relationship with fat or protein yield whose content
87
in milk is equal to a; if b > 1, fat or protein yield tends to increase more
88
proportionally than milk yield; and finally, if b < 1, fat or protein yield
89
increases at a slower rate than the milk yield.
90
In Figures 1b and 1d, the application of this model to data showed that
91
fat and protein synthesis varied proportionally to the output of milk with an
92
exponent 0.964 and 0.910 for milk fat and milk protein, respectively. Thus,
93
the higher the milk yield, the more cheese produced, even each additional
94
unit of milk results a lower increase in fat and protein. Moreover, from this
95
dataset, since the relationship between milk fat and milk yield has higher
96
variability than that between milk protein and milk yield (see Figure 1),
97
modification of milk composition by nutritional means should be easier to
98
achieve for fat than for protein.
10 20 30 40 50
30405060708090
Milk yield, kg/d
Fat concentration, g/kg
y= −0.032x+40.904 R2=0.002
(a)
0 10 20 30 40 50 60
0.00.51.01.52.02.53.0
Milk yield, kg/d
Fat yield, kg/d
y=0.045x0.964 R2=0.814
(b)
10 20 30 40 50
25303540455055
Milk yield, kg/d
Protein concentration, g/kg
y= −0.082x+34.724 R2=0.044
(c)
0 10 20 30 40 50 60
0.00.51.01.52.0
Milk yield, kg/d
Protein yield, kg/d
y=0.044x0.91 R2=0.91