DOI: 10.18522/2308-9709-2021-34-1


The breeding of dairy cattle is an important component of agricultural production having a significant impact on its economic efficiency. The potential milk yield of a cow during the lactation period continuously changes according to the change in its physiological state. The regularity of these changes is reflected in the lactation curve.

The lactation curve for a dairy cow increases rapidly from calving until its peak. Then follows a gradual decline up until the cow’s milk stasis in ten months after calving with a dry period of 45 days. Changes in daily milk yield are determined by changes in the number and activity of the cells of the mammary glands. Attempts at a mathematical description of this curve often endeavor to predict milk yields, dietary requirements and cash flow. As a rule, gaining a better understanding of the lactation process at the quantitative level while so doing is not pursued as a goal [15].

Most of the works devoted to the lactation curve modeling in full feeding conditions are empirical or semi-empirical. In this paper, we present both existing and proposed lactation models.

Many authors have devoted their works to the given issue. To date, more than twenty formulas have been offered. All models can be conventionally divided into three groups depending on the method of their derivation: empirical (models that have been obtained after statistical processing of results from a large amount of experimental data; logical (models based on intuitive ideas fundamental to lactation curve tracing); semi-empirical (based on the principles and methods of the theory of population growth) [28].  


  1. Empirical Models.

Historical Development of Empirical Models for Lactation Process

In all the models described it is assumed that  denotes daily milk yield t, denotes time in days after parturition, and , and  denote model parameters.

The first attempt to develop a mathematical model to describe the lactation curve was launched by Brody, Ragsdale and Turner in 1923. They used the following model for this purpose:

This model resulted in a good attempt to describe the declining phase of lactation, it was unable to model the initial rise in production to peak yield. Wood attributes this model to Gaines, but work by Gaines in this field was only published in 1927, whereas Brody et al. already published their paper in 1923. To overcome this limitation, Brody, Turner and Ragsdale presented an improved version of their model in 1924. This time the model made provision for the initial rise to peak production by incorporating an inclining function into the model:

Although this was a great improvement on their first model, later researchers such as Cobby and Le Du (1978):

found on fitting this model to lactation data of cows, that it resulted in underestimation of milk yield in mid-lactation and overestimated milk yield in late lactation.

This was followed by a relatively complex model than (1.1) introduced by Sikka (1950):

In 1958 Fisher proposed to improve the model (1.3) in the following form, by substituting the exponential decline built into this model with a linear decline:

The first attempt to formalize the lactation curve appears to belong solely to Vučić and Bačić (1961) who developed Gaines’ model into the following form:

This model seems to be the first attempt to develop a model that varies both directly and exponentially with time. Unfortunately, this equation does not exactly conform to the actual data [15].

In an effort to improve on all models that existed at the time, in the 1960s it was proposed an empirical model described by rational functions in the following form:

Using model (1.7), we can consider various special cases depending on changes in values of coefficients (ai, bi) and degrees ( m, n).

For example, when for the coefficients:  and for the degrees:  m=1 and n=2, then from equality (1.7) follows the inverse polynomial model proposed by Nelder (1966):

For other combinations of values m and n from equation (1.7) can be easily obtained empirical model by Bianchini (1984):

and a number of other models (Narushin and Takma, 2003).

The main impetus to the development of empirical modeling of lactation did Wood (1967), who suggested using the gamma function:

In the 1970s in search for an improvement on the existing at that time lactation models continued with the proposal of the polynomial model of the following form:

For example, for n=2 models by Dave (1971), for n=3 models by Dag et al. (2005).

Numerous studies [4, 12, 17, 20, 22, 25-26, 29 et al.] suggest linear and nonlinear empirical models, which contain different combination of algebraic and transcendental functions for the approximation of lactation data.

These include the most common models by Wilmink (1978), Ali and Scheaffer (1987), Guo and Swalve (1995), Mikayilov (2013) and (Memmedova and Mikayilov, 2014), which respectively have the form:


  1. Logical models


  1. Semi-empirical models

Among early attempts to apply mathematical methods to the study of biological problems one of the most successful outcomes occurred while studying the problem of growth of the total number of living organisms based on birth and death rates.

Verhulst (1838; 1845) and Pearl (1920) specified the mathematical model of logistic growth. The model examined the arresting factor of the population growth which lowered the growth rate   by a value proportional to the squared number. This more precise model results in the differential equation:

Here it is assumed that y(t) describes the size of the population over time t.

Where the parameter r is the specific rate of population growth; K is the maximum possible population size; t is time.

Equation (3.1) was derived under the following assumption: it is assumed that the dependence of the specific (average) growth rate of the population on its size is linear.

The Verhulst logistic equation is also referred to in the literature as the Verhulst-Pearl equation after Verhulst, who first derived the curve, and Pearl, who used the curve to approximate population growth in the United States in 1920.

The work (Kutsenko, 2012) states the mathematical description method for the time dependence of the total amount of milk yield from a cow in the lactation period  using Verhulst’s logistic function obtained by solving the differential equation (3.1) over a limited period of growth in the size of biological populations as follows:

According to Lokhorst (1996) and Faridi et al. (2011) the over a limited period of feed intake equation as follows:

Predictive calculations of the formulas (3.2) – (3.3) does not fully correspond to the true state at the initial and at the final stage of lactation curve. To eliminate these defects, generalizing the above models (3.2) - (3.4), we propose the following model

where  empirical parameters, which are based on experimental data on cow milk yield.


  1. Materials and Methods

4.1. Database

In order to compare our proposed and previously used models (France, 1987, etc.), we identified the parameters of these models. Further, various statistical criteria was used to select the best model. For this purpose we used the data of experimental milk yields of cows (Keskin et. al, 2010) from a subsidiary farm of the Selcuk University (Konya, Turkey).

Milking was carried out in accordance with the recommendations of the Animal Science Department of the Agricultural Faculty.

Daily and weekly milk yields were used for each cow.

The observed total milk yield (TMY) per lactation was estimated using the Fleischmann’s method (FLS) (Ruiz et al., 2000):

where TMY is total milk yield, y1 is yield at first milk record, t1 is interval between calving and first recording; yi is yield of the record i and ti is interval between the record i and the record (i +1), (i=1, . . . , k).


4.2. The Commonly used and proposed models

In the present work we have considered in the existing literature and our proposed models describing a lactation curve. The Commonly used models, which have been described above, is presented below (Tab 1).


Generalizing the models (3.2) and (3.4) as well using the solution (2.5) of the differential equation (2.3), which describes the rate of productivity change, we obtained lactation models, that are given below (Tab 2).  


4.3. Statistical Analysis

For comparison of model fitting, we used the seven comparison criteria. These criteria are given below:

  1. Root Mean Squared Error (RMSE) (also called the root mean square deviation, RMSD) is a frequently used measure of the difference between values predicted by a model and the values actually observed from the environment that is being modelled.


where, m=n-p-1 if n≤30 and m=n-p if to n>30;  n ist the number of observations;

p – ist the number of estimable parameters in the approximating linear and nonlinear empirical models, and should always be p<n; yi: is actual milk yield values; ỹi is predicted milk yield values.

  1. The coefficient of determination (R2), which indicates the precision of the modelled daily milk yield in relation to the measured values.


y̅= ∑yi/n –is the average of the milk yield observed values. When R2 is used as model selection criterion, we are faced with several problems [18, 38].

1) To compare the degree of success of different models using R2, the functional structure of the model and estimators must be the same.

2) The R2 value increases as the number of explanatory variables in the model increases.

However, this situation will lead to a rise in the variance of forecast errors. In this case, the use of the adjusted coefficient of determination is more appropriate. Therefore, it is more appropriate to use the adjusted coefficient of determination in nonlinear models [34].

  1. Adjusted coefficient of determination. The use of an adjusted R2 (often written as Radj2) is an attempt to take account of the phenomenon of the R2 automatically and spuriously increasing when extra explanatory variables are added to the model. The adjustedR2is defined as:


  1. The agreement index (D), which indicates the accuracy of the modelled daily milk yield in relation to the measured values [43], and is given by:

The index D ranges from 0 to 1, where the value 1 means a perfect accuracy of the estimated data, and the value 0 means that there is no accuracy.

  1. For the purposes of determining the adequacy of the models used Theil (1966) coefficient mismatch, which is calculated according to the equation:


Theil's inequality coefficient is useful for comparing different forecast methods. To interpret the U statistics the general guide is: U1 is bound between 0 and 1, with values closer to 0 indicating greater forecasting accuracy.

  1. The Mean Absolute Percentage Error (MAPE), also known as Mean Absolute Percentage Deviation (MAPD), is a measure of prediction accuracy of in statistics and is the most common measure of forecast error. Is usually expresses accuracy as a percentage, and is defined by the formula:


  1. Akaike information criterion (AIC). The Akaike information criterion (AIC) [1] is a popular method for comparing the adequacy of simple and multiple linear or nonlinear and nested or non-nested models. It is often used to select the model [7], which is calculated according to the equations:


  1. Results and Discussion

For comparison the existing in literature (Tab 1) and our proposed models (Tab 2), carried out the identification of parameters as well calculated statistical criteria for comparison these models.

For this purpose the parameters of existing and suggested patterns based on data of yields from two cows from a part-time farm of the University Zootechnics Chair “Selchuk” (Konya, Turkey) have been defined.

For computation of statistical parameters was used the program pack Statıstıka 5.0 using the Quasi-Newton and Rosenbrock and Quasi-Newton methods. The calculation results are shown in Table3.

To select the best model, the values of the statistical parameters calculated utilizing equations (4.1) - (4.7) are given in Table 3.

The existing models in the literature and our proposed models (Table 2 and 3) used to describe the lactation process are compared with each other using these values, and the best model is selected.

According to these results (Table 3), it is clear that the best model for both cows among the existing ones is Model 14 proposed by Mikayilov (2013) and among the proposed is the Model 16, when all the seven criteria are taken into account.

When statistical results related to the measured values and values calculated according to all the above models are examined, it is seen that the maximum (R2=78.20% and 78.97%, R2adj=77.94% and 78.77%, D=0.9363 and 0.9383) and minimum (RMSE=1.3383 and 1.4017, UI=0.0376 and 0.0410, A=5.66% and 6.47, AIC=0.6094 and 0.6978) values for both cows are obtained for proposed model 16:


Once a model was chosen as the best, total milk yield per lactation (ETMYL) was estimated for the selected equation (5.1). Using the expression (5.1) and following the formula:


we derive the milk output for a given animal in a period of time between moments  and , and also the total yield time between moments  and  .

 One of the main characteristics, i.e. milk yield to day 259 (for y1) and day 308 (for y2) of the overall lactation curve were estimated using the proposed model (5.1).

The results of calculations for these models were compared with the observed values, which are as follows:

The results of calculations, calculated using the formula (5.2), very close to the observed values

Thus, using the values of the model’s parameters (5.1) and the relation (5.2) we can evaluate the yield at an arbitrary lactation point. Such a procedure can be useful in modeling the processes involved in milk production, for example, calculating feed rations.


It should be noted that earlier, when choosing a models, preference was given to models with fewer parameters, in order to simplify the calculation. However, given the advances in computer technology in today’s world, the number of parameters is no longer relevant as a criterion.  It is certain that, despite a heavy computational burden, the model that is thought to be closest to reality according to statistical analyses should be preferred.

Given the results obtained, it would be wrong to be content with the 2 or 3 simple classical criteria, which yield misleading results in the selection of the most appropriate model in pursuit of solutions to such problems.

 It is thought that the application of detailed analyses is more appropriate in model selection.

Comparison of statistical parameters revealed that among all studied models the proposeds model 16 best describes the yields’ data for both cows. The models presented in this paper require further theoretical and experimental development and testing under varying conditions and on different animals. The aims of further investigations into the problems of tracing dynamic models for the elaboration of animals’ productivity and methods for tracing lactation curves must include the following:

  • consideration of the most important properties of the physiological state of animals in the lactation period such as initial, maximal and final daily milk yields;
  • calculation of statistical parameters according to the above-mentioned models based on experimental data;
  • selection of the best-fit lactation curve models.



The author thank Prof. Dr.İsmail Keskin from Konya (Deparment of Animal Science, Faculty of Agriculture, Selcuk University, Konya, Turkey) and Assoc. Prof. Dr. Nazire Mikail from Siirt (Department of Animal Science, Faculty of Agriculture, Siirt University, Siirt-TURKEY) for the for providing data used in this study



  1. Akaike, H. Information theory and an extension of the maximum likelihood principle. Second International Symposium on Information Theory, Akademiai Kiado, Budapest, 1973. P. 267–281.
  2. Ali, T. E. and Schaeffer, L. R. Accounting for covariance among test day milk yields in dairy cows// Can. J. of Anim. Sci., 1987. № 67. P. 637 – 644.
  3. Bianchini Sobrinho, E. Estudo da curva de lactaçao em vacas da raça Gir//Tese (Doutorado em Genetica). Faculdade de Medicina Veterinaria de Ribeirao Preto. Universidade de Sao Paulo. Ribeirao Preto, 1984. 88 p.
  4. Bouallegue, M. and M’Hamdi, N. Mathematical modeling of lactation curves: A review of parametric models//Lactation in Farm Animals-Biology, Physiological Basis, Nutritional Requirements, and Modelization/ Edited by Naceur M'Hamdi, 2019. P. 1-20.
  5. Brody, S., Ragsdale, A. C. and Turner, C. W. The rate of decline of milk secretion with the advance of the period of lactation//The Journal of General Physiology, 1923. № 5. P. 441–444.
  6. Brody, S., Turner, C. W. and Ragsdale, A. C. The relation between the initial rise and the subsequent decline of milk secretion following parturition// The Journal of General Physiology, 1924. № 6. P. 541 – 545.
  7. Burnham, K. P. and Anderson, D. R. Model selection and multimodel inference: A practical–theoretic approach, 2nd Ed., Springer-Verlag, New York, NY. 2002. 488 p.
  8. Cankaya, S., Unalan, A., and Soydan, E. Selection of a mathematical model to describe the lactation curves of Jersey cattle//Archives Animal Breeding, 2011. № 54(1). P. 27-35.
  9. Cobby, J. M. and Le Du, Y. L. P. On fitting curves to lactation data// Animal Production, 1978. № 26. P. 127 – 133.
  10. Dag, B., Keskin, I. and Mikailsoy, F. Application of different models to the lactation curves of unimproved Awassi ewes in Turkey// South African Journal of Animal Science, 2005. № 35(4). P. 238 – 243.
  11. Dave, B. K. First lactation curves of Indian Water Buffalo// JNKVV Research Journal. 1971. № 5. P. 93 – 98.
  12. Dongre, V.B., Gandhi, R.S., Sing, A., Gupta, A. A brief review on lactation curve model for predicting milk yield and different factors affecting lactation curve in dairy cattle//International Journal of Agriculture: Research and Review. 2011. № 1(1). P. 6–15. Available online at
  13. Faridi, A., Mottaghitalab, M., Rezaee, F., France, J. Narushin-Takma models as flexible alternatives for describing economic traits in broiler breeder flocks// Poult Sci. 2011. №11. 90. P. 507–515
  14. Fischer, A. Research with Württemberg Spotted Mountain cows on the shape of the lactation curve and how it may be influenced by non-genetic factors// Züchtungskunde. 1958. № 30. P.296-304.
  15. France, J. and Thornley, J. H. M. Mathematical models in agriculture. Butterworths, London. 1984. 355 p.
  16. Gaines, W. L. Persistency of lactation in dairy cows: a preliminary study of certain Guernsey and Holstein records// Bulletin of the Illinois Agricultural Experimental Station, 1927. № 288. P.355–424.
  17. Gök, T., Mikail, N. and Akkol, S. Analysis of the first lactation curve in holstein cows with different mathematical models// KSÜ Tarım ve Doğa Derg. 2019. № 22(4). P.601-608.
  18. Gujarati, D.N. Basic econometrics, 4th Edition, the McGraw−Hill Companies, New York City. 2004. 1003 p.
  19. Guo, Z. and Swalve, H.H. Modeling of the lactation curve as a sub-model in the evaluation of test-day records. Proceedings of the Interbull Annual Meeting, Prague, 7-8 September 1995, International Bull Evaluation Service, Uppsala, Sweden. Interbull Bulletin, 1995. № 11. P. 22 – 25.
  20. Kawata, Y. Lactation curves of dairy animals―An interim literature review//Research Bulletin of Obihiro University of Agriculture and Veterinary Medicine. 2011. № 32. P. 71-91.
  21. Keskin, I., Memmedova, N., Ilhan, F., Dağ, B. and Mikailsoy, F. Comparison of eleven mathematical models for describing the first lactation curve of Holstein cattle in Turkey, Second Int. Symposium on Sustainable Development (ISSD’10), Proceedings, V.3. P. 246–255, 8–9 June, Sarajevo, 2010.
  22. Koçak, O. and Ekiz, B. Comparison of different lactation curve models in Holstein cows raised on a farm in the South Eastern Anatolia region//Arch Tierz Dummerstorf.2008. № 51. P. 329–337
  23. Kutsenko A.I. Modelirovaniye dinamiki molochnoy produktivnosti krupnogo rogatogo skota na osnove logisticheskoy funktsii Ferkhyul'sta//Izvestiya TSKHA, 2012. № 2. P. 147-155.
  24. Lokhorst, C. Mathematical curves for the description of input and output variables of the daily production process in aviary housing systems for laying hens//Poult. Sci. 1996. № 75. P. 838–848.
  25. Lombard, C.S. Hierarchical Bayesian modelling for the analysis of the lactation of dairy animals, PhD Thesis, University of The Free State Bloemfontein, South Africa. 2006. 265 p.
  26. Lopez, S. Non-linear functions in animal nutrition//In: France, J.E., Kebreab, E. editors. Mathematical Modeling in Animal Nutrition. Oxfordshire: CABI, 2008. P. 47-88.
  27. Mamedova, N.I., Mikayilov, F.D. Modelirovaniye dinamiki molochnoy produktivnosti krupnogo rogatogo skota//Materialy mezhdunarodnoy nauchno-prakticheskoy konferentsii v ramkakh XXIV mezhdunarodnoy spetsializirovannoy vystavki ‘AGROKOMPLEKS–2014’.11-13 marta 2014 g. Bashkirskiy GAU. Ufa, Rossiya. Tom 1. S. 312-321.
  28. Mikayilov F. D. Opredeleniye molochnoy produktivnosti krupnogo rogatogo skota na osnove matematicheskogo modelirovaniya//Mezhdunarodnoy nauchno-prakticheskoy konferentsii ‘Sovremennyye tekhnologii v veterinarii i zootekhnii. Tvorcheskoye naslediye V.K. Birikha (k 110-letiyu so dnya rozhdeniya)’. 3-4 aprelya 2013, Perm', Rossiya, S. 25-30.
  29. Morant, S. V. and Gnanasakthy, A. A new approachto the mathematical formulation of lactation curves//Animal Production, 1989. № 49. P. 151 – 162.
  30. Narushin, V. G. and Takma, C. Sigmoid model for the evaluation of growth and production curves in laying hens//Biosystems Eng. 2003. № 84. P. 343–348.
  31. Nelder, J. A. Inverse polynomials, a useful group of multifactor response functions// Biometrics, 1966. № 22. P. 128–141.
  32. Pearl, R and Reed, l. J. On the rate of growth of the population of the United States since 1790 and its mathematical representation// Proceedings of National Academy of Sciences USA, 1920. № 6(6). P. 275–288.
  33. Ruiz, R., Oregui, L.M. & Herrero, M. Comparison of models for describing the lactation curve of Latxa sheep and an analysis of factors affecting milk yield//J. Dairy Sci. 2000. № 83. P. 2709-2719. 10.3168/jds.S0022-0302(00)75165-4
  34. Sit, V., Costello, M.P. Catalog of curves for curve fitting. Biometrics Information Hand book Series. Ministry of Forests, B. C. Victoria, Canada. ISSN 1994. № 4. P. 1183- 9759.
  35. Sikka, L.C. A study of lactation as affected by heredity and environment//Journal of Dairy Research. 1950.17: 231-252.
  36. Smagin B.I. Modelirovaniye laktatsionnoy krivoy v molochnom skotovodstve /B.I. Smagin // Matematicheskiye i instrumental'nyye metody ekonomicheskogo analiza: upravleniye kachestvom. Sbornik nauchnykh trudov. Vypusk 19. Tambov, TGTU, 2005. P. 298 – 309.
  37. Theil H. Applied Economic Forecasting. Amsterdam: North Holland. 1966. 474 p.
  38. Tuşat, E., Mikailsoy, F. An investigation of the criteria used to select the polynomial modelsemployed in local GNSS/leveling geoid determination studies//Arabian Journal of Geosciences, 2018. 8(24). P. 801,
  39. Verhulst, P. F. Notice sur la loi que la population poursuit dans son accroissement//Correspon. Mathématique et Physique, 1838. № 10. P. 113 – 121.
  40. Verhulst, P. F. Recherches mathématiques sur la loi d’accroissement de la population//Nouveaux mémoires de l’Académie Royale des Sciences et Belles-Lettres de Bruxelles, 1845. № 18. P.1– 41.
  41. Vujicic, I. and Bacic, B. New equation of the lactation curve (in Croatian)// Letopis Naucnih Radova, Poljorivredni Fakultet u Novum Sadu (University of Novi Sad, Novi Sad, Yugoslavia), 1961. № 5. 9 p.
  42. Wilmink, J.B.M. Adjustment of test day milk, fat and protein yield for age, season and stage of lactation// Livest. Prod. Sci., 1987. № 16. P. 335 – 348.
  43. Willmott, C.J. and Wicks, D.E. An empirical method for the spatial interpolation of monthly precipitation within California// Physical Geography, 1980. № 1. P. 59–73.
  44. Wood, P. D. P. Algebraic model of the lactation curve in cattle//Nature, Lond. 1967. № 216. P. 164–165.

Библиографическая ссылка

Mikailsoy Fariz, Mathematical modeling of lactation curves of dairy animals // «Живые и биокосные системы». – 2020. – № 34; URL: DOI: 10.18522/2308-9709-2021-34-1.