A Tobit-Model for breeding value evaluation in German trotters
Transcription
A Tobit-Model for breeding value evaluation in German trotters
A Tobit-Model for breeding value evaluation in German trotters A.-E. Bugislaus1, N. Reinsch2 1Agrar- und Umweltwissenschaftliche Fakultät der Universität Rostock, 2Leibniz-Institut 16.07.2012 für Nutztierbiologie Dummerstorf (FBN) UNIVERSITÄT ROSTOCK | FAKULTÄT AGRAR- UND UMWELTWISSENSCHAFTEN Breeding goal in German trotters The goal in German trotter breeding is a fast, sound, welltempered and good-gaited trotter with precocity and a correct exterior. Actual breeding value evaluation in German trotters ¾ Model: BLUP animal model utilizing individual race results from all starting trotters ¾ Traits: Earnings per race Rank at finish Racing time per km ¾ Question: Do only trotters with earnings show their real racing potential whereas trotters without earnings are not driven to their limit? Censored race results could be responsible for a potentially severe bias in actual genetic evaluation in German trotters 16.07.2012 UNIVERSITÄT ROSTOCK | FAKULTÄT AGRAR- UND UMWELTWISSENSCHAFTEN 3 Objectives of the study ¾ Definition of censoring in race results of German trotters ¾ Tobit-like-Threshold-Model of racing performances ¾ Use of a real Tobit-Model for the censored trait racing time per km ¾ Comparison with linear model that treated all individual racing times per km as uncensored 16.07.2012 UNIVERSITÄT ROSTOCK | FAKULTÄT AGRAR- UND UMWELTWISSENSCHAFTEN 4 Data for genetic estimations ¾ Total data set consisted of 105,981 race performances from 6,504 trotters (mean of racing time per km = 79.7 s/km) ¾ Data set involved 14,148 races with 7.5 participants in average ¾ Starting method was in all races the auto start ¾ Pedigree back to the fourth generation was included (20,703 animals) 16.07.2012 UNIVERSITÄT ROSTOCK | FAKULTÄT AGRAR- UND UMWELTWISSENSCHAFTEN 5 Censoring in race results of German trotters and description of different used genetic models Trait Exemplary results of one race sorted by ranks at finish Uncensored racing Tobit-like-Threshold-Model Censored racing time per time per km (y) km for Tobit-Model (y*) for placing status y1 y2 y3 y4 y5 y6 y7 y8 y9 y10 5 4 3 2 1 0 0 0 0 0 y1 y2 y3 y4 y5 y6* y7* y8* y9* y10* Model Linear-Model: Bayesian analysis Threshold-Model: Bayesian analysis Tobit-Model: Bayesian analysis with data augmentation Program LMMG (REINSCH, 1996) LMMG_TH (REINSCH, 1996) LMMG_TOB (REINSCH, 2011) 16.07.2012 UNIVERSITÄT ROSTOCK | FAKULTÄT AGRAR- UND UMWELTWISSENSCHAFTEN 6 Censoring in race results of German trotters and description of different used genetic models Trait Exemplary results of one race sorted by ranks at finish Uncensored racing Tobit-like-Threshold-Model Censored racing time per time per km (y) km for Tobit-Model (y*) for placing status y1 y2 y3 y4 y5 y6 y7 y8 y9 y10 5 4 3 2 1 0 0 0 0 0 y1 y2 y3 y4 y5 y6* y7* y8* y9* y10* Model Linear-Model: Bayesian analysis Threshold-Model: Bayesian analysis Tobit-Model: Bayesian analysis with data augmentation Program LMMG (REINSCH, 1996) LMMG_TH (REINSCH, 1996) LMMG_TOB (REINSCH, 2011) 16.07.2012 UNIVERSITÄT ROSTOCK | FAKULTÄT AGRAR- UND UMWELTWISSENSCHAFTEN 7 Censoring in race results of German trotters and description of different used genetic models Trait Exemplary results of one race sorted by ranks at finish Uncensored racing Tobit-like-Threshold-Model Censored racing time per time per km (y) km for Tobit-Model (y*) for placing status y1 y2 y3 y4 y5 y6 y7 y8 y9 y10 5 4 3 2 1 0 0 0 0 0 y1 y2 y3 y4 y5 y6* y7* y8* y9* y10* Model Linear-Model: Bayesian analysis Threshold-Model: Bayesian analysis Tobit-Model: Bayesian analysis with data augmentation Program LMMG (REINSCH, 1996) LMMG_TH (REINSCH, 1996) LMMG_TOB (REINSCH, 2011) 16.07.2012 UNIVERSITÄT ROSTOCK | FAKULTÄT AGRAR- UND UMWELTWISSENSCHAFTEN 8 Censoring in race results of German trotters and description of different used genetic models Trait Exemplary results of one race sorted by ranks at finish Uncensored racing Tobit-like-Threshold-Model Censored racing time per time per km (y) km for Tobit-Model (y*) for placing status y1 y2 y3 y4 y5 y6 y7 y8 y9 y10 5 4 3 2 1 0 0 0 0 0 y1 y2 y3 y4 y5 y6* y7* y8* y9* y10* Model Linear-Model: Bayesian analysis Threshold-Model: Bayesian analysis Tobit-Model: Bayesian analysis with data augmentation Program LMMG (REINSCH, 1996) LMMG_TH (REINSCH, 1996) LMMG_TOB (REINSCH, 2011) 16.07.2012 UNIVERSITÄT ROSTOCK | FAKULTÄT AGRAR- UND UMWELTWISSENSCHAFTEN 9 Tobit-Model for censored trait racing time per km: Data augmentation ¾ For each y* > y5 the threshold is determined as standardized value: t= y 5 − xi ' β − zi ' u σe ¾ A random variable ai < t is drawn from a truncated standard normal distribution and is subsequently transformed to the original scale: yi* = ai * σe + xi ' β + zi ' u ¾ For one iteration yi* is treated as observation, in the following round yi* is again determined 16.07.2012 UNIVERSITÄT ROSTOCK | FAKULTÄT AGRAR- UND UMWELTWISSENSCHAFTEN 10 Univariate genetic-statistical model y = Xb + Z1a + Z2pe + e y: vector of observations containing either the uncensored or censored trait racing time per km or the threshold trait placing status of each trotter in each individual race b: fixed effects a: random animal effect pe: random permanent environmental effect e: residual effect X, Z1, Z2: incidence matrices ¾ 1 million cycles were generated (Gibbs sampling algorithm) ¾ As burn-in period 250,000 rounds were considered 16.07.2012 UNIVERSITÄT ROSTOCK | FAKULTÄT AGRAR- UND UMWELTWISSENSCHAFTEN 11 Fixed effects ¾ sex (stallion, mare, gelding) ¾ age of trotter (12 classes) ¾ year-season of race (three months are one season) ¾ condition of race track (fast, good, medium, heavy, muddy) ¾ distance of race (10 distance classes) ¾ driver (1, …, 1572) ¾ each individual race (1, …, 14148) 16.07.2012 UNIVERSITÄT ROSTOCK | FAKULTÄT AGRAR- UND UMWELTWISSENSCHAFTEN 12 % h² = 0.095 (0.016) Uncensored racing time per km (Linear model) Placing status (Tobit-like-Threshold model) h² = 0.207 (0.025) Censored racing time per km (Tobit model) h² = 0.208 (0.024) 0.034 16.07.2012 0.067 0.100 0.133 0.166 0.199 heritability UNIVERSITÄT ROSTOCK | FAKULTÄT AGRAR- UND UMWELTWISSENSCHAFTEN 0.232 0.265 0.298 0.331 13 % h² = 0.095 (0.016) Uncensored racing time per km (Linear model) Placing status (Tobit-like-Threshold model) h² = 0.207 (0.025) Censored racing time per km (Tobit model) h² = 0.208 (0.024) 0.034 16.07.2012 0.067 0.100 0.133 0.166 0.199 heritability UNIVERSITÄT ROSTOCK | FAKULTÄT AGRAR- UND UMWELTWISSENSCHAFTEN 0.232 0.265 0.298 0.331 14 % h² = 0.095 (0.016) Uncensored racing time per km (Linear model) Placing status (Tobit-like-Threshold model) h² = 0.207 (0.025) Censored racing time per km (Tobit model) h² = 0.208 (0.024) 0.034 16.07.2012 0.067 0.100 0.133 0.166 0.199 heritability UNIVERSITÄT ROSTOCK | FAKULTÄT AGRAR- UND UMWELTWISSENSCHAFTEN 0.232 0.265 0.298 0.331 15 (seconds/km)² Variance components estimated for racing time per km with two different models 1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 Tobit model Linear model additive‐genetic 16.07.2012 permanent environment residual UNIVERSITÄT ROSTOCK | FAKULTÄT AGRAR- UND UMWELTWISSENSCHAFTEN 16 Boxplots for residuals estimated with a Linear Model for the trait racing time per km over different placings Plot of breeding values for the trait racing time per km estimated either with a Linear- or with a Tobit-Model Rank correlations (r) between breeding values estimated with different genetic models as well as the percentage (%) of incorrectly selected stallions r % Tobit-Model vs. Linear Model 0.89 25.5 Tobit-Model vs. Tobit-like-Threshold-Model 0.96 16.1 Tobit-like-Threshold-Model vs. Linear Model 0.86 33.8 Conclusion for trotter breed ¾ Trotters without earnings didn‘t show their real racing potential and should be regarded as censored observations. ¾ Tobit-(like-Threshold)-Models with censored race results represented good suitability for genetic evaluation. ¾ Heritability estimates for the threshold trait placing status and the censored trait racing time per km were almost identical. ¾ Also the high rank correlation between the breeding values of placing status and the breeding values of censored racing time per km showed great agreement. Thank you for your attention! Breeding goal in German thoroughbreds The goal in German thoroughbred breeding is a highly competitive and sound horse with a correct exterior. Previous developed genetic evaluation systems for German thoroughbreds ¾ Model: BLUP animal model using individual race results from all starting thoroughbreds ¾ Traits: Rank at finish or Distance to first placed horse ¾ Handicap: Consideration of carried weights in the genetic model either as fixed linear regression or creation of new performance traits independent of carried weights Objectives of the study ¾ Definition of censoring in race results of German thoroughbreds ¾ Tobit-like-Threshold-Model of racing performances ¾ Derivation of individual racing times for each starting thoroughbred using the racing time of the first placed horse and the stewards decision for the further placed horses ¾ Proof if a real Tobit Model for the new created and censored trait racing time per km is appropriate 16.07.2012 UNIVERSITÄT ROSTOCK | FAKULTÄT AGRAR- UND UMWELTWISSENSCHAFTEN 23 Data for first genetic estimations ¾ Total data set consisted of 62,412 race performances from 6,244 thoroughbreds ¾ Data set involved 6,524 races with 9.6 participants in average ¾ Four generations were conisdered in pedigree (18,766 animals) Censoring in race results of German thoroughbreds and description of different used genetic models Trait Exemplary results of one race sorted by ranks at finish Uncensored Tobit-like-Threshold-Model square root of rank for placing status at finish (y) New created and censored racing time for Tobit-Model (y*) y1 y2 y3 y4 y5 y6 y7 y8 y9 y10 4 3 2 1 0 0 0 0 0 0 y1 y2 y3 y4 y5* y6* y7* y8* y9* y10* Model Linear-Model: Bayesian analysis Threshold-Model: Bayesian analysis Tobit-Model: Bayesian analysis with data augmentation Program LMMG LMMG_TH (REINSCH, 1996) (REINSCH, 1996) UNIVERSITÄT ROSTOCK | FAKULTÄT AGRARUND UMWELTWISSENSCHAFTEN 16.07.2012 LMMG_TOB (REINSCH, 2011)25 Average distances between different ranked horses between rank A and rank B Distance in body lengths 1 and 2 2,23 2 and 3 2,25 3 and 4 2,23 4 and 5 2,91 5 and 6 3,78 6 and 7 4,46 7 and 8 5,19 8 and 9 5,46 Univariate genetic-statistical model y = Xb + Z1a + Z2pe + e y: vector of observations containing either the uncensored trait square root of rank at finish or the threshold trait placing status of each thoroughbred in each individual race b: fixed effects a: random animal effect pe: random permanent environmental effect e: residual effect X, Z1, Z2: incidence matrices ¾ 1 million cycles were generated (Gibbs sampling algorithm) ¾ As burn-in period 250,000 rounds were considered 16.07.2012 UNIVERSITÄT ROSTOCK | FAKULTÄT AGRAR- UND UMWELTWISSENSCHAFTEN 27 Fixed effects ¾ sex (stallion, mare, gelding) ¾ age of trotter (10 classes) ¾ year-season of race (12 seasons) ¾ distance of race (4 distance classes) ¾ trainer (1, …, 706) ¾ jockey (1, …, 686) ¾ each individual race (1, …, 6524) ¾ fixed linear regression of carried weights 16.07.2012 UNIVERSITÄT ROSTOCK | FAKULTÄT AGRAR- UND UMWELTWISSENSCHAFTEN 28 h² = 0.091 (0.014) Uncensored square root of rank at finish (Linear Model) Placing status (Tobit-like-Threshold-Model) h² = 0.189 (0.027) 0.0356 0.0740 0.1124 0.1508 0.1892 heritability 0.2276 0.2660 0.3044 Objectives of the study ¾ Definition of censoring in race results of German thoroughbreds ¾ Tobit-like-Threshold-Model of racing performances ¾ Derivation of individual racing times for each starting thoroughbred using the racing time of the first placed horse and the stewards decision for the further placed horses ¾ Proof if a real Tobit Model for the new created and censored trait racing time per km is appropriate 16.07.2012 UNIVERSITÄT ROSTOCK | FAKULTÄT AGRAR- UND UMWELTWISSENSCHAFTEN 30 Derivation of racing time for non-winning horses (Mota et al., 2005) ¾ Only winning horses receive a racing time at finish ¾ Racing times for the other non-winning animals in the race were calculated by multiplying the number of body lengths behind the winner per 2/10 of second ¾ The trait racing time was analysed at distances of 1000, 1100, 1200, 1300, 1400, 1500 and 1600 m ¾ Heritabilities for the trait racing time at finish varied from 0.29 to 0.05 for the different distances in races Creation of new performance traits independent of carried weights (Bugislaus et al., 2004) ¾ Carried weights ranging from 47 kg to 74.5 kg ¾ Analysed trait was „Square root of distance to first ranked horses in races“ ¾ Observed performances were biased because of different carried weights in individual flat races Creation of new performance traits independent of carried weights (Bugislaus et al., 2004) ¾ Estimation of regression coefficients within thoroughbreds ¾ Total data set consisted of 33,223 performance observations from 2,507 thoroughbreds that started at least in six flat races ¾ A univariate random regression model for the transformed distance to first rank was used for estimations of coefficients on carried weights ¾ ¾ ¾ Model included only a random animal effect and a residual effect The space variable was the carried weight A first order polynomial on carried weights for the animal effect was applied Distribution of coefficients from the carried weight on the square root of distance to first rank 350 300 Frequency 250 200 150 100 50 0 0,1825 0,1325 Regression coefficients over all horses 0,0575 Creation of new performance traits independent of carried weights (Bugislaus et al., 2004) New distance to first rank = ((10 – distance0.5) + (0.1325 * carried weight)) ¾ Estimated heritability was h² = 0.145 (0.019) First conclusion for the thoroughbred breed ¾ Thoroughbreds without earnings didn‘t also show their real racing performance potential and should be regarded as censored observations ¾ A Tobit-like-Threshold-Model was suitable for genetic evaluation ¾ A real Tobit-Model is only appropriate when using new calculated racing times for all placed thoroughbreds 16.07.2012 UNIVERSITÄT ROSTOCK | FAKULTÄT AGRAR- UND UMWELTWISSENSCHAFTEN 36