• Aucun résultat trouvé

2/7/2013 1

N/A
N/A
Protected

Academic year: 2021

Partager "2/7/2013 1"

Copied!
4
0
0

Texte intégral

(1)

2/7/2013

1

Factors controlling accuracy of genomic

selection in oil palm (Elaeis guineensis)

PAG XXI – January 13, 2013 Palm genomics and genetics Workshop

Cros D, Denis M, Sanchez L,

Cochard B, Edyana-Suryana H, Omoré A, Durand-Gasselin T & Bouvet JM

 allogamous

(monoecious with male and female cycles)

male female

 vegetative multiplication difficult

The oil palm

 one cultivated species,

Elaeis guineensis

 world major oil crop

Selection criteria:

Average bunch weight (ABW), Bunch number (BN)

Fruits to bunch ratio (F/B), Pulp to fruits ratio (P/F),

Oil to pulp ratio (O/P)

Bunch production (FFB)

Height increment

(INC)

Oil extraction rate (OER)

Breeding populations:

Distant populations with narrow genetic bases

 Deli

 La Mé

Method of MAS (Meuwissen et al 2001): • Training population phenotyped and genotyped • Dense genotyping of the whole genome • All markers effects estimated simultaneously • No test of significance of marker effects

• Selection on markers alone (GEBV) in test population

Genomic selection

“Selection on genetic values predicted from markers could substantially increase the rate of genetic gain in animals and plants”

Deli

La Mé

Cycle 3

Cycle 2

markers

Cycle 4

Cycle …

markers

markers

markers

18 years

(2)

2/7/2013

2

Could we increase the rate of genetic gain in

oil palm breeding with genomic selection ?

Hypothesis: Progeny-tested individuals could be

used to train a GS model that could be applied to

predict breeding values of individuals of the same

populations

With:

• Narrow genetic base / Low effective size

• Small training populations

• Small number of markers

• Multiallelic markers

Hypothesis: Progeny-tested individuals could be

used to train a GS model that could be applied to

predict breeding values of individuals of the same

populations

 Will be checked by measuring the accuracy of

GS in a cross-validation study with real data

Genetic gain per year = Intensity * Accuracy * σa Generation interval

With accuracy = r(TBV, EBV) and current accuracy ~ 0.8

Materials and methods

Plant material:

Deli

: 131 individuals

La Mé

: 93 individuals

Materials and methods

Molecular data:

235 SSR

(Billotte et al 2005; Tranbarger et al 2011)

~ 1 SSR / 7.4 cM

Phenotypic data:

1.

Progeny tests, 10 quantitative traits

2.

Estimated breeding values (BLUP)

3.

Deregressed and used in a weighted

analysis to derive genomic estimated

breeding values

(Garrick et al 2009)

5-fold cross-validation:

1/ Individuals (genotyped and phenotyped) divided into 5 groups to make training population (4 groups) and test population (5th group)

 5 replicates

2/ Estimation of allelic effects

3/ Calculation of GEBV for test individuals

4/ Calculation of accuracy of genomic selection in test population

(3)

2/7/2013

3

Materials and methods

Definition of groups for training and test populations:

2 methods, in order to get a range of accuracy of GS: 1. Lower bound: CLUSTERING (Saatchi et al 2011)

• Calculate matrix of additive genetic relationships between individuals, • Use K-means clustering to make 5 groups

 Increases within-group relationships / Decreases

between-group relationships (groups represent subpopulations) 2. Upper bound: ACROSS FAMILIES

• Each family is randomly divided into 5 groups

 Maximizes relationships between training and test

populations Progeny 1 2 3 4 10

ACROSS FAMILIES K-MEANS CLUSTERING G1 G1 G2 G3 G4 G5 G2 G5 … …

Materials and methods

Definition of groups for training and test populations:

Materials and methods

Statistical methods to estimate GEBV:

Group of method Method Marker effects Comments Reference

Mixed Model ABLUP no Control Henderson 1975

Mixed Model BLUP yes gi ~ N(0, Vm) Meuwissen et al 2001

Mixed Model GBLUP no Estimate GEBV Henderson 1975,

Eding and Meuwissen 2001

Bayes BRR yes gi ~ N(0, σ²BR) Perez et al 2010

Bayes BL yes gi ~ N(0, τi * σ²e) Perez et al 2010

Semi-parametric RKHS no Estimate GEBV Gianola et al 2006,

Heslot et al 2012 2 GEBV 2 a GEBV DEBV, T BV GEBV, σˆ σ σˆ r Accuracy: (Saatchi et al 2011)

 Some methods better suited for traits with many small effect genes, others for traits with major genes + small effect genes

 Factors with the strongest effect on accuracy:

(1) TRAINING MODE,

(2) TRAIT, POPULATION

(3) TRAIT * POPULATION INTERACTION

Results

 No effects of statistical method, no statistical

method * trait interaction

0.0 0.2 0.4 0.6 0.8 O/P FFB Fruit wgt

INC K/F ABW BN OER P/F F/B

A c c u ra c y Trait CLUSTER DELI FAMILY DELI 0.0 0.2 0.4 0.6 0.8 ABW Fruit wgt

INC FFB BN F/B K/F P/F O/P OER

A c c u ra c y Trait CLUSTER LM FAMILY LM

Results

-47% -44%

… Effect of training mode related to amax:

family  cluster, -22% 1.1 0.86

family  cluster, -18% 0.93 0.77 Effect of training mode on accuracy:

 Range of accuracy for GS

Results

Effect of trait on accuracy:

Accuracy varies with a factor 3 according to trait (very low to very high)

Due to genetic architecture of each trait ?

(number of QTLs, distribution of QTL effects, distribution of QTL along genome versus distribution of SSRs, LD between markers and QTLs)

0 0.1 0.2 0.3 0.4 0.5 0.6 O/P FFB Fruit wgt

INC K/F ABW BN OER P/F F/B

A cc u ra cy Traits Deli a e d d cd c c c c bc 0 0.1 0.2 0.3 0.4 0.5 0.6 ABW Fruit wgt

INC FFB BN F/B K/F P/F O/P OER

A cc u ra cy Traits La Mé a f ef def cde cde cd bc ab ab

(4)

2/7/2013

4

Results

Effect of trait * population interaction on accuracy:

LM DELI 0,7 0,6 0,5 0,4 0,3 0,2 0,1 Population M e a n a c c u ra c y P/F ABW BN F/B FFB Fruit wgt INC K/F O/P OER Trait Diagram of interactions 0 50 100 150 200 250 300 350 400 0 50 100 150 200 250 300 R at io o f ac cu ra cy i n L a M é t o ac cu ra cy i n D e li (% )

Ratio of phenotypic variance in La Mé to phenotypic variance in Deli (%)

Due to differences in genetic architecture between populations and traits ? Related to differences in phenotypic variance

Results

Example: La Mé, training population « family »

Effect of statistical method on accuracy:

No effect, no interaction with trait

0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 A B LU P G B LU P B LU P_M B R R _M B L_ M R K HS _M A cc u ra cy Model ABW BN F/B FFB Fruit wgt INC K/F O/P OER P/F Trait:

… contradictory with trait effect and trait * population interaction  Too small number of phenotypic records ?

Conclusions

Some traits with very low accuracy

 bigger training populations / more markers

More studies required before implementing GS in our oil palm breeding program…

- Effect of increasing training population size ? - Rate of decrease of accuracy over generations ? - Accuracy between experimental designs ? - Genetic architecture of traits in each population ? …

Some answers in 2013 (simulations)

and 2014 (more real data: 2 experimental designs, 2 generations + GBS genotyping)

Références

Documents relatifs

The total population of the 15 European Union coun- tries is estimated to have risen by 0.4% in 2001, reach- ing 379 million on 1st January 2002 according to the provisional

This year, instead of the traditional Gross National Income (GNI) per capita (former- ly called Gross National Product, or GNP, by the World Bank), the large central Data Sheet

Using the focal point approach detailed in the methods, 16,330 SNPs were identified from the re-sequencing of 14 genotypes of the discovery panel, as well as 3,670 validated

almost identical to that of the ADD-POE model when using the simulated data, which indicated that when dense SNPs are used for prediction, modeling a relatively “rare” effect

peaks of Ne decline were recorded around eight and 13 generations ago in 24 (excluded Dagestan Mountain) and 22 (excluded Karakul, Dagestan Mountain and Soviet Merino)

In conclusion, despite some early indications that resistance to MLs may be due to specific polymorphisms in the drug target receptors, there are not yet any mutations identified that

11,16 Considering the importance of the role of Metrnl in reducing insulin resistance and adipose tissue, as well as its relation to cardiovascular risk factors such as LDL, HDL,

When the training population was made up of the first cycle progeny tests and the candidate populations were the progeny tests of three successive cycles, GS accuracy decreased