• Aucun résultat trouvé

A Finite Mixture Model with Trajectories Depending on Covariates

N/A
N/A
Protected

Academic year: 2021

Partager "A Finite Mixture Model with Trajectories Depending on Covariates"

Copied!
72
0
0

Texte intégral

(1)

A Finite Mixture Model with Trajectories Depending on Covariates

Jang SCHILTZ (University of Luxembourg) joint work with

Jean-Daniel GUIGOU (University of Luxembourg),

& Bruno LOVAT (University of Lorraine)

June 11, 2014

(2)

Outline

1 The Basic Finite Mixture Model of Nagin

2 Generalizations of the basic model

3 Our model

(3)

Outline

1 The Basic Finite Mixture Model of Nagin

2 Generalizations of the basic model

3 Our model

(4)

Outline

1 The Basic Finite Mixture Model of Nagin

2 Generalizations of the basic model

3 Our model

(5)

Outline

1 The Basic Finite Mixture Model of Nagin

2 Generalizations of the basic model

3 Our model

(6)

General description of Nagin’s model

We have a collection of individual trajectories.

We try to divide the population into a number of homogenous

sub-populations and to estimate, at the same time, a typical trajectory for each sub-population.

Hence, this model can be interpreted as functional fuzzy cluster analysis. Finite mixture model Daniel S. Nagin (Carnegie Mellon University)

mixture : population composed of a mixture of unobserved groups finite : sums across a finite number of groups

(7)

General description of Nagin’s model

We have a collection of individual trajectories.

We try to divide the population into a number of homogenous

sub-populations and to estimate, at the same time, a typical trajectory for each sub-population.

Hence, this model can be interpreted as functional fuzzy cluster analysis. Finite mixture model Daniel S. Nagin (Carnegie Mellon University)

mixture : population composed of a mixture of unobserved groups finite : sums across a finite number of groups

(8)

General description of Nagin’s model

We have a collection of individual trajectories.

We try to divide the population into a number of homogenous

sub-populations and to estimate, at the same time, a typical trajectory for each sub-population.

Hence, this model can be interpreted as functional fuzzy cluster analysis.

Finite mixture model Daniel S. Nagin (Carnegie Mellon University) mixture : population composed of a mixture of unobserved groups finite : sums across a finite number of groups

(9)

General description of Nagin’s model

We have a collection of individual trajectories.

We try to divide the population into a number of homogenous

sub-populations and to estimate, at the same time, a typical trajectory for each sub-population.

Hence, this model can be interpreted as functional fuzzy cluster analysis.

Finite mixture model Daniel S. Nagin (Carnegie Mellon University)

mixture : population composed of a mixture of unobserved groups finite : sums across a finite number of groups

(10)

General description of Nagin’s model

We have a collection of individual trajectories.

We try to divide the population into a number of homogenous

sub-populations and to estimate, at the same time, a typical trajectory for each sub-population.

Hence, this model can be interpreted as functional fuzzy cluster analysis.

Finite mixture model Daniel S. Nagin (Carnegie Mellon University) mixture : population composed of a mixture of unobserved groups

finite : sums across a finite number of groups

(11)

General description of Nagin’s model

We have a collection of individual trajectories.

We try to divide the population into a number of homogenous

sub-populations and to estimate, at the same time, a typical trajectory for each sub-population.

Hence, this model can be interpreted as functional fuzzy cluster analysis.

Finite mixture model Daniel S. Nagin (Carnegie Mellon University) mixture : population composed of a mixture of unobserved groups finite : sums across a finite number of groups

(12)

The Likelihood Function (1)

Consider a population of size N and a variable of interest Y .

Let Yi = yi1, yi2, ..., yiT be T measures of the variable, taken at times t1, ...tT for subject number i .

πj : probability of a given subject to belong to group number j

⇒ πj is the size of group j .

⇒ P(Yi) =

r

X

j =1

πjPj(Yi), (1)

where Pj(Yi) is probability of Yi if subject i belongs to group j .

(13)

The Likelihood Function (1)

Consider a population of size N and a variable of interest Y .

Let Yi = yi1, yi2, ..., yiT be T measures of the variable, taken at times t1, ...tT for subject number i .

πj : probability of a given subject to belong to group number j

⇒ πj is the size of group j .

⇒ P(Yi) =

r

X

j =1

πjPj(Yi), (1)

where Pj(Yi) is probability of Yi if subject i belongs to group j .

(14)

The Likelihood Function (1)

Consider a population of size N and a variable of interest Y .

Let Yi = yi1, yi2, ..., yiT be T measures of the variable, taken at times t1, ...tT for subject number i .

πj : probability of a given subject to belong to group number j

⇒ πj is the size of group j .

⇒ P(Yi) =

r

X

j =1

πjPj(Yi), (1)

where Pj(Yi) is probability of Yi if subject i belongs to group j .

(15)

The Likelihood Function (1)

Consider a population of size N and a variable of interest Y .

Let Yi = yi1, yi2, ..., yiT be T measures of the variable, taken at times t1, ...tT for subject number i .

πj : probability of a given subject to belong to group number j

⇒ πj is the size of group j .

⇒ P(Yi) =

r

X

j =1

πjPj(Yi), (1)

where Pj(Yi) is probability of Yi if subject i belongs to group j .

(16)

The Likelihood Function (1)

Consider a population of size N and a variable of interest Y .

Let Yi = yi1, yi2, ..., yiT be T measures of the variable, taken at times t1, ...tT for subject number i .

πj : probability of a given subject to belong to group number j

⇒ πj is the size of group j .

⇒ P(Yi) =

r

X

j =1

πjPj(Yi), (1)

where Pj(Yi) is probability of Yi if subject i belongs to group j .

(17)

The Likelihood Function (1)

Consider a population of size N and a variable of interest Y .

Let Yi = yi1, yi2, ..., yiT be T measures of the variable, taken at times t1, ...tT for subject number i .

πj : probability of a given subject to belong to group number j

⇒ πj is the size of group j .

⇒ P(Yi) =

r

X

j =1

πjPj(Yi), (1)

j

(18)

The Likelihood Function (2)

Aim of the analysis: Find r groups of trajectories of a given kind (for instance polynomials of degree 4, P(t) = β0+ β1t + β2t2+ β3t3+ β4t4.

We try to estimate a set of parameters Ω =n

β0j, β1j, β2j, β3j, β4j, πjo which allow to maximize the probability of the measured data.

Possible data distributions:

count data ⇒ Poisson distribution

binary data ⇒ Binary logit distribution

censored data ⇒ Censored normal distribution

(19)

The Likelihood Function (2)

Aim of the analysis: Find r groups of trajectories of a given kind (for instance polynomials of degree 4, P(t) = β0+ β1t + β2t2+ β3t3+ β4t4. We try to estimate a set of parameters Ω =n

β0j, β1j, β2j, β3j, β4j, πjo which allow to maximize the probability of the measured data.

Possible data distributions:

count data ⇒ Poisson distribution binary data ⇒ Binary logit distribution

censored data ⇒ Censored normal distribution

(20)

The Likelihood Function (2)

Aim of the analysis: Find r groups of trajectories of a given kind (for instance polynomials of degree 4, P(t) = β0+ β1t + β2t2+ β3t3+ β4t4. We try to estimate a set of parameters Ω =n

β0j, β1j, β2j, β3j, β4j, πjo which allow to maximize the probability of the measured data.

Possible data distributions:

count data ⇒ Poisson distribution binary data ⇒ Binary logit distribution

censored data ⇒ Censored normal distribution

(21)

The Likelihood Function (2)

Aim of the analysis: Find r groups of trajectories of a given kind (for instance polynomials of degree 4, P(t) = β0+ β1t + β2t2+ β3t3+ β4t4. We try to estimate a set of parameters Ω =n

β0j, β1j, β2j, β3j, β4j, πjo which allow to maximize the probability of the measured data.

Possible data distributions:

count data ⇒ Poisson distribution

binary data ⇒ Binary logit distribution

censored data ⇒ Censored normal distribution

(22)

The Likelihood Function (2)

Aim of the analysis: Find r groups of trajectories of a given kind (for instance polynomials of degree 4, P(t) = β0+ β1t + β2t2+ β3t3+ β4t4. We try to estimate a set of parameters Ω =n

β0j, β1j, β2j, β3j, β4j, πjo which allow to maximize the probability of the measured data.

Possible data distributions:

count data ⇒ Poisson distribution binary data ⇒ Binary logit distribution

censored data ⇒ Censored normal distribution

(23)

The Likelihood Function (2)

Aim of the analysis: Find r groups of trajectories of a given kind (for instance polynomials of degree 4, P(t) = β0+ β1t + β2t2+ β3t3+ β4t4. We try to estimate a set of parameters Ω =n

β0j, β1j, β2j, β3j, β4j, πjo which allow to maximize the probability of the measured data.

Possible data distributions:

count data ⇒ Poisson distribution binary data ⇒ Binary logit distribution

censored data ⇒ Censored normal distribution

(24)

The case of a normal distribution (1)

Notations :

βjtit = β0j + β1jAgeit + β2jAgei2

t + β3jAgei3

t + β4jAgei4

t. φ: density of standard centered normal law.

Then,

L = 1 σ

N

Y

i =1 r

X

j =1

πj T

Y

t=1

φ yit − βjtit

σ



. (2)

It is too complicated to get closed-forms equations.

(25)

The case of a normal distribution (1)

Notations :

βjtit = β0j + β1jAgeit + β2jAgei2

t + β3jAgei3

t + β4jAgei4

t.

φ: density of standard centered normal law. Then,

L = 1 σ

N

Y

i =1 r

X

j =1

πj T

Y

t=1

φ yit − βjtit

σ



. (2)

It is too complicated to get closed-forms equations.

(26)

The case of a normal distribution (1)

Notations :

βjtit = β0j + β1jAgeit + β2jAgei2

t + β3jAgei3

t + β4jAgei4

t. φ: density of standard centered normal law.

Then,

L = 1 σ

N

Y

i =1 r

X

j =1

πj T

Y

t=1

φ yit − βjtit

σ



. (2)

It is too complicated to get closed-forms equations.

(27)

The case of a normal distribution (1)

Notations :

βjtit = β0j + β1jAgeit + β2jAgei2

t + β3jAgei3

t + β4jAgei4

t. φ: density of standard centered normal law.

Then,

L = 1 σ

N

Y

i =1 r

X

j =1

πj T

Y

t=1

φ yit − βjtit

σ



. (2)

It is too complicated to get closed-forms equations.

(28)

The case of a normal distribution (1)

Notations :

βjtit = β0j + β1jAgeit + β2jAgei2

t + β3jAgei3

t + β4jAgei4

t. φ: density of standard centered normal law.

Then,

L = 1 σ

N

Y

i =1 r

X

j =1

πj T

Y

t=1

φ yit − βjtit

σ



. (2)

It is too complicated to get closed-forms equations.

(29)

The case of a normal distribution (1)

Notations :

βjtit = β0j + β1jAgeit + β2jAgei2

t + β3jAgei3

t + β4jAgei4

t. φ: density of standard centered normal law.

Then,

L = 1 σ

N

Y

i =1 r

X

j =1

πj T

Y

t=1

φ yit − βjtit

σ



. (2)

It is too complicated to get closed-forms equations.

(30)

Available Software

SAS-based Proc Traj procedure

By Bobby L. Jones (Carnegie Mellon University).

Uses a quasi-Newton procedure maximum research routine.

Since the likelyhood is nor convex, nor a contraction, there are issues with local maxima.

R-package crimCV

By Jason D. Nielsen (Carleton University Ottawa). Just implements a zero-inflation Poission model.

(31)

Available Software

SAS-based Proc Traj procedure

By Bobby L. Jones (Carnegie Mellon University).

Uses a quasi-Newton procedure maximum research routine.

Since the likelyhood is nor convex, nor a contraction, there are issues with local maxima.

R-package crimCV

By Jason D. Nielsen (Carleton University Ottawa). Just implements a zero-inflation Poission model.

(32)

Available Software

SAS-based Proc Traj procedure

By Bobby L. Jones (Carnegie Mellon University).

Uses a quasi-Newton procedure maximum research routine.

Since the likelyhood is nor convex, nor a contraction, there are issues with local maxima.

R-package crimCV

By Jason D. Nielsen (Carleton University Ottawa). Just implements a zero-inflation Poission model.

(33)

Available Software

SAS-based Proc Traj procedure

By Bobby L. Jones (Carnegie Mellon University).

Uses a quasi-Newton procedure maximum research routine.

Since the likelyhood is nor convex, nor a contraction, there are issues with local maxima.

R-package crimCV

By Jason D. Nielsen (Carleton University Ottawa). Just implements a zero-inflation Poission model.

(34)

Available Software

SAS-based Proc Traj procedure

By Bobby L. Jones (Carnegie Mellon University).

Uses a quasi-Newton procedure maximum research routine.

Since the likelyhood is nor convex, nor a contraction, there are issues with local maxima.

R-package crimCV

By Jason D. Nielsen (Carleton University Ottawa).

Just implements a zero-inflation Poission model.

(35)

Available Software

SAS-based Proc Traj procedure

By Bobby L. Jones (Carnegie Mellon University).

Uses a quasi-Newton procedure maximum research routine.

Since the likelyhood is nor convex, nor a contraction, there are issues with local maxima.

R-package crimCV

By Jason D. Nielsen (Carleton University Ottawa).

Just implements a zero-inflation Poission model.

(36)

Model Selection (1)

Bayesian Information Criterion:

BIC = log(L) − 0, 5k log(N), (3)

where k denotes the number of parameters in the model.

Rule:

The bigger the BIC, the better the model!

(37)

Model Selection (2)

Leave-one-out Cross-Validation Apporach:

CVE = 1 N

N

X

i =1

1 T

T

X

t=1

yit − ˆyi[−i ]

t

. (4)

Rule:

The smaller the CVE, the better the model!

(38)

Posterior Group-Membership Probabilities

Posterior probability of individual i ’s membership in group j : P(j /Yi).

Bayes’s theorem

⇒ P(j/Yi) = P(Yi/j )ˆπj

r

X

j =1

P(Yi/j )ˆπj

. (5)

Bigger groups have on average larger probability estimates.

To be classified into a small group, an individual really needs to be

(39)

An application example

The data : first dataset Salaries of workers in the private sector in Luxembourg from 1940 to 2006.

About 7 million salary lines corresponding to 718.054 workers. Some sociological variables:

gender (male, female)

nationality and residentship (luxemburgish residents, foreign residents, foreign non residents)

working status (white collar worker, blue collar worker) year of birth

age in the first year of professional activity

(40)

An application example

The data : first dataset Salaries of workers in the private sector in Luxembourg from 1940 to 2006.

About 7 million salary lines corresponding to 718.054 workers. Some sociological variables:

gender (male, female)

nationality and residentship (luxemburgish residents, foreign residents, foreign non residents)

working status (white collar worker, blue collar worker) year of birth

age in the first year of professional activity

(41)

An application example

The data : first dataset Salaries of workers in the private sector in Luxembourg from 1940 to 2006.

About 7 million salary lines corresponding to 718.054 workers.

Some sociological variables: gender (male, female)

nationality and residentship (luxemburgish residents, foreign residents, foreign non residents)

working status (white collar worker, blue collar worker) year of birth

age in the first year of professional activity

(42)

An application example

The data : first dataset Salaries of workers in the private sector in Luxembourg from 1940 to 2006.

About 7 million salary lines corresponding to 718.054 workers.

Some sociological variables:

gender (male, female)

nationality and residentship (luxemburgish residents, foreign residents, foreign non residents)

working status (white collar worker, blue collar worker) year of birth

age in the first year of professional activity

(43)

An application example

The data : first dataset Salaries of workers in the private sector in Luxembourg from 1940 to 2006.

About 7 million salary lines corresponding to 718.054 workers.

Some sociological variables:

gender (male, female)

nationality and residentship (luxemburgish residents, foreign residents, foreign non residents)

working status (white collar worker, blue collar worker) year of birth

age in the first year of professional activity

(44)

An application example

The data : first dataset Salaries of workers in the private sector in Luxembourg from 1940 to 2006.

About 7 million salary lines corresponding to 718.054 workers.

Some sociological variables:

gender (male, female)

nationality and residentship (luxemburgish residents, foreign residents, foreign non residents)

working status (white collar worker, blue collar worker)

year of birth

age in the first year of professional activity

(45)

An application example

The data : first dataset Salaries of workers in the private sector in Luxembourg from 1940 to 2006.

About 7 million salary lines corresponding to 718.054 workers.

Some sociological variables:

gender (male, female)

nationality and residentship (luxemburgish residents, foreign residents, foreign non residents)

working status (white collar worker, blue collar worker) year of birth

age in the first year of professional activity

(46)

An application example

The data : first dataset Salaries of workers in the private sector in Luxembourg from 1940 to 2006.

About 7 million salary lines corresponding to 718.054 workers.

Some sociological variables:

gender (male, female)

nationality and residentship (luxemburgish residents, foreign residents, foreign non residents)

working status (white collar worker, blue collar worker) year of birth

(47)

Result for 9 groups (dataset 1)

(48)

Result for 9 groups (dataset 1)

(49)

Results for 9 groups (dataset 1)

(50)

Outline

1 The Basic Finite Mixture Model of Nagin

2 Generalizations of the basic model

3 Our model

(51)

Predictors of trajectory group membership

xi : vector of variables potentially associated with group membership (measured before t1).

Multinomial logit model:

πj(xi) = exiθj

r

X

k=1

exiθk

, (6)

where θj denotes the effect of xi on the probability of group membership.

L = 1 σ

N

Y

i =1 r

X

j =1

exiθj

r

X

k=1

exiθk

T

Y

t=1

φ yit − βjtit σ



. (7)

(52)

Predictors of trajectory group membership

xi : vector of variables potentially associated with group membership (measured before t1).

Multinomial logit model:

πj(xi) = exiθj

r

X

k=1

exiθk

, (6)

where θj denotes the effect of xi on the probability of group membership.

L = 1 σ

N

Y

i =1 r

X

j =1

exiθj

r

X

k=1

exiθk

T

Y

t=1

φ yit − βjtit σ



. (7)

(53)

Predictors of trajectory group membership

xi : vector of variables potentially associated with group membership (measured before t1).

Multinomial logit model:

πj(xi) = exiθj

r

X

k=1

exiθk

, (6)

where θj denotes the effect of xi on the probability of group membership.

L = 1 σ

N

Y

i =1 r

X

j =1

exiθj

r

X

k=1

exiθk

T

Y

t=1

φ yit − βjtit σ



. (7)

(54)

Predictors of trajectory group membership

xi : vector of variables potentially associated with group membership (measured before t1).

Multinomial logit model:

πj(xi) = exiθj

r

X

k=1

exiθk

, (6)

where θj denotes the effect of xi on the probability of group membership.

L = 1 σ

N

Y

r

X exiθj

r

T

Yφ yit − βjtit σ



. (7)

(55)

Adding covariates to the trajectories (1)

Let z1...zL be covariates potentially influencing Y . We are then looking for trajectories

yit = β0jj1Ageit2jAgei2t3jAgei3t4jAgei4tj1z1+...+αjLzLit, (8) where εit ∼ N (0, σ), σ being a constant standard deviation and zl are covariates that may depend or not upon time t.

Unfortunately the influence of the covariates in this model is limited to the intercept of the trajectory.

(56)

Adding covariates to the trajectories (1)

Let z1...zL be covariates potentially influencing Y .

We are then looking for trajectories

yit = β0jj1Ageit2jAgei2t3jAgei3t4jAgei4tj1z1+...+αjLzLit, (8) where εit ∼ N (0, σ), σ being a constant standard deviation and zl are covariates that may depend or not upon time t.

Unfortunately the influence of the covariates in this model is limited to the intercept of the trajectory.

(57)

Adding covariates to the trajectories (1)

Let z1...zL be covariates potentially influencing Y . We are then looking for trajectories

yit = β0jj1Ageit2jAgei2t3jAgei3t4jAgei4tj1z1+...+αjLzLit, (8) where εit ∼ N (0, σ), σ being a constant standard deviation and zl are covariates that may depend or not upon time t.

Unfortunately the influence of the covariates in this model is limited to the intercept of the trajectory.

(58)

Adding covariates to the trajectories (1)

Let z1...zL be covariates potentially influencing Y . We are then looking for trajectories

yit = β0jj1Ageit2jAgei2t3jAgei3t4jAgei4tj1z1+...+αjLzLit, (8) where εit ∼ N (0, σ), σ being a constant standard deviation and zl are covariates that may depend or not upon time t.

Unfortunately the influence of the covariates in this model is limited to the intercept of the trajectory.

(59)

Adding covariates to the trajectories (2)

(60)

Adding covariates to the trajectories (2)

(61)

Outline

1 The Basic Finite Mixture Model of Nagin

2 Generalizations of the basic model

3 Our model

(62)

Our model

Let x1...xL and zi1, ..., ziT be covariates potentially influencing Y . We propose the following model:

yit = β0j +

L

X

l =1

αj0lxl+ γ0jzit

!

+ β1j +

L

X

l =1

αj1lxl+ γ1jzit

! Ageit

+ β2j +

L

X

l =1

αj2lxl + γ2jzit

!

Agei2t + β3j +

L

X

l =1

αj3lxl+ γ3jzit

! Agei3t

+ β4j +

L

X

l =1

αj4lxl + γ4jzit

!

Agei4t + εit,

where εit ∼ N (0, σ), σ being a constant standard deviation.

(63)

Our model

Let x1...xL and zi1, ..., ziT be covariates potentially influencing Y .

We propose the following model:

yit = β0j +

L

X

l =1

αj0lxl+ γ0jzit

!

+ β1j +

L

X

l =1

αj1lxl+ γ1jzit

! Ageit

+ β2j +

L

X

l =1

αj2lxl + γ2jzit

!

Agei2t + β3j +

L

X

l =1

αj3lxl+ γ3jzit

! Agei3t

+ β4j +

L

X

l =1

αj4lxl + γ4jzit

!

Agei4t + εit,

where εit ∼ N (0, σ), σ being a constant standard deviation.

(64)

Our model

Let x1...xL and zi1, ..., ziT be covariates potentially influencing Y . We propose the following model:

yit = β0j +

L

X

l =1

αj0lxl+ γ0jzit

!

+ β1j +

L

X

l =1

αj1lxl+ γ1jzit

! Ageit

+ β2j +

L

X

l =1

αj2lxl + γ2jzit

!

Agei2t + β3j +

L

X

l =1

αj3lxl+ γ3jzit

! Agei3t

+ β4j +

L

X

l =1

αj4lxl + γ4jzit

!

Agei4t + εit,

(65)

Men versus women

(66)

Statistical Properties

Both Nagin’s and our model can be written as L = 1

σ

N

Y

i =1 r

X

j =1

πj T

Y

t=1

φ observed data - modelled data σ



. (9)

Nagin’s model:

yit = β0jj1Ageit2jAgei2t3jAgei3t4jAgei4tj1z1+...+αjLzLit, (10) Our model:

yit = β0j +

L

X

l =1

αj0lxl+ γ0jzit

!

+ β1j +

L

X

l =1

αj1lxl+ γ1jzit

! Ageit

+ β2j +

L

X

l =1

αj2lxl + γ2jzit

!

Agei2t + β3j +

L

X

l =1

αj3lxl+ γ3jzit

! Agei3t

+ β4j +

L

X

l =1

αj4lxl + γ4jzit

!

Agei4t + εit,

(67)

Statistical Properties

Both Nagin’s and our model can be written as L = 1

σ

N

Y

i =1 r

X

j =1

πj T

Y

t=1

φ observed data - modelled data σ



. (9)

Nagin’s model:

yit = β0jj1Ageit2jAgei2t3jAgei3t4jAgei4tj1z1+...+αjLzLit, (10) Our model:

yit = β0j +

L

X

l =1

αj0lxl+ γ0jzit

!

+ β1j +

L

X

l =1

αj1lxl+ γ1jzit

! Ageit

+ β2j +

L

X

l =1

αj2lxl + γ2jzit

!

Agei2t + β3j +

L

X

l =1

αj3lxl+ γ3jzit

! Agei3t

(68)

Parameter estimation

A way of estimating our model with the existing software (if group membership does not depend on the covariates) :

Use the latest version of proc.traj to test if the covariates have indeed an influence on the trajectories.

Apply proc.traj to the data without covariates do the clustering and obtain the number of groups and the constitution of the groups. Use your favorite regression model software to get the trajectories separately for each group.

(69)

Parameter estimation

A way of estimating our model with the existing software (if group membership does not depend on the covariates) :

Use the latest version of proc.traj to test if the covariates have indeed an influence on the trajectories.

Apply proc.traj to the data without covariates do the clustering and obtain the number of groups and the constitution of the groups. Use your favorite regression model software to get the trajectories separately for each group.

(70)

Parameter estimation

A way of estimating our model with the existing software (if group membership does not depend on the covariates) :

Use the latest version of proc.traj to test if the covariates have indeed an influence on the trajectories.

Apply proc.traj to the data without covariates do the clustering and obtain the number of groups and the constitution of the groups.

Use your favorite regression model software to get the trajectories separately for each group.

(71)

Parameter estimation

A way of estimating our model with the existing software (if group membership does not depend on the covariates) :

Use the latest version of proc.traj to test if the covariates have indeed an influence on the trajectories.

Apply proc.traj to the data without covariates do the clustering and obtain the number of groups and the constitution of the groups.

Use your favorite regression model software to get the trajectories separately for each group.

(72)

Bibliography

Nagin, D.S. 2005: Group-based Modeling of Development.

Cambridge, MA.: Harvard University Press.

Jones, B. and Nagin D.S. 2007: Advances in Group-based Trajectory Modeling and a SAS Procedure for Estimating Them. Sociological Research and Methods 35 p.542-571.

Nielsen, J.D. et al. 2012: Group-Based criminal trajectory analysis using cross-validation criteria.

http://www.probability.ca/jeff/research.html.

Muth´en, B., Shedden, K. 1999: Finite Mixture Modeling with Mixture Outcomes Using the EM Algorithm. Biometrics 55 p.463-469.

Guigou, J.D, Lovat, B. and Schiltz, J. 2012: Optimal mix of funded and unfunded pension systems: the case of Luxembourg. Pensions 17-4 p. 208-222.

Références

Documents relatifs

Expanding access to family planning information, products and services requires a more expansive approach to how the global health community engages with the corporate

 Christina Klein, The National Socialist Cultural Policy in Luxembourg 1940 - 1944, BA thesis, University of Düsseldorf 2011...

2012: Analysis of the salary trajectories in Luxembourg : a finite mixture model approach. 2011: Zur Anwendung der

gender (male, female) nationality and residentship sector of activity. year

The second contribution of this paper is the estimation of a semiparametric single index regression model for the censored response Y given X under assumption (A0).. The

Trajectories from public sector of research to private sector : an analysis using French data on young PhD graduates Pierre Béret ♠ , Jean-François Giret ♣ , Isabelle

This process is based on the raw data from these sensors which help identify how, why and when mobile objects are moving, in order to add semantic information to the trajectories

These include the Chinese Academy of Forestry, the Centre for Chinese Agricultural Policy, the China National Forestry Economics and Development Research Centre, Beijing