• Aucun résultat trouvé

An R-package for Finite Mixture Models

N/A
N/A
Protected

Academic year: 2021

Partager "An R-package for Finite Mixture Models"

Copied!
131
0
0

Texte intégral

(1)

An R-package for finite mixture models

Jang SCHILTZ (University of Luxembourg) joint work with

Jean-Daniel GUIGOU (University of Luxembourg),

& Bruno LOVAT (University of Lorraine)

December 7, 2013

Jang SCHILTZ (University of Luxembourg) joint work with Jean-Daniel GUIGOU (University of Luxembourg), & Bruno LOVAT (University of Lorraine) ()An R-package for finite mixture models December 7, 2013 1 / 38

(2)

An R-package for finite mixture models

Jang SCHILTZ (University of Luxembourg) joint work with

Jean-Daniel GUIGOU (University of Luxembourg),

& Bruno LOVAT (University of Lorraine)

December 7, 2013

(3)

Outline

1 The Basic Finite Mixture Model of Nagin

2 Generalizations of the basic model

3 Muth´en’s model

4 Research Agenda

Jang SCHILTZ (University of Luxembourg) joint work with Jean-Daniel GUIGOU (University of Luxembourg), & Bruno LOVAT (University of Lorraine) ()An R-package for finite mixture models December 7, 2013 3 / 38

(4)

Outline

1 The Basic Finite Mixture Model of Nagin

2 Generalizations of the basic model

3 Muth´en’s model

4 Research Agenda

(5)

Outline

1 The Basic Finite Mixture Model of Nagin

2 Generalizations of the basic model

3 Muth´en’s model

4 Research Agenda

Jang SCHILTZ (University of Luxembourg) joint work with Jean-Daniel GUIGOU (University of Luxembourg), & Bruno LOVAT (University of Lorraine) ()An R-package for finite mixture models December 7, 2013 3 / 38

(6)

Outline

1 The Basic Finite Mixture Model of Nagin

2 Generalizations of the basic model

3 Muth´en’s model

4 Research Agenda

(7)

Outline

1 The Basic Finite Mixture Model of Nagin

2 Generalizations of the basic model

3 Muth´en’s model

4 Research Agenda

Jang SCHILTZ (University of Luxembourg) joint work with Jean-Daniel GUIGOU (University of Luxembourg), & Bruno LOVAT (University of Lorraine) ()An R-package for finite mixture models December 7, 2013 4 / 38

(8)

General description of Nagin’s model

We have a collection of individual trajectories.

We try to divide the population into a number of homogenous

sub-populations and to estimate a mean trajectory for each sub-population. Hence, this model can be interpreted as functional fuzzy cluster analysis.

(9)

General description of Nagin’s model

We have a collection of individual trajectories.

We try to divide the population into a number of homogenous

sub-populations and to estimate a mean trajectory for each sub-population.

Hence, this model can be interpreted as functional fuzzy cluster analysis.

Jang SCHILTZ (University of Luxembourg) joint work with Jean-Daniel GUIGOU (University of Luxembourg), & Bruno LOVAT (University of Lorraine) ()An R-package for finite mixture models December 7, 2013 5 / 38

(10)

General description of Nagin’s model

We have a collection of individual trajectories.

We try to divide the population into a number of homogenous

sub-populations and to estimate a mean trajectory for each sub-population.

Hence, this model can be interpreted as functional fuzzy cluster analysis.

(11)

The Likelihood Function (1)

Consider a population of size N and a variable of interest Y .

Let Yi = yi1, yi2, ..., yiT be T measures of the variable, taken at times t1, ...tT for subject number i .

P(Yi) denotes the probability of Yi count data ⇒ Poisson distribution

binary data ⇒ Binary logit distribution

censored data ⇒ Censored normal distribution

Aim of the analysis: Find r groups of trajectories of a given kind (for instance polynomials of degree 4, P(t) = β0+ β1t + β2t2+ β3t3+ β4t4.

Jang SCHILTZ (University of Luxembourg) joint work with Jean-Daniel GUIGOU (University of Luxembourg), & Bruno LOVAT (University of Lorraine) ()An R-package for finite mixture models December 7, 2013 6 / 38

(12)

The Likelihood Function (1)

Consider a population of size N and a variable of interest Y .

Let Yi = yi1, yi2, ..., yiT be T measures of the variable, taken at times t1, ...tT for subject number i .

P(Yi) denotes the probability of Yi count data ⇒ Poisson distribution binary data ⇒ Binary logit distribution

censored data ⇒ Censored normal distribution

Aim of the analysis: Find r groups of trajectories of a given kind (for instance polynomials of degree 4, P(t) = β0+ β1t + β2t2+ β3t3+ β4t4.

(13)

The Likelihood Function (1)

Consider a population of size N and a variable of interest Y .

Let Yi = yi1, yi2, ..., yiT be T measures of the variable, taken at times t1, ...tT for subject number i .

P(Yi) denotes the probability of Yi

count data ⇒ Poisson distribution binary data ⇒ Binary logit distribution

censored data ⇒ Censored normal distribution

Aim of the analysis: Find r groups of trajectories of a given kind (for instance polynomials of degree 4, P(t) = β0+ β1t + β2t2+ β3t3+ β4t4.

Jang SCHILTZ (University of Luxembourg) joint work with Jean-Daniel GUIGOU (University of Luxembourg), & Bruno LOVAT (University of Lorraine) ()An R-package for finite mixture models December 7, 2013 6 / 38

(14)

The Likelihood Function (1)

Consider a population of size N and a variable of interest Y .

Let Yi = yi1, yi2, ..., yiT be T measures of the variable, taken at times t1, ...tT for subject number i .

P(Yi) denotes the probability of Yi count data ⇒ Poisson distribution

binary data ⇒ Binary logit distribution

censored data ⇒ Censored normal distribution

Aim of the analysis: Find r groups of trajectories of a given kind (for instance polynomials of degree 4, P(t) = β0+ β1t + β2t2+ β3t3+ β4t4.

(15)

The Likelihood Function (1)

Consider a population of size N and a variable of interest Y .

Let Yi = yi1, yi2, ..., yiT be T measures of the variable, taken at times t1, ...tT for subject number i .

P(Yi) denotes the probability of Yi count data ⇒ Poisson distribution binary data ⇒ Binary logit distribution

censored data ⇒ Censored normal distribution

Aim of the analysis: Find r groups of trajectories of a given kind (for instance polynomials of degree 4, P(t) = β0+ β1t + β2t2+ β3t3+ β4t4.

Jang SCHILTZ (University of Luxembourg) joint work with Jean-Daniel GUIGOU (University of Luxembourg), & Bruno LOVAT (University of Lorraine) ()An R-package for finite mixture models December 7, 2013 6 / 38

(16)

The Likelihood Function (1)

Consider a population of size N and a variable of interest Y .

Let Yi = yi1, yi2, ..., yiT be T measures of the variable, taken at times t1, ...tT for subject number i .

P(Yi) denotes the probability of Yi count data ⇒ Poisson distribution binary data ⇒ Binary logit distribution

censored data ⇒ Censored normal distribution

Aim of the analysis: Find r groups of trajectories of a given kind (for instance polynomials of degree 4, P(t) = β0+ β1t + β2t2+ β3t3+ β4t4.

(17)

The Likelihood Function (1)

Consider a population of size N and a variable of interest Y .

Let Yi = yi1, yi2, ..., yiT be T measures of the variable, taken at times t1, ...tT for subject number i .

P(Yi) denotes the probability of Yi count data ⇒ Poisson distribution binary data ⇒ Binary logit distribution

censored data ⇒ Censored normal distribution

Aim of the analysis: Find r groups of trajectories of a given kind (for instance polynomials of degree 4, P(t) = β0+ β1t + β2t2+ β3t3+ β4t4.

Jang SCHILTZ (University of Luxembourg) joint work with Jean-Daniel GUIGOU (University of Luxembourg), & Bruno LOVAT (University of Lorraine) ()An R-package for finite mixture models December 7, 2013 6 / 38

(18)

The Likelihood Function (2)

πj : probability of a given subject to belong to group number j

⇒ πj is the size of group j . We try to estimate a set of parameters Ω =

n

β0j, β1j, β2j, β3j, β4j, πj

o which allow to maximize the probability of the measured data.

Pj(Yi) : probability of Yi if subject i belongs to group j

⇒ P(Yi) =

r

X

j =1

πjPj(Yi). (1)

Finite mixture model Daniel S. Nagin (Carnegie Mellon University) finite : sums across a finite number of groups

mixture : population composed of a mixture of unobserved groups

(19)

The Likelihood Function (2)

πj : probability of a given subject to belong to group number j

⇒ πj is the size of group j .

We try to estimate a set of parameters Ω = n

β0j, β1j, β2j, β3j, β4j, πj

o which allow to maximize the probability of the measured data.

Pj(Yi) : probability of Yi if subject i belongs to group j

⇒ P(Yi) =

r

X

j =1

πjPj(Yi). (1)

Finite mixture model Daniel S. Nagin (Carnegie Mellon University) finite : sums across a finite number of groups

mixture : population composed of a mixture of unobserved groups

Jang SCHILTZ (University of Luxembourg) joint work with Jean-Daniel GUIGOU (University of Luxembourg), & Bruno LOVAT (University of Lorraine) ()An R-package for finite mixture models December 7, 2013 7 / 38

(20)

The Likelihood Function (2)

πj : probability of a given subject to belong to group number j

⇒ πj is the size of group j . We try to estimate a set of parameters Ω =

n

β0j, β1j, β2j, β3j, β4j, πj

o which allow to maximize the probability of the measured data.

Pj(Yi) : probability of Yi if subject i belongs to group j

⇒ P(Yi) =

r

X

j =1

πjPj(Yi). (1)

Finite mixture model Daniel S. Nagin (Carnegie Mellon University) finite : sums across a finite number of groups

mixture : population composed of a mixture of unobserved groups

(21)

The Likelihood Function (2)

πj : probability of a given subject to belong to group number j

⇒ πj is the size of group j . We try to estimate a set of parameters Ω =

n

β0j, β1j, β2j, β3j, β4j, πj

o which allow to maximize the probability of the measured data.

Pj(Yi) : probability of Yi if subject i belongs to group j

⇒ P(Yi) =

r

X

j =1

πjPj(Yi). (1)

Finite mixture model Daniel S. Nagin (Carnegie Mellon University) finite : sums across a finite number of groups

mixture : population composed of a mixture of unobserved groups

Jang SCHILTZ (University of Luxembourg) joint work with Jean-Daniel GUIGOU (University of Luxembourg), & Bruno LOVAT (University of Lorraine) ()An R-package for finite mixture models December 7, 2013 7 / 38

(22)

The Likelihood Function (2)

πj : probability of a given subject to belong to group number j

⇒ πj is the size of group j . We try to estimate a set of parameters Ω =

n

β0j, β1j, β2j, β3j, β4j, πj

o which allow to maximize the probability of the measured data.

Pj(Yi) : probability of Yi if subject i belongs to group j

⇒ P(Yi) =

r

X

j =1

πjPj(Yi). (1)

Finite mixture model Daniel S. Nagin (Carnegie Mellon University) finite : sums across a finite number of groups

mixture : population composed of a mixture of unobserved groups

(23)

The Likelihood Function (2)

πj : probability of a given subject to belong to group number j

⇒ πj is the size of group j . We try to estimate a set of parameters Ω =

n

β0j, β1j, β2j, β3j, β4j, πj

o which allow to maximize the probability of the measured data.

Pj(Yi) : probability of Yi if subject i belongs to group j

⇒ P(Yi) =

r

X

j =1

πjPj(Yi). (1)

Finite mixture model Daniel S. Nagin (Carnegie Mellon University)

finite : sums across a finite number of groups

mixture : population composed of a mixture of unobserved groups

Jang SCHILTZ (University of Luxembourg) joint work with Jean-Daniel GUIGOU (University of Luxembourg), & Bruno LOVAT (University of Lorraine) ()An R-package for finite mixture models December 7, 2013 7 / 38

(24)

The Likelihood Function (2)

πj : probability of a given subject to belong to group number j

⇒ πj is the size of group j . We try to estimate a set of parameters Ω =

n

β0j, β1j, β2j, β3j, β4j, πj

o which allow to maximize the probability of the measured data.

Pj(Yi) : probability of Yi if subject i belongs to group j

⇒ P(Yi) =

r

X

j =1

πjPj(Yi). (1)

Finite mixture model Daniel S. Nagin (Carnegie Mellon University)

mixture : population composed of a mixture of unobserved groups

(25)

The Likelihood Function (2)

πj : probability of a given subject to belong to group number j

⇒ πj is the size of group j . We try to estimate a set of parameters Ω =

n

β0j, β1j, β2j, β3j, β4j, πj

o which allow to maximize the probability of the measured data.

Pj(Yi) : probability of Yi if subject i belongs to group j

⇒ P(Yi) =

r

X

j =1

πjPj(Yi). (1)

Finite mixture model Daniel S. Nagin (Carnegie Mellon University) finite : sums across a finite number of groups

mixture : population composed of a mixture of unobserved groups

Jang SCHILTZ (University of Luxembourg) joint work with Jean-Daniel GUIGOU (University of Luxembourg), & Bruno LOVAT (University of Lorraine) ()An R-package for finite mixture models December 7, 2013 7 / 38

(26)

The Likelihood Function (3)

Hypothesis: In a given group, conditional independence is assumed for the sequential realizations of the elements of Yi!!!

⇒ Pj(Yi) =

T

Y

t=1

pj(yit), (2)

where pj(yit) denotes the probability of yit given membership in group j . Likelihood of the estimator:

L =

N

Y

i =1

P(Yi) =

N

Y

i =1 r

X

j =1

πj

T

Y

t=1

pj(yit). (3)

(27)

The Likelihood Function (3)

Hypothesis: In a given group, conditional independence is assumed for the sequential realizations of the elements of Yi!!!

⇒ Pj(Yi) =

T

Y

t=1

pj(yit), (2)

where pj(yit) denotes the probability of yit given membership in group j .

Likelihood of the estimator:

L =

N

Y

i =1

P(Yi) =

N

Y

i =1 r

X

j =1

πj

T

Y

t=1

pj(yit). (3)

Jang SCHILTZ (University of Luxembourg) joint work with Jean-Daniel GUIGOU (University of Luxembourg), & Bruno LOVAT (University of Lorraine) ()An R-package for finite mixture models December 7, 2013 8 / 38

(28)

The Likelihood Function (3)

Hypothesis: In a given group, conditional independence is assumed for the sequential realizations of the elements of Yi!!!

⇒ Pj(Yi) =

T

Y

t=1

pj(yit), (2)

where pj(yit) denotes the probability of yit given membership in group j . Likelihood of the estimator:

L =

N

Y

i =1

P(Yi) =

N

Y

i =1 r

X

j =1

πj

T

Y

t=1

pj(yit). (3)

(29)

The Likelihood Function (3)

Hypothesis: In a given group, conditional independence is assumed for the sequential realizations of the elements of Yi!!!

⇒ Pj(Yi) =

T

Y

t=1

pj(yit), (2)

where pj(yit) denotes the probability of yit given membership in group j . Likelihood of the estimator:

L =

N

Y

i =1

P(Yi)

=

N

Y

i =1 r

X

j =1

πj

T

Y

t=1

pj(yit). (3)

Jang SCHILTZ (University of Luxembourg) joint work with Jean-Daniel GUIGOU (University of Luxembourg), & Bruno LOVAT (University of Lorraine) ()An R-package for finite mixture models December 7, 2013 8 / 38

(30)

The Likelihood Function (3)

Hypothesis: In a given group, conditional independence is assumed for the sequential realizations of the elements of Yi!!!

⇒ Pj(Yi) =

T

Y

t=1

pj(yit), (2)

where pj(yit) denotes the probability of yit given membership in group j . Likelihood of the estimator:

L =

N

Y

i =1

P(Yi) =

N

Y

i =1 r

X

j =1

πj

T

Y

t=1

pj(yit). (3)

(31)

The case of a normal distribution (1)

yit = β0j + βj1Ageit + β2jAgei2t + β3jAgei3t + β4jAgei4t + εit, (4) where εit ∼ N (0, σ), σ being a constant standard deviation.

Notations :

βjtit = β0j + β1jAgeit + β2jAgei2

t + β3jAgei3

t + β4jAgei4

t. φ: density of standard centered normal law.

Hence,

pj(yit) = 1

σφ yit − βjtit

σ .



(5)

Jang SCHILTZ (University of Luxembourg) joint work with Jean-Daniel GUIGOU (University of Luxembourg), & Bruno LOVAT (University of Lorraine) ()An R-package for finite mixture models December 7, 2013 9 / 38

(32)

The case of a normal distribution (1)

yit = β0j + βj1Ageit + β2jAgei2t + β3jAgei3t + β4jAgei4t + εit, (4) where εit ∼ N (0, σ), σ being a constant standard deviation.

Notations :

βjtit = β0j + β1jAgeit + β2jAgei2

t + β3jAgei3

t + β4jAgei4

t. φ: density of standard centered normal law.

Hence,

pj(yit) = 1

σφ yit − βjtit

σ .



(5)

(33)

The case of a normal distribution (1)

yit = β0j + βj1Ageit + β2jAgei2t + β3jAgei3t + β4jAgei4t + εit, (4) where εit ∼ N (0, σ), σ being a constant standard deviation.

Notations :

βjtit = β0j + β1jAgeit + β2jAgei2

t + β3jAgei3

t + β4jAgei4

t.

φ: density of standard centered normal law. Hence,

pj(yit) = 1

σφ yit − βjtit

σ .



(5)

Jang SCHILTZ (University of Luxembourg) joint work with Jean-Daniel GUIGOU (University of Luxembourg), & Bruno LOVAT (University of Lorraine) ()An R-package for finite mixture models December 7, 2013 9 / 38

(34)

The case of a normal distribution (1)

yit = β0j + βj1Ageit + β2jAgei2t + β3jAgei3t + β4jAgei4t + εit, (4) where εit ∼ N (0, σ), σ being a constant standard deviation.

Notations :

βjtit = β0j + β1jAgeit + β2jAgei2

t + β3jAgei3

t + β4jAgei4

t. φ: density of standard centered normal law.

Hence,

pj(yit) = 1

σφ yit − βjtit

σ .



(5)

(35)

The case of a normal distribution (1)

yit = β0j + βj1Ageit + β2jAgei2t + β3jAgei3t + β4jAgei4t + εit, (4) where εit ∼ N (0, σ), σ being a constant standard deviation.

Notations :

βjtit = β0j + β1jAgeit + β2jAgei2

t + β3jAgei3

t + β4jAgei4

t. φ: density of standard centered normal law.

Hence,

pj(yit) = 1

σφ yit − βjtit

σ .



(5)

Jang SCHILTZ (University of Luxembourg) joint work with Jean-Daniel GUIGOU (University of Luxembourg), & Bruno LOVAT (University of Lorraine) ()An R-package for finite mixture models December 7, 2013 9 / 38

(36)

The case of a normal distribution (1)

yit = β0j + βj1Ageit + β2jAgei2t + β3jAgei3t + β4jAgei4t + εit, (4) where εit ∼ N (0, σ), σ being a constant standard deviation.

Notations :

βjtit = β0j + β1jAgeit + β2jAgei2

t + β3jAgei3

t + β4jAgei4

t. φ: density of standard centered normal law.

Hence,

pj(yit) = 1

σφ yit − βjtit

σ .



(5)

(37)

The case of a normal distribution (2)

So we get

L = 1 σ

N

Y

i =1 r

X

j =1

πj

T

Y

t=1

φ yit − βjtit σ



. (6)

The estimations of πj must be in [0, 1].

It is difficult to force this constraint in model estimation.

Jang SCHILTZ (University of Luxembourg) joint work with Jean-Daniel GUIGOU (University of Luxembourg), & Bruno LOVAT (University of Lorraine) ()An R-package for finite mixture models December 7, 2013 10 / 38

(38)

The case of a normal distribution (2)

So we get

L = 1 σ

N

Y

i =1 r

X

j =1

πj

T

Y

t=1

φ yit − βjtit σ



. (6)

The estimations of πj must be in [0, 1].

It is difficult to force this constraint in model estimation.

(39)

The case of a normal distribution (2)

So we get

L = 1 σ

N

Y

i =1 r

X

j =1

πj

T

Y

t=1

φ yit − βjtit σ



. (6)

The estimations of πj must be in [0, 1].

It is difficult to force this constraint in model estimation.

Jang SCHILTZ (University of Luxembourg) joint work with Jean-Daniel GUIGOU (University of Luxembourg), & Bruno LOVAT (University of Lorraine) ()An R-package for finite mixture models December 7, 2013 10 / 38

(40)

The case of a normal distribution (2)

So we get

L = 1 σ

N

Y

i =1 r

X

j =1

πj

T

Y

t=1

φ yit − βjtit σ



. (6)

The estimations of πj must be in [0, 1].

It is difficult to force this constraint in model estimation.

(41)

A computational trick

Instead, we estimate the real parameters θj such that

πj = eθj

r

X

j =1

eθj

, (7)

Finally,

L = 1 σ

N

Y

i =1 r

X

j =1

eθj

r

X

j =1

eθj

T

Y

t=1

φ yit − βjtit σ



. (8)

It is too complicated to get closed-forms equations.

Jang SCHILTZ (University of Luxembourg) joint work with Jean-Daniel GUIGOU (University of Luxembourg), & Bruno LOVAT (University of Lorraine) ()An R-package for finite mixture models December 7, 2013 11 / 38

(42)

A computational trick

Instead, we estimate the real parameters θj such that πj = eθj

r

X

j =1

eθj

, (7)

Finally,

L = 1 σ

N

Y

i =1 r

X

j =1

eθj

r

X

j =1

eθj

T

Y

t=1

φ yit − βjtit σ



. (8)

It is too complicated to get closed-forms equations.

(43)

A computational trick

Instead, we estimate the real parameters θj such that πj = eθj

r

X

j =1

eθj

, (7)

Finally,

L = 1 σ

N

Y

i =1 r

X

j =1

eθj

r

X

j =1

eθj

T

Y

t=1

φ yit − βjtit σ



. (8)

It is too complicated to get closed-forms equations.

Jang SCHILTZ (University of Luxembourg) joint work with Jean-Daniel GUIGOU (University of Luxembourg), & Bruno LOVAT (University of Lorraine) ()An R-package for finite mixture models December 7, 2013 11 / 38

(44)

A computational trick

Instead, we estimate the real parameters θj such that πj = eθj

r

X

j =1

eθj

, (7)

Finally,

L = 1 σ

N

Y

i =1 r

X

j =1

eθj

r

X

j =1

eθj

T

Y

t=1

φ yit − βjtit σ



. (8)

(45)

Available Software

SAS-based Proc Traj procedure

By Bobby L. Jones (Carnegie Mellon University).

Uses a quasi-Newton procedure maximum research routine.

Since the likelyhood is nor convex, nor a contraction, there are issues with local maxima.

R-package crimCV

By Jason D. Nielsen (Carleton University Ottawa). Just implements a zero-inflation Poission model.

Jang SCHILTZ (University of Luxembourg) joint work with Jean-Daniel GUIGOU (University of Luxembourg), & Bruno LOVAT (University of Lorraine) ()An R-package for finite mixture models December 7, 2013 12 / 38

(46)

Available Software

SAS-based Proc Traj procedure

By Bobby L. Jones (Carnegie Mellon University).

Uses a quasi-Newton procedure maximum research routine.

Since the likelyhood is nor convex, nor a contraction, there are issues with local maxima.

R-package crimCV

By Jason D. Nielsen (Carleton University Ottawa). Just implements a zero-inflation Poission model.

(47)

Available Software

SAS-based Proc Traj procedure

By Bobby L. Jones (Carnegie Mellon University).

Uses a quasi-Newton procedure maximum research routine.

Since the likelyhood is nor convex, nor a contraction, there are issues with local maxima.

R-package crimCV

By Jason D. Nielsen (Carleton University Ottawa). Just implements a zero-inflation Poission model.

Jang SCHILTZ (University of Luxembourg) joint work with Jean-Daniel GUIGOU (University of Luxembourg), & Bruno LOVAT (University of Lorraine) ()An R-package for finite mixture models December 7, 2013 12 / 38

(48)

Available Software

SAS-based Proc Traj procedure

By Bobby L. Jones (Carnegie Mellon University).

Uses a quasi-Newton procedure maximum research routine.

Since the likelyhood is nor convex, nor a contraction, there are issues with local maxima.

R-package crimCV

By Jason D. Nielsen (Carleton University Ottawa). Just implements a zero-inflation Poission model.

(49)

Available Software

SAS-based Proc Traj procedure

By Bobby L. Jones (Carnegie Mellon University).

Uses a quasi-Newton procedure maximum research routine.

Since the likelyhood is nor convex, nor a contraction, there are issues with local maxima.

R-package crimCV

By Jason D. Nielsen (Carleton University Ottawa).

Just implements a zero-inflation Poission model.

Jang SCHILTZ (University of Luxembourg) joint work with Jean-Daniel GUIGOU (University of Luxembourg), & Bruno LOVAT (University of Lorraine) ()An R-package for finite mixture models December 7, 2013 12 / 38

(50)

Available Software

SAS-based Proc Traj procedure

By Bobby L. Jones (Carnegie Mellon University).

Uses a quasi-Newton procedure maximum research routine.

Since the likelyhood is nor convex, nor a contraction, there are issues with local maxima.

R-package crimCV

By Jason D. Nielsen (Carleton University Ottawa).

Just implements a zero-inflation Poission model.

(51)

Future Software

R-package FMM

By Jang Schiltz & Mounir Shal (University of Luxembourg). Uses the EM Algortihm.

Allows the estimation of a generalised version of Nagin’s model, as well as Muthen’s model.

Will take us probably another year before completion.

Jang SCHILTZ (University of Luxembourg) joint work with Jean-Daniel GUIGOU (University of Luxembourg), & Bruno LOVAT (University of Lorraine) ()An R-package for finite mixture models December 7, 2013 13 / 38

(52)

Future Software

R-package FMM

By Jang Schiltz & Mounir Shal (University of Luxembourg).

Uses the EM Algortihm.

Allows the estimation of a generalised version of Nagin’s model, as well as Muthen’s model.

Will take us probably another year before completion.

(53)

Future Software

R-package FMM

By Jang Schiltz & Mounir Shal (University of Luxembourg).

Uses the EM Algortihm.

Allows the estimation of a generalised version of Nagin’s model, as well as Muthen’s model.

Will take us probably another year before completion.

Jang SCHILTZ (University of Luxembourg) joint work with Jean-Daniel GUIGOU (University of Luxembourg), & Bruno LOVAT (University of Lorraine) ()An R-package for finite mixture models December 7, 2013 13 / 38

(54)

Future Software

R-package FMM

By Jang Schiltz & Mounir Shal (University of Luxembourg).

Uses the EM Algortihm.

Allows the estimation of a generalised version of Nagin’s model, as well as Muthen’s model.

Will take us probably another year before completion.

(55)

Future Software

R-package FMM

By Jang Schiltz & Mounir Shal (University of Luxembourg).

Uses the EM Algortihm.

Allows the estimation of a generalised version of Nagin’s model, as well as Muthen’s model.

Will take us probably another year before completion.

Jang SCHILTZ (University of Luxembourg) joint work with Jean-Daniel GUIGOU (University of Luxembourg), & Bruno LOVAT (University of Lorraine) ()An R-package for finite mixture models December 7, 2013 13 / 38

(56)

Model Selection (1)

Bayesian Information Criterion:

BIC = log(L) − 0, 5k log(N), (9)

where k denotes the number of parameters in the model. Rule:

The bigger the BIC, the better the model!

(57)

Model Selection (1)

Bayesian Information Criterion:

BIC = log(L) − 0, 5k log(N), (9)

where k denotes the number of parameters in the model.

Rule:

The bigger the BIC, the better the model!

Jang SCHILTZ (University of Luxembourg) joint work with Jean-Daniel GUIGOU (University of Luxembourg), & Bruno LOVAT (University of Lorraine) ()An R-package for finite mixture models December 7, 2013 14 / 38

(58)

Model Selection (1)

Bayesian Information Criterion:

BIC = log(L) − 0, 5k log(N), (9)

where k denotes the number of parameters in the model.

Rule:

The bigger the BIC, the better the model!

(59)

Model Selection (2)

Leave-one-out Cross-Validation Apporach:

CVE = 1 N

N

X

i =1

1 T

T

X

t=1

yit − ˆyi[−i ]

t

. (10)

Rule:

The smaller the CVE, the better the model!

Jang SCHILTZ (University of Luxembourg) joint work with Jean-Daniel GUIGOU (University of Luxembourg), & Bruno LOVAT (University of Lorraine) ()An R-package for finite mixture models December 7, 2013 15 / 38

(60)

Model Selection (2)

Leave-one-out Cross-Validation Apporach:

CVE = 1 N

N

X

i =1

1 T

T

X

t=1

yit − ˆyi[−i ]

t

. (10)

Rule:

The smaller the CVE, the better the model!

(61)

Model Selection (2)

Leave-one-out Cross-Validation Apporach:

CVE = 1 N

N

X

i =1

1 T

T

X

t=1

yit − ˆyi[−i ]

t

. (10)

Rule:

The smaller the CVE, the better the model!

Jang SCHILTZ (University of Luxembourg) joint work with Jean-Daniel GUIGOU (University of Luxembourg), & Bruno LOVAT (University of Lorraine) ()An R-package for finite mixture models December 7, 2013 15 / 38

(62)

Their ”proof ” that CVE is better than BIC

(63)

Their ”proof ” that CVE is better than BIC

Jang SCHILTZ (University of Luxembourg) joint work with Jean-Daniel GUIGOU (University of Luxembourg), & Bruno LOVAT (University of Lorraine) ()An R-package for finite mixture models December 7, 2013 16 / 38

(64)

Posterior Group-Membership Probabilities

Posterior probability of individual i ’s membership in group j : P(j /Yi). Bayes’s theorem

⇒ P(j/Yi) = P(Yi/j )ˆπj

r

X

j =1

P(Yi/j )ˆπj

. (11)

Bigger groups have on average larger probability estimates.

To be classified into a small group, an individual really needs to be strongly consistent with it.

(65)

Posterior Group-Membership Probabilities

Posterior probability of individual i ’s membership in group j : P(j /Yi).

Bayes’s theorem

⇒ P(j/Yi) = P(Yi/j )ˆπj

r

X

j =1

P(Yi/j )ˆπj

. (11)

Bigger groups have on average larger probability estimates.

To be classified into a small group, an individual really needs to be strongly consistent with it.

Jang SCHILTZ (University of Luxembourg) joint work with Jean-Daniel GUIGOU (University of Luxembourg), & Bruno LOVAT (University of Lorraine) ()An R-package for finite mixture models December 7, 2013 17 / 38

(66)

Posterior Group-Membership Probabilities

Posterior probability of individual i ’s membership in group j : P(j /Yi).

Bayes’s theorem

⇒ P(j/Yi) = P(Yi/j )ˆπj

r

X

j =1

P(Yi/j )ˆπj

. (11)

Bigger groups have on average larger probability estimates.

To be classified into a small group, an individual really needs to be strongly consistent with it.

(67)

Posterior Group-Membership Probabilities

Posterior probability of individual i ’s membership in group j : P(j /Yi).

Bayes’s theorem

⇒ P(j/Yi) = P(Yi/j )ˆπj

r

X

j =1

P(Yi/j )ˆπj

. (11)

Bigger groups have on average larger probability estimates.

To be classified into a small group, an individual really needs to be strongly consistent with it.

Jang SCHILTZ (University of Luxembourg) joint work with Jean-Daniel GUIGOU (University of Luxembourg), & Bruno LOVAT (University of Lorraine) ()An R-package for finite mixture models December 7, 2013 17 / 38

(68)

Posterior Group-Membership Probabilities

Posterior probability of individual i ’s membership in group j : P(j /Yi).

Bayes’s theorem

⇒ P(j/Yi) = P(Yi/j )ˆπj

r

X

j =1

P(Yi/j )ˆπj

. (11)

Bigger groups have on average larger probability estimates.

To be classified into a small group, an individual really needs to be strongly consistent with it.

(69)

Model Fit (1)

Diagnostic 1: Average Posterior Probability of Assignment AvePP should be at least 0, 7 for all groups.

Diagonostic 2: Odds of Correct Classification OCCj = AvePPj/1 − AvePPj

ˆ

πj/1 − ˆπj . (12)

OCCj should be greater than 5 for all groups.

Jang SCHILTZ (University of Luxembourg) joint work with Jean-Daniel GUIGOU (University of Luxembourg), & Bruno LOVAT (University of Lorraine) ()An R-package for finite mixture models December 7, 2013 18 / 38

(70)

Model Fit (1)

Diagnostic 1: Average Posterior Probability of Assignment AvePP should be at least 0, 7 for all groups.

Diagonostic 2: Odds of Correct Classification OCCj = AvePPj/1 − AvePPj

ˆ

πj/1 − ˆπj . (12)

OCCj should be greater than 5 for all groups.

(71)

Model Fit (1)

Diagnostic 1: Average Posterior Probability of Assignment AvePP should be at least 0, 7 for all groups.

Diagonostic 2: Odds of Correct Classification OCCj = AvePPj/1 − AvePPj

ˆ

πj/1 − ˆπj . (12)

OCCj should be greater than 5 for all groups.

Jang SCHILTZ (University of Luxembourg) joint work with Jean-Daniel GUIGOU (University of Luxembourg), & Bruno LOVAT (University of Lorraine) ()An R-package for finite mixture models December 7, 2013 18 / 38

(72)

Model Fit (1)

Diagnostic 1: Average Posterior Probability of Assignment AvePP should be at least 0, 7 for all groups.

Diagonostic 2: Odds of Correct Classification OCCj = AvePPj/1 − AvePPj

ˆ

πj/1 − ˆπj . (12)

OCCj should be greater than 5 for all groups.

(73)

Model Fit (2)

Diagonostic 3: Comparing ˆπj to the Proportion of the Sample Assigned to Group j

The ratio of the two should be close to 1.

Diagonostic 4: Confidence Intervals for Group Membership Probabilities

The confidence intervals for group membership probabilities estimates should be narrow, i.e. standard deviation of ˆπj should be small.

Jang SCHILTZ (University of Luxembourg) joint work with Jean-Daniel GUIGOU (University of Luxembourg), & Bruno LOVAT (University of Lorraine) ()An R-package for finite mixture models December 7, 2013 19 / 38

(74)

Model Fit (2)

Diagonostic 3: Comparing ˆπj to the Proportion of the Sample Assigned to Group j

The ratio of the two should be close to 1.

Diagonostic 4: Confidence Intervals for Group Membership Probabilities

The confidence intervals for group membership probabilities estimates should be narrow, i.e. standard deviation of ˆπj should be small.

(75)

An application example

The data : first dataset Salaries of workers in the private sector in Luxembourg from 1940 to 2006.

About 7 million salary lines corresponding to 718.054 workers. Some sociological variables:

gender (male, female)

nationality and residentship (luxemburgish residents, foreign residents, foreign non residents)

working status (white collar worker, blue collar worker) year of birth

age in the first year of professional activity

Jang SCHILTZ (University of Luxembourg) joint work with Jean-Daniel GUIGOU (University of Luxembourg), & Bruno LOVAT (University of Lorraine) ()An R-package for finite mixture models December 7, 2013 20 / 38

(76)

An application example

The data : first dataset Salaries of workers in the private sector in Luxembourg from 1940 to 2006.

About 7 million salary lines corresponding to 718.054 workers. Some sociological variables:

gender (male, female)

nationality and residentship (luxemburgish residents, foreign residents, foreign non residents)

working status (white collar worker, blue collar worker) year of birth

age in the first year of professional activity

(77)

An application example

The data : first dataset Salaries of workers in the private sector in Luxembourg from 1940 to 2006.

About 7 million salary lines corresponding to 718.054 workers.

Some sociological variables: gender (male, female)

nationality and residentship (luxemburgish residents, foreign residents, foreign non residents)

working status (white collar worker, blue collar worker) year of birth

age in the first year of professional activity

Jang SCHILTZ (University of Luxembourg) joint work with Jean-Daniel GUIGOU (University of Luxembourg), & Bruno LOVAT (University of Lorraine) ()An R-package for finite mixture models December 7, 2013 20 / 38

(78)

An application example

The data : first dataset Salaries of workers in the private sector in Luxembourg from 1940 to 2006.

About 7 million salary lines corresponding to 718.054 workers.

Some sociological variables:

gender (male, female)

nationality and residentship (luxemburgish residents, foreign residents, foreign non residents)

working status (white collar worker, blue collar worker) year of birth

age in the first year of professional activity

(79)

An application example

The data : first dataset Salaries of workers in the private sector in Luxembourg from 1940 to 2006.

About 7 million salary lines corresponding to 718.054 workers.

Some sociological variables:

gender (male, female)

nationality and residentship (luxemburgish residents, foreign residents, foreign non residents)

working status (white collar worker, blue collar worker) year of birth

age in the first year of professional activity

Jang SCHILTZ (University of Luxembourg) joint work with Jean-Daniel GUIGOU (University of Luxembourg), & Bruno LOVAT (University of Lorraine) ()An R-package for finite mixture models December 7, 2013 20 / 38

(80)

An application example

The data : first dataset Salaries of workers in the private sector in Luxembourg from 1940 to 2006.

About 7 million salary lines corresponding to 718.054 workers.

Some sociological variables:

gender (male, female)

nationality and residentship (luxemburgish residents, foreign residents, foreign non residents)

working status (white collar worker, blue collar worker)

year of birth

age in the first year of professional activity

(81)

An application example

The data : first dataset Salaries of workers in the private sector in Luxembourg from 1940 to 2006.

About 7 million salary lines corresponding to 718.054 workers.

Some sociological variables:

gender (male, female)

nationality and residentship (luxemburgish residents, foreign residents, foreign non residents)

working status (white collar worker, blue collar worker) year of birth

age in the first year of professional activity

Jang SCHILTZ (University of Luxembourg) joint work with Jean-Daniel GUIGOU (University of Luxembourg), & Bruno LOVAT (University of Lorraine) ()An R-package for finite mixture models December 7, 2013 20 / 38

(82)

An application example

The data : first dataset Salaries of workers in the private sector in Luxembourg from 1940 to 2006.

About 7 million salary lines corresponding to 718.054 workers.

Some sociological variables:

gender (male, female)

nationality and residentship (luxemburgish residents, foreign residents, foreign non residents)

working status (white collar worker, blue collar worker) year of birth

age in the first year of professional activity

(83)

Result for 9 groups (dataset 1)

Jang SCHILTZ (University of Luxembourg) joint work with Jean-Daniel GUIGOU (University of Luxembourg), & Bruno LOVAT (University of Lorraine) ()An R-package for finite mixture models December 7, 2013 21 / 38

(84)

Result for 9 groups (dataset 1)

(85)

Results for 9 groups (dataset 1)

Jang SCHILTZ (University of Luxembourg) joint work with Jean-Daniel GUIGOU (University of Luxembourg), & Bruno LOVAT (University of Lorraine) ()An R-package for finite mixture models December 7, 2013 22 / 38

(86)

Outline

1 The Basic Finite Mixture Model of Nagin

2 Generalizations of the basic model

3 Muth´en’s model

4 Research Agenda

Références

Documents relatifs

– Full-text electronic copies of all peer-reviewed journal articles and papers from published conference.. proceedings published since 1 Jan 2009 (while employed by

construction européenne et dans le cadre du projet de recherche « Pierre Werner et l’Europe ». Ø Archives radiophoniques de RTL Luxembourg et archives télévisées de

The ZIP employ two different process : a binary distribution that generate structural

Damien Sagrillo Szeged, 13 April 2019.. • Luxemburger

This model implies a bicameral de- cision-making structure in which authority is shared by a Board of Governors with overall re- sponsibility for the management of the

Jang SCHILTZ (University of Luxembourg) A generalized finite mixture model July 28, 2015 9 / 23.. An

Jang SCHILTZ Identifiability of Finite Mixture Models with underlying Normal Distribution June 4, 2020 2 /

We prove that the closed testing principle leads to a sequential testing procedure (STP) that allows for confidence statements to be made regarding the order of a finite mixture