• Aucun résultat trouvé

Analysis of the salary distribution in Luxembourg - a finite mixture approach

N/A
N/A
Protected

Academic year: 2021

Partager "Analysis of the salary distribution in Luxembourg - a finite mixture approach"

Copied!
158
0
0

Texte intégral

(1)

Analysis of the salary distribution in Luxembourg

A finite mixture model approach

Jang SCHILTZ (University of Luxembourg)

joint work with

(2)

Outline

1 Nagin’s Finite Mixture Model

2 The Luxemburgish salary trajectories

3 Stability of the results

(3)

Outline

1 Nagin’s Finite Mixture Model

2 The Luxemburgish salary trajectories

3 Stability of the results

(4)

Outline

1 Nagin’s Finite Mixture Model

2 The Luxemburgish salary trajectories

3 Stability of the results

(5)

Outline

1 Nagin’s Finite Mixture Model

2 The Luxemburgish salary trajectories

3 Stability of the results

(6)

Outline

1 Nagin’s Finite Mixture Model

2 The Luxemburgish salary trajectories

3 Stability of the results

(7)

General description of Nagin’s model

We have a collection of individual trajectories.

We try to divide the population into a number of homogenous

subpopulations and to estimate a mean trajectory for each subpopulation.

(8)

General description of Nagin’s model

We have a collection of individual trajectories.

We try to divide the population into a number of homogenous

subpopulations and to estimate a mean trajectory for each subpopulation.

(9)

General description of Nagin’s model

We have a collection of individual trajectories.

We try to divide the population into a number of homogenous

subpopulations and to estimate a mean trajectory for each subpopulation.

(10)

The Likelihood Function (1)

Consider a population of size N and a variable of interest Y .

Let Yi = yi1, yi2, ..., yiT be T measures of the variable, taken at times

t1, ...tT for subject number i .

P(Yi) denotes the probability of Yi

count data ⇒ Poisson distribution

binary data ⇒ Binary logit distribution

censored data ⇒ Censored normal distribution

(11)

The Likelihood Function (1)

Consider a population of size N and a variable of interest Y .

Let Yi = yi1, yi2, ..., yiT be T measures of the variable, taken at times

t1, ...tT for subject number i .

P(Yi) denotes the probability of Yi

count data ⇒ Poisson distribution binary data ⇒ Binary logit distribution

censored data ⇒ Censored normal distribution

(12)

The Likelihood Function (1)

Consider a population of size N and a variable of interest Y .

Let Yi = yi1, yi2, ..., yiT be T measures of the variable, taken at times

t1, ...tT for subject number i .

P(Yi) denotes the probability of Yi

count data ⇒ Poisson distribution binary data ⇒ Binary logit distribution

censored data ⇒ Censored normal distribution

(13)

The Likelihood Function (1)

Consider a population of size N and a variable of interest Y .

Let Yi = yi1, yi2, ..., yiT be T measures of the variable, taken at times

t1, ...tT for subject number i .

P(Yi) denotes the probability of Yi

count data ⇒ Poisson distribution

binary data ⇒ Binary logit distribution

censored data ⇒ Censored normal distribution

(14)

The Likelihood Function (1)

Consider a population of size N and a variable of interest Y .

Let Yi = yi1, yi2, ..., yiT be T measures of the variable, taken at times

t1, ...tT for subject number i .

P(Yi) denotes the probability of Yi

count data ⇒ Poisson distribution binary data ⇒ Binary logit distribution

censored data ⇒ Censored normal distribution

(15)

The Likelihood Function (1)

Consider a population of size N and a variable of interest Y .

Let Yi = yi1, yi2, ..., yiT be T measures of the variable, taken at times

t1, ...tT for subject number i .

P(Yi) denotes the probability of Yi

count data ⇒ Poisson distribution binary data ⇒ Binary logit distribution

censored data ⇒ Censored normal distribution

(16)

The Likelihood Function (1)

Consider a population of size N and a variable of interest Y .

Let Yi = yi1, yi2, ..., yiT be T measures of the variable, taken at times

t1, ...tT for subject number i .

P(Yi) denotes the probability of Yi

count data ⇒ Poisson distribution binary data ⇒ Binary logit distribution

censored data ⇒ Censored normal distribution

(17)

The Likelihood Function (2)

πj : probability of a given subject to belong to group number j

⇒ πj is the size of group j .

We try to estimate a set of parameters Ω = n

β0j, β1j, β2j, β3j, β4j, πj

o which allow to maximize the probability of the measured data.

Pj(Y

i) : probability of Yi if subject i belongs to group j

⇒ P(Yi) =

r

X

j =1

πjPj(Yi). (1)

Finite mixture model Daniel S. Nagin (Carnegie Mellon University) finite : sums across a finite number of groups

(18)

The Likelihood Function (2)

πj : probability of a given subject to belong to group number j

⇒ πj is the size of group j .

We try to estimate a set of parameters Ω = n

β0j, β1j, β2j, β3j, β4j, πj

o which allow to maximize the probability of the measured data.

Pj(Y

i) : probability of Yi if subject i belongs to group j

⇒ P(Yi) =

r

X

j =1

πjPj(Yi). (1)

Finite mixture model Daniel S. Nagin (Carnegie Mellon University) finite : sums across a finite number of groups

(19)

The Likelihood Function (2)

πj : probability of a given subject to belong to group number j

⇒ πj is the size of group j .

We try to estimate a set of parameters Ω = n

β0j, β1j, β2j, β3j, β4j, πj

o which allow to maximize the probability of the measured data.

Pj(Y

i) : probability of Yi if subject i belongs to group j

⇒ P(Yi) =

r

X

j =1

πjPj(Yi). (1)

Finite mixture model Daniel S. Nagin (Carnegie Mellon University) finite : sums across a finite number of groups

(20)

The Likelihood Function (2)

πj : probability of a given subject to belong to group number j

⇒ πj is the size of group j .

We try to estimate a set of parameters Ω = n

β0j, β1j, β2j, β3j, β4j, πj

o which allow to maximize the probability of the measured data.

Pj(Y

i) : probability of Yi if subject i belongs to group j

⇒ P(Yi) =

r

X

j =1

πjPj(Yi). (1)

Finite mixture model Daniel S. Nagin (Carnegie Mellon University) finite : sums across a finite number of groups

(21)

The Likelihood Function (2)

πj : probability of a given subject to belong to group number j

⇒ πj is the size of group j .

We try to estimate a set of parameters Ω = n

β0j, β1j, β2j, β3j, β4j, πj

o which allow to maximize the probability of the measured data.

Pj(Y

i) : probability of Yi if subject i belongs to group j

⇒ P(Yi) =

r

X

j =1

πjPj(Yi). (1)

Finite mixture model Daniel S. Nagin (Carnegie Mellon University) finite : sums across a finite number of groups

(22)

The Likelihood Function (2)

πj : probability of a given subject to belong to group number j

⇒ πj is the size of group j .

We try to estimate a set of parameters Ω = n

β0j, β1j, β2j, β3j, β4j, πj

o which allow to maximize the probability of the measured data.

Pj(Y

i) : probability of Yi if subject i belongs to group j

⇒ P(Yi) =

r

X

j =1

πjPj(Yi). (1)

Finite mixture model Daniel S. Nagin (Carnegie Mellon University)

finite : sums across a finite number of groups

(23)

The Likelihood Function (2)

πj : probability of a given subject to belong to group number j

⇒ πj is the size of group j .

We try to estimate a set of parameters Ω = n

β0j, β1j, β2j, β3j, β4j, πj

o which allow to maximize the probability of the measured data.

Pj(Y

i) : probability of Yi if subject i belongs to group j

⇒ P(Yi) =

r

X

j =1

πjPj(Yi). (1)

(24)

The Likelihood Function (2)

πj : probability of a given subject to belong to group number j

⇒ πj is the size of group j .

We try to estimate a set of parameters Ω = n

β0j, β1j, β2j, β3j, β4j, πj

o which allow to maximize the probability of the measured data.

Pj(Y

i) : probability of Yi if subject i belongs to group j

⇒ P(Yi) =

r

X

j =1

πjPj(Yi). (1)

(25)

The Likelihood Function (3)

Hypothesis: In a given group, conditional independence is assumed for the sequential realizations of the elements of Yi!!!

⇒ Pj(Yi) = T

Y

t=1

pj(yit), (2)

where pj(yit) denotes the probability of yit given membership in group j .

Likelihood of the estimator:

(26)

The Likelihood Function (3)

Hypothesis: In a given group, conditional independence is assumed for the sequential realizations of the elements of Yi!!!

⇒ Pj(Yi) = T

Y

t=1

pj(yit), (2)

where pj(yit) denotes the probability of yit given membership in group j .

Likelihood of the estimator:

(27)

The Likelihood Function (3)

Hypothesis: In a given group, conditional independence is assumed for the sequential realizations of the elements of Yi!!!

⇒ Pj(Yi) = T

Y

t=1

pj(yit), (2)

where pj(yit) denotes the probability of yit given membership in group j .

Likelihood of the estimator:

(28)

The Likelihood Function (3)

Hypothesis: In a given group, conditional independence is assumed for the sequential realizations of the elements of Yi!!!

⇒ Pj(Yi) = T

Y

t=1

pj(yit), (2)

where pj(yit) denotes the probability of yit given membership in group j .

Likelihood of the estimator:

(29)

The Likelihood Function (3)

Hypothesis: In a given group, conditional independence is assumed for the sequential realizations of the elements of Yi!!!

⇒ Pj(Yi) = T

Y

t=1

pj(yit), (2)

where pj(yit) denotes the probability of yit given membership in group j .

Likelihood of the estimator:

(30)

The case of a censored normal distribution (1)

Y∗: latent variable measured by Y .

yit∗ = β0j + β1jAgeit + β2jAge 2 it + β j 3Age 3 it + β j 4Age 4 it + εit, (4)

where εit ∼ N (0, σ), σ being a constant standard deviation.

Hence,

yit = Smin si yit∗ < Smin,

yit = yit∗ si Smin≤ yit∗ ≤ Smax,

yit = Smax si yit∗ > Smax,

where Smin and Smax dennote the minimum and maximum of the censored

(31)

The case of a censored normal distribution (1)

Y∗: latent variable measured by Y .

yit∗ = β0j + β1jAgeit + β2jAge 2 it + β j 3Age 3 it + β j 4Age 4 it + εit, (4)

where εit ∼ N (0, σ), σ being a constant standard deviation.

Hence,

yit = Smin si yit∗ < Smin,

yit = yit∗ si Smin≤ yit∗ ≤ Smax,

yit = Smax si yit∗ > Smax,

where Smin and Smax dennote the minimum and maximum of the censored

(32)

The case of a censored normal distribution (1)

Y∗: latent variable measured by Y .

yit∗ = β0j + β1jAgeit + β2jAge 2 it + β j 3Age 3 it + β j 4Age 4 it + εit, (4)

where εit ∼ N (0, σ), σ being a constant standard deviation.

Hence,

yit = Smin si yit∗ < Smin,

yit = yit∗ si Smin≤ yit∗ ≤ Smax,

yit = Smax si yit∗ > Smax,

where Smin and Smax dennote the minimum and maximum of the censored

(33)

The case of a censored normal distribution (2)

Notations : βjtit = β0j + β j 1Ageit + β2jAgeit2+ β j 3Ageit3+ β j 4Ageit4.

φ: density of standard centered normal law.

Φ: cumulative distribution function of standard centered normal law.

Hence, pj(yit = Smin) = Φ  Smin− βjtit σ  , (5) pj(yit) = 1 σφ  yit − βjtit σ 

pour Smin≤ yit ≤ Smax, (6)

pj(yit = Smax) = 1 − Φ

 Smax − βjtit

σ



(34)

The case of a censored normal distribution (2)

Notations : βjtit = β0j + β j 1Ageit + β2jAgeit2+ β j 3Ageit3+ β j 4Ageit4.

φ: density of standard centered normal law.

Φ: cumulative distribution function of standard centered normal law.

Hence, pj(yit = Smin) = Φ  Smin− βjtit σ  , (5) pj(yit) = 1 σφ  yit − βjtit σ 

pour Smin≤ yit ≤ Smax, (6)

pj(yit = Smax) = 1 − Φ

 Smax − βjtit

σ



(35)

The case of a censored normal distribution (2)

Notations : βjtit = β0j + β j 1Ageit + β2jAgeit2+ β j 3Ageit3+ β j 4Ageit4.

φ: density of standard centered normal law.

Φ: cumulative distribution function of standard centered normal law.

Hence, pj(yit = Smin) = Φ  Smin− βjtit σ  , (5) pj(yit) = 1 σφ  yit − βjtit σ 

pour Smin≤ yit ≤ Smax, (6)

pj(yit = Smax) = 1 − Φ

 Smax − βjtit

σ



(36)

The case of a censored normal distribution (2)

Notations : βjtit = β0j + β j 1Ageit + β2jAgeit2+ β j 3Ageit3+ β j 4Ageit4.

φ: density of standard centered normal law.

Φ: cumulative distribution function of standard centered normal law.

Hence, pj(yit = Smin) = Φ  Smin− βjtit σ  , (5) pj(yit) = 1 σφ  yit − βjtit σ 

pour Smin≤ yit ≤ Smax, (6)

pj(yit = Smax) = 1 − Φ

 Smax − βjtit

σ



(37)

The case of a censored normal distribution (2)

Notations : βjtit = β0j + β j 1Ageit + β2jAgeit2+ β j 3Ageit3+ β j 4Ageit4.

φ: density of standard centered normal law.

Φ: cumulative distribution function of standard centered normal law.

Hence, pj(yit = Smin) = Φ  Smin− βjtit σ  , (5) pj(yit) = 1 σφ  yit − βjtit σ 

pour Smin≤ yit ≤ Smax, (6)

pj(yit = Smax) = 1 − Φ

 Smax − βjtit

σ



(38)

The case of a censored normal distribution (2)

Notations : βjtit = β0j + β j 1Ageit + β2jAgeit2+ β j 3Ageit3+ β j 4Ageit4.

φ: density of standard centered normal law.

Φ: cumulative distribution function of standard centered normal law.

Hence, pj(yit = Smin) = Φ  Smin− βjtit σ  , (5) pj(yit) = 1 σφ  yit − βjtit σ 

pour Smin≤ yit ≤ Smax, (6)

pj(yit = Smax) = 1 − Φ

 Smax − βjtit

σ



(39)

The case of a censored normal distribution (2)

Notations : βjtit = β0j + β j 1Ageit + β2jAgeit2+ β j 3Ageit3+ β j 4Ageit4.

φ: density of standard centered normal law.

Φ: cumulative distribution function of standard centered normal law.

Hence, pj(yit = Smin) = Φ  Smin− βjtit σ  , (5) pj(yit) = 1 σφ  yit − βjtit σ 

pour Smin≤ yit ≤ Smax, (6)

pj(yit = Smax) = 1 − Φ

 Smax − βjtit

σ



(40)

The case of a censored normal distribution (2)

Notations : βjtit = β0j + β j 1Ageit + β2jAgeit2+ β j 3Ageit3+ β j 4Ageit4.

φ: density of standard centered normal law.

Φ: cumulative distribution function of standard centered normal law.

Hence, pj(yit = Smin) = Φ  Smin− βjtit σ  , (5) pj(yit) = 1 σφ  yit − βjtit σ 

pour Smin≤ yit ≤ Smax, (6)

pj(yit = Smax) = 1 − Φ

 Smax − βjtit

σ



(41)

The case of a censored normal distribution (3)

If all the measures are in the interval [Smin, Smax], we get

L = 1 σ N Y i =1 r X j =1 πj T Y t=1 φ yit − β jt it σ  . (8)

It is too complicated to get closed-forms equations

⇒ quasi-Newton procedure maximum research routine

Software:

SAS-based Proc Traj procedure

(42)

The case of a censored normal distribution (3)

If all the measures are in the interval [Smin, Smax], we get

L = 1 σ N Y i =1 r X j =1 πj T Y t=1 φ yit − β jt it σ  . (8)

It is too complicated to get closed-forms equations

⇒ quasi-Newton procedure maximum research routine

Software:

SAS-based Proc Traj procedure

(43)

The case of a censored normal distribution (3)

If all the measures are in the interval [Smin, Smax], we get

L = 1 σ N Y i =1 r X j =1 πj T Y t=1 φ yit − β jt it σ  . (8)

It is too complicated to get closed-forms equations

⇒ quasi-Newton procedure maximum research routine

Software:

SAS-based Proc Traj procedure

(44)

The case of a censored normal distribution (3)

If all the measures are in the interval [Smin, Smax], we get

L = 1 σ N Y i =1 r X j =1 πj T Y t=1 φ yit − β jt it σ  . (8)

It is too complicated to get closed-forms equations

⇒ quasi-Newton procedure maximum research routine

Software:

SAS-based Proc Traj procedure

(45)

The case of a censored normal distribution (3)

If all the measures are in the interval [Smin, Smax], we get

L = 1 σ N Y i =1 r X j =1 πj T Y t=1 φ yit − β jt it σ  . (8)

It is too complicated to get closed-forms equations

⇒ quasi-Newton procedure maximum research routine

(46)

A computational trick

The estimations of πj must be in [0, 1].

It is difficult to force this constraint in model estimation.

Instead, we estimate the real parameters θj such that

(47)

A computational trick

The estimations of πj must be in [0, 1].

It is difficult to force this constraint in model estimation.

Instead, we estimate the real parameters θj such that

(48)

A computational trick

The estimations of πj must be in [0, 1].

It is difficult to force this constraint in model estimation.

Instead, we estimate the real parameters θj such that

(49)

A computational trick

The estimations of πj must be in [0, 1].

It is difficult to force this constraint in model estimation.

Instead, we estimate the real parameters θj such that

(50)

A computational trick

The estimations of πj must be in [0, 1].

It is difficult to force this constraint in model estimation.

Instead, we estimate the real parameters θj such that

(51)

Muth´

en’s model (1)

Muth´en and Shedden (1999): Generalized growth curve model

Elegant and technically demanding extension of the uncensored normal model.

Adds random effects to the parameters βj that define a group’s mean trajectory.

Trajectories of individual group members can vary from the group trajectory.

Software:

(52)

Muth´

en’s model (1)

Muth´en and Shedden (1999): Generalized growth curve model

Elegant and technically demanding extension of the uncensored normal model.

Adds random effects to the parameters βj that define a group’s mean trajectory.

Trajectories of individual group members can vary from the group trajectory.

Software:

(53)

Muth´

en’s model (1)

Muth´en and Shedden (1999): Generalized growth curve model

Elegant and technically demanding extension of the uncensored normal model.

Adds random effects to the parameters βj that define a group’s mean trajectory.

Trajectories of individual group members can vary from the group trajectory.

Software:

(54)

Muth´

en’s model (1)

Muth´en and Shedden (1999): Generalized growth curve model

Elegant and technically demanding extension of the uncensored normal model.

Adds random effects to the parameters βj that define a group’s mean trajectory.

Trajectories of individual group members can vary from the group trajectory.

Software:

(55)

Muth´

en’s model (1)

Muth´en and Shedden (1999): Generalized growth curve model

Elegant and technically demanding extension of the uncensored normal model.

Adds random effects to the parameters βj that define a group’s mean trajectory.

Trajectories of individual group members can vary from the group trajectory.

(56)

Muth´

en’s model (2)

Advantage of GGCM

Fewer groups are required to specify a satisfactory model.

Disadvantages of GGCM

1 Difficult to extend to other types of data. 2 Group cross-over effects.

(57)

Muth´

en’s model (2)

Advantage of GGCM

Fewer groups are required to specify a satisfactory model.

Disadvantages of GGCM

1 Difficult to extend to other types of data.

2 Group cross-over effects.

(58)

Muth´

en’s model (2)

Advantage of GGCM

Fewer groups are required to specify a satisfactory model.

Disadvantages of GGCM

1 Difficult to extend to other types of data. 2 Group cross-over effects.

(59)

Muth´

en’s model (2)

Advantage of GGCM

Fewer groups are required to specify a satisfactory model.

Disadvantages of GGCM

1 Difficult to extend to other types of data. 2 Group cross-over effects.

(60)

Model Selection

Bayesian Information Criterion:

BIC = log(L) − 0, 5k log(N), (11) where k denotes the number of parameters in the model.

Rule:

(61)

Model Selection

Bayesian Information Criterion:

BIC = log(L) − 0, 5k log(N), (11) where k denotes the number of parameters in the model.

Rule:

(62)

Model Selection

Bayesian Information Criterion:

BIC = log(L) − 0, 5k log(N), (11) where k denotes the number of parameters in the model.

Rule:

(63)

Posterior Group-Membership Probabilities

Posterior probability of individual i ’s membership in group j : P(j /Yi).

Bayes’s theorem ⇒ P(j/Yi) = P(Yi/j )ˆπj r X j =1 P(Yi/j )ˆπj . (12)

Bigger groups have on average larger probability estimates.

(64)

Posterior Group-Membership Probabilities

Posterior probability of individual i ’s membership in group j : P(j /Yi).

Bayes’s theorem ⇒ P(j/Yi) = P(Yi/j )ˆπj r X j =1 P(Yi/j )ˆπj . (12)

Bigger groups have on average larger probability estimates.

(65)

Posterior Group-Membership Probabilities

Posterior probability of individual i ’s membership in group j : P(j /Yi).

Bayes’s theorem ⇒ P(j/Yi) = P(Yi/j )ˆπj r X j =1 P(Yi/j )ˆπj . (12)

Bigger groups have on average larger probability estimates.

(66)

Posterior Group-Membership Probabilities

Posterior probability of individual i ’s membership in group j : P(j /Yi).

Bayes’s theorem ⇒ P(j/Yi) = P(Yi/j )ˆπj r X j =1 P(Yi/j )ˆπj . (12)

Bigger groups have on average larger probability estimates.

(67)

Posterior Group-Membership Probabilities

Posterior probability of individual i ’s membership in group j : P(j /Yi).

Bayes’s theorem ⇒ P(j/Yi) = P(Yi/j )ˆπj r X j =1 P(Yi/j )ˆπj . (12)

Bigger groups have on average larger probability estimates.

(68)

Use for Model diagnostics (2)

Diagnostic 1: Average Posterior Probability of Assignment

AvePP should be at least 0, 7 for all groups.

Diagonostic 2: Odds of Correct Classification

OCCj =

AvePPj/1 − AvePPj

ˆ

πj/1 − ˆπj

. (13)

(69)

Use for Model diagnostics (2)

Diagnostic 1: Average Posterior Probability of Assignment

AvePP should be at least 0, 7 for all groups.

Diagonostic 2: Odds of Correct Classification

OCCj =

AvePPj/1 − AvePPj

ˆ

πj/1 − ˆπj

. (13)

(70)

Use for Model diagnostics (2)

Diagnostic 1: Average Posterior Probability of Assignment

AvePP should be at least 0, 7 for all groups.

Diagonostic 2: Odds of Correct Classification

OCCj =

AvePPj/1 − AvePPj

ˆ

πj/1 − ˆπj

. (13)

(71)

Use for Model diagnostics (2)

Diagonostic 3: Comparing ˆπj to the Proportion of the Sample Assigned to Group j

The ratio of the two should be close to 1.

Diagonostic 4: Confidence Intervals for Group Membership Probabilities

(72)

Use for Model diagnostics (2)

Diagonostic 3: Comparing ˆπj to the Proportion of the Sample Assigned to Group j

The ratio of the two should be close to 1.

Diagonostic 4: Confidence Intervals for Group Membership Probabilities

(73)

Outline

1 Nagin’s Finite Mixture Model

2 The Luxemburgish salary trajectories

3 Stability of the results

(74)

The data : first dataset

Salaries of workers in the private sector in Luxembourg from 1940 to 2006.

About 7 million salary lines corresponding to 718.054 workers.

Some sociological variables: gender (male, female)

nationality and residentship (luxemburgish residents, foreign residents, foreign non residents)

working status (white collar worker, blue collar worker) year of birth

(75)

The data : first dataset

Salaries of workers in the private sector in Luxembourg from 1940 to 2006.

About 7 million salary lines corresponding to 718.054 workers.

Some sociological variables: gender (male, female)

nationality and residentship (luxemburgish residents, foreign residents, foreign non residents)

working status (white collar worker, blue collar worker) year of birth

(76)

The data : first dataset

Salaries of workers in the private sector in Luxembourg from 1940 to 2006.

About 7 million salary lines corresponding to 718.054 workers.

Some sociological variables: gender (male, female)

nationality and residentship (luxemburgish residents, foreign residents, foreign non residents)

working status (white collar worker, blue collar worker) year of birth

(77)

The data : first dataset

Salaries of workers in the private sector in Luxembourg from 1940 to 2006.

About 7 million salary lines corresponding to 718.054 workers.

Some sociological variables: gender (male, female)

nationality and residentship (luxemburgish residents, foreign residents, foreign non residents)

working status (white collar worker, blue collar worker) year of birth

(78)

The data : first dataset

Salaries of workers in the private sector in Luxembourg from 1940 to 2006.

About 7 million salary lines corresponding to 718.054 workers.

Some sociological variables: gender (male, female)

nationality and residentship (luxemburgish residents, foreign residents, foreign non residents)

working status (white collar worker, blue collar worker)

year of birth

(79)

The data : first dataset

Salaries of workers in the private sector in Luxembourg from 1940 to 2006.

About 7 million salary lines corresponding to 718.054 workers.

Some sociological variables: gender (male, female)

nationality and residentship (luxemburgish residents, foreign residents, foreign non residents)

working status (white collar worker, blue collar worker) year of birth

(80)

The data : first dataset

Salaries of workers in the private sector in Luxembourg from 1940 to 2006.

About 7 million salary lines corresponding to 718.054 workers.

Some sociological variables: gender (male, female)

nationality and residentship (luxemburgish residents, foreign residents, foreign non residents)

working status (white collar worker, blue collar worker) year of birth

(81)

The data : second dataset

Salaries of all workers in Luxembourg which began to work in Luxembourg between 1980 and 1990 at an age less than 30 years.

1.303.010 salary lines corresponding to 85.049 workers.

Some sociological variables: gender (male, female) nationality and residentship sector of activity

year of birth

age in the first year of professional activity marital status

(82)

The data : second dataset

Salaries of all workers in Luxembourg which began to work in Luxembourg between 1980 and 1990 at an age less than 30 years.

1.303.010 salary lines corresponding to 85.049 workers.

Some sociological variables: gender (male, female) nationality and residentship sector of activity

year of birth

age in the first year of professional activity marital status

(83)

The data : second dataset

Salaries of all workers in Luxembourg which began to work in Luxembourg between 1980 and 1990 at an age less than 30 years.

1.303.010 salary lines corresponding to 85.049 workers.

Some sociological variables: gender (male, female)

nationality and residentship sector of activity

year of birth

age in the first year of professional activity marital status

(84)

The data : second dataset

Salaries of all workers in Luxembourg which began to work in Luxembourg between 1980 and 1990 at an age less than 30 years.

1.303.010 salary lines corresponding to 85.049 workers.

Some sociological variables: gender (male, female) nationality and residentship

sector of activity year of birth

age in the first year of professional activity marital status

(85)

The data : second dataset

Salaries of all workers in Luxembourg which began to work in Luxembourg between 1980 and 1990 at an age less than 30 years.

1.303.010 salary lines corresponding to 85.049 workers.

Some sociological variables: gender (male, female) nationality and residentship sector of activity

year of birth

age in the first year of professional activity marital status

(86)

The data : second dataset

Salaries of all workers in Luxembourg which began to work in Luxembourg between 1980 and 1990 at an age less than 30 years.

1.303.010 salary lines corresponding to 85.049 workers.

Some sociological variables: gender (male, female) nationality and residentship sector of activity

year of birth

age in the first year of professional activity marital status

(87)

The data : second dataset

Salaries of all workers in Luxembourg which began to work in Luxembourg between 1980 and 1990 at an age less than 30 years.

1.303.010 salary lines corresponding to 85.049 workers.

Some sociological variables: gender (male, female) nationality and residentship sector of activity

year of birth

age in the first year of professional activity

marital status

(88)

The data : second dataset

Salaries of all workers in Luxembourg which began to work in Luxembourg between 1980 and 1990 at an age less than 30 years.

1.303.010 salary lines corresponding to 85.049 workers.

Some sociological variables: gender (male, female) nationality and residentship sector of activity

year of birth

age in the first year of professional activity marital status

(89)

The data : second dataset

Salaries of all workers in Luxembourg which began to work in Luxembourg between 1980 and 1990 at an age less than 30 years.

1.303.010 salary lines corresponding to 85.049 workers.

Some sociological variables: gender (male, female) nationality and residentship sector of activity

year of birth

(90)

Dataset transformations

Mathematica programming

1 row per year → 1 row per worker selection of the period we are interested in

taking out the years without work up to a maximum of five years selection of the people who worked at least during 20 years

Transformations in SPSS

the number of months problem!

(91)

Dataset transformations

Mathematica programming

1 row per year → 1 row per worker

selection of the period we are interested in

taking out the years without work up to a maximum of five years selection of the people who worked at least during 20 years

Transformations in SPSS

the number of months problem!

(92)

Dataset transformations

Mathematica programming

1 row per year → 1 row per worker selection of the period we are interested in

taking out the years without work up to a maximum of five years selection of the people who worked at least during 20 years

Transformations in SPSS

the number of months problem!

(93)

Dataset transformations

Mathematica programming

1 row per year → 1 row per worker selection of the period we are interested in

taking out the years without work up to a maximum of five years

selection of the people who worked at least during 20 years

Transformations in SPSS

the number of months problem!

(94)

Dataset transformations

Mathematica programming

1 row per year → 1 row per worker selection of the period we are interested in

taking out the years without work up to a maximum of five years selection of the people who worked at least during 20 years

Transformations in SPSS

the number of months problem!

(95)

Dataset transformations

Mathematica programming

1 row per year → 1 row per worker selection of the period we are interested in

taking out the years without work up to a maximum of five years selection of the people who worked at least during 20 years

Transformations in SPSS

the number of months problem!

(96)

Dataset transformations

Mathematica programming

1 row per year → 1 row per worker selection of the period we are interested in

taking out the years without work up to a maximum of five years selection of the people who worked at least during 20 years

Transformations in SPSS

the number of months problem!

elimination of all the workers who had monthly salaries above 15.000

(97)

Dataset transformations

Mathematica programming

1 row per year → 1 row per worker selection of the period we are interested in

taking out the years without work up to a maximum of five years selection of the people who worked at least during 20 years

Transformations in SPSS

(98)

Proc Traj procedure

Selection of the time period for macroeconomic reasons

(Crisis in the steel industry and emergence of the financial market place of Luxembourg) 20 years of work for workers beginning their carrier between 1982 and 1987

Proc Traj Macro: DATA TEST;

INPUT ID O1-O20 T1-T20; CARDS;

data RUN;

PROC TRAJ DATA=TEST OUTPLOT=OP OUTSTAT=OS OUT=OF OUTEST=OE ITDETAIL;

ID ID; VAR O1-O20; INDEP T1-T20;

(99)

Proc Traj procedure

Selection of the time period for macroeconomic reasons (Crisis in the steel industry and emergence of the financial market place of Luxembourg)

20 years of work for workers beginning their carrier between 1982 and 1987

Proc Traj Macro: DATA TEST;

INPUT ID O1-O20 T1-T20; CARDS;

data RUN;

PROC TRAJ DATA=TEST OUTPLOT=OP OUTSTAT=OS OUT=OF OUTEST=OE ITDETAIL;

ID ID; VAR O1-O20; INDEP T1-T20;

(100)

Proc Traj procedure

Selection of the time period for macroeconomic reasons (Crisis in the steel industry and emergence of the financial market place of Luxembourg) 20 years of work for workers beginning their carrier between 1982 and 1987

Proc Traj Macro: DATA TEST;

INPUT ID O1-O20 T1-T20; CARDS;

data RUN;

PROC TRAJ DATA=TEST OUTPLOT=OP OUTSTAT=OS OUT=OF OUTEST=OE ITDETAIL;

ID ID; VAR O1-O20; INDEP T1-T20;

(101)

Proc Traj procedure

Selection of the time period for macroeconomic reasons (Crisis in the steel industry and emergence of the financial market place of Luxembourg) 20 years of work for workers beginning their carrier between 1982 and 1987

Proc Traj Macro:

DATA TEST;

INPUT ID O1-O20 T1-T20; CARDS;

data RUN;

PROC TRAJ DATA=TEST OUTPLOT=OP OUTSTAT=OS OUT=OF OUTEST=OE ITDETAIL;

ID ID; VAR O1-O20; INDEP T1-T20;

(102)

Proc Traj procedure

Selection of the time period for macroeconomic reasons (Crisis in the steel industry and emergence of the financial market place of Luxembourg) 20 years of work for workers beginning their carrier between 1982 and 1987

Proc Traj Macro: DATA TEST;

INPUT ID O1-O20 T1-T20; CARDS;

data RUN;

PROC TRAJ DATA=TEST OUTPLOT=OP OUTSTAT=OS OUT=OF OUTEST=OE ITDETAIL;

ID ID; VAR O1-O20; INDEP T1-T20;

(103)

Proc Traj procedure

Selection of the time period for macroeconomic reasons (Crisis in the steel industry and emergence of the financial market place of Luxembourg) 20 years of work for workers beginning their carrier between 1982 and 1987

Proc Traj Macro: DATA TEST;

INPUT ID O1-O20 T1-T20; CARDS;

data RUN;

(104)
(105)
(106)
(107)

Outline

1 Nagin’s Finite Mixture Model

2 The Luxemburgish salary trajectories

3 Stability of the results

(108)

Result for 3 groups (dataset 2):

(109)

Result for 3 groups (dataset 2):

(110)

Result for 3 groups (dataset 2):

(111)

Result for 3 groups (dataset 2):

(112)

Result for 3 groups (dataset 2):

(113)

Result for 3 groups (dataset 2):

(114)

The statistical shape analysis approach

Comparing the geometrical figure of the trajectories −→ statistical shape analyis:

Compute the mean shape of the different results.

(115)

The statistical shape analysis approach

Comparing the geometrical figure of the trajectories

−→ statistical shape analyis:

Compute the mean shape of the different results.

(116)

The statistical shape analysis approach

Comparing the geometrical figure of the trajectories −→ statistical shape analyis:

Compute the mean shape of the different results.

(117)

The statistical shape analysis approach

Comparing the geometrical figure of the trajectories −→ statistical shape analyis:

Compute the mean shape of the different results.

(118)

The statistical shape analysis approach

Comparing the geometrical figure of the trajectories −→ statistical shape analyis:

Compute the mean shape of the different results.

(119)

The statistical shape analysis approach

Are these set of trajectories different?

(120)

The statistical shape analysis approach

Are these set of trajectories different?

(121)

The statistical shape analysis approach

(122)

The statistical shape analysis approach

Are these set of trajectories different?

(123)

The statistical shape analysis approach

Are these set of trajectories different?

(124)

The statistical shape analysis approach

Are these set of trajectories different?

Shape Analysis says yes,

(125)

The statistical shape analysis approach

(126)

The classical statistics approach

Compare the estimated parameters:

Performing the Wald test to see if the parameters differ between two models.

(127)

The classical statistics approach

Compare the estimated parameters:

Performing the Wald test to see if the parameters differ between two models.

(128)

The classical statistics approach

Compare the estimated parameters:

Performing the Wald test to see if the parameters differ between two models.

(129)

The classical statistics approach

Compare the estimated parameters:

Performing the Wald test to see if the parameters differ between two models.

(130)

Functional Data Analysis Approach

Compare the set of trajectories as functions:

(131)

Functional Data Analysis Approach

Compare the set of trajectories as functions:

(132)

Functional Data Analysis Approach

Compare the set of trajectories as functions:

(133)

Outline

1 Nagin’s Finite Mixture Model

2 The Luxemburgish salary trajectories

3 Stability of the results

(134)

Predictors of trajectory group membership

xi : vector of variables potentially associated with group membership

(measured before t1).

Multinomial logit model:

πj(xi) = exiθj r X k=1 exiθk , (14)

where θj denotes the effect of xi on the probability of group membership.

(135)

Predictors of trajectory group membership

xi : vector of variables potentially associated with group membership

(measured before t1).

Multinomial logit model:

πj(xi) = exiθj r X k=1 exiθk , (14)

where θj denotes the effect of xi on the probability of group membership.

(136)

Predictors of trajectory group membership

xi : vector of variables potentially associated with group membership

(measured before t1).

Multinomial logit model:

πj(xi) = exiθj r X k=1 exiθk , (14)

where θj denotes the effect of xi on the probability of group membership.

(137)

Predictors of trajectory group membership

xi : vector of variables potentially associated with group membership

(measured before t1).

Multinomial logit model:

πj(xi) = exiθj r X k=1 exiθk , (14)

where θj denotes the effect of xi on the probability of group membership.

(138)

Group membership probabilities

The Wald test which indicates whether any number of ocefficients is significally different, allows the statistical testing of the predictors.

(139)

Group membership probabilities

The Wald test which indicates whether any number of ocefficients is significally different, allows the statistical testing of the predictors.

(140)

Group membership probabilities

The Wald test which indicates whether any number of ocefficients is significally different, allows the statistical testing of the predictors.

(141)

Group membership probabilities: macro

Proc Traj Macro:

DATA TEST;

INPUT ID O1-O20 T1-T20 NATIO SEXE; CARDS;

data RUN;

PROC TRAJ DATA=TEST OUTPLOT=OP OUTSTAT=OS OUT=OF OUTEST=OE ITDETAIL;

ID ID; VAR O1-O20; INDEP T1-T20;

(142)

Group membership probabilities: macro

Proc Traj Macro: DATA TEST;

INPUT ID O1-O20 T1-T20 NATIO SEXE; CARDS;

data RUN;

PROC TRAJ DATA=TEST OUTPLOT=OP OUTSTAT=OS OUT=OF OUTEST=OE ITDETAIL;

ID ID; VAR O1-O20; INDEP T1-T20;

(143)

Group membership probabilities: macro

Proc Traj Macro: DATA TEST;

INPUT ID O1-O20 T1-T20 NATIO SEXE; CARDS;

data RUN;

PROC TRAJ DATA=TEST OUTPLOT=OP OUTSTAT=OS OUT=OF OUTEST=OE ITDETAIL;

ID ID; VAR O1-O20; INDEP T1-T20;

(144)
(145)
(146)
(147)
(148)

Adding covariates to the trajectories (2)

Y∗: latent variable measured by Y .

yit∗ = β0j + β1jt + β2jt2+ β3jt3+ β4jt4+ αj1z1t+ ... + αjLzLt+ εit, (16)

where εit ∼ N (0, σ), σ being a constant standard deviation and zlt are

covariates that may depend or not upon time t.

(149)

Adding covariates to the trajectories (2)

Y∗: latent variable measured by Y .

yit∗ = β0j + β1jt + β2jt2+ β3jt3+ β4jt4+ αj1z1t+ ... + αjLzLt+ εit, (16)

where εit ∼ N (0, σ), σ being a constant standard deviation and zlt are

covariates that may depend or not upon time t.

(150)

Adding covariates to the trajectories (2)

Y∗: latent variable measured by Y .

yit∗ = β0j + βj1t + β2jt2+ βj3t3+ β4jt4+ αj1z1t+ ... + αjLzLt+ εit, (16)

where εit ∼ N (0, σ), σ being a constant standard deviation and zlt are

covariates that may depend or not upon time t.

(151)

Adding covariates to the trajectories (2)

Y∗: latent variable measured by Y .

yit∗ = β0j + βj1t + β2jt2+ βj3t3+ β4jt4+ αj1z1t+ ... + αjLzLt+ εit, (16)

where εit ∼ N (0, σ), σ being a constant standard deviation and zlt are

covariates that may depend or not upon time t.

(152)
(153)
(154)

Functional PLS regression with functional response

Response Y = {Yt}t∈TY and predictor X = {Xt}t∈TX of functional type.

The Escoufier operators associated to X and Y are defined by

WXZ = Z TX E(XtZ )Xtdt, ∀ r.v Z (17) and WYZ = Z TY E(YtZ )Ytdt, ∀ r.v Z . (18) Theorem (C.Preda, J.S., 2011)

At each step of the PLS regression, the PLS components th, h > 1 are

eigenvectors of the product of the two Esoufier operators i.e.

(155)

Functional PLS regression with functional response

Response Y = {Yt}t∈TY and predictor X = {Xt}t∈TX of functional type.

The Escoufier operators associated to X and Y are defined by

WXZ = Z TX E(XtZ )Xtdt, ∀ r.v Z (17) and WYZ = Z TY E(YtZ )Ytdt, ∀ r.v Z . (18) Theorem (C.Preda, J.S., 2011)

At each step of the PLS regression, the PLS components th, h > 1 are

eigenvectors of the product of the two Esoufier operators i.e.

(156)

Functional PLS regression with functional response

Response Y = {Yt}t∈TY and predictor X = {Xt}t∈TX of functional type.

The Escoufier operators associated to X and Y are defined by

WXZ = Z TX E(XtZ )Xtdt, ∀ r.v Z (17) and WYZ = Z TY E(YtZ )Ytdt, ∀ r.v Z . (18) Theorem (C.Preda, J.S., 2011)

At each step of the PLS regression, the PLS components th, h > 1 are

eigenvectors of the product of the two Esoufier operators i.e.

(157)

Functional PLS regression with functional response

Response Y = {Yt}t∈TY and predictor X = {Xt}t∈TX of functional type.

The Escoufier operators associated to X and Y are defined by

WXZ = Z TX E(XtZ )Xtdt, ∀ r.v Z (17) and WYZ = Z TY E(YtZ )Ytdt, ∀ r.v Z . (18) Theorem (C.Preda, J.S., 2011)

(158)

Bibliography

Nagin, D.S. 2005: Group-based Modeling of Development. Cambridge, MA.: Harvard University Press.

Jones, B. and Nagin D.S. 2007: Advances in Group-based Trajectory Modeling and a SAS Procedure for Estimating Them. Sociological Research and Methods, 35 p.542-571.

Guigou, J.D, Lovat, B. and Schiltz, J. 2012: Analysis of the salary trajectories in Luxembourg : a finite mixture model approach. To appear.

Giebel S. 2011: Zur Anwendung der statistischen Formanalyse. Phd Thesis, University of Luxembourg.

Références

Documents relatifs

There are actually three different types of dynamics: Groups 2, 5 and 8 show “flat” careers, meaning that people from these groups have almost no increase in their salary after

This is still an inter-individual model, but unlike other classical models such as standard growth curve models, it allows the existence of subpolulations with completely

Jang SCHILTZ (University of Luxembourg, LSF) joint work with Analysis of the salary trajectories in Luxembourg Jean-Daniel GUIGOU (University of Luxembourg, LSF) &amp; Bruno

Effets civils (art. Mais seule l’incorporation parfaite p rod uit complète assim i­ lation.. L’incorporation parfaite déploie son plein effet : l’incor­ poré est

The success of interval analysis as a tool to compute guaranteed set estimates in the context of bounded-error estimation seems to have somewhat obscured the fact that the very

In the Falck case, the far-sighted family champion of change Alberto Falck—with crucial support of the external CEO Achille Colombo—was able to de-escalate the family business

In this paper we have proven the local convergence of the continuous time model associ- ated to step-size adaptive ESs towards local minima on monotonic C 2 -composite func- tions1.

Eh-pH diagram of both the monocrystalline antimony electrode (Sb(s)) and antimony-based screen-printed electrodes (Sb(SPEs)) in ammonia pH buffer solutions, in the absence or