• Aucun résultat trouvé

Mining a medieval social network by kernel SOM and related methods

N/A
N/A
Protected

Academic year: 2022

Partager "Mining a medieval social network by kernel SOM and related methods"

Copied!
53
0
0

Texte intégral

(1)

Mining a medieval social network by kernel SOM and related methods

Nathalie Villa-Vialaneix http://www.nathalievilla.org

Institut de Mathématiques de Toulouse (Univ. Toulouse)

& IUT de Carcassonne (Univ. Perpignan VD) France

Joint work withFabrice Rossi, INRIA, Rocquencourt, France andQuoc-Dinh Truong, IRIT, Toulouse, France

Journées MASHS, June 5th, 2008

(2)

Table of contents

1 Introduction

2 Graph drawing

3 Clustering the vertices of a graph

4 “All-in-one” method: Self-Organizing Maps for graphs

(3)

A multidisciplinary project: “Graph-Comp”

Laboratoire d’histoire Institut deMathématiquesde Toulouse

(Univ. Le Mirail & CNRS) (Univ. de Toulouse & CNRS)

Institut de Recherche enInformatique Laboratoire d’Informatique

de Toulouse de Nantes Atlantique

(Univ. de Toulouse & CNRS) (Univ. de Nantes & CNRS)

Florent Hautefeuille Bertrand Jouve Romain Boulet

FRAMESPA (Univ. Le Mirail) IMT (Univ. Le Mirail) IMT (Univ. Le Mirail)

Pascale Kuntz Fabien Picarougne Bleuenn Le Goffic

LINA (Univ. Nantes) LINA (Univ. Nantes) LINA (Univ. Nantes)

Taoufiq Dkaki Quoc-Dinh Truong Fabrice Rossi

IRIT (Univ. Le Mirail) IRIT (Univ. Le Mirail) INRIA Rocquencourt

(4)

A multidisciplinary project: “Graph-Comp”

Florent Hautefeuille Bertrand Jouve Romain Boulet

FRAMESPA (Univ. Le Mirail) IMT (Univ. Le Mirail) IMT (Univ. Le Mirail)

Pascale Kuntz Fabien Picarougne Bleuenn Le Goffic

LINA (Univ. Nantes) LINA (Univ. Nantes) LINA (Univ. Nantes)

Taoufiq Dkaki Quoc-Dinh Truong Fabrice Rossi

IRIT (Univ. Le Mirail) IRIT (Univ. Le Mirail) INRIA Rocquencourt

Florent Hautefeuille Bertrand Jouve Romain Boulet

FRAMESPA (Univ. Le Mirail) IMT (Univ. Le Mirail) IMT (Univ. Le Mirail)

Pascale Kuntz Fabien Picarougne Bleuenn Le Goffic

LINA (Univ. Nantes) LINA (Univ. Nantes) LINA (Univ. Nantes)

Taoufiq Dkaki Quoc-Dinh Truong Fabrice Rossi

IRIT (Univ. Le Mirail) IRIT (Univ. Le Mirail) INRIA Rocquencourt

(5)

A multidisciplinary project: “Graph-Comp”

Florent Hautefeuille Bertrand Jouve Romain Boulet

FRAMESPA (Univ. Le Mirail) IMT (Univ. Le Mirail) IMT (Univ. Le Mirail)

Pascale Kuntz Fabien Picarougne Bleuenn Le Goffic

LINA (Univ. Nantes) LINA (Univ. Nantes) LINA (Univ. Nantes)

(6)

Study of a medieval corpus

Work already presented in MASHS 2007 : [Boulet et al., 2007]

A huge corpus of medieval documents

In the archives of Cahors (Lot), corpus of5000 agrarian contracts. These contracts

come fromfour seignories(about 10 villages) of South-West of France;

were established between1240 and 1520(before and after the Hundred Years’ war);

This corpus interests the historians because:

only a few documents from middle ages deal withpeasants’ life; it permits to studya priorithe evolution of the social network before and after theHundred Years’ War.

(7)

Study of a medieval corpus

Work already presented in MASHS 2007 : [Boulet et al., 2007]

A huge corpus of medieval documents

In the archives of Cahors (Lot), corpus of5000 agrarian contracts. These contracts

come fromfour seignories(about 10 villages) of South-West of France;

were established between1240 and 1520(before and after the Hundred Years’ war);

This corpus interests the historians because:

only a few documents from middle ages deal withpeasants’ life; it permits to studya priorithe evolution of the social network before and after theHundred Years’ War.

(8)

Study of a medieval corpus

Work already presented in MASHS 2007 : [Boulet et al., 2007]

A huge corpus of medieval documents

In the archives of Cahors (Lot), corpus of5000 agrarian contracts. These contracts

come fromfour seignories(about 10 villages) of South-West of France;

were established between1240 and 1520(before and after the Hundred Years’ war);

This corpus interests the historians because:

only a few documents from middle ages deal withpeasants’ life; it permits to studya priorithe evolution of the social network before and after theHundred Years’ War.

(9)

Study of a medieval corpus

Work already presented in MASHS 2007 : [Boulet et al., 2007]

A huge corpus of medieval documents

In the archives of Cahors (Lot), corpus of5000 agrarian contracts. These contracts

come fromfour seignories(about 10 villages) of South-West of France;

were established between1240 and 1520(before and after the Hundred Years’ war);

This corpus interests the historians because:

only a few documents from middle ages deal withpeasants’ life;

it permits to studya priorithe evolution of the social network before and after theHundred Years’ War.

(10)

Methodology

Each contract of the corpus is recorded in a database (still to be finished) thought by Fabien Picarougne:

Parts of this database can be accessed on the web site

(11)

A large graph for the medieval social network

From part of the database (1000 contractsbeforethe Hundred Years’

War), we built aweighted graph:

vertices: the peasants found in the contracts (nobilities are removed);

two peasants arelinked togetherif:

they appear together in the same contract;

they appear in two different contracts which differ from less than 15 years and on which they are related to the same lord or to the same notary.

the graph thus haveweights(wi,j)i,j=1...,n which are the number of contracts satisfying one of these conditions. They are such that:

wi,j =wj,i ≥0 wi,i =0.

Providing tools to help historiansunderstanding the structure of this social network.

(12)

A large graph for the medieval social network

From part of the database (1000 contractsbeforethe Hundred Years’

War), we built aweighted graph:

vertices: the peasants found in the contracts (nobilities are removed);

two peasants arelinked togetherif:

they appear together in the same contract;

they appear in two different contracts which differ from less than 15 years and on which they are related to the same lord or to the same notary.

the graph thus haveweights(wi,j)i,j=1...,n which are the number of contracts satisfying one of these conditions. They are such that:

wi,j =wj,i ≥0 wi,i =0.

Providing tools to help historiansunderstanding the structure of this social network.

(13)

A large graph for the medieval social network

From part of the database (1000 contractsbeforethe Hundred Years’

War), we built aweighted graph:

vertices: the peasants found in the contracts (nobilities are removed);

two peasants arelinked togetherif:

they appear together in the same contract;

they appear in two different contracts which differ from less than 15 years and on which they are related to the same lord or to the same notary.

the graph thus haveweights(wi,j)i,j=1...,n which are the number of contracts satisfying one of these conditions. They are such that:

wi,j =wj,i ≥0 wi,i =0.

Providing tools to help historiansunderstanding the structure of this social network.

(14)

A large graph for the medieval social network

From part of the database (1000 contractsbeforethe Hundred Years’

War), we built aweighted graph:

vertices: the peasants found in the contracts (nobilities are removed);

two peasants arelinked togetherif:

they appear together in the same contract;

they appear in two different contracts which differ from less than 15 years and on which they are related to the same lord or to the same notary.

the graph thus haveweights(wi,j)i,j=1...,n which are the number of contracts satisfying one of these conditions. They are such that:

wi,j =wj,i ≥0 wi,i =0.

Providing tools to help historiansunderstanding the structure of this social network.

(15)

First description of the graph

Thelargest connected componentof the medieval social network:

has615 vertices(i.e., 615 different peasants were found in the contracts),

has4193 edgeswhose weights sum at 40 329 but 50% of the edges have a weight 1 and less than 2% have a weight greater than 100, is a“small world graph”with a small density (2.2%) and a high local connectivity (77%),

is a“scale free graph”: the number of vertices having a number of links greater thank is decreasingexponentiallywhenk increases. Caracteristics similar to those found formodern social networks. But ! How to visualize and/or simplify this graph to interpret it ?

(16)

First description of the graph

Thelargest connected componentof the medieval social network:

has615 vertices(i.e., 615 different peasants were found in the contracts),

has4193 edgeswhose weights sum at 40 329 but 50% of the edges have a weight 1 and less than 2% have a weight greater than 100,

is a“small world graph”with a small density (2.2%) and a high local connectivity (77%),

is a“scale free graph”: the number of vertices having a number of links greater thank is decreasingexponentiallywhenk increases. Caracteristics similar to those found formodern social networks. But ! How to visualize and/or simplify this graph to interpret it ?

(17)

First description of the graph

Thelargest connected componentof the medieval social network:

has615 vertices(i.e., 615 different peasants were found in the contracts),

has4193 edgeswhose weights sum at 40 329 but 50% of the edges have a weight 1 and less than 2% have a weight greater than 100, is a“small world graph”with a small density (2.2%) and a high local connectivity (77%),

is a“scale free graph”: the number of vertices having a number of links greater thank is decreasingexponentiallywhenk increases. Caracteristics similar to those found formodern social networks. But ! How to visualize and/or simplify this graph to interpret it ?

(18)

First description of the graph

Thelargest connected componentof the medieval social network:

has615 vertices(i.e., 615 different peasants were found in the contracts),

has4193 edgeswhose weights sum at 40 329 but 50% of the edges have a weight 1 and less than 2% have a weight greater than 100, is a“small world graph”with a small density (2.2%) and a high local connectivity (77%),

is a“scale free graph”: the number of vertices having a number of links greater thank is decreasingexponentiallywhenk increases.

Caracteristics similar to those found formodern social networks. But ! How to visualize and/or simplify this graph to interpret it ?

(19)

First description of the graph

Thelargest connected componentof the medieval social network:

has615 vertices(i.e., 615 different peasants were found in the contracts),

has4193 edgeswhose weights sum at 40 329 but 50% of the edges have a weight 1 and less than 2% have a weight greater than 100, is a“small world graph”with a small density (2.2%) and a high local connectivity (77%),

is a“scale free graph”: the number of vertices having a number of links greater thank is decreasingexponentiallywhenk increases.

Caracteristics similar to those found formodern social networks.

But ! How to visualize and/or simplify this graph to interpret it ?

(20)

First description of the graph

Thelargest connected componentof the medieval social network:

has615 vertices(i.e., 615 different peasants were found in the contracts),

has4193 edgeswhose weights sum at 40 329 but 50% of the edges have a weight 1 and less than 2% have a weight greater than 100, is a“small world graph”with a small density (2.2%) and a high local connectivity (77%),

is a“scale free graph”: the number of vertices having a number of links greater thank is decreasingexponentiallywhenk increases.

Caracteristics similar to those found formodern social networks.

But ! How to visualize and/or simplify this graph to interpret it ?

(21)

Table of contents

1 Introduction

2 Graph drawing

3 Clustering the vertices of a graph

4 “All-in-one” method: Self-Organizing Maps for graphs

(22)

What is graph drawing ?

Graph drawing aims at the arrangement of the vertices and edges in order to make therepresentation of the graph understandable and aesthetics.

SeeGraph Visualization Software Referenceswebsite:

http://www.polytech.univ-nantes.fr/GVSR/

(LINA,[Pinaud et al., 2007]).

Representation of the medieval network by force-directed algorithms

“GEM” “Spring Electrical”

(23)

What is graph drawing ?

Graph drawing aims at the arrangement of the vertices and edges in order to make therepresentation of the graph understandable and aesthetics.

SeeGraph Visualization Software Referenceswebsite:

http://www.polytech.univ-nantes.fr/GVSR/

(LINA,[Pinaud et al., 2007]).

Here,Tulip.

Enablesforce-directed algorithms:

gradient-descent minimization of an energy function.

Representation of the medieval network by force-directed algorithms

“GEM” “Spring Electrical”

(24)

What is graph drawing ?

Graph drawing aims at the arrangement of the vertices and edges in order to make therepresentation of the graph understandable and aesthetics.

Representation of the medieval network by force-directed algorithms

“GEM” “Spring Electrical”

(25)

Table of contents

1 Introduction

2 Graph drawing

3 Clustering the vertices of a graph

4 “All-in-one” method: Self-Organizing Maps for graphs

(26)

Aims of the clustering

We want to underlinehomogeneous social groupsthat are fewly connected to each others

[Newman and Girvan, 2004]:“reducing [the] level of complexity [of a network] to one that can be interpreted readily by the human eye, will be invaluable in helping us to understand the large-scale structure of these new network data”

Review on clustering of the vertices of a graph in[Schaeffer, 2007]: How tomeasure the qualityof a graph clustering?

Presentation of global or local algorithms based on

a similarity measure and the adaptation of a clustering algorithm to similarity data;

mapping of the graph on a euclidean space; the minimization of a cluster fitness measures. Several kinds:

batch online

hierarchical (divisive or agglomerative)

(27)

Aims of the clustering

We want to underlinehomogeneous social groupsthat are fewly connected to each others

[Newman and Girvan, 2004]:“reducing [the] level of complexity [of a network] to one that can be interpreted readily by the human eye, will be invaluable in helping us to understand the large-scale structure of these new network data”

Review on clustering of the vertices of a graph in[Schaeffer, 2007]:

How to measure the quality of a graph clustering?

Presentation of global or local algorithms based on

a similarity measure and the adaptation of a clustering algorithm to similarity data;

mapping of the graph on a euclidean space;

the minimization of a cluster fitness measures.

Several kinds:

Review on clustering of the vertices of a graph in[Schaeffer, 2007]: How tomeasure the qualityof a graph clustering?

Presentation of global or local algorithms based on

a similarity measure and the adaptation of a clustering algorithm to similarity data;

mapping of the graph on a euclidean space; the minimization of a cluster fitness measures. Several kinds:

batch online

hierarchical (divisive or agglomerative)

(28)

Aims of the clustering

We want to underlinehomogeneous social groupsthat are fewly connected to each others

[Newman and Girvan, 2004]:“reducing [the] level of complexity [of a network] to one that can be interpreted readily by the human eye, will be invaluable in helping us to understand the large-scale structure of these new network data”

Review on clustering of the vertices of a graph in[Schaeffer, 2007]:

How tomeasure the qualityof a graph clustering?

Presentation of global or local algorithms based on

a similarity measure and the adaptation of a clustering algorithm to similarity data;

mapping of the graph on a euclidean space;

the minimization of a cluster fitness measures.

Several kinds:

batch online

(29)

Spectral clustering [von Luxburg, 2007]

For a graph with verticesV ={x1, . . . ,xn}having positive weights (wi,j)i,j=1,...,nLaplacian: L= (Li,j)i,j=1,...,n where

Li,j=

( −wi,j ifi,j di ifi=j ;

Graph cut optimization

If the graph is connected, clustering the vertices intok groupsA1, . . . ,Ak that minimize

Cut(A1, . . . ,Ak)= Xk

i=1

X

j∈Ai,j0<Ai

wi,i0

'findh1, . . . ,hk ∈Rn that minimize (continuous approximation)

Xk

i=1

hiTLhisubject to(h1. . .hk)(h1. . .hk)T =In

(30)

Spectral clustering [von Luxburg, 2007]

For a graph with verticesV ={x1, . . . ,xn}having positive weights (wi,j)i,j=1,...,nLaplacian: L= (Li,j)i,j=1,...,n where

Li,j=

( −wi,j ifi,j di ifi=j ;

Graph cut optimization

If the graph is connected, clustering the vertices intok groupsA1, . . . ,Ak that minimize

Cut(A1, . . . ,Ak)= Xk

i=1

X

j∈Ai,j0<Ai

wi,i0

'findh1, . . . ,hk ∈Rn that minimize (continuous approximation)

Xk

i=1

hiTLhisubject to(h1. . .hk)(h1. . .hk)T =In

(31)

Spectral clustering [von Luxburg, 2007]

For a graph with verticesV ={x1, . . . ,xn}having positive weights (wi,j)i,j=1,...,nLaplacian: L= (Li,j)i,j=1,...,n where

Li,j=

( −wi,j ifi,j di ifi=j ;

Graph cut optimization

If the graph is connected, clustering the vertices intok groupsA1, . . . ,Ak that minimize

Cut(A1, . . . ,Ak)= Xk

i=1

X

j∈Ai,j0<Ai

wi,i0

⇔find(h1, . . . ,hk)∈Qk

i=1

0, 1

|Ai|

n

that minimizes

'findh1, . . . ,hk ∈Rn that minimize (continuous approximation)

k

X

i=1

hiTLhisubject to(h1. . .hk)(h1. . .hk)T =In

(32)

Spectral clustering [von Luxburg, 2007]

For a graph with verticesV ={x1, . . . ,xn}having positive weights (wi,j)i,j=1,...,nLaplacian: L= (Li,j)i,j=1,...,n where

Li,j=

( −wi,j ifi,j di ifi=j ;

Graph cut optimization

If the graph is connected, clustering the vertices intok groupsA1, . . . ,Ak that minimize

Cut(A1, . . . ,Ak)= Xk

i=1

X

j∈Ai,j0<Ai

wi,i0

'findh1, . . . ,hk ∈Rn that minimize (continuous approximation)

Xk

i=1

hiTLhisubject to(h1. . .hk)(h1. . .hk)T =In

(33)

Spectral clustering on medieval social network

Algorithm

1 Compute theeigenvectors,v1, . . . ,vk ∈RnofLassociated with thek smallest positive eigenvalues.

2 Use the rows of the matrix(v1. . .vk)has amapping of the graph in Rk and perform a clustering algorithm on them.

Representation of the clustering(k-means, 50 clusters + force directed algorithm):

2 big clusters ofcentral peoplehighly connected;

Identification of individuals that help toconnect the network;

isolated individuals around.

But: Size of the biggest cluster: 268 !! 16 clusters have size 1

more than 50% of the clusters have a size less than 2

(34)

Spectral clustering on medieval social network

Algorithm

1 Compute theeigenvectors,v1, . . . ,vk ∈RnofLassociated with thek smallest positive eigenvalues.

2 Use the rows of the matrix(v1. . .vk)has amapping of the graph in Rk and perform a clustering algorithm on them.

Representation of the clustering(k-means, 50 clusters + force directed algorithm):

2 big clusters ofcentral peoplehighly connected;

Identification of individuals that help toconnect the network;

isolated individuals around.

But: Size of the biggest cluster: 268 !! 16 clusters have size 1

more than 50% of the clusters have a size less than 2

(35)

Spectral clustering on medieval social network

Algorithm

1 Compute theeigenvectors,v1, . . . ,vk ∈RnofLassociated with thek smallest positive eigenvalues.

2 Use the rows of the matrix(v1. . .vk)has amapping of the graph in Rk and perform a clustering algorithm on them.

Representation of the clustering(k-means, 50 clusters + force directed algorithm):

2 big clusters ofcentral peoplehighly connected;

Identification of individuals that help toconnect the network;

isolated individuals around.

Representation of the clustering(k-means, 50 clusters + force directed algorithm):

2 big clusters ofcentral peoplehighly connected;

Identification of individuals that help toconnect the network;

isolated individuals around.

But: Size of the biggest cluster: 268 !! 16 clusters have size 1

more than 50% of the clusters have a size less than 2

(36)

Spectral clustering on medieval social network

Algorithm

1 Compute theeigenvectors,v1, . . . ,vk ∈RnofLassociated with thek smallest positive eigenvalues.

2 Use the rows of the matrix(v1. . .vk)has amapping of the graph in Rk and perform a clustering algorithm on them.

Representation of the clustering(k-means, 50 clusters + force directed algorithm):

2 big clusters ofcentral peoplehighly connected;

Identification of individuals that help toconnect the network;

isolated individuals around.

But: Size of the biggest cluster: 268 !!

16 clusters have size 1

more than 50% of the clusters have a size

(37)

A regularized version of the Laplacian: the heat kernel

From the diffusion matrix to the heat kernel Diffusion matrix: forβ >0,Kβ=e−βL.

kβ: V×V → R (xi,xj) → Ki,jβ is thediffusion kernel(or heat kernel).

Mapping of the graph on a euclidean space: kβis the scalar product associated with the mapping

φ:xi ∈V →(v1. . .vn)i∈Rnλ

where(vl)l are the eigenvectors ofL andRnλ denotes then-dimensional space with norm weighted by(e−βλl)l(λ≡eigenvalues ofL).

(38)

A regularized version of the Laplacian: the heat kernel

From the diffusion matrix to the heat kernel Diffusion matrix: forβ >0,Kβ=e−βL.

kβ: V×V → R (xi,xj) → Ki,jβ

is thediffusion kernel(or heat kernel).

Intuitive interpretation: kβ(xi,xj)'quantity of energy accumulated inxj after a given time if energy is injected inxiat time 0 and if diffusion is done along the edges (βcontrol the intensity of the diffusion).

Mapping of the graph on a euclidean space: kβis the scalar product associated with the mapping

φ:xi ∈V →(v1. . .vn)i∈Rnλ

where(vl)l are the eigenvectors ofL andRnλ denotes then-dimensional space with norm weighted by(e−βλl)l(λ≡eigenvalues ofL).

(39)

A regularized version of the Laplacian: the heat kernel

From the diffusion matrix to the heat kernel Diffusion matrix: forβ >0,Kβ=e−βL.

kβ: V×V → R (xi,xj) → Ki,jβ

is thediffusion kernel(or heat kernel).

Mapping of the graph on a euclidean space: kβis the scalar product associated with the mapping

φ:xi ∈V →(v1. . .vn)i∈Rnλ

where(vl)l are the eigenvectors ofL andRnλ denotes then-dimensional

−βλ λ

(40)

Spectral clustering vs Kernel k -means

Spectral Clustering Kernelk-means

Max size: 268 242

Nb of clusters of size 1: 16 17

Median size: 2 2

(41)

Table of contents

1 Introduction

2 Graph drawing

3 Clustering the vertices of a graph

4 “All-in-one” method: Self-Organizing Maps for graphs

(42)

General principle of SOM for graphs

The vertices of the graph aremapped on a euclidean space(byL:

“spectral SOM” or byK: “kernel SOM”).

p1 p2

p3

1 2

3

Each neuronjof the map is represented by aprototypepj.

Couples(j,pj)and(xi,f(xi))depend from each others and areiteratively updated in order to approach the minimization of theenergyof the map:

En = Xn

j=1

XM

i=1

h(d(f(xj),i))kφ(xi)−pjk2.

(43)

General principle of SOM for graphs

Each vertexxiis affected to aneuron(a cluster) of theKohonen map,f(xi). Neurons are related to each others by aneighborhood relationship

(“distance”:d).

p1 p2

p3

1 2

3

Each neuronjof the map is represented by aprototypepj.

Couples(j,pj)and(xi,f(xi))depend from each others and areiteratively updated in order to approach the minimization of theenergyof the map:

En =

n

X

j=1 M

X

i=1

h(d(f(xj),i))kφ(xi)−pjk2.

(44)

General principle of SOM for graphs

p1 p2

p3

1 2

3

Each neuronjof the map is represented by aprototypepj.

Couples(j,pj)and(xi,f(xi))depend from each others and areiteratively updated in order to approach the minimization of theenergyof the map:

En =

n

X

j=1 M

X

i=1

h(d(f(xj),i))kφ(xi)−pjk2.

(45)

Spectral SOM

Number of clusters29 Q-modularity: Pk

i=1(ei−ai2) Number of clusters of size 111

Maximum size of the clusters325 Qmodul=0.433

Median size2 (vs 0.420 & 0.425 for clusterings)

(46)

Kernel SOM [Boulet et al., 2008]

Number of clusters35 Q-modularity: Pk

i=1(ei−ai2) Number of clusters of size 113

Maximum size of the clusters255 Qmodul=0.551 Median size3

(47)

Quelques cartes thématiques

1 Dates

2 Lieux

(48)

Force directed algorithm for clustered graphs

[Truong et al., 2007, Truong et al., 2008]

La Suite...

By adding constrains on force-directed algorithms

(49)

By adding constrains on force-directed algorithms

(50)

By adding constrains on force-directed algorithms

(51)

Perspectives

Several perspectives:

Improving theglobal representationof the network (hierarchical algorithms, improving algorithms for clustered graphs drawing, others algorithms such as simulated annealing, . . . )

Understanding theevolutionof the social network through time (before/during/after Hundred Years’ War): specific tools have to be built in order to

understand what become the dominant families (“Aliquier”, “Fornie”, . . . ),

make a comparison despite the fact that thevertices are not the same.

(52)

Perspectives

Several perspectives:

Improving theglobal representationof the network (hierarchical algorithms, improving algorithms for clustered graphs drawing, others algorithms such as simulated annealing, . . . )

Understanding theevolutionof the social network through time (before/during/after Hundred Years’ War): specific tools have to be built in order to

understand what become the dominant families (“Aliquier”, “Fornie”, . . . ),

make a comparison despite the fact that thevertices are not the same.

(53)

References

Boulet, R., Hautefeuille, F., Jouve, B., Kuntz, P., Le Goffic, B., Picarougne, F., and Villa, N. (2007).

Sur l’analyse de réseaux de sociabilité dans la société paysanne médiévale.

InMASHS 2007, Brest, France.

Boulet, R., Jouve, B., Rossi, F., and Villa, N. (2008).

Batch kernel SOM and related laplacian methods for social network analysis.

Neurocomputing, 71(7-9):1257–1273.

Newman, M. and Girvan, M. (2004).

Finding and evaluating community structure in networks.

Physical Review, E, 69:026113.

Pinaud, B., Kuntz, P., and Picarougne, F. (2007).

The website for graph visualization software references (GVSR).

InGraph Drawing: 14thInternational Symposium on Graph Drawing 2006, volume 4372 ofLecture Notes in Computer Science, pages 440–441, Berlin, Heidelberg. Springer.

Schaeffer, S. (2007).

Graph clustering.

Computer Science Review, 1(1):27–64.

Truong, Q., Dkaki, T., and Charrel, P. (2007).

An energy model for the drawing of clustered graphs.

InProceedings of Vème colloque international VSST, Marrakech, Maroc.

Truong, Q., Dkaki, T., and Charrel, P. (2008).

Références

Documents relatifs

In this section, we provide k-means accounts for judgmental contrast observed on multiple distribution shapes, and for a variety of interaction effects pertaining to the number of

The above theorem states that the clustering scheme is strongly stable for F 2 provided the absolute margin condition holds.. Here, we briefly argue that this result is op- timal in

If we use a basic clustering algorithm based on the minimization of the empirical distortion (such as the k-means algorithm) when we deal with noisy data, the expected criterion

Here, we extend our earlier work on quantum computing for unsupervised learning [15] towards the problem of kernel k = 2 means clustering and discuss how to solve it via

Sadly, publications using k-means clustering lack a number of important points, such as justification of the chosen number of clusters and sample size, selection of variables and

In this paper, we show a direct equivalence between the embedding computed in spectral clustering and the mapping computed with kernel PCA, and how both are special cases of a

As the data set has to be described by its dissimilarity measure, the choice of a good dissimilarity is critical: we propose here to investigate the combination of the

In view of the limits identified previously in related studies and in NVIDIA solutions, we propose a strategy for parallelizing the complete spectral clustering chain on