International Master of Research in Computer Science: Computer Aided Decision Support
Graph Embedding Probem
Jean-Yves Ramel – Romain Raveaux
Laboratoire Informatique de Tours - FRANCE Presented by :
Romain Raveaux
International Master of Research in Computer Science: Computer Aided Decision Support
Content
1.
Graph Embedding problem
1.
G à RN
2.
Kernel
3.
Kernel trick
Statistical vs Structural Pattern Recognition
symbolic data structure numeric feature vector
Yes No
No Yes
Yes No
No Yes
Data structure
Representational strength Fixed dimensionality
Sensitivity to noise
Efficient computational tools
Pa#ern Recogni-on
Structural Sta-s-cal
Feature space
Structutal PR
Expressive, convenient, powerful but computationally expensive
representations
Statistical PR
Mathematically sound, mature,
less expensive and computationally efficient models Graph embedding
Explicit GEM
§ embeds each input graph into a numeric feature vector
§ provides more useful methods of GEM for PR
§ can be employed in a standard dot product for defining an implicit graph embedding function
Implicit GEM
§ computes scalar product of two graphs in an implicitly existing vector space, by using graph kernels
§ does not permit all the operations that could be defined on vector spaces
Explicit GEM
Topological Descriptors
• Principle
• Map each graph to a feature vector
• Use distances and metrics on vectors for learning on graphs
• Advantages
• Reuses known and efficient tools for feature vectors
• Disadvantages
• Efficiency comes at a price: feature vector transformation leads to loss of topological information (or includes
subgraph isomorphism as one step)
Implicit GEM
Polynomial Alternatives
• Graph kernels
• Compare substructures of graphs that are computable in polynomial time.
• Criteria for a good graph kernel
• Expressive
• Efficient to compute
• Positive definite
• Applicable to wide range of graphs
Implicit Graph Embedding
• Graph kernel
• What is a Kernel?
• Map two objects x and x′ via mapping φ into feature space H.
• Measure their similarity in H as <φ(x), φ(x′)>.
• Kernel Trick: Compute inner product in H as kernel in input space
• k(x, x′) = <φ(x), φ(x′)>.
Graph Kernels
• General case:
• Directed
• Labels on both vertex and edges
• Loops and cycles are allowed (not in all algorithms)
• Particular cases easily derived from the general one:
• Non-directed
• No label on edge, no label on vertex
Theoretical issues
• To design a kernel taking the whole graph structure into account amounts to build a complete graph kernel that
distinguishes between 2 graphs only if they are not isomorphic
• Complete graph kernel design is theoretically possible … but practically infeasible (NP-complex)
• Approximations are therefore necessary:
• Common local subtrees kernels
• Common (label) walk kernels (most popular)
12
ϕ : G→ Rn ϕ( g) = (x1,…., xn)’
Many informa8on can be extracted :
ànodes, cliques, paths, walk, …
Very Simple Graph Kernel
Graph Kernel / Graph Embedding [Bunke09]
Example
Graph Embedding
Implicit Graph Embedding
Graph kernels based on Common Walks
• Walk = (possibly infinite) sequence of labels obtained by following edges on the graph
• Path = walk with no vertex visited twice
• Important concept: direct product of two graphs G1xG2
• V(G1xG2)={(v1,v2), v1 and v2: same labels)
• E(G1xG2)={(e1,e2): e1, e2: same labels, p(e1) and p(e2) same labels, n(e1) and n(e2) same labels}
e
p(e) n(e)
Same labels : Difficulty to deal with numeric attributes
Random Walks:
Explanation : Direct Graph Product
Random Walk
Walks of lengh 2
Random Walks:
Explanation : Idea
Random Walks:
Explanation : a better idea
Random Walks:
Explanation : Diffusion Kernels
Random Walks:
Explanation : Graph Comparison
Efficient computation ?
Part 2
•
Feature space
Explicit Graph Embedding
• Graph probing based methods
• Spectral based graph embedding
Graph probing
• Feature extraction from a graph
• [Papadopoulos et al., 1999]
• [Lopresti 2003]
Spectral based graph embedding
• [Harchaoui, 2007] [Luo et al., 2003] [Robleskelly and Hancock, 2007]
• Often limited to graph databases where all graphs have the same number of nodes.
1
1 1
1 1
1 1 1
1 1
Spectral graph theory employing the adjacency and Laplacien matrices
Eigen values and Eigen vectors PCA, ICA, MDS
Spectral based graph embedding
• [Shokoufandeh 2006], a "topological signature vector"
Trends and recent ideas
• Muzzamil Luqman [Luqman09]
• Sidère [Sidère09]
Reconnaissance des Formes à base de graphes
Graph Embedding « flou » [Luqman09]
ϕ( g) = (x1,…., xn)’
Fonction d’appartenance floue (définissant la contribution d’une primitive en fonction de sa valeur )
Lexique des motifs topologiques Fréquences d’apparition des motifs
→ Construction d’un vecteur ou matrice de caractéristiques statistiques
Quelle transformation ϕ ? Quelle mesure de similarité ?
Pas d’information sur les appariements entre sommets des deux graphes
Reconnaissance des Formes à base de graphes
Graph Embedding [Sidère09]
Exemple
Problèmes avec le embedding, probing, kernel
Graphe avec attributs
Why I did not speak about :
• Graphlet Kernel (B., Petri, et al., MLG 2007)
• Principle
• Count subgraphs of limited size k in G and G‘
• These subgraphs are referred to as graphlets (Przulj, Bioinformatics 2007)
• Define graph kernel that counts isomorphic graphlets in two graphs
Why I did not speak about :
• Combine graph kernels with graphical models (Bach, ICML 2008)
• Presents a new kernel for 2D or 3D point clouds
• Compares local subsets of the point clouds
• Considers subsets based on subtrees and walks
• Uses a specific factorized form for the local kernels between subtrees.
• Combine graph kernels with group theory (Kondor and B., ICML 2008)
• Represent graph as a function over the symmetric group
• Derive invariants for that function called the skew spectrum
• Use subset of these invariants that is computable in O(n3) as feature representation of the graph.
Big Open Questions
• Comparing paths in two different graphs is polynomial
• Subgraph isomorphism is known to be NP-hard
• Computing the so-called universal graph distance which counts all common subgraphs of two graphs is harder than subgraph
isomorphism
• When we compare any other subgraphs e.g. simple paths (where vertices do not repeat)
• Cycles
• Trees
We seem to lose polynomial run-time
• Are there other subgraphs for which efficient computation is possible?
Conclusion
To take the stock
• Provide a very high representa8on
– Topology, structure, composi8on, ...
– Choice of representa8on (node or arc)
– Choice of aRributes (symbolic, numerical, ...) – Time of construc8on, size, ...
• Treatments graphs
– Local analysis ➔ segmenta8on, localiza8on – See Opera8onal Research (many algorithms?)
• Comparison graphs
– many changes
– Very high complexity – Use heuris8c
– Transforma8on vectors sta8s8cs – Loss of interest?
• The last of the rules RdF J. C. Simon:
– Above all, never despair!
Literrature
• Towards the unifica8on of structural and sta8s8cal paRern recogni8on
– Horst Bunke, Kaspar Riesen