• Aucun résultat trouvé

solution

N/A
N/A
Protected

Academic year: 2022

Partager "solution"

Copied!
2
0
0

Texte intégral

(1)

SOLUTIONS - KERNEL METHODS

STATISTICAL LEARNING COURSE, ECOLE NORMALE SUP ´ERIEURE

Rapha¨el Berthier [email protected]

1. Examples of positive definite kernels (1) (a) Denotek=k1+k2. Let α1, . . . , αn∈R and x1, . . . , xn∈ X. Then

X

i,j

αiαjk(xi, xj) =X

i,j

αiαjk1(xi, xj)

| {z }

≥0

+X

i,j

αiαjk2(xi, xj)

| {z }

≥0

≥0.

(b) Denote k = k1k2. Let x1, . . . , xn ∈ X and K1, K2 the Gram matrices associated to kernels k1, k2 at the pointsx1, . . . , xn. We show that K = K1K2, the Gram matrix associated tok, is positive definite. Here, denotes the Hadamard product (i.e., the pointwise product). As K1 is a symmetric positive semi-definite matrix, one can diagonalize K1=P

iλiuiuTi . Then K =X

i

λiuiuTi K2

But for any vector u :

X

ij

αiαj(uuT K2)ij =X

ij

αiαj(K2)ijuiuj = (αu)TK2(αu)≥0

Thus by summing non-negative terms, P

i,jαiαjKij ≥0.

(2) Consider H=L2(R) andφ(x) =1{t≤x}. (3) Similarly,

1 x+y =

Z 1 0

tx−12ty−12dt=hφ(x), φ(y)iL2([0,1]).

Thus x+y1 is a positive definite kernel. Nowxy is the standard scalar product onR, and by productk is a positive definite kernel.

(4) Denotenthe cardinal ofX. ForA⊂X, denote Φ(A) the indicator function ofA. Then

|A∩B|=φ(A)Tφ(B),

thus|A∩B|is a positive definite kernel. Further, denotingActhe complement ofA,

(2)

2 STATISTICAL LEARNING COURSE, ECOLE NORMALE SUP ´ERIEURE

1

|A∪B| = 1 n− |Ac∩Bc|

= 1

n

1−|Ac∩Bn c|

= 1

n(1−φ(Ac)Tnφ(Bc))

= 1 n

X

i=0

(φ(Ac)Tφ(Bc) n )i

which is a positive definite kernel by sum and products of positive definite kernels.

Finally, by a final product, K is a positive definite kernel.

(5)

GCD(n, m) =Y

pi

pmin(ψi i(m),ψi(n)),

where the pi are the prime numbers and where ψi(m) give the power of pi in the decomposition of m. We see this as a product of kernels : indeed, consider the feature map

2. Distance in the feature space (1) (a)

kφ(x)−φ(y)k2=k(x, x)−2k(x, y) +k(y, y) (b)

kφ(x)−φ(y)k2 = (x−y)2 x+y . (2) (a)

kφ(x)−µ+k2=k(x, x) + 1 n2+

X

i,yi=1

X

j,yj=1

k(xi, xj)− 2 n+

X

i,yi=1

k(x, xi) (b) We outputy= +1 if

1 n2+

X

i,yi=1

X

j,yj=1

k(xi, xj)− 2 n+

X

i,yi=1

k(x, xi)≤ 1 n2

X

i,yi=−1

X

j,yj=−1

k(xi, xj)− 2 n

X

i,yi=−1

k(x, xi) and −1 otherwise.

(c) kφ(x)− 1

n+ X

i,yi=1

φ(xi)k2 ≤ kφ(x)− 1 n

X

i,yi=−1

φ(xi)k2⇔ X

i,yi=1

k(x, xi)≥ X

i,yi=−1

k(x, xi)

(d) The application is straightforward. However, it sheds light on the importance of the choice of the kernelk. It decides how samples influence the classification rule of other points. The choice of the kernel can thus be seen as a choice of “similarity between points”.

Références

Documents relatifs

La conante de conneivit´e du r´eseau hexagonal a ´et´e calcul´ee par Hugo Duminil-Copin et Stanislav Smirnov en  [  ], r´esolvant ainsi une conje ure formul´ee

La conante de conneivit´e du r´eseau hexagonal a ´et´e calcul´ee par Hugo Duminil-Copin et Stanislav Smirnov en  [], r´esolvant ainsi une conjeure formul´ee en

STATISTICAL LEARNING COURSE, ECOLE NORMALE SUP ´ ERIEURE.. Rapha¨ el

INRIA - Ecole Normale Sup´ erieure Telecom Paris. NIPS -

Pour les syst` emes continus dans le plan, le th´ eor` eme de Poincar´ e-Bendixon, ´ enonc´ e ci- dessous et d´ emontr´ e dans [8], affirme qu’en plus de ces deux r´

Rïîð²R;ñŸòRó5ô;NQM*ëÿV èURfM}çíô ä ñIæ S&}XTaTE^UBAmõ÷ö

For those zonotopes, Ziegler proves that the space of tilings can be directed so as to get a graded poset (with a unique minimal element and a unique maximal element), for c ≤ 4. To

We explain and demonstrate why considering φ instruction replacement and renaming constraints together results in an improved coalescing of variables, thus reducing the number of