• Aucun résultat trouvé

Supplement Materials for “Large-scale Kernel RankSVM”

N/A
N/A
Protected

Academic year: 2022

Partager "Supplement Materials for “Large-scale Kernel RankSVM”"

Copied!
2
0
0

Texte intégral

(1)

Supplement Materials for “Large-scale Kernel RankSVM”

I Another Evaluation Criterion

We further consider mean average precision (MAP), which is the mean of average precision (AP) of each query.

Mean Average Precision≡ P

q∈SAP(q)

|S|

Average precision is the average of precisions at all possible recall levels.

AP(q)≡Xlq

m=1Precision@m·∆r(m)

where ∆r(m) is the change in recall from positionm−1 to m.

To calculate the precision, a relevance level that is less than the medium of all relevance levels is considered to be negative and others are considered to be positive.

Figures (I) and (II) show the experimental results.

102 103 104

0.4 0.45 0.5 0.55 0.6 0.65

Time (sec.)

Mean Average Precision

SVMlight TRON SVMrank

(a) Mean average precision

Figure (I): Experimental results on MQ2007

II Best Parameters for Each Method and Each Criterion

In Section 4, we mentioned that a grid search was con- ducted to decide the parameters to be used. The best parameters found for each method and each criterion are listed in Tables (III)-(V).

III Comparison of Performances Between Different Formulations

Because each method compared in our experiments considers different optimization problems, we conduct

Data set TRON SVMlight SVMrank RV-SVM MQ2007 (−4,−4) (−2,−5) (−1,−4) NA MQ2008 (1,−6) (−1,−5) (−4,3) (4,−2) MQ2007-list (−4,−5) (1,−5) (−5,−6) NA Table (III): Best parameter (log2C,log2γ) for each method on pairwise accuracy.

Data set TRON SVMlight SVMrank RV-SVM MQ2007 (−2,−5) (−2,−4) (−1,−4) NA MQ2008 (3,−5) (3,−6) (−4,−6) (0,−2) Table (IV): Best parameter (log2C,log2γ) for each method on mean NDCG. Note that mean NDCG is not available for MQ2007-list because the overflow of 2π(i)¯ caused by the large k.

more examinations on their performances using other commonly considered evaluation criteria. Here we consider precision@5 and mean reciprocal rank. For a given query q, its mean reciprocal rank is defined as follows.

Mean reciprocal rank≡ 1 lq

lq

X

i=1

1

|{j|yπ(j)¯ > yπ(π¯ −1(i))}|. We report the average of this value among allq∈S.

Our purpose here is to compare the performance of different formulations in spite of the optimization methods. Thus, after selecting the best parameters for each criterion, we run each package for a long enough training time (min(24 hours, the training time to fulfill the default stopping condition)) to obtain stable results.

The best parameters found for each method and each criterion are listed in Tables (VI)-(VII). The results are reported in Tables (VIII) and (IX). As mentioned earlier, the package RV-SVMfailed to run on MQ2007 Data set TRON SVMlight SVMrank RV-SVM MQ2007 (−4,−4) (1,−6) (3,−2) NA MQ2008 (1,−4) (3,−6) (−4,−6) (−4,−2) Table (V): Best parameter (log2C,log2γ) for each method on mean average precision. Note that we follow LETOR 4.0not to consider MAP onMQ2007-list.

(2)

andMQ2007-listbecause the required memory is beyond our machine capacity.

From the results, we can see that the optimization problem considered in our approach is comparable to or better than state of the art. This indicates that our method does not sacrifice the final performance to obtain a faster training time.

Data set TRON SVMlight SVMrank RV-SVM MQ2007 (−1,−4) (0,−5) (−3,−2) NA MQ2008 (0,−3) (−3,−2) (−5,−5) (0,−1) Table (VI): Best parameter (log2C,log2γ) for each method on precision@5. Note that we follow LETOR 4.0not to consider precision@5 on MQ2007-list.

Data set TRON SVMlight SVMrank RV-SVM MQ2007 (−2,−4) (2,−5) (3,−6) NA MQ2008 (1,−4) (4,−6) (−2,−5) (−5,−2) MQ2007-list (5,−4) (1,−5) (−5,−3) NA Table (VII): Best parameter (log2C,log2γ) for each method on mean reciprocal rank.

Data set TRON SVMlight SVMrank RV-SVM MQ2007 0.3154 0.3125 0.2262 NA MQ2008 0.3974 0.4103 0.4038 0.4038 MQ2007-list 0.8095 0.4613 0.7054 NA Table (VIII): Comparison of precision@5 between differ- ent formulations. We run each package long enough to exclude the difference resulted from optimization meth- ods.

IV The Relation Between Data Size and Training Time

To check the performance of the proposed method when the number of instances grows, we sub-sampled different number of instances fromMQ2007, and report the training time required. The experimental result is shown in Figure (III). The X-axis is the number of instances, and the Y-axis is the training time required.

Note that both axes are log-scaled, and the results show that the proposed method remains efficient as the number of instances grows.

Data set TRON SVMlight SVMrank RV-SVM MQ2007 0.7503 0.7436 0.6629 NA MQ2008 0.8127 0.8045 0.7966 0.8147 MQ2007-list 0.9579 0.8457 0.9300 NA Table (IX): Comparison of mean reciprocal rank be- tween different formulations. We run each package long enough to exclude the difference resulted from optimiza- tion methods.

101 102 103

0.5 0.55 0.6 0.65 0.7 0.75 0.8 0.85 0.9

Time (sec.)

Mean Average Precision

SVMlight TRON RV−SVM SVMrank

(a) Mean average precision

Figure (II): Experimental results on MQ2008

103 104 105

100 101 102 103

Number of Instances

CPU Time (secs)

Proposed Method

Figure (III): Experimental results on the relation be- tween data size and training time.

Références

Documents relatifs

A first example of this approach could be the extension of an organization’s knowledge base to documents related to its territorial and social context: in this way the

Simultaneous approximation, parametric geometry of numbers, function fields, Minkowski successive minima, Mahler duality, compound bodies, Schmidt and Summerer n–systems, Pad´

By applying nonlinear steepest descent tech- niques to an associated 3 × 3-matrix valued Riemann–Hilbert problem, we find an explicit formula for the leading order asymptotics of

Among existing approaches for learning to rank, rankSVM [7] is a commonly used method extended from the popular support vector machine (SVM) [2, 6] for data classification.. In

Signal-dependent time-frequency representations for classification using a radially gaussian kernel and the alignment criterion.. IEEE workshop on Statistical Signal Pro- cessing

In the setting of Lie groups of polynomial growth, a large time upper estimate for the kernel has been obtained by Alexopoulos (cf. [1])... In this paper we study the

Motivated by applications in mathematical imaging, asymptotic expansions of the heat generated by single, or well separated, small particles are derived in [5, 7] based on

LTA will provide a certifed, flexible and harmonised modular curriculum of the experts in real- time intralingual subtitling, with a focus on four working contexts: cultural