• Aucun résultat trouvé

Bilingual Information Retrieval with DesIRe and Internet Translation Services

N/A
N/A
Protected

Academic year: 2022

Partager "Bilingual Information Retrieval with DesIRe and Internet Translation Services"

Copied!
2
0
0

Texte intégral

(1)

with DesIRe

and Internet Translation Services

Norbert Gövert

University of Dortmund, Germany

1 Introduction

DesIReistheDortmundextensiblestructuredInformationRetrieval engine 1

. Itsextensi-

bilityisbasedonthe implementationof physicaldata independence; it'squeryinterface

consistsof datatypeswith respective search predicates. This concept enabledus toadd

bilingualsearchpredicatesforthedatatypesText::English andText::German(forEn-

glish and German text, respectively). Our implementation uses free Internet resources

fortranslating topics fromEnglish to Germanand vice versa.

2 Search predicates for bilingual retrieval

Havinga system whichis extensiblew.r.t. datatypesand theirrespectivesearchpredi-

cates wedecided toextend the Text::English and Text::German datatypes by search

predicatsfor bilingualtextretrieval. Thesepredicates needed topeformthe translation

of topics and queries from German to English in case of datatype Text::English and

viceversa in case of datatypeText::German.

For translation of queries we adopted two rather naive, but fully automatic ap-

proaches. In both approaches we used free internetresources:

Approach1usestheBabelshtranslationservice 2

ofAltavista. Thisserviceallows

totranslatepassages in asource language toagiven target language. Besides the

translationfromGermantoEnglishand viceversa, Babelshiscapableofvarious

otherlanguages.

<[email protected]>

1

http://ls6-www.cs.uni-dortmund.de/ir/projects/DesIRe/

2

(2)

Approach 2 uses an ordinary online dictionary for word-by-word translations.

We chose the Leo Dictionary service 3

for this purpose. Leo provides for a En-

glish/German dictionary with about 223900 entries. Translations can be done

in both directions. Since also composed words and phrases are included in the

dictionary, we exploited this by not translating the original topics word-by-word

butbyinterpretingeachtwoneighbouringtermsasphrases. Adoptingarealnaive

approachweeven didn'ttakemeasuresinordertotackletheword disambiguation

problem.

bilingual

search predicate wrapper Internet translation

service tranlated query

source/target language Query

index query result

1

2

3

Figure1: bilingualsearch predicates

Figure 1 shows the general scheme of our search predicates for bilingual text re-

trieval. The user gives the query in a source language, which is translated by means of

a translation wrapper. The task of the wrapper is to give a uniform interface to free

translation resources on the internet: It accepts the query as given by the user plus

sourceand targetlanguage and then handles the translation through the service itwas

implementedfor.

3

Références

Documents relatifs

(Within these notes, for simplicity, we equate GOF patterns with design patterns; we use Small Cap- itals for the names of patterns.) These programs are parametrized along

It attempts to show how Arabic represents paragraph cues and considers how readers can identify paragraph boundaries if, as it is often claimed Arabic discourse has no overt

The main problems associated with dictionary-based CLIR are 1) phrase identification and translation, 2) source language ambiguity, 3) translation ambiguity, 4) the coverage

The queries, formulated in Spanish, were translated into English by means of a commercial automatic translation system; the terms extracted from the resulting translations were

The first phase constructs the Estimated Mutual Information matrix of the property values to identify simple property matches and to suggest complex matches, while the

Andererseits aber dennoch: Nicht die Reorganisation der Repräsentationen bringt Wissen, sondern ein mehr an Mäeutik, die Aufdeckung von implizitem

The translation: thanks God, I jlawlessly accomplished my mission. The suggested translation: thanks God, I accomplished my mission in the best way. The sentence is correct but

In fact, the fundamental linguistic insight of our article lies in the importance we need to give to teaching lexical items, what they mean, how they are used in