• Aucun résultat trouvé

The Return of the Probability of Relevance

N/A
N/A
Protected

Academic year: 2022

Partager "The Return of the Probability of Relevance"

Copied!
1
0
0

Texte intégral

(1)

The Return of the Probability of Relevance

Norbert Fuhr

The University of Duisburg-Essen Duisburg, Germany

norbert.fuhr@uni-due.de

Abstract

The probability ranking principle (PRP) proves that rank- ing documents by decreasing probability of relevance yields optimum retrieval quality. Most research on probabilistic models has focused only on producing a probabilistic rank- ing, without estimating the actual probabilities. In this talk, we discuss models for three types of modern IR applications which rely on calibrated values of the probability of rele- vance.

1. Vertical search deals with the aggregation of docu- ments with different types or media (such as, e.g., Web pages, news, tweets, videos, images) in response to a query. Based on the probabilistic estimation of the number of relevant documents per resource, the decision-theoretic selection model describes the opti- mum solution for this problem.

2. The optimum clustering framework provides not only the first theoretic foundation for document clustering, it also proves the clustering hypothesis. Its key idea is to base cluster analysis and evaluation on a set of queries, by defining documents as being similar if they are relevant to the same queries.

3. The interactive PRP generalizes the classical PRP for interactive retrieval. It characterizes each situation in interactive retrieval as a list of choices, where each choice is described as the effort for evaluating it, the probability that the user will accept it, and the benefit resulting from acceptance. By developing appropriate parameter estimation methods, we can describe inter- active retrieval by Markov models, which allow for a number of predictions.

With these models, it becomes possible to implement ap- proaches based on solid theoretic foundations, which are more transparent than heuristic approaches, thus allowing for theory-guided adaptation and tuning.

About the Speaker

Dr. Norbert Fuhr is a full professor in the Department of Computer Science at the University of Duisburg-Essen. He obtained his Ph.D in Computer Science from the Technical University of Darmstadt in 1986 where he served as an assis- tant professor. He became Associate Professor in the com- puter science department of the University of Dortmund in 1991, before taking up his current position in 2002.

He has published more than 300 papers in the fields of IR, databases and digital libraries. His current research interests are retrieval models, networked digital library architectures, user-oriented retrieval methods and the evaluation of digital libraries.

He has served as regular PC member of many major interna- tional conferences related to information retrieval and digital libraries, such as ACM-SIGIR, CIKM, ECIR, SPIRE, ICDL, ECDL, ICADL, FQAS. He was PC chair of ECIR 2002, IR track chair of CIKM 2005 and Co-Chair of SIGIR 2007. For the German IR-group GI-FGIR, he served as Chair from 1992-2008. He also is a member of the editorial boards of the journals Information Retrieval, ACM Transactions on Infor- mation Systems, International Journal of Digital Libraries, and Foundations and Trends in Information Retrieval.

In 2012, he received the prestigious Gerald Salton Award in recognition of his significant, sustained and continuing con- tributions to research in information retrieval.

The committee particularly emphasised his”pioneering con- tributions to the theoretical foundations of information re- trieval and database systems. His work describing how learn- ing methods can be used with retrieval models and index- ing anticipated the current interest in learning ranking func- tions, his development of probabilistic retrieval models for database systems and XML was ground-breaking, and his recent work on retrieval models for interactive retrieval has inspired new research. His rigorous approach to research and research methods is an outstanding example for our field.”

Références

Documents relatifs

He finally published in 1906 his book on “Earthquakes (Seismic Geography)”. This impressive work gave a complete account of the state of seismology at the time, and described

Time evolution of proteolysis index has been measured and modelled on five different types of pork muscle under different accurate temperature and water activity

Beckey, a pioneer in mass spectro- metry, whose work has been devoted to the development of field ioniza- tion mass spectrometry (FI MS) celebrated his 65th birthday.. While this

Among the topics which we discuss are Euler constant, nested roots, divergent series, Ramanujan – Nagell equation, partitions, Ramanujan tau function, Hardy Littlewood and the

The remaining contributions can be divided into three, dealing respectively with the influence of Marshall upon Anglophone countries, upon the rest of the world, and on

The Eportfolio: How can it be used in French as a second language teaching and learning.. Practical paper presenting

Work Injury and return to work in the context of work mobility: implications for families... We know from the

Put the verbs into the