• Aucun résultat trouvé

Multimedia Information Retrieval

N/A
N/A
Protected

Academic year: 2022

Partager "Multimedia Information Retrieval"

Copied!
1
0
0

Texte intégral

(1)

Multimedia Information Retrieval (An Introduction)

c Apostolos Papadopoulos Yannis Manolopoulos

Department of Informatics, Aristotle University, 54124 Thessaloniki, Greece

1 Abstract

Information Retrieval (IR) is an active research area for many years. Initially, the field focused on the efficient and effective processing of user information needs ex- pressed as a set of keywords. In fact, this model is still active today (e.g., web search engines). However, large volumes of information are available in non-textual form, such as images, audio files and videos. The research field of Multimedia Information Retrieval (MIR) deals with the efficient and effective processing of queries involv- ing multimedia objects. The big challenge here is to pro- vide ”retrieval by content”, which means that we do not just want to provide results based on metadata or textual descriptions of the objects, but based on the content of these objects. In this talk, we perform a gentle introduc- tion to the field, starting from stabilized methods used in text-based retrieval, and then moving on to the fun- damentals and challenges of multimedia-based retrieval.

We discuss the representation of multimedia objects as vectors in a multi-dimensional space, the organization of these representations by means of indexing schemes and the retrieval of similar objects based on user queries.

The outline of the talk is as follows:

• text representation

• information retrieval vs data retrieval

• components of an IR system

• indexing text databases

• boolean and vector-based retrieval

• efficiency and effectiveness

• similarity queries

• feature extraction (images, audio, video)

• indexing multimedia objects (R-trees and X-trees)

• index-based processing

• current trends

• conclusions

• bibliography

Proceedings of the Spring Young Researcher’s Colloquium on Database and Information Systems, Saint-Petersburg, Rus- sia, 2008

1.1 Basic Bibliography

1. R. Baeza-Yates and B. Ribeiro-Neto. ”Modern In- formation Retrieval”, Addison Wesley, 1999.

2. S. Berchtold, D.A. Keim, and H.-P. Kriegel. ”The X-tree: An Index Structure for High Dimensional Data”, VLDB Conference, 1996.

3. C. Faloutsos: ”Searching Multimedia Databases by Content”, Kluwer Academic Publishers, 1996.

4. B. Furht (Ed): ”Handbook of Multimedia Comput- ing”, CRC Press, 1999.

5. A. Guttman: ”R-tree: A Dynamic Index Struc- ture for Spatial Searching”, ACM SIGMOD Con- ference, 1984.

6. O. Marques and B. Furht: ”Content-Based Image and Video Retrieval”, Kluwer Academic Publishers, 2002.

7. N. Roussopoulos, S. Kelly, and F. Vincent: ”Nearest Neighbor Queries”, ACM SIGMOD Conference, 1995.

8. R.C. Veltkamp, M. Tanase: ”Content-Based Image Retrieval Systems: A Survey”, 2000

Références

Documents relatifs

It amalgamates qualitative inductive preferences that are known from daily life with a quantitative query language using a machine- based learning algorithm?. These

The information flow includes various multimedia data in different formats, diverse metadata for content or technical description, context information, and the community impacts..

An ontology layer is used to define (in terms of a fuzzy DLR-Lite like description logic) the relevant abstract concepts and relations of the application domain, while a

Main features (so far) of DLMedia are that: (i) it uses a DLR-Lite(D) like language as query and ontology representation language; (ii) it supports queries about the form and content

As depicted in Figure 1, the REVEAL THIS system comprises a number of single-media and multimedia technologies that can be grouped, for presentation purposes,

Techniques for attempting to bridge the semantic gap in multimedia retrieval have mostly used an auto-annotation approach, in which keyword annotations are applied to

We have used the generated MPEG-7 OWL ontology as an “upper-ontology” for multimedia metadata, where three different music schemas have been linked.. Thus, it has been possible

In other applied research, global access to multilingual video news requires integration of a broad set of image processing (e.g., keyframe detec- tion, face identifi cation, scene