Image logo% WIDTH=598 HEIGHT=74

1st Workshop on the Integration of Information Retrieval and Databases (WIRD'04)

The first SIGIR workshop on Information Retrieval and Databases (WIRD'04) has explored research topics where data retrieval meets information retrieval. Special interest of this workshop has been in research and applications focusing on search in intranets in the enterprise setting.


Information retrieval (IR) is associated by many with document retrieval because of its past (and still ongoing) focus on text document retrieval. Database (DB) research is associated with (object-) relational data modelling, SQL, transaction-based processing, and many more aspects of databases. Whereas DB research has been driven for years by structured languages and the idea of data modelling and abstraction, IR focused on measuring retrieval quality for large (mostly text) collections.

Nowadays, multimedia and XML collections are a driving force for the integration of IR and DB approaches. IR yields the methods for relevance-based ranking, while DB research provides methods for dealing with structured, and, increasingly, semi-structured data.


David Hawking, A Panoptic View on Databases, XML and IR (presentation).

Accepted Papers

Ingo Frommholz, Ulrich Thiel, and Thomas Kamps, Annotation-based Document Retrieval with Four-Valued Probabilistic Datalog (paper, presentation).

Vojkan Mihajlović, Djoerd Hiemstra, Henk Ernst Blok, and Peter M. G. Apers, An XML-IR-DB Sandwich: Is it Better With an Algebra in Between? (paper, presentation).

Mayssam Sayyadian, Azadeh Shakery, AnHai Doan, and ChengXiang Zhai, Toward Entity Retrieval over Structured and Text Data (paper, presentation).

Further information

This workshop was part of the Joint Workshop on XML and IR+DB (workshop proceedings), together with the 3rd XML and Information Retrieval Workshop organised by Ricardo Baeza-Yates and Yoelle Maarek.

A write-up of both workshops, including the final group discussion What do we gain (loose) with XML, and IR+DB? will be published in SIGIR Forum.


Thomas Roelleke, Queen Mary University London, London, UK
Arjen P. de Vries, CWI, Amsterdam, The Netherlands

Programme Committee

Norbert Fuhr, University of Duisburg-Essen, Germany
Djoerd Hiemstra, University of Twente, The Netherlands
Michael Taylor, Microsoft Research, Cambridge, UK
Udo Kruschwitz, University of Essex, UK
Torsten Grabs, Microsoft, Redmond, USA
Apostel (Paul) Natsev, IBM TJ Watson Research Cente, USA
Dan Suciu, University of Washington, Seattle, USA
Ingo Frommholz, Fraunhofer IPSI, Germany
Surajit Chaudhuri, Microsoft Research, Redmond, USA
Jayavel Shanmugasundaram, Cornell University, USA