Ralf Krestel - LAWEB 09

LAWEB 09

An Architecture for Finding Entities on the Web

Abstract

Abstract—Recent progress in research fields such as Information Extraction and Information Retrieval enables the creation of systems providing better search experiences to web users. For example, systems that retrieve entities instead of just documents have been built. In this paper we present an approach for large-scale Entity Retrieval using web collections as underlying corpus. We propose an architecture for entity extraction and entity ranking starting from web documents. This is obtained (1) using an existing web document index and (2) creating an entity centric index. We describe advantages and feasibility of our approach using state-of-the-art tools.

Full Paper

LAWEB09.pdf

Conference Homepage

LA-WEB 2009

BibTex Entry

@InProceedings{krestel-laweb09,
  author = {Gianluca Demartini and Claudiu-S Firan and Mihai Georgescu and Tereza Iofciu and Ralf Krestel and Wolfgang Nejdl},
  title = {{An Architecture for Finding Entities on the Web}},
  booktitle = {LA-WEB '09: Proceedings of the 2009 Latin American Web Congress},
  isbn = {978-0-7695-3856-3},
  doi = {http://dx.doi.org/10.1109/LA-WEB.2009.14},
  pages = {230--237},
  location = {Yucátan, Mérida, México},
  month = {November 9--11},
  year = {2009},
  publisher = {IEEE Computer Society}
}