Websites: Tools: Introduction to Information Extraction and Retrieval: - Sunita Sarawagi: Information Extraction. Foundations and Trends in Databases, 2008.
- Ricardo A. Baeza-Yates, Berthier A. Ribeiro-Neto: Modern Information Retrieval. ACM Press / Addison-Wesley 1999, ISBN 0-201-39829-X
- Introduction to Information Retrieval. C.D. Manning, P. Raghavan, H. Schütze. Cambridge UP, 2008 (http://nlp.stanford.edu/IR-book/information-retrieval-book.html)
Entity and Fact Extraction: - Oren Etzioni, Michael J. Cafarella, Doug Downey, Ana-Maria Popescu, Tal Shaked, Stephen Soderland, Daniel S. Weld, Alexander Yates: Unsupervised named-entity extraction from the Web: An experimental study. Artif. Intell. 165(1): 91-134 (2005)
- Eirinaios Michelakis, Rajasekar Krishnamurthy, Peter J. Haas, Shivakumar Vaithyanathan: Uncertainty management in rule-based information extraction systems. SIGMOD 2009: 101-114
Disambiguation and Duplicate Detection: - Xianpei Han, Jun Zhao: Named entity disambiguation by leveraging wikipedia semantic knowledge. CIKM 2009: 215-224
- Risto Gligorov, Warner ten Kate, Zharko Aleksovski, Frank van Harmelen: Using Google distance to weight approximate ontology matches. WWW 2007: 767-776
Document Classification: - Eser Kandogan, Rajasekar Krishnamurthy, Sriram Raghavan, Shivakumar Vaithyanathan, Huaiyu Zhu: Avatar semantic search: a database approach to information retrieval. SIGMOD Conference 2006: 790-792
- Pável Calado, Marco Cristo, Edleno Silva de Moura, Nivio Ziviani, Berthier A. Ribeiro-Neto, Marcos André Gonçalves: Combining link-based and content-based methods for web document classification. CIKM 2003: 394-401
Storage and Indexing:
- Gerhard Weikum, Gjergji Kasneci, Maya Ramanath, Fabian M. Suchanek: Database and information-retrieval methods for knowledge discovery. Commun. ACM 52(4): 56-64 (2009)
- Atanas Kiryakov, Borislav Popov, Ivan Terziev, Dimitar Manov, Damyan Ognyanoff: Semantic annotation, indexing, and retrieval. J. Web Sem. 2(1): 49-79 (2004)
Ranking Entities and Homepages: - Tao Cheng, Xifeng Yan, Kevin Chen-Chuan Chang: EntityRank: Searching Entities Directly and Holistically. VLDB 2007: 387-398
- Panagiotis G. Ipeirotis, Eugene Agichtein, Pranay Jain, Luis Gravano: To search or to crawl?: towards a query optimizer for text-centric tasks. SIGMOD Conference 2006: 265-276
|