Catálogo de publicaciones - libros

Compartir en
redes sociales

Título de Acceso Abierto

Entity-Oriented Search

Krisztian Balog

Resumen/Descripción – provisto por la editorial

No disponible.

Palabras clave – provistas por la editorial

Information Storage and Retrieval; Artificial Intelligence (incl. Robotics); Probability and Statistics in Computer Science; Information Systems Applications (incl. Internet)

Disponibilidad

Institución detectada	Año de publicación	Navegá	Descargá	Solicitá
No requiere	2018	SpringerLink

Información

Tipo de recurso:

libros

ISBN impreso

978-3-319-93933-9

ISBN electrónico

978-3-319-93935-3

Editor responsable

Springer Nature

País de edición

Reino Unido

Fecha de publicación

2018

Información sobre derechos de publicación

Cobertura temática

Ciencias de la computación e información

Ingeniería eléctrica, electrónica e informática

Tabla de contenidos

doi: 10.1007/978-3-319-93935-3_1

Introduction

Krisztian Balog

Entity-oriented search is the search paradigm of organizing and accessing information centered around entities, and their attributes and relationships. This introductory chapter defines what an entity is, identifies prominent contexts for entity-oriented search, presents a number of specific tasks, puts the subject into a historical perspective, and lays the foundations for the rest of the book.

Pp. 1-23

Accedé al texto completo

doi: 10.1007/978-3-319-93935-3_2

Meet the Data

Krisztian Balog

This chapter introduces the different types of data sources, from unstructured to structured, that will be used in the remainder of the book. Specifically, we discuss the web, Wikipedia, and knowledge bases. We further introduce standard datasets and provide pointers to tools and resources.

Pp. 25-53

Accedé al texto completo

doi: 10.1007/978-3-319-93935-3_3

Term-Based Models for Entity Ranking

Krisztian Balog

Ad hoc entity retrieval is the task of answering a free text query with a ranked list of entities. The main idea behind our approaches in this chapter can be summarized as follows: If textual representations can be constructed for entities, then the ranking of these representations (“entity descriptions”) becomes straightforward by building on traditional document retrieval techniques. Accordingly, the bulk of the work presented in this chapter revolves around assembling term-based entity representations from various sources, ranging from unstructured documents to structured knowledge bases. We also discuss evaluation methodology and standard test collections.

Part I - Entity Ranking | Pp. 57-99

Accedé al texto completo

doi: 10.1007/978-3-319-93935-3_4

Semantically Enriched Models for Entity Ranking

Krisztian Balog

Perhaps the most exciting challenge and opportunity in entity retrieval is how to leverage entity-specific properties—attributes, types, and relationships—to improve retrieval performance. In this chapter, we take a departure from purely term-based approaches toward semantically enriched retrieval models. We look at a number of specific entity retrieval tasks that have been studied at various benchmarking campaigns. Specifically, these tasks are ad hoc entity retrieval, list search, related entity finding, and similar entity search. Additionally, we also consider measures of (static) entity importance.

Part I - Entity Ranking | Pp. 101-143

Accedé al texto completo

doi: 10.1007/978-3-319-93935-3_5

Entity Linking

Krisztian Balog

Being able to identify entities in a document is a key step toward understanding what the document is about. Entity linking refers to the process of annotating an input text with entity identifiers from a reference knowledge repository. We present a canonical pipeline approach to entity linking that consists of mention detection, candidate selection, and disambiguation components. Then, we look at each of these components in detail. We further discuss evaluation methodology, test collections, and publicly available entity linking systems.

Part II - Bridging Text and Structure | Pp. 147-188

Accedé al texto completo

doi: 10.1007/978-3-319-93935-3_6

Populating Knowledge Bases

Krisztian Balog

Knowledge base population refers to the task of discovering new facts about entities from a large text corpus, and augmenting a knowledge base with these facts. We start this chapter by giving a brief overview of the broader problem area of extracting structured information from unstructured data. Then, we present a two-step approach that facilitates knowledge base population. In step one, an incoming document stream is filtered to identify documents that potentially contain new facts about a given entity. In step two, the filtered documents are processed for extracting new facts.

Part II - Bridging Text and Structure | Pp. 189-222

Accedé al texto completo

doi: 10.1007/978-3-319-93935-3_7

Understanding Information Needs

Krisztian Balog

Understanding what the user is looking for is at the heart of delivering a quality search experience. The focus of this chapter is on obtaining semantically enriched representations of search queries with the help of knowledge repositories. Specifically, we (1) identify the types or categories of entities that are targeted by the query, (2) recognize specific entity mentions in queries and annotate them with unique identifiers from the underlying knowledge repository, and (3) automatically generate query templates from a search log, which then can provide structured interpretations of queries.

Part III - Semantic Search | Pp. 225-267

Accedé al texto completo

doi: 10.1007/978-3-319-93935-3_8

Leveraging Entities in Document Retrieval

Krisztian Balog

This chapter focuses on the classic problem of ad hoc document retrieval and discusses how entities may be leveraged to improve retrieval performance. Entities facilitate a semantic understanding of both the user’s information need, as expressed by the keyword query, and of the document’s content. We present three different families of approaches: (1) expansion-based methods, which utilize entities as a source of expansion terms to enrich the representation of the query; (2) projection-based methods, which treat entities as a latent layer, while leaving the original document/query representations intact; and (3) entity-based methods, which consider explicitly the entities that are recognized in documents, and embrace entity-based representations in “duet” with traditional term-based representations.

Part III - Semantic Search | Pp. 269-297

Accedé al texto completo

doi: 10.1007/978-3-319-93935-3_9

Utilizing Entities for an Enhanced Search Experience

Krisztian Balog

This chapter presents a selection of topics, where entities are utilized with the overall aim of improving the users’ search experiences. First, we discuss techniques for assisting users with articulating their information needs, including query assistance services and specialized query building interfaces. Next, we turn to the question of result presentation and introduce entity cards. Finally, we study entity recommendation methods that present users with contextual suggestions, encourage exploration, and allow for serendipitous discoveries.

Part III - Semantic Search | Pp. 299-336

Accedé al texto completo

doi: 10.1007/978-3-319-93935-3_10

Conclusions and Future Directions

Krisztian Balog

Today, the importance of entities has been broadly recognized and entities have become first-class citizens in many information access systems, including web, mobile, and enterprise search; question answering; and personal digital assistants. Entities have also become a meeting point for several research communities, including that of information retrieval, natural language processing, databases, and the Semantic Web. This final chapter concludes the book by summarizing progress, discussing limitations of current approaches, and pointing out potential future research directions.

Part III - Semantic Search | Pp. 337-348

Accedé al texto completo