Catálogo de publicaciones - libros
Título de Acceso Abierto
Entity-Oriented Search
Krisztian Balog
Resumen/Descripción – provisto por la editorial
No disponible.
Palabras clave – provistas por la editorial
Information Storage and Retrieval; Artificial Intelligence (incl. Robotics); Probability and Statistics in Computer Science; Information Systems Applications (incl. Internet)
Disponibilidad
Institución detectada | Año de publicación | Navegá | Descargá | Solicitá |
---|---|---|---|---|
No requiere | 2018 | SpringerLink |
Información
Tipo de recurso:
libros
ISBN impreso
978-3-319-93933-9
ISBN electrónico
978-3-319-93935-3
Editor responsable
Springer Nature
País de edición
Reino Unido
Fecha de publicación
2018
Información sobre derechos de publicación
© The Editor(s) (if applicable) and the Author(s) 2018
Tabla de contenidos
Introduction
Krisztian Balog
Entity-oriented search is the search paradigm of organizing and accessing information centered around entities, and their attributes and relationships. This introductory chapter defines what an entity is, identifies prominent contexts for entity-oriented search, presents a number of specific tasks, puts the subject into a historical perspective, and lays the foundations for the rest of the book.
Pp. 1-23
Meet the Data
Krisztian Balog
This chapter introduces the different types of data sources, from unstructured to structured, that will be used in the remainder of the book. Specifically, we discuss the web, Wikipedia, and knowledge bases. We further introduce standard datasets and provide pointers to tools and resources.
Pp. 25-53
Term-Based Models for Entity Ranking
Krisztian Balog
Ad hoc entity retrieval is the task of answering a free text query with a ranked list of entities. The main idea behind our approaches in this chapter can be summarized as follows: If textual representations can be constructed for entities, then the ranking of these representations (“entity descriptions”) becomes straightforward by building on traditional document retrieval techniques. Accordingly, the bulk of the work presented in this chapter revolves around assembling term-based entity representations from various sources, ranging from unstructured documents to structured knowledge bases. We also discuss evaluation methodology and standard test collections.
Part I - Entity Ranking | Pp. 57-99
Semantically Enriched Models for Entity Ranking
Krisztian Balog
Perhaps the most exciting challenge and opportunity in entity retrieval is how to leverage entity-specific properties—attributes, types, and relationships—to improve retrieval performance. In this chapter, we take a departure from purely term-based approaches toward semantically enriched retrieval models. We look at a number of specific entity retrieval tasks that have been studied at various benchmarking campaigns. Specifically, these tasks are ad hoc entity retrieval, list search, related entity finding, and similar entity search. Additionally, we also consider measures of (static) entity importance.
Part I - Entity Ranking | Pp. 101-143
Entity Linking
Krisztian Balog
Being able to identify entities in a document is a key step toward understanding what the document is about. Entity linking refers to the process of annotating an input text with entity identifiers from a reference knowledge repository. We present a canonical pipeline approach to entity linking that consists of mention detection, candidate selection, and disambiguation components. Then, we look at each of these components in detail. We further discuss evaluation methodology, test collections, and publicly available entity linking systems.
Part II - Bridging Text and Structure | Pp. 147-188
Populating Knowledge Bases
Krisztian Balog
Knowledge base population refers to the task of discovering new facts about entities from a large text corpus, and augmenting a knowledge base with these facts. We start this chapter by giving a brief overview of the broader problem area of extracting structured information from unstructured data. Then, we present a two-step approach that facilitates knowledge base population. In step one, an incoming document stream is filtered to identify documents that potentially contain new facts about a given entity. In step two, the filtered documents are processed for extracting new facts.
Part II - Bridging Text and Structure | Pp. 189-222
Understanding Information Needs
Krisztian Balog
Understanding what the user is looking for is at the heart of delivering a quality search experience. The focus of this chapter is on obtaining semantically enriched representations of search queries with the help of knowledge repositories. Specifically, we (1) identify the types or categories of entities that are targeted by the query, (2) recognize specific entity mentions in queries and annotate them with unique identifiers from the underlying knowledge repository, and (3) automatically generate query templates from a search log, which then can provide structured interpretations of queries.
Part III - Semantic Search | Pp. 225-267
Leveraging Entities in Document Retrieval
Krisztian Balog
This chapter focuses on the classic problem of ad hoc document retrieval and discusses how entities may be leveraged to improve retrieval performance. Entities facilitate a semantic understanding of both the user’s information need, as expressed by the keyword query, and of the document’s content. We present three different families of approaches: (1) expansion-based methods, which utilize entities as a source of expansion terms to enrich the representation of the query; (2) projection-based methods, which treat entities as a latent layer, while leaving the original document/query representations intact; and (3) entity-based methods, which consider explicitly the entities that are recognized in documents, and embrace entity-based representations in “duet” with traditional term-based representations.
Part III - Semantic Search | Pp. 269-297
Utilizing Entities for an Enhanced Search Experience
Krisztian Balog
This chapter presents a selection of topics, where entities are utilized with the overall aim of improving the users’ search experiences. First, we discuss techniques for assisting users with articulating their information needs, including query assistance services and specialized query building interfaces. Next, we turn to the question of result presentation and introduce entity cards. Finally, we study entity recommendation methods that present users with contextual suggestions, encourage exploration, and allow for serendipitous discoveries.
Part III - Semantic Search | Pp. 299-336
Conclusions and Future Directions
Krisztian Balog
Today, the importance of entities has been broadly recognized and entities have become first-class citizens in many information access systems, including web, mobile, and enterprise search; question answering; and personal digital assistants. Entities have also become a meeting point for several research communities, including that of information retrieval, natural language processing, databases, and the Semantic Web. This final chapter concludes the book by summarizing progress, discussing limitations of current approaches, and pointing out potential future research directions.
Part III - Semantic Search | Pp. 337-348