Catálogo de publicaciones - libros
Information Retrieval Technology: Second Asia Information Retrieval Symposium, AIRS 2005, Jeju Island, Korea, October 13-15, 2005, Proceedings
Gary Geunbae Lee ; Akio Yamada ; Helen Meng ; Sung Hyon Myaeng (eds.)
En conferencia: 2º Asia Information Retrieval Symposium (AIRS) . Jeju Island, South Korea . October 13, 2005 - October 15, 2005
Resumen/Descripción – provisto por la editorial
No disponible.
Palabras clave – provistas por la editorial
Information Storage and Retrieval; Library Science; Theory of Computation; Information Systems Applications (incl. Internet); Algorithm Analysis and Problem Complexity; Data Structures
Disponibilidad
Institución detectada | Año de publicación | Navegá | Descargá | Solicitá |
---|---|---|---|---|
No detectada | 2005 | SpringerLink |
Información
Tipo de recurso:
libros
ISBN impreso
978-3-540-29186-2
ISBN electrónico
978-3-540-32001-2
Editor responsable
Springer Nature
País de edición
Reino Unido
Fecha de publicación
2005
Información sobre derechos de publicación
© Springer-Verlag Berlin Heidelberg 2005
Cobertura temática
Tabla de contenidos
doi: 10.1007/11562382_61
KNM: A Novel Intelligent User Interface for Webpage Navigation
Shiguang Liu; Hua-Jun Zeng; Zheng Chen; Chuangbai Xiao; Wei-Ying Ma
In order to help readers grasp key information from web pages within limited time, a novel intelligent user interface for webpage navigation: Keyphrase Navigation Map (KNM) is proposed in this paper, which presents a map to the readers, assisting them to navigate their webpage. First, the key phrases are generated from the webpage and ranked by the keyphrase extraction engine, then they are clustered by keyphrase relevancy, and finally they are shown on the thumbnail according to their relative coordinates in the full page. The usability test proves that the KNM which presents an overview of the corresponding web page to readers can really help them to find information they are interested in effectively.
- Poster and Demo Session 2 | Pp. 641-646
doi: 10.1007/11562382_62
Towards Construction of Evaluation Framework for Query Expansion
Masaharu Yoshioka
Query expansion is an important technique for achieving higher retrieval performance. However, a good framework for evaluating this technique in isolation does not exist because the effect of query expansion depends on the quality of the initial query. Feature quantities that characterize the quality of the initial query are defined in this study. Correlations between these quantities and retrieval performance were analyzed using the NTCIR-4 web test collection.
- Poster and Demo Session 2 | Pp. 647-652
doi: 10.1007/11562382_63
An Efficient Incremental Nearest Neighbor Algorithm for Processing -Nearest Neighbor Queries with Visal and Semantic Predicates in Multimedia Information Retrieval System
Dong-Ho Lee; Dong-Joo Park
Recently, advanced multimedia applications, such as geographic information system, and content-based image/video retrieval system, require the efficient processing of -nearest neighbor queries with semantic predicates as well as visual predicates. In this paper, we propose an integrated index structure, so-called SPY-TEC+, that provides an efficient method for indexing visual and semantic feature information at the same time using the SPY-TEC and the signature file. We also propose an efficient incremental nearest neighbor algorithm for processing the -nearest neighbor queries with visual and semantic predicates on the SPY-TEC+.
- Poster and Demo Session 2 | Pp. 653-658
doi: 10.1007/11562382_64
Generating Term Transliterations Using Contextual Information and Validating Generated Results Using Web Corpora
Jin-Shea Kuo
Transliterating foreign entities into Chinese is usually done through direct-style approach. The direct-style approach transliterates each syllable rendered from foreign terms into Chinese directly. Not every syllable can be rendered. An approach utilizing contextual information for term transliteration is proposed in this paper to attack this problem. Traditionally, evaluating transliteration performances always uses character and word error rates. However, many transliteration variants of the same term always found. Validating the generated results using Web corpora is more suitable and is proposed in this paper. Using the proposed evaluation method, experiments on term transliteration were conducted. From the experimental results show that taking contextual information is helpful to term transliteration and validating the generated results using Web corpora can provide more concrete evidences to transliteration evaluation.
- Poster and Demo Session 2 | Pp. 659-665
doi: 10.1007/11562382_65
Handling Orthographic Varieties in Japanese IR: Fusion of Word-, N-Gram-, and Yomi-Based Indices Across Different Document Collections
Nina Kummer; Christa Womser-Hacker; Noriko Kando
Orthographic varieties are common in the Japanese language and represent a serious problem for Japanese information retrieval (IR), as IR systems run the risk of missing documents that contain variant forms of the search term. We propose two different strategies for handling orthographic varieties: pronunciation or yomi-based indexing and “Fuzzy Querying”, comparing katakana terms based on edit distance. Both strategies were integrated into our multiple index and fusion system [1] and tested using two different test collections, newspaper articles (Mainichi Shimbun ’98) and scientific abstracts (NTCIR-1), to compare their performance across text genres.
- Poster and Demo Session 2 | Pp. 666-672
doi: 10.1007/11562382_66
A Term Weighting Approach for Text Categorization
Kyung-Chan Lee; Seung-Shik Kang; Kwang-Soo Hahn
It is common that representative words in a document are identified and discriminated by their statistical distribution of their frequency statistics. We assume that evaluating the confidence measure of terms through content-based document analysis leads to a better performance than the parametric assumptions of the standard frequency-based method. In this paper, we propose a new approach of term weighting method that replaces the frequency-based probabilistic methods. Experiments on Naïve Bayesian classifiers showed that our approach achieved an improvement compared to the frequency-based method on each point of the evaluation.
- Poster and Demo Session 2 | Pp. 673-678
doi: 10.1007/11562382_67
A LF Based Answer Indexing Method for Encyclopedia Question-Answering System
Hyeon-Jin Kim; Ji-Hyun Wang; Chang-Ki Lee; Chung-Hee Lee; Myung-Gil Jang
This paper proposes a fast and effective question-answer system for encyclopedia domain using a new answer indexing method. We define about 160 answer types. The indexer generates AIU(Answer Index Unit) structures between answer candidates and content words within LF(Logical Form) and sentence boundary. We select essential terms among question terms using syntactic information for ranking the answer candidates. Experiments show our new method is good for the encyclopedia question-answering system.
- Poster and Demo Session 2 | Pp. 679-684
doi: 10.1007/11562382_68
Approximate Phrase Match to Compile Synonymous Translation Terms for Korean Medical Indexing
Jae Sung Lee; Hye Mi Yun
The medical thesaurus, MeSH, has been used to index medical documents. A Korean MeSH also has been developed, but it does not include many of the synonymous translations for the English terms. The coverage of synonymous translation is important to index medical documents correctly. In this paper, we propose an approximate phrase match method to extract synonymous translations from Korean medical documents, where parentheses are used to include English terms, or English keywords are used in the keyword field. The approximate phrase match is to handle the unregistered terms in a bilingual dictionary. The empirical evaluation showed that the proposed methods are very effective to compile translation phrase pairs.
- Poster and Demo Session 2 | Pp. 685-690
doi: 10.1007/11562382_69
Protein Function Classification Based on Gene Ontology
Dae-Won Park; Hyoung-Sam Heo; Hyuk-Chul Kwon; Hea-Young Chung
Most proteins interact with other proteins, cells, tissues or diseases. They have biological functions and can be classified according to their functions. With the functions and the functional relations of proteins, we can explain many biological phenomena and obtain answers in solving biological problems. Therefore, it is important to determine the functions of proteins. In this paper we present a protein function classification method for the function prediction of proteins. With human proteins assigned to GO molecular function terms, we measure the similarity of proteins to function classes using the functional distribution.
- Poster and Demo Session 2 | Pp. 691-696
doi: 10.1007/11562382_70
Extracting and Utilizing of IS-A Relation Patterns for Question Answering Systems
Bojun Shim; Youngjoong Ko; Jungyun Seo
Most of existing open domain question answering systems predefine the conceptual category to which answers can belong. So, they cannot generate appropriate answers in every case or must use a strategy that handles exceptions when the concept requested in the question is not prepared in the system. In this paper, we suggest a flexible strategy that can generate the candidate answers which correspond to any nominal target concepts. The proposed question answering system is equipped with general patterns that can extract hyponyms of the nominal target concept with their confidence scores. Therefore, it can create a set of candidate answers from the dynamically generated ontology when a user requests any nominal concept.
- Poster and Demo Session 2 | Pp. 697-702