Catálogo de publicaciones - libros
Web Information Systems Engineering: WISE 2005: 6th International Conference on Web Information Systems Engineering, New York, NY, USA, November 20-22, 2005, Proceedings
Anne H. H. Ngu ; Masaru Kitsuregawa ; Erich J. Neuhold ; Jen-Yao Chung ; Quan Z. Sheng (eds.)
En conferencia: 6º International Conference on Web Information Systems Engineering (WISE) . New York, NY, USA . November 20, 2005 - November 22, 2005
Resumen/Descripción – provisto por la editorial
No disponible.
Palabras clave – provistas por la editorial
Popular Computer Science; Information Systems Applications (incl. Internet); Information Storage and Retrieval; Database Management; Artificial Intelligence (incl. Robotics); Computers and Society
Disponibilidad
Institución detectada | Año de publicación | Navegá | Descargá | Solicitá |
---|---|---|---|---|
No detectada | 2005 | SpringerLink |
Información
Tipo de recurso:
libros
ISBN impreso
978-3-540-30017-5
ISBN electrónico
978-3-540-32286-3
Editor responsable
Springer Nature
País de edición
Reino Unido
Fecha de publicación
2005
Información sobre derechos de publicación
© Springer-Verlag Berlin Heidelberg 2005
Cobertura temática
Tabla de contenidos
doi: 10.1007/11581062_51
List Data Extraction in Semi-structured Document
Hui Xu; Juan-Zi Li; Peng Xu
The amount of semi-structured documents is tremendous online, such as business annual reports, online airport listings, catalogs, hotel directories, etc. List, which has structured characteristics, is used to store highly structured and database-like information in many semi-structured documents. This paper is about list data extraction from semi-structured documents. By list data extraction, we mean extracting data from lists and grouping it by rows and columns. List data extraction is of benefit to text mining applications on semi-structured documents. Recently, several methods are proposed to extract list data by utilizing the word layout and arrangement information [1, 2]. However, in the research community, few previous studies has so far sufficiently investigated the problem of making use of not only layout and arrangement information, but also the semantic information of words, to the best of our knowledge. In this paper, we propose a clustering based method making use of both the layout information and the semantic information of words for this extraction task. We show experimental results on plain-text annual reports from Shanghai Stock Exchange, in which 73.49% of the lists were extracted correctly.
- Poster Flash Session 1 | Pp. 584-585
doi: 10.1007/11581062_52
Optimization Issues for Keyword Search over Tree-Structured Documents
Sujeet Pradhan; Katsumi Tanaka
In this paper, we discuss one of several optimization issues regarding our algebraic query model for keyword search over tree-structured documents. In particular, we focus on the properties of a class of filters. The filters in this class not only restrict the size of query results, but also are capable of reducing the cost of query processing.
- Poster Flash Session 1 | Pp. 586-587
doi: 10.1007/11581062_53
Semantic Integration of Schema Conforming XML Data Sources
Dimitri Theodoratos; Theodore Dalamagas; I-Ting Liu
A challenging problem in Web engineering is the integration of XML data sources. Even if these data sources conform to schemas, they may have their schemas and the correspongind XML documents structured differently. In this paper, we address the problem of integrating XML data sources (a) by adding semantic information to document schemas, and (b) by using a query language that allows a partial specification of tree patterns. The semantic information allows the grouping of elements into the so called schema dimensions . Our approach allows querying data sources with different schemas in an integrated way. Users posing queries have the flexibility to specify structural constraints fully, partially or not at all. Our approach was initially developed for arbitrarily structured data sources [1]. Here, we show how this approach can be applied to tree-structured data sources that comply to schemas.
- Poster Flash Session 1 | Pp. 588-589
doi: 10.1007/11581062_54
Extract Salient Words with WordRank for Effective Similarity Search in Text Data
Xiaojun Wan; Jianwu Yang
We propose a method named WordRank to extract a few salient words from the target document and then use these words to retrieve similar documents based on popular retrieval functions. The set of extracted words is a concise and topic-oriented representation of the target document and reduces the ambiguous and noisy information in the document, so as to improve the retrieval performance. Experiments and results demonstrate the high effectiveness of the proposed approach.
- Poster Flash Session 1 | Pp. 590-591
doi: 10.1007/11581062_55
Intensional P2P Mappings Between RDF Ontologies
Zoran Majkić
We consider the Peer-To-Peer (P2P) database system with RDF ontologies and with the semantic characterization of P2P mappings based on logical views over local peer’s ontology. Such kind of virtual-predicate based mappings needs an embedding of RDF ontologies into a predicate first-order logic, or at some of its sublanguages as, for example, logic programs for deductive databases. We consider a peer as a local epistemic logic system with its own belief based on RDF tuples, independent from other peers and their own beliefs. This motivates the need of a semantic characterization of P2P mappings based not on the extension but on the meaning of concepts used in the mappings, that is, based on intensional logic. We show that it adequately models robust weakly-coupled framework of RDF ontologies and supports decidable query answering.The approach to use conventional first order logic (FOL) as the semantic underpinning for RDF has many advantages: FOL is well established and well understood. We will consider an RDF-ontology as finite set of triples < r , p , v >, where r is a resource name (for class, an instance or a value), p is a property (InstanceOf or Property in RDF, or Subclass or Property in RDFS), and v is a value (which could also be a resource name). We denote by $\mathcal{T}$ the set of all triples which satisfy such requirements.
- Poster Flash Session 1 | Pp. 592-594
doi: 10.1007/11581062_56
Meta-modeling of Educational Practices for Adaptive Web Based Education Systems
Manuel Caeiro-Rodríguez; Martín Llamas-Nistal; Luis Anido-Rifón
This paper proposes a component-based architecture for adaptive Web-based education systems that support a particular kind of EML models. Our work is concerned with the development of an EML meta-model to provide an enhanced support for the modeling of collaborative practices. The proposal is based on the identification of perspectives.
- Poster Flash Session 1 | Pp. 595-596
doi: 10.1007/11581062_57
An On-line Intelligent Recommendation System for Digital Products Using Fuzzy Logic
Yukun Cao; Yunfeng Li; Chengliang Wang
Developing an intelligent recommendation system is a good way to overcome the problem of products information overload. We believe that the personalized recommendation system should be build according the special features of a certain kind of products, thereby forming professional recommendation systems for different products. In the paper, we propose a system for digital products, such as laptop, digital camera, PDA, etc. The approach utilizes fuzzy logic to retrieve optimal products based on the consumer’s current preferences from the system-user interactions. Experimental results show the promise of our systems.
- Poster Flash Session 1 | Pp. 597-598
doi: 10.1007/11581062_58
Consensus Making on the Semantic Web: Personalization and Community Support
Anna V. Zhdanova; Francisco Martín-Recuerda
We propose a framework for ontology-based consensus making, which is grounded on personalization and community support. Corresponding software is designed to be naturally deployed in community Web environments.
Palabras clave: Community Support; Ontology Alignment; Ontology View; Ontology Management; Consensus Framework.
- Poster Flash Session 1 | Pp. 599-600
doi: 10.1007/11581062_59
Dictionary-Based Voting Text Categorization in a Chemistry-Focused Search Engine
Chunyan Liang; Li Guo; Zhaojie Xia; Xiaoxia Li; Zhangyuan Yang
A chemistry-focused search engine, named ChemEngine, is developed to help chemists to get chemical information more conveniently and precisely on Internet. Text Categorization is used in ChemEngine to facilitate users’ search. The semantic similarity and noisy data in chemical web pages make traditional classifier perform poorly on them. To classify chemical web pages more accurately, a new text categorization approach based on dictionary and voting is proposed and integrated into the ChemEngine.
Palabras clave: Semantic Similarity; Traditional Classifier; Result List; Train Data; Vote Method.
- Poster Flash Session 1 | Pp. 601-602
doi: 10.1007/11581062_60
An Approach to Securely Interconnect Geo Web Services
Adrian Spalka; Stefan Schulz
Web Services play a growing role in the geographic community. Efforts to establish a Spatial Data Infrastructure (SDI) are coordinated by the Open Geospatial Consortium (OGC). However, as the infrastructure gets established, and more content providers wish to offer their products, questions of security arise. Geographical services often stand to gain tremendously from composition and delegation of requests, where one piece of data is constructed from several sets of data, using different sources. However, current standards by the OGC regulate only message transfers, without taking considerations of access control, security and privacy into account.
- Poster Flash Session 2 | Pp. 603-604