Catálogo de publicaciones - libros

Compartir en
redes sociales


Transactions on Rough Sets IV

James F. Peters ; Andrzej Skowron (eds.)

Resumen/Descripción – provisto por la editorial

No disponible.

Palabras clave – provistas por la editorial

Artificial Intelligence (incl. Robotics); Mathematical Logic and Formal Languages; Computation by Abstract Devices; Database Management; Image Processing and Computer Vision

Disponibilidad
Institución detectada Año de publicación Navegá Descargá Solicitá
No detectada 2005 SpringerLink

Información

Tipo de recurso:

libros

ISBN impreso

978-3-540-29830-4

ISBN electrónico

978-3-540-32016-6

Editor responsable

Springer Nature

País de edición

Reino Unido

Fecha de publicación

Información sobre derechos de publicación

© Springer-Verlag Berlin Heidelberg 2005

Tabla de contenidos

A Treatise on Rough Sets

Zdzisław Pawlak

This article presents some general remarks on rough sets and their place in general picture of research on vagueness and uncertainty – concepts of utmost interest, for many years, for philosophers, mathematicians, logicians and recently also for computer scientists and engineers particularly those working in such areas as AI, computational intelligence, intelligent systems, cognitive science, data mining and machine learning. Thus this article is intended to present some philosophical observations rather than to consider technical details or applications of rough set theory. Therefore we also refrain from presentation of many interesting applications and some generalizations of the theory.

Palabras clave: Sets; fuzzy sets; rough sets; antinomies; vagueness.

- Regular Papers | Pp. 1-17

On Optimization of Decision Trees

Igor V. Chikalov; Mikhail Ju. Moshkov; Maria S. Zelentsova

In the paper algorithms are considered which allow to consecutively optimize decision trees for decision tables with many-valued decisions relatively different complexity measures such as number of nodes, weighted depth, average weighted depth, etc. For decision tables over an arbitrary infinite restricted information system [5] these algorithms have (at least for the three mentioned measures) polynomial time complexity depending on the length of table description. For decision tables over one of such information systems experimental results of decision tree optimization are described.

Palabras clave: Decision trees; complexity measures; optimization.

- Regular Papers | Pp. 18-36

Dealing with Missing Data: Algorithms Based on Fuzzy Set and Rough Set Theories

Dan Li; Jitender Deogun; William Spaulding; Bill Shuart

Missing data, commonly encountered in many fields of study, introduce inaccuracy in the analysis and evaluation. Previous methods used for handling missing data (e.g., deleting cases with incomplete information, or substituting the missing values with estimated mean scores), though simple to implement, are problematic because these methods may result in biased data models. Fortunately, recent advances in theoretical and computational statistics have led to more flexible techniques to deal with the missing data problem. In this paper, we present missing data imputation methods based on clustering, one of the most popular techniques in Knowledge Discovery in Databases (KDD). We combine clustering with soft computing, which tends to be more tolerant of imprecision and uncertainty, and apply fuzzy and rough clustering algorithms to deal with incomplete data. The experiments show that a hybridization of fuzzy set and rough set theories in missing data imputation algorithms leads to the best performance among our four algorithms, i.e., crisp K-means, fuzzy K-means, rough K-means, and rough-fuzzy K-means imputation algorithms.

Palabras clave: Missing data; imputation; K-means clustering; fuzzy sets; rough sets; rough-fuzzy hybridization.

- Regular Papers | Pp. 37-57

Characteristic Relations for Incomplete Data: A Generalization of the Indiscernibility Relation

Jerzy W. Grzymala-Busse

This paper shows that attribute-value pair blocks, used for many years in rule induction, may be used as well for computing indiscernibility relations for completely specified decision tables. Much more importantly, for incompletely specified decision tables, i.e., for data with missing attribute values, the same idea of attribute-value pair blocks is a convenient tool to compute characteristic sets, a generalization of equivalence classes of the indiscernibility relation, and also characteristic relations, a generalization of the indiscernibility relation. For incompletely specified decision tables there are three different ways lower and upper approximations may be defined: singleton, subset and concept. Finally, it is shown that, for a given incomplete data set, the set of all characteristic relations for the set of all congruent decision tables is a lattice.

Palabras clave: Characteristic Relation; Decision Table; Rule Induction; Indiscernibility Relation; Rule Induction Algorithm.

- Regular Papers | Pp. 58-68

Supervised Learning in the Gene Ontology Part I: A Rough Set Framework

Herman Midelfart

Prediction of gene function introduces a new learning problem where the decision classes associated with the objects (i.e., genes) are organized in a directed acyclic graph (DAG). Rough set theory, on the other hand, assumes that the classes are unrelated cannot handle this problem properly. To this end, we introduce a new rough set framework. The traditional decision system is extended into DAG decision system which can represent the DAG. From this system we develop several new operators, which can determine the known and the potential objects of a class and show how these sets can be combined with the usual rough set approximations. The properties of these operators are also investigated.

Palabras clave: Gene Ontology; Directed Acyclic Graph; Decision Class; Information Vector; Gene Ontology Consortium.

- Regular Papers | Pp. 69-97

Supervised Learning in the Gene Ontology Part II: A Bottom-Up Algorithm

Herman Midelfart

Prediction of gene function for expression profiles introduces a new problem for supervised learning algorithms. The decision classes are taken from an ontology, which defines relationships between the classes. Supervised algorithms, on the other hand, assumes that the classes are unrelated. Hence, we introduce a new algorithm which can take these relationships into account. This is tested on a microarray data set created from human fibroblast cells and on several artificial data sets. Since standard performance measures do not apply to this problem, we also introduce several new measures for measuring classification performance in an ontology.

Palabras clave: Gene Ontology; Irrelevant Attribute; Decision Class; Training Accuracy; Conditional Attribute.

- Regular Papers | Pp. 98-124

Comparative Analysis of Deterministic and Nondeterministic Decision Tree Complexity Local Approach

Mikhail Ju. Moshkov

For problems over arbitrary information system we study the relationships among the complexity of a problem description, the minimal complexity of a decision tree solving this problem deterministically, and the minimal complexity of a decision tree solving this problem nondeterministically. We consider the local approach to investigation of decision trees where only attributes from a problem description are used for construction of decision trees solving this problem.

Palabras clave: Decision tree; rough set theory; complexity.

- Regular Papers | Pp. 125-143

A Fast Host-Based Intrusion Detection System Using Rough Set Theory

Sanjay Rawat; V. P. Gulati; Arun K. Pujari

Intrusion Detection system has become the main research focus in the area of information security. Last few years have witnessed a large variety of technique and model to provide increasingly efficient intrusion detection solutions. We advocate here that the intrusive behavior of a process is highly localized characteristics of the process. There are certain smaller episodes in a process that make the process intrusive in an otherwise normal stream. As a result it is unnecessary and most often misleading to consider the whole process in totality and to attempt to characterize its abnormal features. In the present work we establish that subsequences of reasonably small length of sequence of system calls would suffice to identify abnormality in a process. We make use of rough set theory to demonstrate this concept. Rough set theory also facilitates identifying rules for intrusion detection. The main contributions of the paper are the following- (a) It is established that very small subsequence of system call is sufficient to identify intrusive behavior with high accuracy. We demonstrate our result using DARPA’98 BSM data; (b) A rough set based system is developed that can extract rules for intrusion detection; (c) An algorithm is presented that can determine the status of a process as either normal or abnormal on-line.

Palabras clave: Data mining; Decision Table; Rough Set; Intrusion Detection; Anomaly; Misuse.

- Regular Papers | Pp. 144-161

Incremental Learning and Evaluation of Structures of Rough Decision Tables

Wojciech Ziarko

Rough decision tables were introduced by Pawlak in the context of rough set theory. A rough decision table represents, a non-functional in general, relationship between two groups of properties of objects, referred to as condition and decision attributes, respectively. In practical applications, the rough decision tables are normally learned from data. In this process, for better coverage of the domain of interest, they can be structured into hierarchies. To achieve convergence of the learned hierarchy of rough decision tables to a stable final state, it is desirable to avoid total regeneration of the learned structure after new objects, not represented in the hierarchy, are encountered. This can be accomplished through an incremental learning process in which the structure of rough decision tables is updated, rather than regenerated, after new observations appeared. The introduction and the investigation of this incremental learning process within the framework of the rough set model is the main theme of the article. The article is also concerned with evaluation of learned decision tables and their structures by introducing the absolute gain function to measure the quality of information represented by the tables.

Palabras clave: Boundary Area; Decision Table; Incremental Learning; Decision Attribute; Information Table.

- Regular Papers | Pp. 162-177

A Framework for Reasoning with Rough Sets

Aida Vitória

Rough sets framework has two appealing aspects. First, it is a mathematical approach to deal with vague concepts. Second, rough set techniques can be used in data analysis to find patterns hidden in the data. The number of applications of rough sets to practical problems in different fields demonstrates the increasing interest in this framework and its applicability. This thesis proposes a language that caters for implicit definitions of rough sets obtained by combining different regions of other rough sets. In this way, concept approximations can be derived by taking into account domain knowledge. A declarative semantics for the language is also discussed. It is then shown that programs in the proposed language can be compiled to extended logic programs under the paraconsistent stable model semantics. The equivalence between the declarative semantics of the language and the declarative semantics of the compiled programs is proved. This transformation provides the computational basis for implementing our ideas. A query language for retrieving information about the concepts represented through the defined rough sets is also discussed. Several motivating applications are described. Finally, an extension of the proposed language with numerical measures is presented. This extension is motivated by the fact that numerical measures are an important aspect in data mining applications.

Palabras clave: Decision Rule; Logic Program; Integrity Constraint; Decision Table; Predicate Symbol.

- Dissertations and Monographs | Pp. 178-276