Catálogo de publicaciones - libros
Transactions on Rough Sets IV
James F. Peters ; Andrzej Skowron (eds.)
Resumen/Descripción – provisto por la editorial
No disponible.
Palabras clave – provistas por la editorial
Artificial Intelligence (incl. Robotics); Mathematical Logic and Formal Languages; Computation by Abstract Devices; Database Management; Image Processing and Computer Vision
Disponibilidad
Institución detectada | Año de publicación | Navegá | Descargá | Solicitá |
---|---|---|---|---|
No detectada | 2005 | SpringerLink |
Información
Tipo de recurso:
libros
ISBN impreso
978-3-540-29830-4
ISBN electrónico
978-3-540-32016-6
Editor responsable
Springer Nature
País de edición
Reino Unido
Fecha de publicación
2005
Información sobre derechos de publicación
© Springer-Verlag Berlin Heidelberg 2005
Tabla de contenidos
doi: 10.1007/11574798_1
A Treatise on Rough Sets
Zdzisław Pawlak
This article presents some general remarks on rough sets and their place in general picture of research on vagueness and uncertainty – concepts of utmost interest, for many years, for philosophers, mathematicians, logicians and recently also for computer scientists and engineers particularly those working in such areas as AI, computational intelligence, intelligent systems, cognitive science, data mining and machine learning. Thus this article is intended to present some philosophical observations rather than to consider technical details or applications of rough set theory. Therefore we also refrain from presentation of many interesting applications and some generalizations of the theory.
Palabras clave: Sets; fuzzy sets; rough sets; antinomies; vagueness.
- Regular Papers | Pp. 1-17
doi: 10.1007/11574798_2
On Optimization of Decision Trees
Igor V. Chikalov; Mikhail Ju. Moshkov; Maria S. Zelentsova
In the paper algorithms are considered which allow to consecutively optimize decision trees for decision tables with many-valued decisions relatively different complexity measures such as number of nodes, weighted depth, average weighted depth, etc. For decision tables over an arbitrary infinite restricted information system [5] these algorithms have (at least for the three mentioned measures) polynomial time complexity depending on the length of table description. For decision tables over one of such information systems experimental results of decision tree optimization are described.
Palabras clave: Decision trees; complexity measures; optimization.
- Regular Papers | Pp. 18-36
doi: 10.1007/11574798_3
Dealing with Missing Data: Algorithms Based on Fuzzy Set and Rough Set Theories
Dan Li; Jitender Deogun; William Spaulding; Bill Shuart
Missing data, commonly encountered in many fields of study, introduce inaccuracy in the analysis and evaluation. Previous methods used for handling missing data (e.g., deleting cases with incomplete information, or substituting the missing values with estimated mean scores), though simple to implement, are problematic because these methods may result in biased data models. Fortunately, recent advances in theoretical and computational statistics have led to more flexible techniques to deal with the missing data problem. In this paper, we present missing data imputation methods based on clustering, one of the most popular techniques in Knowledge Discovery in Databases (KDD). We combine clustering with soft computing, which tends to be more tolerant of imprecision and uncertainty, and apply fuzzy and rough clustering algorithms to deal with incomplete data. The experiments show that a hybridization of fuzzy set and rough set theories in missing data imputation algorithms leads to the best performance among our four algorithms, i.e., crisp K-means, fuzzy K-means, rough K-means, and rough-fuzzy K-means imputation algorithms.
Palabras clave: Missing data; imputation; K-means clustering; fuzzy sets; rough sets; rough-fuzzy hybridization.
- Regular Papers | Pp. 37-57
doi: 10.1007/11574798_4
Characteristic Relations for Incomplete Data: A Generalization of the Indiscernibility Relation
Jerzy W. Grzymala-Busse
This paper shows that attribute-value pair blocks, used for many years in rule induction, may be used as well for computing indiscernibility relations for completely specified decision tables. Much more importantly, for incompletely specified decision tables, i.e., for data with missing attribute values, the same idea of attribute-value pair blocks is a convenient tool to compute characteristic sets, a generalization of equivalence classes of the indiscernibility relation, and also characteristic relations, a generalization of the indiscernibility relation. For incompletely specified decision tables there are three different ways lower and upper approximations may be defined: singleton, subset and concept. Finally, it is shown that, for a given incomplete data set, the set of all characteristic relations for the set of all congruent decision tables is a lattice.
Palabras clave: Characteristic Relation; Decision Table; Rule Induction; Indiscernibility Relation; Rule Induction Algorithm.
- Regular Papers | Pp. 58-68
doi: 10.1007/11574798_5
Supervised Learning in the Gene Ontology Part I: A Rough Set Framework
Herman Midelfart
Prediction of gene function introduces a new learning problem where the decision classes associated with the objects (i.e., genes) are organized in a directed acyclic graph (DAG). Rough set theory, on the other hand, assumes that the classes are unrelated cannot handle this problem properly. To this end, we introduce a new rough set framework. The traditional decision system is extended into DAG decision system which can represent the DAG. From this system we develop several new operators, which can determine the known and the potential objects of a class and show how these sets can be combined with the usual rough set approximations. The properties of these operators are also investigated.
Palabras clave: Gene Ontology; Directed Acyclic Graph; Decision Class; Information Vector; Gene Ontology Consortium.
- Regular Papers | Pp. 69-97
doi: 10.1007/11574798_6
Supervised Learning in the Gene Ontology Part II: A Bottom-Up Algorithm
Herman Midelfart
Prediction of gene function for expression profiles introduces a new problem for supervised learning algorithms. The decision classes are taken from an ontology, which defines relationships between the classes. Supervised algorithms, on the other hand, assumes that the classes are unrelated. Hence, we introduce a new algorithm which can take these relationships into account. This is tested on a microarray data set created from human fibroblast cells and on several artificial data sets. Since standard performance measures do not apply to this problem, we also introduce several new measures for measuring classification performance in an ontology.
Palabras clave: Gene Ontology; Irrelevant Attribute; Decision Class; Training Accuracy; Conditional Attribute.
- Regular Papers | Pp. 98-124
doi: 10.1007/11574798_7
Comparative Analysis of Deterministic and Nondeterministic Decision Tree Complexity Local Approach
Mikhail Ju. Moshkov
For problems over arbitrary information system we study the relationships among the complexity of a problem description, the minimal complexity of a decision tree solving this problem deterministically, and the minimal complexity of a decision tree solving this problem nondeterministically. We consider the local approach to investigation of decision trees where only attributes from a problem description are used for construction of decision trees solving this problem.
Palabras clave: Decision tree; rough set theory; complexity.
- Regular Papers | Pp. 125-143
doi: 10.1007/11574798_8
A Fast Host-Based Intrusion Detection System Using Rough Set Theory
Sanjay Rawat; V. P. Gulati; Arun K. Pujari
Intrusion Detection system has become the main research focus in the area of information security. Last few years have witnessed a large variety of technique and model to provide increasingly efficient intrusion detection solutions. We advocate here that the intrusive behavior of a process is highly localized characteristics of the process. There are certain smaller episodes in a process that make the process intrusive in an otherwise normal stream. As a result it is unnecessary and most often misleading to consider the whole process in totality and to attempt to characterize its abnormal features. In the present work we establish that subsequences of reasonably small length of sequence of system calls would suffice to identify abnormality in a process. We make use of rough set theory to demonstrate this concept. Rough set theory also facilitates identifying rules for intrusion detection. The main contributions of the paper are the following- (a) It is established that very small subsequence of system call is sufficient to identify intrusive behavior with high accuracy. We demonstrate our result using DARPA’98 BSM data; (b) A rough set based system is developed that can extract rules for intrusion detection; (c) An algorithm is presented that can determine the status of a process as either normal or abnormal on-line.
Palabras clave: Data mining; Decision Table; Rough Set; Intrusion Detection; Anomaly; Misuse.
- Regular Papers | Pp. 144-161
doi: 10.1007/11574798_9
Incremental Learning and Evaluation of Structures of Rough Decision Tables
Wojciech Ziarko
Rough decision tables were introduced by Pawlak in the context of rough set theory. A rough decision table represents, a non-functional in general, relationship between two groups of properties of objects, referred to as condition and decision attributes, respectively. In practical applications, the rough decision tables are normally learned from data. In this process, for better coverage of the domain of interest, they can be structured into hierarchies. To achieve convergence of the learned hierarchy of rough decision tables to a stable final state, it is desirable to avoid total regeneration of the learned structure after new objects, not represented in the hierarchy, are encountered. This can be accomplished through an incremental learning process in which the structure of rough decision tables is updated, rather than regenerated, after new observations appeared. The introduction and the investigation of this incremental learning process within the framework of the rough set model is the main theme of the article. The article is also concerned with evaluation of learned decision tables and their structures by introducing the absolute gain function to measure the quality of information represented by the tables.
Palabras clave: Boundary Area; Decision Table; Incremental Learning; Decision Attribute; Information Table.
- Regular Papers | Pp. 162-177
doi: 10.1007/11574798_10
A Framework for Reasoning with Rough Sets
Aida Vitória
Rough sets framework has two appealing aspects. First, it is a mathematical approach to deal with vague concepts. Second, rough set techniques can be used in data analysis to find patterns hidden in the data. The number of applications of rough sets to practical problems in different fields demonstrates the increasing interest in this framework and its applicability. This thesis proposes a language that caters for implicit definitions of rough sets obtained by combining different regions of other rough sets. In this way, concept approximations can be derived by taking into account domain knowledge. A declarative semantics for the language is also discussed. It is then shown that programs in the proposed language can be compiled to extended logic programs under the paraconsistent stable model semantics. The equivalence between the declarative semantics of the language and the declarative semantics of the compiled programs is proved. This transformation provides the computational basis for implementing our ideas. A query language for retrieving information about the concepts represented through the defined rough sets is also discussed. Several motivating applications are described. Finally, an extension of the proposed language with numerical measures is presented. This extension is motivated by the fact that numerical measures are an important aspect in data mining applications.
Palabras clave: Decision Rule; Logic Program; Integrity Constraint; Decision Table; Predicate Symbol.
- Dissertations and Monographs | Pp. 178-276