Catálogo de publicaciones - libros
Fuzzy Systems and Knowledge Discovery: Second International Conference, FSKD 2005, Changsha, China, August 27-29, 2005, Proceedings, Part II
Lipo Wang ; Yaochu Jin (eds.)
En conferencia: 2º International Conference on Fuzzy Systems and Knowledge Discovery (FSKD) . Changsha, China . August 27, 2005 - August 29, 2005
Resumen/Descripción – provisto por la editorial
No disponible.
Palabras clave – provistas por la editorial
Theory of Computation; Artificial Intelligence (incl. Robotics); Mathematical Logic and Formal Languages; Computation by Abstract Devices; Algorithm Analysis and Problem Complexity; Image Processing and Computer Vision
Disponibilidad
Institución detectada | Año de publicación | Navegá | Descargá | Solicitá |
---|---|---|---|---|
No detectada | 2005 | SpringerLink |
Información
Tipo de recurso:
libros
ISBN impreso
978-3-540-28331-7
ISBN electrónico
978-3-540-31828-6
Editor responsable
Springer Nature
País de edición
Reino Unido
Fecha de publicación
2005
Información sobre derechos de publicación
© Springer-Verlag Berlin Heidelberg 2005
Tabla de contenidos
doi: 10.1007/11540007_3
Difference-Similitude Matrix in Text Classification
Xiaochun Huang; Ming Wu; Delin Xia; Puliu Yan
Text classification can greatly improve the performance of information retrieval and information filtering, but high dimensionality of documents baffles the applications of most classification approaches. This paper proposed a Difference-Similitude Matrix (DSM) based method to solve the problem. The method represents a pre-classified collection as an item-document matrix, in which documents in same categories are described with similarities while documents in different categories with differences. Using the DSM reduction algorithm, simpler and more efficient than rough set reduction, we reduced the dimensionality of document space and generated rules for text classification.
- Dimensionality Reduction | Pp. 21-30
doi: 10.1007/11540007_4
A Study on Feature Selection for Toxicity Prediction
Gongde Guo; Daniel Neagu; Mark T. D. Cronin
The increasing amount and complexity of data used in predictive toxicology calls for efficient and effective feature selection methods in data pre-processing for data mining. In this paper, we propose a kNN model-based feature selection method (kNNMFS) aimed at overcoming the weaknesses of ReliefF method. It modifies the ReliefF method by: (1) using a kNN model as the starter selection aimed at choosing a set of more meaningful representatives to replace the original data for feature selection; (2) integration of the Heterogeneous Value Difference Metric to handle heterogeneous applications – those with both ordinal and nominal features; and (3) presenting a simple method of difference function calculation. The performance of kNNMFS was evaluated on a toxicity data set Phenols using a linear regression algorithm. Experimental results indicate that kNNMFS has a significant improvement in the classification accuracy for the trial data set.
- Dimensionality Reduction | Pp. 31-34
doi: 10.1007/11540007_5
Application of Feature Selection for Unsupervised Learning in Prosecutors’ Office
Peng Liu; Jiaxian Zhu; Lanjuan Liu; Yanhong Li; Xuefeng Zhang
Feature selection is effective in removing irrelevant data. However, the result of feature selection in unsupervised learning is not as satisfying as that in supervised learning. In this paper, we propose a novel methodology ULAC (Feature Selection for Unsupervised Learning Based on Attribute Correlation Analysis and Clustering Algorithm) to identify important features for unsupervised learning. We also apply ULAC into prosecutors’ office to solve the real world application for unsupervised learning.
- Dimensionality Reduction | Pp. 35-38
doi: 10.1007/11540007_6
A Novel Field Learning Algorithm for Dual Imbalance Text Classification
Ling Zhuang; Honghua Dai; Xiaoshu Hang
Fish-net algorithm is a novel field learning algorithm which derives classification rules by looking at the range of values of each attribute instead of the individual point values. In this paper, we present a Feature Selection Fish-net learning algorithm to solve the Dual Imbalance problem on text classification. Dual imbalance includes the instance imbalance and feature imbalance. The instance imbalance is caused by the unevenly distributed classes and feature imbalance is due to the different document length. The proposed approach consists of two phases: (1) select a feature subset which consists of the features that are more supportive to difficult minority class; (2) construct classification rules based on the original Fish-net algorithm. Our experimental results on Reuters21578 show that the proposed approach achieves better balanced accuracy rate on both majority and minority class than Naive Bayes MultiNomial and SVM.
- Dimensionality Reduction | Pp. 39-48
doi: 10.1007/11540007_9
Robust Nonlinear Dimension Reduction: A Self-organizing Approach
Yuexian Hou; Liyue Yao; Pilian He
Most NDR algorithms need to solve large-scale eigenvalue problems or some variation of eigenvalue problems, which is of quadratic complexity of time and might be unpractical in case of large-size data sets. Besides, current algorithms are global, which are often sensitive to noise and disturbed by ill-conditioned matrix. In this paper, we propose a novel self-organizing NDR algorithm: SIE. The time complexity of SIE is O(NlogN). The main computing procedure of SIE is local, which improves the robustness of the algorithm remarkably.
- Dimensionality Reduction | Pp. 67-72
doi: 10.1007/11540007_12
Palmprint Identification Algorithm Using Hu Invariant Moments
Jin Soo Noh; Kang Hyeon Rhee
Recently, Biometrics-based personal identification is regarded as an effective method of person’s identity with recognition automation and high performance. In this paper, the palmprint recognition method based on Hu invariant moment is proposed. And the low-resolution (75dpi) palmprint image (135×135 Pixel) is used for the small scale database of the effectual palmprint recognition system. The proposed system is consists of two parts: firstly, the palmprint fixed equipment for the acquisition of the correctly palmprint image and secondly, the algorithm of the efficient processing for the palmprint recognition.
- Pattern Recognition and Trend Analysis | Pp. 91-94
doi: 10.1007/11540007_13
Generalized Locally Nearest Neighbor Classifiers for Object Classification
Wenming Zheng; Cairong Zou; Li Zhao
In this paper, we extend the locally nearest neighbor classifiers to tackle the nonlinear classification problems via the kernel trick. The better performance is confirmed by the handwritten zip code digits classification experiments on the US Postal Service (USPS) database.
- Pattern Recognition and Trend Analysis | Pp. 95-99
doi: 10.1007/11540007_15
A PPM Prediction Model Based on Web Objects’ Popularity
Lei Shi; Zhimin Gu; Yunxia Pei; Lin Wei
Web prefetching technique is one of the primary solutions used to reduce Web access latency and improve the quality of service. This paper makes use of Zipf’s 1st law and Zipf’s 2nd law to model the Web objects’ popularity, where Zipf’s 1st law is employed to model the high frequency Web objects and 2nd law for the low frequency Web objects, and proposes a PPM prediction model based on Web objects’ popularity for Web prefetching. A performance evaluation of the model is presented using real server logs. Trace-driven simulation results show that not only the model is easily to be implemented, but also can achieve a high prediction precision at the cost of relative low storage complexity and network traffic.
- Pattern Recognition and Trend Analysis | Pp. 110-119
doi: 10.1007/11540007_17
Axial Representation of Character by Using Wavelet Transform
Xinge You; Bin Fang; Yuan Yan Tang; Luoqing Li; Dan Zhang
Axial representation plays a significant role in character recognition. The strokes of a character may consist of two regions, i.e. singular and regular regions. Therefore, a method to extract the central axis of a character requires two different processes to compute the axis in theses two different regions. The major problem of most traditional algorithms is that the extracted central axis in the singular region may be distorted by artifacts and branches. To overcome this problem, the wavelet-based amendment processing technique is developed to link the primary axis, so that the central axis in the singular region can be produced. Combining with our previously developed method for computing the primary axis in the regular region, we develop a novel scheme of extracting the central axis of character based on the wavelet transform (WT). Experimental results show that the final axis obtained from the proposed scheme closely resembles the human perceptions. It is applicable to both binary image and gray-level image as well. The axis representation is robust against noise.
- Pattern Recognition and Trend Analysis | Pp. 130-139
doi: 10.1007/11540007_19
A Hybrid Artificial Intelligent-Based Criteria-Matching with Classification Algorithm
Alex T. H. Sim; Vincent C. S. Lee
Classifying dynamic behavioural based events, for example human behaviour profile, is a non-trivial task. In this paper, we propose an AI-based criteria-matching with classification algorithm which can be used to classify preference based decision outcome. The proposed algorithm is mathematically justified and with more practical benefits than a conventional multivariate discriminant analysis algorithm which is widely used for prediction tasks. Real world (Singapore) diamond dataset test results revealed the practical usefulness of our proposed algorithm to diamond sellers in Singapore.
- Pattern Recognition and Trend Analysis | Pp. 150-159