Catálogo de publicaciones - libros
Independent Component Analysis and Signal Separation: 7th International Conference, ICA 2007, London, UK, September 9-12, 2007. Proceedings
Mike E. Davies ; Christopher J. James ; Samer A. Abdallah ; Mark D Plumbley (eds.)
En conferencia: 7º International Conference on Independent Component Analysis and Signal Separation (ICA) . London, UK . September 9, 2007 - September 12, 2007
Resumen/Descripción – provisto por la editorial
No disponible.
Palabras clave – provistas por la editorial
No disponibles.
Disponibilidad
Institución detectada | Año de publicación | Navegá | Descargá | Solicitá |
---|---|---|---|---|
No detectada | 2007 | SpringerLink |
Información
Tipo de recurso:
libros
ISBN impreso
978-3-540-74493-1
ISBN electrónico
978-3-540-74494-8
Editor responsable
Springer Nature
País de edición
Reino Unido
Fecha de publicación
2007
Información sobre derechos de publicación
© Springer-Verlag Berlin Heidelberg 2007
Cobertura temática
Tabla de contenidos
Image Compression by Redundancy Reduction
Carlos Magno Sousa; André Borges Cavalcante; Denner Guilhon; Allan Kardec Barros
Image compression is achieved by reducing redundancy between neighboring pixels but preserving features such as edges and contours of the original image. Deterministic and statistical models are usually employed to reduce redundancy. Compression methods that use statistics have heavily been influenced by neuroscience research. In this work, we propose an image compression system based on the concept derived from neural information processing models. The system performance is compared with principal component analysis (PCA) and the discrete cosine transform (DCT) at several compression ratios (CR). Evaluation through both visual inspection and objective measurements showed that the proposed system is more robust to distortions such as ringing and block artifacts than PCA and DCT.
- Sparse Methods | Pp. 422-429
Complex Nonconvex Norm Minimization for Underdetermined Source Separation
Emmanuel Vincent
Underdetermined source separation methods often rely on the assumption that the time-frequency source coefficients are independent and Laplacian distributed. In this article, we extend these methods by assuming that these coefficients follow a generalized Gaussian prior with shape parameter . We study mathematical and experimental properties of the resulting complex nonconvex norm optimization problem in a particular case and derive an efficient global optimization algorithm. We show that the best separation performance for three-source stereo convolutive speech mixtures is achieved for small .
- Sparse Methods | Pp. 430-437
Sparse Component Analysis in Presence of Noise Using an Iterative EM-MAP Algorithm
Hadi Zayyani; Massoud Babaie-Zadeh; G. Hosein Mohimani; Christian Jutten
In this paper, a new algorithm for source recovery in under-determined Sparse Component Analysis (SCA) or atomic decomposition on over-complete dictionaries is presented in the noisy case. The algorithm is essentially a method for obtaining sufficiently sparse solutions of under-determined systems of linear equations with additive Gaussian noise. The method is based on iterative Expectation-Maximization of a Maximum A Posteriori estimation of sources (EM-MAP) and a new steepest-descent method is introduced for the optimization in the M-step. The solution obtained by the proposed algorithm is compared to the minimum ℓ-norm solution achieved by Linear Programming (LP). It is experimentally shown that the proposed algorithm is about one order of magnitude faster than the interior-point LP method, while providing better accuracy.
- Sparse Methods | Pp. 438-445
Mutual Interdependence Analysis (MIA)
Heiko Claussen; Justinian Rosca; Robert Damper
Functional Data Analysis (FDA) is used for datasets that are more meaningfully represented in the functional form. Functional principal component analysis, for instance, is used to extract a set of functions of maximum variance that can represent the data. In this paper, a method of Mutual Interdependence Analysis (MIA) is proposed that can extract an equally correlated function with a set of inputs. Formally, the MIA criterion defines the function whose mean variance of correlations with all inputs is minimized. The meaningfulness of the MIA extraction is proven on real data. In a simple text independent speaker verification example, MIA is used to extract a signature function per each speaker, and results in an equal error rate of 2.9 % in the set of 168 speakers.
- Speech and Audio Applications | Pp. 446-453
Modeling Perceptual Similarity of Audio Signals for Blind Source Separation Evaluation
Brendan Fox; Andrew Sabin; Bryan Pardo; Alec Zopf
Existing perceptual models of audio quality, such as PEAQ, were designed to measure audio codec performance and are not well suited to evaluation of audio source separation algorithms. The relationship of many other signal quality measures to human perception is not well established. We collected subjective human assessments of distortions encountered when separating audio sources from mixtures of two to four harmonic sources. We then correlated these assessments to 18 machine-measurable parameters. Results show a strong correlation (r=0.96) between a linear combination of a subset of four of these parameters and mean human assessments. This correlation is stronger than that between human assessments and several measures currently in use.
- Speech and Audio Applications | Pp. 454-461
Beamforming Initialization and Data Prewhitening in Natural Gradient Convolutive Blind Source Separation of Speech Mixtures
Malay Gupta; Scott C. Douglas
Successful speech enhancement by convolutive blind source separation (BSS) techniques requires careful design of all aspects of the chosen separation method. The conventional strategy for system initialization in both time- and frequency-domain BSS involves a FIR filter matrix and no data preprocessing; however, this strategy may not be the best for any chosen separation algorithm. In this paper, we experimentally evaluate two different approaches for potentially-improving the performance of time-domain and frequency-domain natural gradient speech separation algorithms – of the signal mixtures, and beamforming initialization for the separation system – to determine which of the two classes of algorithms benefit most from them. Our results indicate that frequency-domain-based natural gradient BSS methods generally need geometric information about the system to obtain any reasonable separation quality. For time-domain natural gradient separation algorithms, either beamforming initialization or prewhitening improves separation performance, particularly for larger-scale problems involving three or more sources and sensors.
- Speech and Audio Applications | Pp. 462-470
Blind Vector Deconvolution: Convolutive Mixture Models in Short-Time Fourier Transform Domain
Atsuo Hiroe
For short-time Fourier Transform (STFT) domain ICA, dealing with reverberant sounds is a significant issue. It often invites a dilemma on STFT frame length: frames shorter than reverberation time (short frames) generate incomplete instantaneous mixtures, while too long frames may disturb the separation.
To improve the separation of such reverberant sounds, the authors propose a new framework which accounts for STFT with short frames. In this framework, time domain convolutive mixtures are transformed to STFT domain convolutive mixtures. For separating the mixtures, an approach of applying another STFT is presented so as to treat them as instantaneous mixtures.
The authors experimentally confirmed that this framework outperforms the conventional STFT domain ICA.
- Speech and Audio Applications | Pp. 471-479
A Batch Algorithm for Blind Source Separation of Acoustic Signals Using ICA and Time-Frequency Masking
Eugen Hoffmann; Dorothea Kolossa; Reinhold Orglmeister
The problem of (BSS) of convolved acoustic signals is of great interest for many classes of applications such as in-car speech recognition, hands-free telephony or hearing devices. Due to the convolutive mixing process, the source separation is performed in the frequency domain, using (ICA). However the quality of solution of the ICA-algorithms can be improved by applying . In this paper we present a batch-algorithm for time-frequency masking using the time-frequency structure of separated signals.
- Speech and Audio Applications | Pp. 480-487
The Role of High Frequencies in Convolutive Blind Source Separation of Speech Signals
Maria G. Jafari; Mark D. Plumbley
In this paper, we investigate the importance of the high frequencies in the problem of convolutive blind source separation (BSS) of speech signals. In particular, we focus on frequency domain blind source separation (FD-BSS), and show that when separation is performed in the low frequency bins only, the recovered signals are similar in quality to those extracted when all frequencies are taken into account. The methods are compared through informal listening tests, as well as using an objective measure.
- Speech and Audio Applications | Pp. 488-494
Signal Separation by Integrating Adaptive Beamforming with Blind Deconvolution
Kostas Kokkinakis; Philipos C. Loizou
In this paper, we present a broadband two-microphone blind spatial separation technique by efficiently combining adaptive beamforming (ABF) with multichannel blind deconvolution (MBD). First, the inaccessible source signal streams are partially identified by simple time-delay steering and then are spatially separated through an MBD structure. The proposed spatio-temporal ABF-MBD algorithm exhibits fast convergence properties and high computational efficiency. Numerical experiments illustrate the practical appeal of the proposed method in separating convolutive mixtures of speech within nearly anechoic and also highly reverberant enclosures.
- Speech and Audio Applications | Pp. 495-503