Catálogo de publicaciones - libros

Compartir en
redes sociales

Computer Vision: ECCV 2002: 7th European Conference on Computer Vision Copenhagen, Denmark, May 28-31, 2002 Proceedings, Part III

Anders Heyden ; Gunnar Sparr ; Mads Nielsen ; Peter Johansen (eds.)

En conferencia: 7º European Conference on Computer Vision (ECCV) . Copenhagen, Denmark . May 28, 2002 - May 31, 2002

Resumen/Descripción – provisto por la editorial

No disponible.

Palabras clave – provistas por la editorial

Image Processing and Computer Vision; Computer Graphics; Pattern Recognition; Artificial Intelligence

Disponibilidad

Institución detectada	Año de publicación	Navegá	Descargá	Solicitá
No detectada	2002	SpringerLink

Información

Tipo de recurso:

libros

ISBN impreso

978-3-540-43746-8

ISBN electrónico

978-3-540-47977-2

Editor responsable

Springer Nature

País de edición

Reino Unido

Fecha de publicación

2002

Información sobre derechos de publicación

Cobertura temática

Ciencias de la computación e información

Tabla de contenidos

Verificá que desde tu institución tengas acceso para descargar o solicitar el libro completo o alguno de sus capítulos.

doi: 10.1007/3-540-47977-5_31

Probabilistic and Voting Approaches to Cue Integration for Figure-Ground Segmentation

Eric Hayman; Jan-Olof Eklundh

This paper describes techniques for fusing the output of multiple cues to robustly and accurately segment foreground objects from the background in image sequences. Two different methods for cue integration are presented and tested. The first is a probabilistic approach which at each pixel computes the likelihood of observations over all cues before assigning pixels to foreground or background layers using Bayes Rule. The second method allows each cue to make a decision independent of the other cues before fusing their outputs with a weighted sum. A further important contribution of our work concerns demonstrating how models for some cues can be learnt and subsequently adapted online. In particular, regions of coherent motion are used to train distributions for colour and for a simple texture descriptor. An additional aspect of our framework is in providing mechanisms for suppressing cues when they are believed to be unreliable, for instance during training or when they disagree with the general consensus. Results on extended video sequences are presented.

- Texture Shading and Colour / Grouping and Segmentation / Object Recognition | Pp. 469-486

doi: 10.1007/3-540-47977-5_32

Bayesian Estimation of Layers from Multiple Images

Y. Wexler; A. Fitzgibbon; A. Zisserman

When estimating foreground and background layers (or equivalently an ), it is often the case that pixel measurements contain mixed colours which are a combination of foreground and background. Object boundaries, especially at thin sub-pixel structures like hair, pose a serious problem.

In this paper we present a multiple view algorithm for computing the alpha matte. Using a Bayesian framework, we model each pixel as a combined sample from the foreground and background and compute a MAP estimate to factor the two. The novelties in this work include the incorporation of three different types of priors for enhancing the results in problematic scenes. The priors used are inequality constraints on colour and alpha values, spatial continuity, and the probability distribution of alpha values.

The combination of these priors result in accurate and visually satisfying estimates. We demonstrate the method on real image sequences with varying degrees of geometric and photometric complexity. The output enables virtual objects to be added between the foreground and background layers, and we give examples of this augmentation to the original sequences.

- Texture Shading and Colour / Grouping and Segmentation / Object Recognition | Pp. 487-501

doi: 10.1007/3-540-47977-5_33

A Stochastic Algorithm for 3D Scene Segmentation and Reconstruction

Feng Han; Zhouwen Tu; Song-Chun Zhu

In this paper, we present a stochastic algorithm by effective Markov chain Monte Carlo (MCMC) for segmenting and reconstructing 3D scenes. The objective is to segment a range image and its associated reflectance map into a number of surfaces which fit to various 3D surface models and have homogeneous reflectance (material) properties. In comparison to previous work on range image segmentation, the paper makes the following contributions. Firstly, it is aimed at generic natural scenes, indoor and outdoor, which are often much complexer than most of the existing experiments in the “polyhedra world”. Natural scenes require the algorithm to automatically deal with multiple types (families) of surface models which compete to explain the data. Secondly, it integrates the range image with the reflectance map. The latter provides material properties and is especially useful for surface of high specularity, such as glass, metal, ceramics. Thirdly, the algorithm is designed by reversible jump and diffusion Markov chain dynamics and thus achieves globally optimal solutions under the Bayesian statistical framework. Thus it realizes the cue integration and multiple model switching. Fourthly, it adopts two techniques to improve the speed of the Markov chain search: One is a coarse-to-fine strategy and the other are data driven techniques such as edge detection and clustering. The data driven methods provide important information for narrowing the search spaces in a probabilistic fashion. We apply the algorithm to two data sets and the experiments demonstrate robust and satisfactory results on both. Based on the segmentation results, we extend the reconstruction of surfaces behind occlusions to fill in the occluded parts.

- Texture Shading and Colour / Grouping and Segmentation / Object Recognition | Pp. 502-516

doi: 10.1007/3-540-47977-5_34

Normalized Gradient Vector Diffusion and Image Segmentation

Zeyun Yu; Chandrajit Bajaj

In this paper, we present an approach for image segmentation, based on the existing and . Our algorithm includes initial segmentation using and region merging based on . We use a set of heat diffusion equations to generate a vector field over the image domain, which provides us with a natural way to define seeds as well as an external force to attract the active snakes. Then an initial segmentation of the original image can be obtained by a similar idea as seen in active snake model. Finally an -based region merging technique is used to find the true segmentation as desired. The experimental results show that our -based region merging algorithm overcomes some problems as seen in classic active snake model. We will also see that our has several advantages over the traditional gradient vector diffusion.

- Texture Shading and Colour / Grouping and Segmentation / Object Recognition | Pp. 517-530

doi: 10.1007/3-540-47977-5_35

Spectral Partitioning with Indefinite Kernels Using the Nyström Extension

Serge Belongie; Charless Fowlkes; Fan Chung; Jitendra Malik

Fowlkes et al. [] recently introduced an approximation to the Normalized Cut (NCut) grouping algorithm [] based on random subsampling and the . As presented, their method is restricted to the case where , the weighted adjacency matrix, is positive definite. Although many common measures of image similarity (i.e. kernels) are positive definite, a popular example being Gaussian-weighted distance, there are important cases that are not. In this work, we present a modification to Nyström-NCut that does not require to be positive definite. The modification only affects the orthogonalization step, and in doing so it necessitates one additional () operation, where is the number of random samples used in the approximation. As such it is of interest to know which kernels are positive definite and which are indefinite. In addressing this issue, we further develop connections between NCut and related methods in the kernel machines literature. We provide a proof that the Gaussian-weighted chi-squared kernel is positive definite, which has thus far only been conjectured. We also explore the performance of the approximation algorithm on a variety of grouping cues including contour, color and texture.

- Texture Shading and Colour / Grouping and Segmentation / Object Recognition | Pp. 531-542

doi: 10.1007/3-540-47977-5_36

A Framework for High-Level Feedback to Adaptive, Per-Pixel, Mixture-of-Gaussian Background Models

Michael Harville

Time-Adaptive, Per-Pixel Mixtures Of Gaussians (TAPP-MOGs) have recently become a popular choice for robust modeling and removal of complex and changing backgrounds at the pixel level. However, TAPPMOG-based methods cannot easily be made to model dynamic backgrounds with highly complex appearance, or to adapt promptly to sudden “uninteresting” scene changes such as the repositioning of a static object or the turning on of a light, without further undermining their ability to segment foreground objects, such as people, where they occlude the background for too long. To alleviate tradeoffs such as these, and, more broadly, to allow TAPPMOG segmentation results to be tailored to the specific needs of an application, we introduce a general framework for guiding pixel-level TAPPMOG evolution with feedback from “high-level” modules. Each such module can use pixel-wise maps of positive and negative feedback to attempt to impress upon the TAPPMOG some definition of foreground that is best expressed through “higher-level” primitives such as image region properties or semantics of objects and events. By pooling the foreground error corrections of many high-level modules into a shared, pixel-level TAPPMOG model in this way, we improve the quality of the foreground segmentation and the performance of all modules that make use of it. We show an example of using this framework with a TAPPMOG method and high-level modules that all rely on dense depth data from a stereo camera.

- Texture Shading and Colour / Grouping and Segmentation / Object Recognition | Pp. 543-560

doi: 10.1007/3-540-47977-5_37

Multivariate Saddle Point Detection for Statistical Clustering

Dorin Comaniciu; Visvanathan Ramesh; Alessio Del Bue

Decomposition methods based on nonparametric density estimation define a cluster as the basin of attraction of a local maximum (mode) of the density function, with the cluster borders being represented by valleys surrounding the mode. To measure the significance of each delineated cluster we propose a test statistics that compares the estimated density of the mode with the estimated maximum density on the cluster boundary. While for a given kernel bandwidth the modes can be safely obtained by using the mean shift procedure, the detection of maximum density points on the cluster boundary (i.e., the saddle points) is not straightforward for multivariate data. We therefore develop a gradient-based iterative algorithm for saddle point detection and show its effectiveness in various data decomposition tasks. After finding the largest density saddle point associated with each cluster, we compute significance measures that allow formal hypothesis testing of cluster existence. The new statistical framework is extended and tested for the task of image segmentation.

- Texture Shading and Colour / Grouping and Segmentation / Object Recognition | Pp. 561-576

doi: 10.1007/3-540-47977-5_38

Parametric Distributional Clustering for Image Segmentation

Lothar Hermes; Thomas Zöller; Joachim M. Buhmann

Unsupervised Image Segmentation is one of the central issues in Computer Vision. From the viewpoint of exploratory data analysis, segmentation can be formulated as a clustering problem in which pixels or small image patches are grouped together based on local feature information. In this contribution, parametrical distributional clustering (PDC) is presented as a novel approach to image segmentation. In contrast to noise sensitive point measurements, local distributions of image features provide a statistically robust description of the local image properties. The segmentation technique is formulated as a generative model in the maximum likelihood framework. Moreover, there exists an insightful connection to the novel information theoretic concept of the (Tishby et al. []), which emphasizes the compromise between efficient coding of an image and preservation of characteristic information in the measured feature distributions.

The search for good grouping solutions is posed as an optimization problem, which is solved by techniques. In order to further increase the computational efficiency of the resulting segmentation algorithm, a multi-scale optimization scheme is developed. Finally, the performance of the novel model is demonstrated by segmentation of color images from the Corel data base.

- Texture Shading and Colour / Grouping and Segmentation / Object Recognition | Pp. 577-591

doi: 10.1007/3-540-47977-5_39

Probabalistic Models and Informative Subspaces for Audiovisual Correspondence

John W. Fisher; Trevor Darrell

We propose a probabalistic model of single source multi-modal generation and show how algorithms for maximizing mutual information can find the correspondences between components of each signal. We show how non-parametric techniques for finding informative subspaces can capture the complex statistical relationship between signals in different modalities. We extend a previous technique for finding informative subspaces to include new priors on the projection weights, yielding more robust results. Applied to human speakers, our model can find the relationship between audio speech and video of facial motion, and partially segment out background events in both channels. We present new results on the problem of audio-visual verification, and show how the audio and video of a speaker can be matched even when no prior model of the speaker’s voice or appearance is available.

- Texture Shading and Colour / Grouping and Segmentation / Object Recognition | Pp. 592-603

doi: 10.1007/3-540-47977-5_40

Volterra Filtering of Noisy Images of Curves

Jonas August

How should one filter very noisy images of curves? While blurring with a Gaussian reduces noise, it also reduces contour contrast. Both non-homogeneous and anisotropic diffusion smooth images while preserving contours, but these methods assume a single local orientation and therefore they can merge or distort nearby or crossing contours. To avoid these difficulties, we view curve enhancement as a statistical estimation problem in the three-dimensional ()-space of positions and directions, where our prior is a probabilistic model of an ideal edge/line map known as the curve indicator random field (). Technically, this random field is a superposition of local times of Markov processes that model the individual curves; intuitively, it is an idealized artist’s sketch, where the value of the field is the amount of ink deposited by the artist’s pen. After reviewing the cirf framework and our earlier formulas for the CIRF cumulants, we compute the minimum mean squared error () estimate of the embedded in large amounts of Gaussian white noise. The derivation involves a perturbation expansion in an infinite noise limit, and results in linear, quadratic, and cubic (Volterra) filters for enhancing images of contours. The self-avoidingness of smooth curves in () simplified our analysis and the resulting algorithms, which run in ( log ) time, where is the size of the input. This suggests that the Gestalt principle of good continuation may not only express the likely smoothness of contours, but it may have a computational basis as well.

- Texture Shading and Colour / Grouping and Segmentation / Object Recognition | Pp. 604-620