Catálogo de publicaciones - libros

Compartir en
redes sociales


Image Analysis: 15th Scandinavian Conference, SCIA 2007, Aalborg, Denmark, June 10-14, 2007

Bjarne Kjær Ersbøll ; Kim Steenstrup Pedersen (eds.)

En conferencia: 15º Scandinavian Conference on Image Analysis (SCIA) . Aalborg, Denmark . June 10, 2007 - June 14, 2007

Resumen/Descripción – provisto por la editorial

No disponible.

Palabras clave – provistas por la editorial

Image Processing and Computer Vision; Pattern Recognition; Computer Graphics

Disponibilidad
Institución detectada Año de publicación Navegá Descargá Solicitá
No detectada 2007 SpringerLink

Información

Tipo de recurso:

libros

ISBN impreso

978-3-540-73039-2

ISBN electrónico

978-3-540-73040-8

Editor responsable

Springer Nature

País de edición

Reino Unido

Fecha de publicación

Información sobre derechos de publicación

© Springer-Verlag Berlin Heidelberg 2007

Tabla de contenidos

On Reasoning over Tracking Events

Daniel Rowe; Jordi Gonzàlez; Ivan Huerta; Juan J. Villanueva

High-level understanding of motion events is a critical task in any system which aims to analyse dynamic human-populated scenes. However, current tracking techniques still do not address complex interaction events among multiple targets. In this paper, a principled event-management framework is proposed, and it is included in a hierarchical and modular tracking architecture. Multiple-target interaction events, and a proper scheme for tracker instantiation and removal according to scene events, are considered. Multiple-target group management allows the system to switch among different operation modes. Robust and accurate tracking results have been obtained in both indoor and outdoor scenarios, without considering a-priori knowledge about either the scene or the targets based on a previous training period.

Pp. 502-511

FPGA Implementation of NN Classifier Based on Wavelet Transform and Partial Distance Search

Yao-Jung Yeh; Hui-Ya Li; Wen-Jyi Hwang; Chiung-Yao Fang

A novel algorithm for field programmable gate array (FPGA) realization of NN classifier is presented in this paper. The algorithm identifies first closest vectors in the design set of a NN classifier for each input vector by performing the partial distance search (PDS) in the wavelet domain. It employs subspace search, bitplane reduction and multiple-coefficient accumulation techniques for the effective reduction of the area complexity and computation latency. The proposed implementation has been embedded in a softcore CPU for physical performance measurement. Experimental results show that the implementation provides a cost-effective solution to the FPGA realization of NN classification systems where both high throughput and low area cost are desired.

Pp. 512-521

Affine Illumination Compensation for Multispectral Images

Pedro Latorre Carmona; Reiner Lenz; Filiberto Pla; Jose M. Sotoca

We apply a general form of affine transformation model to compensate illumination variations in a series of multispectral images of a static scene and compare it to a particular affine and a diagonal transformation models. These models operate in the original multispectral space or in a lower-dimensional space obtained by Singular Value Decomposition () of the set of images. We use a system consisting of a multispectral camera and a light dome that allows the measurement of multispectral data under carefully controlled illumination conditions to generate a series of multispectral images of a static scene under varying illumination conditions. We evaluate the compensation performance using the colour difference between images. The experiments show that the first 2 models perform satisfactorily in the original and lower dimensional spaces.

Pp. 522-531

GPU-Based Edge-Directed Image Interpolation

Martin Kraus; Mike Eissele; Magnus Strengert

The rendering of lower resolution image data on higher resolution displays has become a very common task, in particular because of the increasing popularity of webcams, camera phones, and low-bandwidth video streaming. Thus, there is a strong demand for real-time, high-quality image magnification. In this work, we suggest to exploit the high performance of programmable graphics processing units (GPUs) for an adaptive image magnification method. To this end, we propose a GPU-friendly algorithm for image up-sampling by edge-directed image interpolation, which avoids ringing artifacts, excessive blurring, and staircasing of oblique edges. At the same time it features gray-scale invariance, is applicable to color images, and allows for real-time processing of full-screen images on today’s GPUs.

Pp. 532-541

Graph-Based Range Image Registration Combining Geometric and Photometric Features

Ikuko Shimizu; Akihiro Sugimoto; Radim Šára

We propose a coarse registration method of range images using both geometric and photometric features. The framework of existing methods using multiple features first defines a single similarity distance summing up each feature based evaluations, and then minimizes the distance between range images for registration. In contrast, we formulate registration as a graph-based optimization problem, where we independently evaluate geometric feature and photometric feature and consider only the order of point-to-point matching quality. We then find as large consistent matching as possible in the sense of the matching-quality order. This is solved as one global combinatorial optimization problem. Our method thus does not require any good initial estimation and, at the same time, guarantees that the global solution is achieved.

Pp. 542-552

Automatic Identification and Validation of Tie Points on Hyperspectral Satellite Images from CHRIS/PROBA

André R. S. Marçal

There is great interest in the development of automatic geometric correction systems for satellite images. A fully automatic system, based exclusively on the identification of tie points (image to image control points) by image matching needs to use efficient selection and validation methods. Four Tie Point Suitability Indices (TPSI) are proposed to select the most suitable areas in an image to search for tie points. Three tie point validation parameters are also proposed. The validation parameters make use of the various spectral bands available in hyperspectral and multispectral satellite images. The proposed TPSIs and validation parameters were tested with hyperspectral high-resolution satellite images from the CHRIS/PROBA sensor.

Pp. 553-561

Boneless Pose Editing and Animation

J. Andreas Bærentzen; Kristian Evers Hansen; Kenny Erleben

In this paper, we propose a pose editing and animation method for triangulated surfaces based on a user controlled partitioning of the model into deformable parts and rigid parts which are denoted handles. In our pose editing system, the user can sculpt a set of poses simply by transforming the handles for each pose. Using Laplacian editing, the deformable parts are deformed to match the handles. In our animation system the user can constrain one or several handles in order to define a new pose. New poses are interpolated from the examples poses, by solving a small non-linear optimization problem in order to obtain the interpolation weights. While the system can be used simply for building poses, it is also an animation system. The user can specify a path for a given constraint and the model is animated correspondingly.

Pp. 562-571

Text Driven Face-Video Synthesis Using GMM and Spatial Correlation

Dereje Teferi; Maycel I. Faraj; Josef Bigun

Liveness detection is increasingly planned to be incorporated into biometric systems to reduce the risk of spoofing and impersonation. Some of the techniques used include detection of motion of the head while posing/speaking, iris size in varying illumination, fingerprint sweat, text-prompted speech, speech-to-lip motion synchronization etc. In this paper, we propose to build a biometric signal to test attack resilience of biometric systems by creating a text-driven video synthesis of faces. We synthesize new realistic looking video sequences from real image sequences representing utterance of digits. We determine the image sequences for each digit by using a GMM based speech recognizer. Then, depending on system prompt (sequence of digits) our method regenerates a video signal to test attack resilience of a biometric system that asks for random digit utterances to prevent play-back of pre-recorded data representing both audio and images. The discontinuities in the new image sequence, created at the connection of each digit, are removed by using a frame prediction algorithm that makes use of the well known block matching algorithm. Other uses of our results include web-based video communication for electronic commerce and frame interpolation for low frame rate video.

Pp. 572-580

Accurate 3D Left-Right Brain Hemisphere Segmentation in MR Images Based on Shape Bottlenecks and Partial Volume Estimation

Lu Zhao; Jussi Tohka; Ulla Ruotsalainen

Current automatic methods based on mid-sagittal plane to segment left and right human brain hemispheres in 3D magnetic resonance (MR) images simply use a planar surface. However, the two brain hemispheres, in fact, can not be separated by just a simple plane properly. A novel automatic method to segment left and right brain hemispheres in MR images is proposed in this paper, which is based on an extended shape bottlenecks algorithm and a fast and robust partial volume estimation approach. In this method, brain tissues firstly are extracted from the MR image of human head. Then the information potential map is generated, according to which a brain hemisphere mask with the same size of the original image is created. 10 simulated and 5 real T1-weighted MR images were used to evaluate this method, and much more accurate segmentation of human brain hemispheres was achieved comparing with the segmentation with mid-sagittal plane.

Pp. 581-590

Image Inpainting by Cooling and Heating

David Gustavsson; Kim S. Pedersen; Mads Nielsen

We discuss a method suitable for inpainting both large scale geometric structures and stochastic texture components. We use the well-known FRAME model for inpainting. We introduce a temperature term in the learnt FRAME Gibbs distribution. By using a fast cooling scheme a MAP-like solution is found that can reconstruct the geometric structure. In a second step a heating scheme is used that reconstruct the stochastic texture. Both steps in the reconstruction process are necessary, and contribute in two very different ways to the appearance of the reconstruction.

Pp. 591-600