Catálogo de publicaciones - libros
Image Analysis and Recognition: 4th International Conference, ICIAR 2007, Montreal, Canada, August 22-24, 2007. Proceedings
Mohamed Kamel ; Aurélio Campilho (eds.)
En conferencia: 4º International Conference Image Analysis and Recognition (ICIAR) . Montreal, QC, Canada . August 22, 2007 - August 24, 2007
Resumen/Descripción – provisto por la editorial
No disponible.
Palabras clave – provistas por la editorial
Pattern Recognition; Image Processing and Computer Vision; Biometrics; Artificial Intelligence (incl. Robotics); Computer Graphics; Algorithm Analysis and Problem Complexity
Disponibilidad
| Institución detectada | Año de publicación | Navegá | Descargá | Solicitá |
|---|---|---|---|---|
| No detectada | 2007 | SpringerLink |
Información
Tipo de recurso:
libros
ISBN impreso
978-3-540-74258-6
ISBN electrónico
978-3-540-74260-9
Editor responsable
Springer Nature
País de edición
Reino Unido
Fecha de publicación
2007
Información sobre derechos de publicación
© Springer-Verlag Berlin Heidelberg 2007
Tabla de contenidos
Use of Adaptive Still Image Descriptors for Annotation of Video Frames
Andrea Kutics; Akihiko Nakagawa; Kazuhiro Shindoh
This paper presents a novel method for annotating videos taken from the TRECVID 2005 data using only static visual features and metadata of still image frames. The method is designed to provide the user with annotation or tagging tools to incorporate multimedia data such as video or still images as well as text into searching or other combined applications running either on the web or on other networks. It mainly uses MPEG-7-based visual features and metadata of prototype images and allows the user to select either a prototype or a training set. It also adaptively adjusts the weights of the visual features the user finds most adequate to bridge the semantic gap. The user can also detect relevant regions in video frames by using a self-developed segmentation tool and can carry out region-based annotation with the same video frame set. The method provides satisfactory results even when the annotations of the TRECVID 2005 video data greatly vary considering the semantic level of concepts. It is simple and fast, using a very small set of training data and little or no user intervention. It also has the advantage that it can be applied to any combination of visual and textual features.
- Image Retrieval and Indexing | Pp. 686-697
Data Hiding on H.264/AVC Compressed Video
Sung Min Kim; Sang Beom Kim; Youpyo Hong; Chee Sun Won
An important issue in embedding watermark bits in compressed video stream is to keep the bit-rate unchanged after the watermarking. This is a very difficult problem for high efficient compression methods such as H.264/AVC, because just one bit alteration in highly compressed bit-stream may widely affect the video content. In this paper we solve this problem by embedding watermark bit to the sign bit of the in Context Adaptive Variable Length Coding (CAVLC) of H.264/AVC. The algorithm yields no bit-rate change after the data hiding. Also, we can easily balance between the capacity of the watermark bits and the fidelity of the video. The simplicity of the proposed algorithm is an added bonus for the real-time applications. Our experiments show that the PSNRs of the video sequences after the data hiding are higher than 43dB.
- Image and Video Coding and Encryption | Pp. 698-707
Reduced Uneven Multi-hexagon-grid Search for Fast Integer Pel Motion Estimation in H.264/AVC
Cheong-Ghil Kim; In-Jik Lee; Shin-Dug Kim
A reduced uneven multi-hexagon-grid search algorithm for fast integer pel motion estimation in H.264/AVC is presented. The objective is to reduce the number of candidates for the block matching by predicting the likely area in which the minimum sum of absolute differences (SAD) can be taken. For this purpose, the proposed algorithm employs directionally partial hexagon search patterns utilizing the motion vectors computed in previous stages which supply the spatial correlation characteristics between adjacent macro blocks and the temporal ones between video frames. Experimental results show that the proposed method can save 39%~69% of computational complexity compared with the original one at the cost of negligible degradation on RD performance.
- Image and Video Coding and Encryption | Pp. 708-714
Reversible Data Hiding for JPEG Images Based on Histogram Pairs
Guorong Xuan; Yun Q. Shi; Zhicheng Ni; Peiqi Chai; Xia Cui; Xuefeng Tong
This paper proposes a lossless data hiding technique for JPEG images based on histogram pairs. It embeds data into the JPEG quantized 8x8 block DCT coefficients and can achieve good performance in terms of PSNR versus payload through manipulating histogram pairs with optimum threshold and optimum region of the JPEG DCT coefficients. It can obtain higher payload than the prior arts. In addition, the increase of JPEG file size after data embedding remains unnoticeable. These have been verified by our extensive experiments.
- Image and Video Coding and Encryption | Pp. 715-727
Iterated Fourier Transform Systems: A Method for Frequency Extrapolation
Gregory S. Mayer; Edward R. Vrscay
In this paper we introduce a fractal-based method over (complex-valued) Fourier transforms of functions with compact support . This method of “iterated Fourier transform systems” (IFTS) has a natural mathematical connection with the fractal-based method of “iterated function systems with greyscale maps” (IFSM) in the spatial domain [6,7]. A major motivation for our formulation is the problem of resolution enhancement of band-limited magnetic resonance images. In an attempt to minimize sampling/transform artifacts, it is our desire to work directly with the raw frequency data provided by an MR imager as much as possible before ‘“returning” to the spatial domain. In this paper, we show that our fractal-based IFTS method can be tailored to perform frequency extrapolation.
- Image and Video Coding and Encryption | Pp. 728-739
Adaptive Deblocking Algorithm Based on Image Characteristics for Low Bit-Rate Video
Jongho Kim; Minseok Choi; Jechang Jeong
Reconstructed images in low bit-rate video coding often have visually annoying artifacts, such as blocking artifacts and corner outliers, usually due to block-based quantization and motion-compensated prediction. In this paper, we propose an adaptive deblocking algorithm based on local image characteristics. We remove ringing artifacts using a simple low-pass filter in the preprocessing. Among four filtering modes, an appropriate filter is selected adaptively, removing noticeable blocking artifacts in flat areas, while preserving details without blurring in highly complex areas. The proposed filter contains a non-symmetric filter a corner outlier removal filter. Simulation results show the effectiveness of the proposed deblocking algorithm in objective and subjective quality compared with those of the MPEG-4 deblocking filter.
- Image and Video Coding and Encryption | Pp. 740-751
Ingredient Separation of Natural Images: A Multiple Transform Domain Method Based on Sparse Coding Strategy
Xi Tan
‘’ is a ubiquitous strategy employed in the sensory information process system of mammals. Such strategy aims to find a representation of data in which the components of the representation are only rarely significantly active. This paper presents a and demonstrates that it may be used to separate natural images into different ingredients based on the sparse coding strategy. In such model an overcomplete dictionary is constructed by combining different type of complete or over-complete systems that can respectively deal with different image ingredients. Based on a sparse prior restriction, decomposition coefficients are inferred by maximizing a posterior distribution. The resulting coefficients belonging to different systems correspond to different image ingredients. The proposed provides a flexible framework for image ingredient separation which allows one to extract image structure of special interest.
- Image and Video Coding and Encryption | Pp. 752-760
Optimal Algorithm for Lossy Vector Data Compression
Alexander Kolesnikov
An algorithm for lossy compression of vector data (vector maps, vector graphics, contours of shapes) was developed. The algorithm is based on optimal polygonal approximation for error measure and dynamic quantization of the vector data. The algorithm includes optimal distribution of the approximation line segments among the vector objects, optimal polygonal approximation of the objects with dynamic quantization and construction of the optimal variable-rate vector quantizer. The developed algorithm can be used for lossy compression of one-dimensional signals and multidimensional vector data.
- Image and Video Coding and Encryption | Pp. 761-771
MPEG Video Watermarking Using Tensor Singular Value Decomposition
Emad E. Abdallah; A. Ben Hamza; Prabir Bhattacharya
In this paper, we introduce a new watermarking algorithm to embed an invisible watermark into the intra-frames of an MPEG video sequence. Unlike previous methods where each video frame is marked separately, our proposed technique uses high-order tensor decomposition of videos. The key idea behind our approach is to represent a fixed number of the intra-frames as a 3D tensor with two dimensions in space and one dimension in time. Then we modify the singular values of the 3D tensor, which have a good stability and represent the video properties. The main attractive features of this approach are simplicity and robustness. The experimental results show the robustness of the proposed scheme against the most common attacks including geometric transformations, adaptive random noise, low pass filtering, histogram equalization, frame dropping, frame swapping, and frame averaging.
- Image and Video Coding and Encryption | Pp. 772-783
Embedding Quality Measures in PIFS Fractal Coding
Andrea Abate; Michele Nappi; Daniel Riccio
Fractal image coding is a relatively recent technique based on the representation of an image by a map of self-similarities. In last years, most researchers focused their attention on the problem of speeding up the fractal coding process, while paying little attention to possible improvements of the objective and subjective image quality. In this paper, we investigate image quality measures, which could represent a reasonable alternative to the RMSE when finding a suitable map of similarities. Subjective assessments have been performed in order to compare performances of the selected quality metrics. Experimental results bear witness to the superiority of such a quality metric based on Fourier coefficients.
- Image and Video Coding and Encryption | Pp. 784-793