Catálogo de publicaciones - libros
Advances in Multimedia Information Processing: 6th Pacific Rim Conference on Multimedia, Jeju Island, Korea, November 11-13, 2005, Proceedings, Part I
Yo-Sung Ho ; Hyoung Joong Kim (eds.)
En conferencia: 6º Pacific-Rim Conference on Multimedia (PCM) . Jeju Island, South Korea . November 13, 2005 - November 16, 2005
Resumen/Descripción – provisto por la editorial
No disponible.
Palabras clave – provistas por la editorial
Multimedia Information Systems; Information Storage and Retrieval; Computer Communication Networks; Information Systems Applications (incl. Internet); Computer Graphics; Image Processing and Computer Vision
Disponibilidad
Institución detectada | Año de publicación | Navegá | Descargá | Solicitá |
---|---|---|---|---|
No detectada | 2005 | SpringerLink |
Información
Tipo de recurso:
libros
ISBN impreso
978-3-540-30027-4
ISBN electrónico
978-3-540-32130-9
Editor responsable
Springer Nature
País de edición
Reino Unido
Fecha de publicación
2005
Información sobre derechos de publicación
© Springer-Verlag Berlin Heidelberg 2005
Cobertura temática
Tabla de contenidos
doi: 10.1007/11581772_41
High Speed JPEG Coder Based on Modularized and Pipelined Architecture with Distributed Control
Fahad Ali Mujahid; Eun-Gu Jung; Dong-Soo Har; Jun-Hee Hong; Hoi-Jeong Lim
The design of an efficient reusable IP based Extended JPEG encoder is presented in this paper. This encoder uses user-defined quantization and Huffman tables that can be reconfigured at run-time. It has a modularized and pipelined architecture with distributed control for each block. A simple interface makes integration of the modules in various systems simple and straightforward. The design when targeted on FPGA operated at speed of up to 90MHz and when mapped on 0.25 μ m CMOS process the design can operate at speeds over 450MHz, which is faster than any of the similar JPEG encoder designs reported.
Palabras clave: Intellectual Property; Coder Unit; Joint Photographic Expert Group; Pipeline Architecture; Intellectual Property Core.
Pp. 466-476
doi: 10.1007/11581772_42
Efficient Distribution of Feature Parameters for Speech Recognition in Network Environments
Jae Sam Yoon; Gil Ho Lee; Hong Kook Kim
In network or ubiquitous environments, there are difficulties in performing large vocabulary speech recognition by a small device due to its limited power. Therefore, an approach, so-called distributed speech recognition (DSR), that distributes the processing modules of automatic speech recognition into a device and a server has been attractive. Of all processing modules of DSR, quantization of feature parameters plays a main role in terms of the transmission bandwidth and the recognition performance. In this paper, we propose an efficient quantizer of feature parameters by incorporating the correlation between successive analysis frames of speech. The proposed quantizer is based on the predictive multi-stage vector quantization scheme and designed with different bit rates by trading off with the performance of speech recognition. It is shown from speech recognition experiments that the DSR system employing the proposed quantization method can reduce a bit rate by 20% with a comparable recognition performance to the ETSI DSR standard.
Palabras clave: Speech Recognition; Vector Quantizer; Automatic Speech Recognition; Speech Enhancement; Speech Recognition System.
Pp. 477-488
doi: 10.1007/11581772_43
Magnitude-Sign Split Quantization for Bandwidth Scalable Wideband Speech Codec
Ji-Hyuk You; Chul-Man Park; Jung-Il Lee; Chang-Beom Ahn; Seoung-Jun Oh; Hochong Park
New quantization method based on magnitude-sign split scheme for bandwidth scalable wideband speech codec is proposed. In the high-band codec, the signal is band-pass filtered and each band is transformed independently into DCT domain. The DCT coefficients are split into magnitude and sign, and each is quantized separately based on its unique characteristics. In addition, the quantized gain parameter in the low-band codec is utilized in the high-band codec for an enhanced performance. The 19.8kbps bandwidth scalable wideband codec consisting of G.729E for low-band and the proposed codec for high-band is developed, and it is confirmed that the proposed codec has better subjective performance than 24kbps G.722.1.
Palabras clave: Channel Capacity; Sign Quantization; Voice Packet; Speech Codec; IEEE ICASSP.
Pp. 489-499
doi: 10.1007/11581772_44
Self-timed Interconnect with Layered Interface Based on Distributed and Modularized Control for Multimedia SoCs
Eun-Gu Jung; Eon-Pyo Hong; Kyoung-Son Jhang; Jeong-A Lee; Dong-Soo Har
In this paper, a high performance asynchronous on-chip bus designed in a Globally Asynchronous Locally Synchronous (GALS) style is proposed. The asynchronous on-chip bus is capable of handling multiple outstanding transactions and in-order completion to achieve a high performance, which is implemented with distributed and modularized control unit in a layered interface. The architecture of asynchronous on-chip bus is discussed and implemented for simulations. Simulation results show that throughput of the proposed asynchronous on-chip bus with multiple outstanding transactions and in-order transaction completion is increased by 31.3%, while power consumption overhead is only 6.76%, as compared to an asynchronous on-chip bus with a single outstanding transaction.
Pp. 500-511
doi: 10.1007/11581772_45
Enhanced Downhill Simplex Search for Fast Video Motion Estimation
Hwai-Chung Fei; Chun-Jen Chen; Shang-Hong Lai
Block-based motion estimation can be regarded as a function minimization problem in a finite two-dimensional space. Therefore, fast block-based motion estimation can be achieved by using an efficient function minimization algorithm instead of using a predefined search pattern, such as the diamond search. The downhill simplex search algorithm is an efficient derivative-free function minimization algorithm. In this paper, we propose several enhanced schemes to improve the efficiency of applying the downhill simplex search algorithm to motion estimation. The proposed enhanced schemes include a new initialization process, a special rounding method, and an early-stop error function evaluation procedure. Experimental results on several benchmarking videos show superior performance of the proposed algorithm over some existing fast block matching methods.
Palabras clave: Motion Vector; Motion Estimation; Full Search; Diamond Search; Simplex Search.
Pp. 512-523
doi: 10.1007/11581772_46
Camera Motion Detection in Video Sequences Using Motion Cooccurrences
Hyun-Ho Jeon; Andrea Basso; Peter F. Driessen
In this paper, we propose a camera motion detection method that can identify pan, tilt and zoom in a video sequence. The proposed method exploits motion features based on the motion cooccurrence matrix, which is able to provide dominant motion characteristics between two images such as the size of the homogeneous motion area and the direction of motion. We show that motion cooccurrence matrices are quite different for different types of motion and can be used to effectively identify simple camera motion such as pan, tilt and zoom in video sequences. Our method does not rely on the parametric motion model and can be used to qualitatively detect camera motion. Performance of the proposed method is evaluated by experiments for a set of test sequence.
Palabras clave: Video Sequence; Motion Estimation; Camera Motion; Global Motion; Cooccurrence Matrix.
Pp. 524-534
doi: 10.1007/11581772_47
A Hybrid Motion Compensated 3-D Video Coding System for Blocking Artifacts Reduction
Cho-Chun Cheng; Wen-Liang Hwang; Zuowei Shen; Tao Xia
We compare, both objectively and subjectively, the performance of various advanced motion compensation methods, including overlapped block motion compensation (OBMC) and control grid interpolation (CGI), in a 3-D wavelet based coding system. Our results indicate that an OBMC sequence usually has a higher PSNR than those of the other methods, while a CGI sequence usually has the least number of blocking artifacts. We combine these two methods in a hybrid system in order to achieve better visual quality and maintain satisfactory coding efficiency simultaneously. The objective and subjective results indicate that the proposed hybrid method removes more than 50% of blocking artifacts of an OBMC sequence, while simultaneously maintaining a high PSNR performance.
Palabras clave: Discrete Wavelet Transform; Motion Vector; Visual Quality; Motion Compensation; Block Match Algorithm.
Pp. 535-546
doi: 10.1007/11581772_48
Fast Panoramic Image Generation Method Using Morphological Corner Detection
Jungho Lee; Woongho Lee; Ikhwan Cho; Dongseok Jeong
We present a method of building a panoramic image from adjacent images. The panoramic image is constructed using several images taken by adjacent cameras or parsed from a video, and used for photogrammetry or many computer graphic applications. The perspective transformation, which is estimated from the appropriate corresponding pairs of images, can be used to construct the panoramic image without unwarranted distortion. We used the corner points for the corresponding features, and morphological structures were utilized for fast and robust corner detection. We used the criterion of the corner strength, which guarantees the robust detection of the corner in most situations. For the transformation, 8 parameters were estimated from perspective equations, and bilinear color blending was used to construct a seamless panoramic video. The experiments showed that the proposed method yields fast results with good quality under various conditions.
Palabras clave: Corner Point; Template Match; Panoramic Image; Corner Detection; Image Mosaic.
Pp. 547-558
doi: 10.1007/11581772_49
Generation of 3D Building Model Using 3D Line Detection Scheme Based on Line Fitting of Elevation Data
Dong-Min Woo; Seung-Soo Han; Young-Kee Jung; Kyu-Won Lee
This paper presents a new 3D line segment extraction method, which can be used in generating 3D rooftop model. The core of our method is that 3D line segment is extracted by using line fitting of elevation data on 2D line coordinates of ortho-image. In order to use elevation in line fitting, the elevation itself should be reliable. To measure the reliability of elevation, in this paper, we employ the concept of self-consistency. We test the effectiveness of the proposed method with a quantitative accuracy analysis using synthetic images generated from Avenches data set of Ascona aerial images. Experimental results indicate that our method generates 3D line segments almost 10 times more accurate than raw elevations obtained by area-based method. Also, our proposed method shows much improved accuracy over the cooperative hybrid stereo method. Using a simple 3D line grouping scheme, 3D line segments are shown to generate a precise 3D building model effectively.
Palabras clave: Line Segment; Suggested Method; Elevation Data; Synthetic Image; Stereo Match.
Pp. 559-569
doi: 10.1007/11581772_50
Segmentation of the Liver Using the Deformable Contour Method on CT Images
Seong-Jae Lim; Yong-Yeon Jeong; Yo-Sung Ho
Automatic liver segmentation from abdominal computed tomography (CT) images is one of the most important steps for computer-aided diagnosis (CAD) for liver CT. However, the liver must be separated manually or semi-automatically since surface features of the liver and partial-volume effects make automatic discrimination from other adjacent organs or tissues very difficult. In this paper, we present an unsupervised liver segmentation algorithm with three steps. In the preprocessing, we simplify the input CT image by estimating the liver position using a prior knowledge about the location of the liver and by performing multilevel threshold on the estimated liver position. The proposed scheme utilizes the multiscale morphological filter recursively with region-labeling and clustering to detect the search range for deformable contouring. Most of the liver contours are positioned within the search range. In order to perform an accurate segmentation, we produce the gradient-label map, which represents the gradient magnitude in the search range. The proposed algorithm performed deformable contouring on the gradient-label map by using regular patterns of the liver boundary. Experimental results are comparable to those of manual tracing by radiological doctors and shown to be efficient.
Palabras clave: Liver segmentation; Morphological filtering; Deformable contouring; Computer-Aided Diagnosis(CAD).
Pp. 570-581