Catálogo de publicaciones - libros

Compartir en
redes sociales

Semantic Multimedia: Second International Conference on Semantic and Digital Media Technologies, SAMT 2007, Genoa, Italy, December 5-7, 2007. Proceedings

Bianca Falcidieno ; Michela Spagnuolo ; Yannis Avrithis ; Ioannis Kompatsiaris ; Paul Buitelaar (eds.)

En conferencia: 2º International Conference on Semantic and Digital Media Technologies (SAMT) . Genoa, Italy . December 5, 2007 - December 7, 2007

Resumen/Descripción – provisto por la editorial

No disponible.

Palabras clave – provistas por la editorial

Popular Computer Science; Multimedia Information Systems; Computer Communication Networks; Information Systems Applications (incl. Internet); Data Mining and Knowledge Discovery; Document Preparation and Text Processing

Disponibilidad

Institución detectada	Año de publicación	Navegá	Descargá	Solicitá
No detectada	2007	SpringerLink

Información

Tipo de recurso:

libros

ISBN impreso

978-3-540-77033-6

ISBN electrónico

978-3-540-77051-0

Editor responsable

Springer Nature

País de edición

Reino Unido

Fecha de publicación

2007

Información sobre derechos de publicación

Cobertura temática

Ingeniería eléctrica, electrónica e informática

Tabla de contenidos

Verificá que desde tu institución tengas acceso para descargar o solicitar el libro completo o alguno de sus capítulos.

doi: 10.1007/978-3-540-77051-0_31

Document Layout Substructure Discovery

Claudio Andreatta

In this paper we present a system, DoLSuD, for the automatic discovery of relevant substructures in a document layout. DoLSuD, Document Layout Substructure Discovery, extracts, analyzes and describes the visual content of structured documents, such as catalogs, in order to discover repeating and distinctive substructures in the document layout and to establish relations between textual and image content. The paper presents the system along with experimental results and the web based service which utilizes the analysis results.

- Short Papers | Pp. 268-271

doi: 10.1007/978-3-540-77051-0_32

Recognition of JPEG Compressed Face Images Based on AdaBoost

Chunmei Qing; Jianming Jiang

This paper presents an advanced face recognition system based on AdaBoost algorithm in the JPEG compressed domain. First, the dimensionality is reduced by truncating some of the block-based DCT coefficients and the nonuniform illumination variations are alleviated by discarding the DC coefficient of each block. Next, an improved AdaBoost.M2 algorithm which uses Euclidean Distance(ED) to eliminate non-effective weak classifiers is proposed to select most discriminative DCT features from the truncated DCT coefficient vectors. At last, the LDA is used as the final classifier. Experiments on Yale face databases show that the proposed approach is superior to other methods in terms of recognition accuracy, efficiency, and illumination robustness.

- Short Papers | Pp. 272-275

doi: 10.1007/978-3-540-77051-0_33

Camera Motion Analysis Towards Semantic-Based Video Retrieval in Compressed Domain

Ying Weng; Jianmin Jiang

To reduce the semantic gap between low-level visual features and the richness of human semantics, this paper proposes new algorithms, by virtue of the combined camera motion descriptors with multi-threshold, to automatically retrieve the semantic concepts, i.e., close-up, and panorama, directly in MPEG compressed domain based on camera motion analysis. Extensive experiments illustrate that the proposed algorithms provide promising retrieval results under real-time application scenario and without human intervention.

- Short Papers | Pp. 276-279

doi: 10.1007/978-3-540-77051-0_34

Challenges in Supporting Faceted Semantic Browsing of Multimedia Collections

Daniel Alexander Smith; Alisdair Owens; m. c. schraefel; Patrick Sinclair; Paul André; Max L. Wilson; Alistair Russell; Kirk Martinez; Paul Lewis

We discuss three approaches, 3store, D2R and MySQL, we have explored to support efficient querying of multimedia data sources via mSpace, a rich UI. Our results underline key research challenges facing the development of high performance RDF query layers to support complex real-time UIs.

- Short Papers | Pp. 280-283

doi: 10.1007/978-3-540-77051-0_35

A Study of Vocabularies for Image Annotation

Allan Hanbury

In order to evaluate image annotation and object categorisation algorithms, ground truth in the form of a set of images correctly annotated with text describing each image is required. Statistics on the WordNet categories of keywords collected from recent automated image annotation and object categorisation publications and evaluation campaigns are presented. These statistics provide a snapshot of keywords used to train and test current image annotation systems as well as information on the usefulness of WordNet for categorising them.

- Short Papers | Pp. 284-287

doi: 10.1007/978-3-540-77051-0_36

Towards a Cross-Media Analysis of Spatially Co-located Image and Text Regions in TV-News

Thierry Declerck; Andreas Cobet

We describe in this poster/short paper on-going work on the extraction and semantic interpretation of text regions in television news programmes. We present some of the data we consider in this work, the actual technologies in use and where they have to be improved. Finally we briefly discuss a possible innovative and valuable approach to the establishment of a cross-media analysis framework.

- Short Papers | Pp. 288-291

doi: 10.1007/978-3-540-77051-0_37

Towards Person Google: Multimodal Person Search and Retrieval

Lutz Goldmann; Amjad Samour; Thomas Sikora

Content based multimedia retrieval systems have been proposed to allow for automatic and efficient indexing and retrieval of the increasing amount of audiovisual data (image, video and audio clips). The search for specific persons within this data is an important subtopic due to its large range of applications. This article describes an original system for multimodal person search and provides some initial performance results that demonstrate the efficiency of the system.

- K-Space Awarded PhD Papers | Pp. 292-295

doi: 10.1007/978-3-540-77051-0_38

Event Detection in Pedestrian Detection and Tracking Applications

Philip Kelly; Noel E. O’Connor; Alan F. Smeaton

In this paper, we present a system framework for event detection in pedestrian and tracking applications. The system is built upon a robust computer vision approach to detecting and tracking pedestrians in unconstrained crowded scenes. Upon this framework we propose a pedestrian indexing scheme and suite of tools for detecting events or retrieving data from a given scenario.

- K-Space Awarded PhD Papers | Pp. 296-299

doi: 10.1007/978-3-540-77051-0_39

SAMMI

Marco Paleari; Benoit Huet; Brian Duffy

Multimedia indexing is about developing techniques allowing people to effectively find media. Content-based methods become necessary when dealing with big databases. Current technology allows exploring the emotional space which is known to carry very interesting semantic information. In this paper we state the need for an integrated method which extracts reliable affective information and attaches this semantic information to the medium itself. We present a list of possible applications and advantages that the emotional information can bring about together with a framework called SAMMI and the preliminary results of this newly initiated research work.

- K-Space Awarded PhD Papers | Pp. 300-303