Catálogo de publicaciones - libros

Compartir en
redes sociales


Semantic Multimedia: Second International Conference on Semantic and Digital Media Technologies, SAMT 2007, Genoa, Italy, December 5-7, 2007. Proceedings

Bianca Falcidieno ; Michela Spagnuolo ; Yannis Avrithis ; Ioannis Kompatsiaris ; Paul Buitelaar (eds.)

En conferencia: 2º International Conference on Semantic and Digital Media Technologies (SAMT) . Genoa, Italy . December 5, 2007 - December 7, 2007

Resumen/Descripción – provisto por la editorial

No disponible.

Palabras clave – provistas por la editorial

Popular Computer Science; Multimedia Information Systems; Computer Communication Networks; Information Systems Applications (incl. Internet); Data Mining and Knowledge Discovery; Document Preparation and Text Processing

Disponibilidad
Institución detectada Año de publicación Navegá Descargá Solicitá
No detectada 2007 SpringerLink

Información

Tipo de recurso:

libros

ISBN impreso

978-3-540-77033-6

ISBN electrónico

978-3-540-77051-0

Editor responsable

Springer Nature

País de edición

Reino Unido

Fecha de publicación

Información sobre derechos de publicación

© Springer-Verlag Berlin Heidelberg 2007

Tabla de contenidos

Outline of the MX Standard

Luca A. Ludovico

MX is a new XML-based format to describe comprehensively heterogeneous music contents. In a single MX file, music symbols, printed scores, audio tracks, computer-driven performances, catalogue metadata, text and graphic contents related to a single music piece are linked and mutually synchronised within the same framework. Heterogeneous contents are organised in a multilayered structure that supports different encoding formats and a number of digital objects for each layer.

- MX: The IEEE Standard for Interactive Music | Pp. 196-199

Automatic Synchronisation Between Audio and Score Musical Description Layers

Antonello D’Aguanno; Giancarlo Vercellesi

This work describes algorithms dedicated to score and audio alignment using the MX / IEEE P1599 format. The format allows description of the score, and management of synchronisation points, linking them with different versions of the performed music. An algorithm is proposed here that allows alignment of an MX score and its execution, coded in PCM format, which produces an output for the MX Spine that contains synchronisation between notes and audio signal. The proposed architecture is based on two different steps: the first deals with the audio level and the extraction of features like pitch, onset and the like, while the second determines the alignment between the features extracted in the first step and the notes present in the MX score. For each step, different algorithms are proposed and discussed, and analysis and comparison of synchronisation capabilities are provided.

- MX: The IEEE Standard for Interactive Music | Pp. 200-210

Interacting at the Symbolic and Structural Levels

Adriano Baratè

This article presents an approach to music interaction and creativity at the and . While common music interaction occurs at the symbolic level, on a sequence of notes, it is the structural level, the focus of this writing, that allows interaction with music contents at a higher degree. This requires identification and creation of in existing and new compositions, then the resulting set of music themes can be treated with Petri Nets, even in real time. Algorithms can be applied to modify the music synthesised or played, and, thanks to MX, this interaction is performed simultaneously at different levels. After an introduction of the main concepts, the article describes a case study based on “Peaches en Regalia”, by Frank Zappa, presented at the , , which took place in Rome on 9-10 June 2006. Finally, advanced MX scenarios will be presented.

- MX: The IEEE Standard for Interactive Music | Pp. 211-221

Interacting with Csound Timbral Textures

Elisa Russo

The possibility of interconnecting with allows creation and manipulation of timbres in an exhaustive way. Thanks to Csound features, it is possible to let MX files sound with any synthesis algorithm. This approach supports all the connections to current files through MIDI, real-time controls, analysis and synthesis thanks to the direct interaction with files linked in MX.

- MX: The IEEE Standard for Interactive Music | Pp. 222-225

A Semantic Web Environment for Digital Shapes Understanding

Leila De Floriani; Annie Hui; Laura Papaleo; May Huang; James Hendler

In the last few years, the volume of multimedia content available on the Web significantly increased. This led to the need for techniques to handle such data. In this context, we see a growing interest in considering the Semantic Web in action and in the definition of tools capable of analyzing and organizing digital shape models. In this paper, we present a Semantic Web environment, , for inspecting 3D shapes and for structuring and annotating such shapes according to ontology-driven metadata. Specifically, we describe in details the first module of , the and the algorithms we have developed for extracting geometrical and topological information from 3D shapes. We also describe the second module, the which produces a graph-based representation of the decomposition of the shape into manifold components. This is successively modified by the third and the fourth modules, which perform the automatic and manual segmentation of the manifold parts.

- Semantic Multimedia Annotation II | Pp. 226-239

Modeling Linguistic Facets of Multimedia Content for Semantic Annotation

Massimo Romanelli; Paul Buitelaar; Michael Sintek

We provide an integrated ontological framework offering coverage for deep semantic content, including ontological representation of multimedia based on the MPEG-7 standard. We link the deep semantic level with the media-specific semantic level to operationalize multimedia information. Through the link between multimedia representation and the semantics of specific domains we approach the Semantic Gap. The focus of the paper is on the linguistic features of multimedia, the annotation of these features and their analysis.

- Semantic Multimedia Annotation II | Pp. 240-251

Leveraging Ontologies, Context and Social Networks to Automate Photo Annotation

Fergal Monaghan; David O’Sullivan

This paper presents an approach to semi-automate photo annotation. Instead of using content-recognition techniques this approach leverages context information available at the scene of the photo such as time and location in combination with existing photo annotations to provide suggestions to the user. An algorithm exploits a number of technologies including Global Positioning System (GPS), Semantic Web, Web services and Online Social Networks, considering all information and making a best-effort attempt to suggest both people and places depicted in the photo. The user then selects which of the suggestions are correct to annotate the photo. This process accelerates the photo annotation process dramatically which in turn aids photo search for a wide range of query tools that currently trawl the millions of photos on the Web.

- Short Papers | Pp. 252-255

Use Cases of Scalable Video Based Summarization and Adaptation Within MPEG-21 DIA

Luis Herranz; José M. Martínez

Frame selection techniques based on semantic analysis are widely used to provide content-based summaries of video sequences. Video adaptation is also required to enable the consumption of these summaries through an increasing variety of devices and networks. Scalable video coding provides embedded coding of video where video adaptation is very simple and efficient. In this paper, we describe a model where both video summarization and adaptation are integrated within the MPEG-21 DIA framework, along with some use cases.

- Short Papers | Pp. 256-259

Ontology for Semantic Integration in a Cognitive Surveillance System

Carles Fernández; Jordi Gonzàlez

The increasing interest in Cognitive Vision Systems (CVS) motivates the apparition of ad-hoc stages designed for the integration of multiple kinds of knowledge. This paper proposes a novel ontology to restrict and integrate high-level semantics for Human Sequence Evaluation (HSE), which targets multilingual capabilities and multipurpose end-user interfaces. The main contributions of this paper are the conception of a neutral semantic layer, which allows to link vision and linguistic domains; and the use of instead of verbs as basic elements for an ontological categorization of occurrences. In our approach, the domain has been restricted to outdoor surveilled scenarios, involving interactions among pedestrians, static objects, and vehicular traffic.

- Short Papers | Pp. 260-263

Conditional Random Fields for High-Level Part Correlation Analysis in Images

Giuseppe Passino; Ebroul Izquierdo

A novel approach to model the semantic knowledge associated to objects detected in images is presented. The model is aimed at the classification of such objects according to contextual information combined to the extracted features. The system is based on Conditional Random Fields, a probabilistic graphical model used to model the conditional a-posteriori probability of the object classes, thus avoiding problems related to source modelling and features independence constraints. The novelty of the approach is in the addressing of the high-level, semantically rich objects interrelationships among image parts. This paper presents the application of the model to this new problem class and a first implementation of the system.

- Short Papers | Pp. 264-267