Catálogo de publicaciones - libros

Compartir en
redes sociales

Pattern Recognition and Image Analysis: Second Iberian Conference, IbPRIA 2005, Estoril, Portugal, June 7-9, 2005, Proceeding, Part II

Jorge S. Marques ; Nicolás Pérez de la Blanca ; Pedro Pina (eds.)

En conferencia: 2º Iberian Conference on Pattern Recognition and Image Analysis (IbPRIA) . Estoril, Portugal . June 7, 2005 - June 9, 2005

Resumen/Descripción – provisto por la editorial

No disponible.

Palabras clave – provistas por la editorial

Pattern Recognition; Image Processing and Computer Vision; Artificial Intelligence (incl. Robotics); Document Preparation and Text Processing; Computer Graphics

Disponibilidad

Institución detectada	Año de publicación	Navegá	Descargá	Solicitá
No detectada	2005	SpringerLink

Información

Tipo de recurso:

libros

ISBN impreso

978-3-540-26154-4

ISBN electrónico

978-3-540-32238-2

Editor responsable

Springer Nature

País de edición

Reino Unido

Fecha de publicación

2005

Información sobre derechos de publicación

Cobertura temática

Ciencias de la computación e información

Ingeniería eléctrica, electrónica e informática

Tabla de contenidos

Verificá que desde tu institución tengas acceso para descargar o solicitar el libro completo o alguno de sus capítulos.

doi: 10.1007/11492542_71

Cryptographic-Speech-Key Generation Architecture Improvements

L. Paola García-Perera; Juan A. Nolazco-Flores; Carlos Mex-Perera

In this work we show a performance improvement of our system by taking into account the weights of the mixture of Gaussians of the Hidden Markov Model. Furthermore and independently tunning of each of the phoneme Support Vector Machine (SVM) parameters is performed. In our system the user utters a pass phrase and the phoneme waveform segments are found using the Automatic Speech Recognition Technology. Given the speech model and the phoneme information in the segments, a set of features are created to train an SVM that could generate a cryptographic key. Applying our method to a set of 10, 20, and 30 speakers from the YOHO database, the results show a good improvement compared with our last configuration, improving the robustness in the generation of the cryptographic key.

Palabras clave: Support Vector Machine; Hide Markov Model; Continuous Hide Markov Model; Phoneme Information; Automatic Speech Recogniser.

VIII - Speech Recognition | Pp. 579-585

doi: 10.1007/11492542_72

Performance of a SCFG-Based Language Model with Training Data Sets of Increasing Size

Joan Andreu Sánchez; José Miguel Benedí; Diego Linares

In this paper, a hybrid language model which combines a word-based n-gram and a category-based Stochastic Context-Free Grammar (SCFG) is evaluated for training data sets of increasing size. Different estimation algorithms for learning SCFGs in General Format and in Chomsky Normal Form are considered. Experiments on the UPenn Treebank corpus are reported. These experiments have been carried out in terms of the test set perplexity and the word error rate in a speech recognition experiment.

Palabras clave: Syntactic Structure; Word Error Rate; Terminal Symbol; Statistical Language Modeling; Perplexity Result.

VIII - Speech Recognition | Pp. 586-594

doi: 10.1007/11492542_73

Speaker Dependent ASRs for Huastec and Western-Huastec Náhuatl Languages

Juan A. Nolazco-Flores; Luis R. Salgado-Garza; Marco Peña-Díaz

The purpose of this work is to show the results obtained when the latest technological advances in the area of Automatic Speech Recognition (ASR) are applied to the Western-Huastec Náhuatl and Huastec languages. Western-Huastec Náhuatl and Huastec are not only native (indigenous) languages in México, but also minority languages, and people who speak these languages usually are analphabetic. A speech database was created by recording the voice of native speaker when reading a set of documents used for native bilingual primary school in the official mexican state education system. A pronunciation dictionary was created for each language. A continuous Hidden Markov Models (HMM) were used for acoustical modeling, and bigrams were used for language Modeling. A Viterbi decoder was used for recognition. The word error rate of this task is below 8.621% for Western-Huastec Náhuatl language and 10.154% for Huastec language.

Palabras clave: Automatic Speech Recognition; Minority Language; Speech Recognition System; Word Error Rate; Automatic Speech Recognition System.

VIII - Speech Recognition | Pp. 595-602

doi: 10.1007/11492542_74

Phrase-Based Alignment Models for Statistical Machine Translation

Jesús Tomás; Jaime Lloret; Francisco Casacuberta

The first pattern recognition approaches to machine translation were based on single-word models. However, these models present an important deficiency; they do not take contextual information into account for the translation decision. The phrase-based approach consists in translating a multiword source sequence into a multiword target sequence, instead of a single source word into a single target word. We present different methods to train the parameters of this kind of model. In the evaluation phase of this approach, we obtained interesting results in comparison with other statistical models.

IX - Natural Language Analysis | Pp. 605-613

doi: 10.1007/11492542_75

Automatic Segmentation of Bilingual Corpora: A Comparison of Different Techniques

Ismael García Varea; Daniel Ortiz; Francisco Nevado; Pedro A. Gómez; Francisco Casacuberta

Segmentation of bilingual text corpora is a very important issue to deal with in machine translation. In this paper we present a new method to perform bilingual segmentation of a parallel corpus, SPBalign , which is based on phrase-based statistical translation models. The new technique proposed here is compared with other two existing techniques, which are also based on statistical translation methods: the RECalign technique, which is based on the concept of recursive alignment, and the GIATIalign technique, which is based on simple word alignments. Experimental results are obtained for the EuTrans-I English-Spanish task, in order to create new, shorter bilingual segments to be included in a translation memory database. The evaluation of these three methods has been performed comparing the bilingual segmentations obtained by these techniques with respect to a manually segmented bilingual test corpus. These results show us that the new method proposed here outperforms in all cases the two already proposed bilingual segmentation techniques.

IX - Natural Language Analysis | Pp. 614-621

doi: 10.1007/11492542_76

Word Translation Disambiguation Using Multinomial Classifiers

Jesús Andrés; José R. Navarro; Alfons Juan; Francisco Casacuberta

This work focuses on a hybrid machine translation system from Spanish into Catalan called SisHiTra. In particular, we focus on its word translation disambiguation module, which has to decide on the correct translation of each ambiguous input word in accordance with its context. We propose the use of statistical pattern recognition techniques for this task and, in particular, multinomial Naive Bayes text classifiers. Extensive empirical results on the use of these classifiers are presented, in which the influence of the window (context) size and parameter smoothing are carefully studied.

Palabras clave: Window Size; Machine Translation; Ambiguous Word; Smoothing Technique; Statistical Machine Translation.

IX - Natural Language Analysis | Pp. 622-629

doi: 10.1007/11492542_77

Different Approaches to Bilingual Text Classification Based on Grammatical Inference Techniques

Jorge Civera; Elsa Cubel; Alfons Juan; Enrique Vidal

Bilingual documentation has become a common phenomenon in many official institutions and private companies. In this scenario, the categorization of bilingual text is a useful tool, that can be also applied in the machine translation field. To tackle this classification task, different approaches will be proposed. On the one hand, two finite-state transducer algorithms from the grammatical inference domain will be discussed. On the other hand, the well-known naive Bayes approximation will be presented along with a possible modelization based on n -gram language models. Experiments carried out on a bilingual corpus have demonstrated the adequacy of these methods and the relevance of a second information source in text classification, as supported by classification error rates. Relative reduction of 29% with respect to the best previous results on the monolingual version of the same task has been obtained.

Palabras clave: Machine Translation; Input Sentence; Grammatical Inference; Regular Grammar; Bilingual Corpus.

IX - Natural Language Analysis | Pp. 630-637

doi: 10.1007/11492542_78

Semantic Similarity Between Sentences Through Approximate Tree Matching

Francisco Jose Ribadas; Manuel Vilares; Jesus Vilares

We describe an algorithm to measure the similarity between sentences, integrating the edit distance between trees and single-term similarity techniques, and also allowing the pattern to be defined approximately, omitting some structural details. A technique of this kind is of interest in a variety of applications, such as information extraction/retrieval or question answering, where error-tolerant recognition allows incomplete sentences to be integrated in the computation process.

Palabras clave: Semantic Similarity; Pattern Tree; Edit Distance; Parse Tree; Semantic Distance.

IX - Natural Language Analysis | Pp. 638-646

doi: 10.1007/11492542_79

A Text Categorization Approach for Music Style Recognition

Carlos Pérez-Sancho; José M. Iñesta; Jorge Calera-Rubio

The automatic classification of music files into styles is one challenging problem in music information retrieval and for music style perception understanding. It has a number of applications, like the indexation and exploration of musical databases. Some techniques used in text classification can be applied to this problem. The key point is to establish a music equivalent to the words in texts. A number of works use the combination of intervals and duration ratios for music description. In this paper, different statistical text recognition algorithms are applied to style recognition using this kind of melody representation, exploring their performance for different word sizes and statistical models.

Palabras clave: Near Neighbour; Average Mutual Information; Word Size; Musical Genre; Music Information Retrieval.

X - Applications | Pp. 649-657

doi: 10.1007/11492542_80

The MORFO3D Foot Database

José García-Hernández; Stella Heras; Alfons Juan; Roberto Paredes; Beatriz Nácher; Sandra Alemany; Enrique Alcántara; Juan Carlos González

A foot database comprising 3D foot shapes and footwear fitting reports of more than 300 participants is presented. It was primarily acquired to study footwear fitting, though it can also be used to analyse anatomical features of the foot. In fact, we present a technique for automatic detection of several foot anatomical landmarks, together with some empirical results.

Palabras clave: Mean Square Error; Anatomical Landmark; Automatic Detection; Medial Malleolus; Adhesive Marker.

X - Applications | Pp. 658-665