Catálogo de publicaciones - libros

Compartir en
redes sociales

Text, Speech and Dialogue: 8th International Conference, TSD 2005, Karlovy Vary, Czech Republic, September 12-15, 2005, Proceedings

Václav Matoušek ; Pavel Mautner ; Tomáš Pavelka (eds.)

En conferencia: 8º International Conference on Text, Speech and Dialogue (TSD) . Karlovy Vary, Czech Republic . September 12, 2005 - September 15, 2005

Resumen/Descripción – provisto por la editorial

No disponible.

Palabras clave – provistas por la editorial

Language Translation and Linguistics; Artificial Intelligence (incl. Robotics); Information Storage and Retrieval; Information Systems Applications (incl. Internet)

Disponibilidad

Institución detectada	Año de publicación	Navegá	Descargá	Solicitá
No detectada	2005	SpringerLink

Información

Tipo de recurso:

libros

ISBN impreso

978-3-540-28789-6

ISBN electrónico

978-3-540-31817-0

Editor responsable

Springer Nature

País de edición

Reino Unido

Fecha de publicación

2005

Información sobre derechos de publicación

Cobertura temática

Ciencias de la computación e información

Ingeniería eléctrica, electrónica e informática

Lenguas y literatura

Tabla de contenidos

Verificá que desde tu institución tengas acceso para descargar o solicitar el libro completo o alguno de sus capítulos.

doi: 10.1007/11551874_21

Equilibrium Points of Single-Layered Neural Networks with Feedback and Applications in the Analysis of Text Documents

Alexander Shmelev; Vyacheslav Avdeychik

This report describes a technology of coding of the text documents used within the framework of a working breadboard model of meta-search system developed in ”Bineuro” with context-dependent processing and with classification of search results. The technology of automatic analysis and coding of text documents is based on text corpus representation in the form of an associative semantic network. Code vectors are generated as the equilibrium points of neural network with feedback and with parallel dynamics, such as Hopfild network with an asymmetrical matrix of feedback. These code vectors can be used for tasks of ranging, automatic cluster analysis of documents and in many other tasks connected to automatic post processing of search results by the search engines on the Internet. As an example of application of code vectors we will consider below a task of classification of text documents.

- Text | Pp. 164-170

doi: 10.1007/11551874_22

A Syntax and Semantics Linking Algorithm for the Chinese Language

Zhou Qiang; Dang Zhengfa

Many statistics-based approaches have been proposed to label semantic roles automatically. For language lack of large-scale semantically annotated corpora, e.g. Chinese, these approaches do not work well. In this paper we proposed an unsupervised syntax and semantic linking algorithm, and make full use of current Chinese language resources to equip it with detailed linking knowledge. Therefore, the semantic role labeling task can be attributed to a data-driven application of the linking algorithm. Some preliminary experiment results demonstrate the ability of the linking algorithm for the automatic semantic role labeling on current Chinese treebank.

- Text | Pp. 171-178

doi: 10.1007/11551874_23

Fuzzy Information Retrieval Indexed by Concept Identification

Bo-Yeong Kang; Dae-Won Kim; Hae-Jung Kim

To retrieve relevant information, indexing should be achieved using the concepts of the document that a writer intends to highlight. Moreover, the user involvement is increasingly required to extract relevant information from information sources. Therefore, in the present work we propose a fuzzy retrieval model indexed by concept identification: (1) a concept identification based indexing and (2) a novel fuzzy ranking model. The concept based indexing identifies index terms by considering the concepts of a document, and a novel fuzzy ranking model based on the user preference is presented, which is able to calculates the relevance ranking based on the user preference.

- Text | Pp. 179-186

doi: 10.1007/11551874_24

A Theme Allocation for a Sentence Based on Head Driven Patterns

Bo-Yeong Kang; Sung-Hyon Myaeng

Since sentences are the basic propositional units of text, knowing their themes should help various tasks requiring the knowledge about the semantic content of text. In this paper, we examine the notion of sentence theme and propose an automatic scheme where head-driven patterns are used for theme assignment. We tested our scheme with sentences in encyclopedia articles and obtained a promising result of 98.96% in F-score for training data and 88.57% for testing data, which outperform the baseline.

- Text | Pp. 187-194

doi: 10.1007/11551874_25

A Hybrid Approach to Statistical Language Modeling with Multilayer Perceptrons and Unigrams

Fernando Blat; María José Castro; Salvador Tortajada; Joan Andreu Sánchez

In language engineering, language models are employed in order to improve system performance. These language models are usually -gram models which are estimated from large text databases using the occurrence frequencies of these -grams. An alternative to conventional frequency-based estimation of -gram probabilities consists on using neural networks to this end. In this paper, an approach to language modeling with a hybrid language model is presented as a linear combination of a connectionist -gram model, which is used to represent the global relations between certain linguistic categories, and a stochastic model of word distribution into such categories. The hybrid language model is tested on the corpus of the Wall Street journal processed in the Penn Treebank project.

- Speech | Pp. 195-202

doi: 10.1007/11551874_26

Supervised and Unsupervised Speaker Adaptation in Large Vocabulary Continuous Speech Recognition of Czech

Petr Cerva; Jan Nouza

This paper deals with the problem of efficient speaker adaptation in large vocabulary continuous speech recognition (LVCSR) systems. The main goal is to adapt acoustic models of speech and to increase the recognition accuracy of these systems in tasks, where only one user is expected (e.g. voice dictation) or where the speaking person can be identified automatically (e.g. broadcast news transcription). For this purpose, we propose several modifications of the well known MLLR (Maximum Likelihood Linear Regression) method and we combine them with the MAP (Maximum A Posteriori) method. The results from a series of experiments show that the error rate of our 300K-word Czech recogniser can be reduced by about 9.9 % when only 30 seconds of supervised data are used for adaptation or by about 9.6 % when unsupervised adaptation on the same data is performed.

- Speech | Pp. 203-210

doi: 10.1007/11551874_27

Modelling Lexical Stress

Rogier C. van Dalen; Pascal Wiggers; Leon J. M. Rothkrantz

Human listeners use lexical stress for word segmentation and disambiguation. We look into using lexical stress for speech recognition by examining a Dutch-language corpus. We propose that different spectral features are needed for different phonemes and that, besides vowels, consonants should be taken into account.

- Speech | Pp. 211-218

doi: 10.1007/11551874_28

The Sound Database Formation for the Allophone-Based Model for English Concatenative Speech Synthesis

Karina Evgrafova

The goal of this paper is to describe the development of the sound database for the allophone-based model for English concatenative speech synthesis. The procedure of the sound unit inventory construction is described and its main results are presented. At present moment the optimized sound units inventory of the allophonic database for English concatenative speech synthesis contains 1200 elements (1000 vowel allophones and 200 consonant allophones). The smoothness of junctions between the allophones shows high quality of the segmentation made. The decrease in the number of the database components in the result of optimization does not affect the quality of the resulting synthesized speech. At the level of segments it can be evaluated as fairly high in terms of both naturalness and intelligibility.

- Speech | Pp. 219-225

doi: 10.1007/11551874_29

Using Artificially Reverberated Training Data in Distant-Talking ASR

Tino Haderlein; Elmar Nöth; Wolfgang Herbordt; Walter Kellermann; Heinrich Niemann

Automatic Speech Recognition (ASR) in reverberant rooms can be improved by choosing training data from the same acoustical environment as the test data. In a real-world application this is often not possible. A solution for this problem is to use speech signals from a close-talking microphone and reverberate them artificially with multiple room impulse responses. This paper shows results on recognizers whose training data differ in size and percentage of reverberated signals in order to find the best combination for data sets with different degrees of reverberation. The average error rate on a close-talking and a distant-talking test set could thus be reduced by 29% relative.

- Speech | Pp. 226-233

doi: 10.1007/11551874_30

French–German Bilingual Acoustic Modeling for Embedded Voice Driven Applications

Jozef Ivanecký; Volker Fischer; Siegfried Kunzmann

Multilingual access to information and services is a key requirement in any pervasive or ubiquitous computing environment. In this paper we describe our efforts towards multilingual speech recognition with a focus on applications that are designed to run on embedded devices, like e.g. a commercially available PDA. We give an overview on speech recognition techniques suited for the special requirements of the expected phonetic and acoustic environments and explore the ability to create multilingual acoustic models and applications that are able to run on embedded devices in real-time.

- Speech | Pp. 234-240