Catálogo de publicaciones - libros
Text, Speech and Dialogue: 8th International Conference, TSD 2005, Karlovy Vary, Czech Republic, September 12-15, 2005, Proceedings
Václav Matoušek ; Pavel Mautner ; Tomáš Pavelka (eds.)
En conferencia: 8º International Conference on Text, Speech and Dialogue (TSD) . Karlovy Vary, Czech Republic . September 12, 2005 - September 15, 2005
Resumen/Descripción – provisto por la editorial
No disponible.
Palabras clave – provistas por la editorial
Language Translation and Linguistics; Artificial Intelligence (incl. Robotics); Information Storage and Retrieval; Information Systems Applications (incl. Internet)
Disponibilidad
Institución detectada | Año de publicación | Navegá | Descargá | Solicitá |
---|---|---|---|---|
No detectada | 2005 | SpringerLink |
Información
Tipo de recurso:
libros
ISBN impreso
978-3-540-28789-6
ISBN electrónico
978-3-540-31817-0
Editor responsable
Springer Nature
País de edición
Reino Unido
Fecha de publicación
2005
Información sobre derechos de publicación
© Springer-Verlag Berlin Heidelberg 2005
Tabla de contenidos
doi: 10.1007/11551874_21
Equilibrium Points of Single-Layered Neural Networks with Feedback and Applications in the Analysis of Text Documents
Alexander Shmelev; Vyacheslav Avdeychik
This report describes a technology of coding of the text documents used within the framework of a working breadboard model of meta-search system developed in ”Bineuro” with context-dependent processing and with classification of search results. The technology of automatic analysis and coding of text documents is based on text corpus representation in the form of an associative semantic network. Code vectors are generated as the equilibrium points of neural network with feedback and with parallel dynamics, such as Hopfild network with an asymmetrical matrix of feedback. These code vectors can be used for tasks of ranging, automatic cluster analysis of documents and in many other tasks connected to automatic post processing of search results by the search engines on the Internet. As an example of application of code vectors we will consider below a task of classification of text documents.
- Text | Pp. 164-170
doi: 10.1007/11551874_22
A Syntax and Semantics Linking Algorithm for the Chinese Language
Zhou Qiang; Dang Zhengfa
Many statistics-based approaches have been proposed to label semantic roles automatically. For language lack of large-scale semantically annotated corpora, e.g. Chinese, these approaches do not work well. In this paper we proposed an unsupervised syntax and semantic linking algorithm, and make full use of current Chinese language resources to equip it with detailed linking knowledge. Therefore, the semantic role labeling task can be attributed to a data-driven application of the linking algorithm. Some preliminary experiment results demonstrate the ability of the linking algorithm for the automatic semantic role labeling on current Chinese treebank.
- Text | Pp. 171-178
doi: 10.1007/11551874_23
Fuzzy Information Retrieval Indexed by Concept Identification
Bo-Yeong Kang; Dae-Won Kim; Hae-Jung Kim
To retrieve relevant information, indexing should be achieved using the concepts of the document that a writer intends to highlight. Moreover, the user involvement is increasingly required to extract relevant information from information sources. Therefore, in the present work we propose a fuzzy retrieval model indexed by concept identification: (1) a concept identification based indexing and (2) a novel fuzzy ranking model. The concept based indexing identifies index terms by considering the concepts of a document, and a novel fuzzy ranking model based on the user preference is presented, which is able to calculates the relevance ranking based on the user preference.
- Text | Pp. 179-186
doi: 10.1007/11551874_24
A Theme Allocation for a Sentence Based on Head Driven Patterns
Bo-Yeong Kang; Sung-Hyon Myaeng
Since sentences are the basic propositional units of text, knowing their themes should help various tasks requiring the knowledge about the semantic content of text. In this paper, we examine the notion of sentence theme and propose an automatic scheme where head-driven patterns are used for theme assignment. We tested our scheme with sentences in encyclopedia articles and obtained a promising result of 98.96% in F-score for training data and 88.57% for testing data, which outperform the baseline.
- Text | Pp. 187-194
doi: 10.1007/11551874_25
A Hybrid Approach to Statistical Language Modeling with Multilayer Perceptrons and Unigrams
Fernando Blat; María José Castro; Salvador Tortajada; Joan Andreu Sánchez
In language engineering, language models are employed in order to improve system performance. These language models are usually -gram models which are estimated from large text databases using the occurrence frequencies of these -grams. An alternative to conventional frequency-based estimation of -gram probabilities consists on using neural networks to this end. In this paper, an approach to language modeling with a hybrid language model is presented as a linear combination of a connectionist -gram model, which is used to represent the global relations between certain linguistic categories, and a stochastic model of word distribution into such categories. The hybrid language model is tested on the corpus of the Wall Street journal processed in the Penn Treebank project.
- Speech | Pp. 195-202
doi: 10.1007/11551874_26
Supervised and Unsupervised Speaker Adaptation in Large Vocabulary Continuous Speech Recognition of Czech
Petr Cerva; Jan Nouza
This paper deals with the problem of efficient speaker adaptation in large vocabulary continuous speech recognition (LVCSR) systems. The main goal is to adapt acoustic models of speech and to increase the recognition accuracy of these systems in tasks, where only one user is expected (e.g. voice dictation) or where the speaking person can be identified automatically (e.g. broadcast news transcription). For this purpose, we propose several modifications of the well known MLLR (Maximum Likelihood Linear Regression) method and we combine them with the MAP (Maximum A Posteriori) method. The results from a series of experiments show that the error rate of our 300K-word Czech recogniser can be reduced by about 9.9 % when only 30 seconds of supervised data are used for adaptation or by about 9.6 % when unsupervised adaptation on the same data is performed.
- Speech | Pp. 203-210
doi: 10.1007/11551874_27
Modelling Lexical Stress
Rogier C. van Dalen; Pascal Wiggers; Leon J. M. Rothkrantz
Human listeners use lexical stress for word segmentation and disambiguation. We look into using lexical stress for speech recognition by examining a Dutch-language corpus. We propose that different spectral features are needed for different phonemes and that, besides vowels, consonants should be taken into account.
- Speech | Pp. 211-218
doi: 10.1007/11551874_28
The Sound Database Formation for the Allophone-Based Model for English Concatenative Speech Synthesis
Karina Evgrafova
The goal of this paper is to describe the development of the sound database for the allophone-based model for English concatenative speech synthesis. The procedure of the sound unit inventory construction is described and its main results are presented. At present moment the optimized sound units inventory of the allophonic database for English concatenative speech synthesis contains 1200 elements (1000 vowel allophones and 200 consonant allophones). The smoothness of junctions between the allophones shows high quality of the segmentation made. The decrease in the number of the database components in the result of optimization does not affect the quality of the resulting synthesized speech. At the level of segments it can be evaluated as fairly high in terms of both naturalness and intelligibility.
- Speech | Pp. 219-225
doi: 10.1007/11551874_29
Using Artificially Reverberated Training Data in Distant-Talking ASR
Tino Haderlein; Elmar Nöth; Wolfgang Herbordt; Walter Kellermann; Heinrich Niemann
Automatic Speech Recognition (ASR) in reverberant rooms can be improved by choosing training data from the same acoustical environment as the test data. In a real-world application this is often not possible. A solution for this problem is to use speech signals from a close-talking microphone and reverberate them artificially with multiple room impulse responses. This paper shows results on recognizers whose training data differ in size and percentage of reverberated signals in order to find the best combination for data sets with different degrees of reverberation. The average error rate on a close-talking and a distant-talking test set could thus be reduced by 29% relative.
- Speech | Pp. 226-233
doi: 10.1007/11551874_30
French–German Bilingual Acoustic Modeling for Embedded Voice Driven Applications
Jozef Ivanecký; Volker Fischer; Siegfried Kunzmann
Multilingual access to information and services is a key requirement in any pervasive or ubiquitous computing environment. In this paper we describe our efforts towards multilingual speech recognition with a focus on applications that are designed to run on embedded devices, like e.g. a commercially available PDA. We give an overview on speech recognition techniques suited for the special requirements of the expected phonetic and acoustic environments and explore the ability to create multilingual acoustic models and applications that are able to run on embedded devices in real-time.
- Speech | Pp. 234-240