"Speech and Language Technologies" ed. by Ivo Ipšić
InTeO | 2011 | ISBN: 9533073225 9789533073224 | 354 pages | PDF | 20 MB
InTeO | 2011 | ISBN: 9533073225 9789533073224 | 354 pages | PDF | 20 MB
This book addresses state-of-the-art systems and achievements in various topics in the research field of speech and language technologies. Book chapters are organized in different sections covering diverse problems, which have to be solved in speech recognition and language understanding systems.
In the first section machine translation systems based on large parallel corpora using rule-based and statistical-based translation methods are presented. The third chapter presents work on real time two way speech-to-speech translation systems. In the second section two papers explore the use of speech technologies in language learning. The third section presents a work on language modeling used for speech recognition. The chapters in section Text-to-speech systems and emotional speech describe corpus-based speech synthesis and highlight the importance of speech prosody in speech recognition. In the fifth section the problem of speaker diarization is addressed. The last section presents various topics in speech technology applications like audio-visual speech recognition and lip reading systems.
Contents
Preface
Part 1 Machine Translation
1 Towards Efficient Translation Memory Search Based on Multiple Sentence Signatures
2 Sentence Alignment by Means of Cross-Language Information Retrieval
3 The BBN TransTalk Speech-to-Speech Translation System
Part 2 Language Learning
4 Automatic Feedback for L2 Prosody Learning
5 Exploring Speech Technologies for Language Learning
Part 3 Language Modeling
6 N-Grams Model For Polish
Part 4 Text to Speech Systems and Emotional Speech
7 Multilingual and Multimodal Corpus-Based Text-to-Speech System - PLATTOS -
8 Estimation of Speech Intelligibility Using Perceptual Speech Quality Scores
9 Spectral Properties and Prosodic Parameters of Emotional Speech in Czech and Slovak
10 Speech Interface Evaluation on Car Navigation System - Many Undesirable Utterances and Severe Noisy Speech -
Part 5 Speaker Diarization
11 A Review of Recent Advances in Speaker Diarization with Bayesian Methods
12 Discriminative Universal Background Model Training for Speaker Recognition
Part 6 Applications
13 Building a Visual Front-end for Audio-Visual Automatic Speech Recognition in Vehicle Environments
14 Visual Speech Recognition
15 Towards Augmentative Speech Communication
16 Soccer Event Retrieval Based on Speech Content: A Vietnamese Case Study
17 Voice Interfaces in Art - an Experimentation with Web Open Standards as a Model to Increase Web Accessibility and Digital Inclusion
with TOC BookMarkLinks