Text-to-Speech Technology: A Survey of German Speech Synthesis Systems (original) (raw)
Related papers
Design and Development of a Text-To-Speech Synthesizer System
This paper describes the design and development of TTS. This paper describes the overview of different types of synthesis system. One approach to the generation of natural-sounding synthesized speech waveforms is to select and concatenate units from a large speech database. The system used the Syllabication procedure and Phones and Diphones. I. Introduction Speech synthesizer or Text to speech Synthesizer is most widely used system in speech technology. We have various text to speech synthesizer systems available like Festival, Multilingual and Flite etc. A Text-To-Speech (TTS) synthesizer is a computer-based system that should be able to read any text aloud, whether it was directly introduced in the computer by an operator or scanned and submitted to an Optical Character Recognition (OCR) system. As such, the process of TTS conversion allows the transformation of a string of phonetic and prosodic symbols into a synthetic speech signal. The quality of the result produced by a TTS sy...
Text - To - Speech Synthesis (TTS)
Speech is one of the oldest and most natural means of information exchange between human. Over the years, Attempts have been made to develop vocally interactive computers to realise voice/speech synthesis. Obviously such an interface would yield great benefits. In this case a computer can synthesize text and give out a speech. Text-To-Speech Synthesis is a Technology that provides a means of converting written text from a descriptive form to a spoken language that is easily understandable by the end user (Basically in English Language). It runs on JAVA platform, and the methodology used was Object Oriented Analysis and Development Methodology; while Expert System was incorporated for the internal operations of the program. This design will be geared towards providing a one-way communication interface whereby the computer communicates with the user by reading out textual document for the purpose of quick assimilation and reading development.
Text-to-speech technology in human computer interaction
CHISA 2006
Speech, or verbal communication, is one of the most important features which distinguish humans from other animals. Researchers in speech technology are still working on getting machines to interact with humans the same way human-to-human communication occurs. Human-computer interaction is a discipline concerned with the design, evaluation and implementation of interactive computing systems for human use . This paper investigates the use of text-to-speech technology and ways of making this technology acceptable and 'userfriendly' on interactive basis. This can be achieved through a design which takes into consideration user expectations. Evaluation is also of importance in order to get feedback and assess if the design meets the user's expectations. It is from this process that a conclusion is drawn on the desirable properties of text-to-speech systems for human-computer interaction.
except for brief excerpts in connection with reviews or scholarly analysis. Use in connection with any form of information storage and retrieval, electronic adaptation, computer software, or by similar or dissimilar methodology now known or hereafter developed is forbidden. The use in this publication of trade names, trademarks, service marks, and similar terms, even if they are not identified as such, is not to be taken as an expression of opinion as to whether or not they are subject to proprietary rights.
A Novel Digital Signal Processing Software Text-To-Speech
IEEE Transactions on Signal Processing, 1992
This paper is a novel digital signal processing software of the advanced conversion of text-to-speech synthesis technology, which has been available as a range of hardware products for more than ten years, to software. It was initially created as a replacement for character cell terminals and telephony applications, but it is now also used to give people who are visually impaired access to information. With a digital formant synthesizer used to mimic the human vocal tract, text-to-speech quality is very high in both understandability and naturalness. The computational requirements of this synthesizer put a tremendous amount of strain on a workstation in the days before extremely fast processors. Multiple text streams were simultaneously converted to speech for this study using a Digital Equipment AlphaModel 600 workstation. Modern RISC processor power enables applications to freely output speech. A text-to-speech application programming interface (API) is now necessary as a result of this capability. The TTS software API we created is compatible with a wide range of hardware and operating systems. The architecture of the TTS software is described in this paper. Additionally, the API is mentioned. Finally, we describe our experience porting the TTS code base from the previous hardware platforms.