Research Review on Text-to-Speech Systems and Speech Synthesizing Techniques (original) (raw)
Related papers
Text-to-speech (TTS) conversion system enables user to enter text in Gujarati language and as an output it generates the equivalent sound. This type of system will be greatly useful for illiterate and vision-impaired people to hear and understand the content. TTS systems are still suffering from the problem of producing emotional speech like that of human being. Scientists are trying to give emotions and feelings to it. This shows that research work can be done to enhance the efficiency of TTS system. There are TTS systems that are under development for many languages other than Gujarati language. So, we are proposing the TTS system, which will work on Gujarati language. The input of our desired system is typed or scanned Gujarati text and equivalent Gujarati speech with smooth flow will be generated as an output. We are trying to add a new feature through which we can hear our own voice by superimposing our own voice frequency on the prerecorded synthesized speech, so that we can listen to any text in our own voice. This paper starts with the introduction to the fundamental concepts of TTS synthesis. So, it will be useful for the readers who are less familiar in this area of research.
2012
The basic goal of the Text To Speech (TTS) system is to synthesize the speech for the given input text. TTS is thus an automatic counterpart of a human being loudly reading written text. For physically challenged people with viewing disability, TTS systems will be helpful to communicate with others. Limited domain TTS as the name suggests is built to serve a specific purpose e.g. The TTS used in announcement related queries. Unrestricted TTS system is capable to synthesize good quality of speech in different domains. Festival framework has been used for building the TTS system. In this paper an overview of the technique of Generic TTS is discussed in the first section. Section II, contains the details of the speech corpus used for developing the TTS. Basic prerequisites and language specific issues needed for building TTS in Gujarati are explained. The technique called Hidden Markov Model (HMM) – based speech synthesis, has been demonstrated to be very effective in synthesizing acce...
Recent Trends in Text to Speech Synthesis of Indian Languages
Helix- The Scientific explorer, 2019
A Text To Speech (TTS) synthesizer is a computer application capable of converting arbitrary input text into speech. This conversion broadly involves two steps, namely, text processing and speech synthesis. Text processing converts the entered text to a sequence of synthesis units, while speech synthesis is the generation of an acoustic wave form corresponding to each of these units. Naturalness and intelligibility are the most important qualities expected from a TTS system. In this paper we aim to provide an overview of various techniques for text to speech synthesis, discuss their characteristics, summarize and compares advantages and drawbacks. We have listed various Text-to-Speech synthesis frameworks developed and implemented at different Indian institutes.
The Main Principles of Text-to-Speech Synthesis System
2010
Abstract—In this paper, the main principles of text-to-speech synthesis system ,are presented. Associated problems ,which ,arise when,developing ,speech ,synthesis system ,are described. Used approaches and their application in the speech synthesis systems for Azerbaijani language are shown. Keywords—synthesis of Azerbaijani language, morphemes, phonemes, sounds, sentence, speech synthesizer, intonation, accent, pronunciation.
A survey on speech synthesis techniques in Indian languages
Multimedia Systems, 2020
The text to speech technology has achieved significant progress during the past decade and is an active area of research and development in providing different human-computer interactive systems. Even though a number of speech synthesis models are available for different languages focusing on the domain requirements with many motive applications, a source of information on current trends in Indian language speech synthesis is unavailable till date making it difficult for the beginners to initiate research for the development of TTS systems for the low-resourced languages. This paper provides a review of the contributions made by different researchers in the field of Indian language speech synthesis along with a study on the Indian language characteristics and the associated challenges in designing TTS systems. A set of available applications and tools results out of different projects undertaken by different organizations along with a set of possible future developments are also discussed to provide a single reference to an important strand of research in speech synthesis which may benefit anyone interested to initiate research in this area.
Text to Speech Synthesis of Hindi Language using Polysyllable Units
2020
A Text To Speech (TTS) synthesis is a computer based system that should be able to read any text aloud. Thus TTS technology is essential to those people who are visually impaired. It also plays a very important role in the field of Telecommunication, Industrial and educational applications. Thus TTS has been developed for foreign languages and is well established. As Indian language characters are complex in nature, it is not a straight forward approach to build the TTS system for Indian languages as compared to English. India is a country of multi languages among them Hindi is one of the 23 official language. Hence this paper discusses development of Hindi TTS system. Syllable units in Hindi language are better choice than any other units because each character in Hindi language is close to syllable which is in the form of
Tools for the development of a Hindi Speech Synthesis System
Fifth ISCA Workshop …, 2004
We describe in detail a Grapheme-to-Phoneme (G2P) converter required for the development of a good quality Hindi Text-to-Speech (TTS) system. The Festival framework is chosen for developing the Hindi TTS system. Since Festival does not provide complete language processing support specific to various languages, it needs to be augmented to facilitate the development of TTS systems in certain new languages. Because of this, a generic G2P converter has been developed. In the customized Hindi G2P converter, we have handled schwa deletion and compound word extraction. In the experiments carried out to test the Hindi G2P on a text segment of 3485 words, 97.67% word phonetisation accuracy is obtained. This Hindi G2P has been used for phonetising large text corpora which in turn is used in designing an inventory of phonetically rich sentences. The sentences ensured a good coverage of the phonetically valid diphones using only 0.3% of the complete text corpora.
Punjabi Text-To-Speech Synthesis System
Speech based interface can play a vital role for the successful im plem entation of com puterized system s for masses. As a tool for this purpose, effort has been made for the developm ent of a Text-To-Speech (TTS) synthesis system for Pun jabi lan guage written in Gurm ukhi script. Concatenative m ethod has been used to develop this TTS system. Syllables have been reported as good choice of speech unit for speech databases of many languages. Since Punjabi is a syllabic language, so syllables has been selected as the basic speech unit for this TTS system , which preserves within unit co-articulation effects. System involves development of algorithms for pre-processing, schwa deletion and syllabification of the input Punjabi text, as well as speech database for Punjabi. A syllable based Pun jabi speech database has been developed that stores articulations of syllable sounds at starting, m iddle and end positions of the word for producing natural sounding synthesized speech.
Design and Development of a Text-To-Speech Synthesizer System
This paper describes the design and development of TTS. This paper describes the overview of different types of synthesis system. One approach to the generation of natural-sounding synthesized speech waveforms is to select and concatenate units from a large speech database. The system used the Syllabication procedure and Phones and Diphones. I. Introduction Speech synthesizer or Text to speech Synthesizer is most widely used system in speech technology. We have various text to speech synthesizer systems available like Festival, Multilingual and Flite etc. A Text-To-Speech (TTS) synthesizer is a computer-based system that should be able to read any text aloud, whether it was directly introduced in the computer by an operator or scanned and submitted to an Optical Character Recognition (OCR) system. As such, the process of TTS conversion allows the transformation of a string of phonetic and prosodic symbols into a synthetic speech signal. The quality of the result produced by a TTS sy...
A Novel Approach for Designing a Speech Synthesis System for Indian Languages
Abstract Designing a speech synthesizer for Indian languages do not have a long history, rather it started a decade back. Different theoretical approaches have been put forward in this regard. We are designing the Text-To-Speech system for Oriya, Hindi and Bangla. The present study made by us is based upon the rules of ancient Indian philology derived from Paniniyashikshya and Pratishakshya.