Using Hidden-Markov Model in Speech-Based Education System for the Visually Impaired Learner (original) (raw)
Related papers
Speech user interfaces and contents for e-learning
The system of electronic training on the basis of speech recognition and synthes is is offered in paper. More than 10 years textbooks for blind people were not published in Kazakhstan. Therefore there is a problem of creating speech user interfaces and content for e-learning individuals with weakened vision or limited mobility. Speech recognition will be realized with use of limited dictionary consisting of set of user interface's commands, and speech synthesis will be realized without restriction on text structure and volume, including text of all training material. The speech user interface will allow people weakened vision or limited mobility to overcome classical interface barriers and, by receiving quality education, raise their living standards.
Multimedia Tools and Applications, 2018
In general, visually impaired students need of another person's to teach them with the help of computers and book. However, a number of students are not aware of using the computers and understanding the concepts by self. In order to solve this issue, a speech to speech interaction system is developed on the basis of a novel dialogue management system. This interaction is developed by combining Multimedia tools and Partially Observable Markov Decision Process (POMDP) with agenda based model used in the proposed dialogue management system to learn the speech signals from user and system will reply accordingly. The proposed system helps visually impaired students to learn easily using a novel dialogue Multimed Tools Appl
Intelligent Voice-Based E-Education System: A Framework and Evaluation
International Journal of Computing, 2010
Voice-based web e-Education is a technology-supported learning paradigm that allows phone-access of learners to e-Learning web-based applications. These applications are designed mainly for the visually impaired. They are however lacking in attributes of adaptive and reusable learning objects, which are emerging requirements for applications in these domain. This paper presents a framework for developing intelligent voice-based applications in the context of e-Education. The framework presented supports intelligent components such as adaptation and recommendation services. A prototype Intelligent Voice-based E-Education System (iVEES) was developed and subjected to test by visually impaired users. A usability study was carried out using the International Standard Organization’s (ISO) 9241-11 specification to determine the level of effectiveness, efficiency and user satisfaction. Report of our findings shows that the application is of immense benefit, based on the system’s inherent c...
Human-centric Computing and Information Sciences, 2013
Despite many assistive tools available for browsing the web, blind persons are not able to perform the tasks using internet that are done by persons without such disability. Even the futuristic social networking sites and other websites using the features of web 2.0 indicate a lesser accessible/responsible web. In this paper, we propose a framework, which can be used by the websites of public interest to make their important utilities better accessible and usable to blind web users. The approach is based on providing an alternate access system on the fly using one single website. The framework makes use of existing technologies like JavaScript, available speech APIs etc. and therefore provides a lightweight and robust solution to the accessibility problem. As a case study, we demonstrate the usefulness of the proposed framework by showing its working on a key functionality of the Indian Railways Reservation Website.
An Interactive Intelligent Web-Based Text-to-speech System for the Visually Impaired
2021
Text-to-speech is an AI-based system that converts text into speech. It uses Natural Language Processing and Digital Signal Processing to achieve its goal. The paper describes a web-based system that converts text to speech. It was developed using JavaScript and is compatible with a variety of speech synthesis libraries. The programming language used in this project is light-weight and can support development of web apps. It is also able to convert plain text files into an audio format. The system is able to convert plain text files to audio. This software can be used by users all around the world at the same time and its supported on major desktop and hand-held devices once it is linked to the web. This system will be of great help to lecturers and students in the classroom, e-libraries and computer aided learning for visually impaired users.
Enhanced Speech-Enabled Tools for Intelligent and Mobile E-Learning Applications
Concepts, Methodologies, Tools and Applications
Web-based learning is rapidly becoming the preferred way to quickly, efficiently, and economically create and deliver training or educational content through various communication media. This chapter presents systems that use speech technology to emulate the one-on-one interaction a student can get from a virtual instructor. A Web-based learning tool, the Learn IN Context (LINC+) system, designed and used in a real mixed-mode learning context for a computer (C++ language) programming course taught at the Université de Moncton (Canada) is described here. It integrates an Internet Voice Searching and Navigating (IVSN) system that helps learners to search and navigate both the web and their desktop environment through voice commands and dictation. LINC+ also incorporates an Automatic User Profile Building and Training (AUPB&T) module that allows users to increase speech recognition performance without having to go through the long and fastidious manual training process. New Automated S...
Cued Speech: A visual communication mode for the deaf society
IEICE Electronics Express, 2010
Cued Speech is a visual mode of communication that uses handshapes and placements in combination with the mouth movements of speech to make the phonemes of a spoken language look different from each other and clearly understandable to deaf individuals. The aim of Cued Speech is to overcome the problems of lip reading and thus enable deaf persons to wholly understand spoken language. In this study, automatic phoneme recognition in Cued Speech for French based on hidden Markov model (HMMs) is introduced. The phoneme correct for a normal-hearing cuer was 82.9%, and for a deaf 81.5%. The results also showed, that creating cuer-independent HMMs should not face any specific difficulties, other than those occured in audio speech recognition.