Sahar Rauf | University of Engineering & Technology Lahore, Pakistan (original) (raw)

Papers by Sahar Rauf

Research paper thumbnail of A Multi-Genre Urdu Broadcast Speech Recognition System

2021 24th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA), 2021

This paper reports the development of a multi-genre Urdu Broadcast (BC) corpus and a Large Vocabu... more This paper reports the development of a multi-genre Urdu Broadcast (BC) corpus and a Large Vocabulary Continuous Speech Recognition (LVCSR) system. BC speech corpus of 98 hours from 453 speakers is collected and annotated. For acoustic modeling, Time-delay Neural Network (TDNN) is developed with prior Gaussian Mixture Model-Hidden Markov Model (GMM-HMM) training and alignments. For the language model, 3-gram, 4-gram and Recurrent Neural Network (RNN) based models are developed on a text corpus of 188 million words. The developed models are tested on 4.3 hours of unseen BC multi-genre speech dataset and the best Word Error Rate (WER) 18.59% is achieved using RNN based Language Model (LM). Moreover, a detailed word error analysis is carried out to compare the errors made by humans and the Automatic Speech Recognition (ASR) System. The results showed a similar behavior of word misrecognitions by both humans and ASR.

Research paper thumbnail of Improving Hate Speech Detection of Urdu Tweets Using Sentiment Analysis

IEEE Access, 2021

Sentiment Analysis is a technique that is being used abundantly nowadays for customer reviews ana... more Sentiment Analysis is a technique that is being used abundantly nowadays for customer reviews analysis, popularity analysis of electoral candidates, hate speech detection and similar applications. Sentiment analysis on tweets encounters challenges such as highly skewed classes, high dimensional feature vectors and highly sparse data. In this study, we have analyzed the improvement achieved by successively addressing these problems in order to determine their severity for sentiment analysis of tweets. Firstly, we prepared a comprehensive data set consisting of Urdu Tweets for sentiment analysis-based hate speech detection. To improve the performance of the sentiment classifier, we employed dynamic stop words filtering, Variable Global Feature Selection Scheme (VGFSS) and Synthetic Minority Optimization Technique (SMOTE) to handle the sparsity, dimensionality and class imbalance problems respectively. We used two machine learning algorithms i.e., Support Vector Machines (SVM) and Multinomial Naïve Bayes' (MNB) for investigating performance in our experiments. Our results show that addressing class skew along with alleviating the high dimensionality problem brings about the maximum improvement in the overall performance of the sentiment analysis-based hate speech detection. INDEX TERMS Sentiment analysis, hate speech, data sparsity, highly skewed classes, high-dimensional feature vector.

Research paper thumbnail of Multitier Annotation of Urdu Speech Corpus

This paper describes the multi-level annotation process of Urdu speech corpus and its quality ass... more This paper describes the multi-level annotation process of Urdu speech corpus and its quality assessment using PRAAT. The annotation of speech corpus has been done at phoneme, word, syllable and break index levels. Phoneme, word and break index level annotation has been done manually by trained linguists whereas syllable-tier annotation has been done automatically using template matching algorithm. The mean accuracy achieved at phoneme and break index label and boundary identification is 79.07% and 89.67% respectively. The quality assessment of word and syllable tiers is still under investigation.

Research paper thumbnail of District names speech corpus for Pakistani Languages

2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2015

This paper presents a speech corpus that is developed for Urdu automatic speech recognition (ASR)... more This paper presents a speech corpus that is developed for Urdu automatic speech recognition (ASR) system. The corpus comprises of single word utterances fixed vocabulary consisting of district names of Pakistan. The data is recorded over a telephone channel from all over Pakistan to cover six major accents; Punjabi, Urdu, Saraiki, Pashto, Sindhi, and Balochi. The data was collected in challenging acoustic environments; the major issues were silence, background noise and alternate pronunciations, which can affect the performance of the system. In order to address these issues, comprehensive data verification and cleaning guidelines are presented. The proposed process serves as a data preprocessing step for the development of ASR, which is successfully integrated in an Urdu dialog system to provide weather information of Pakistan.

Research paper thumbnail of Multitier Annotation of Urdu Speech Corpus

This paper describes the multi-level annotation process of Urdu speech corpus and its quality ass... more This paper describes the multi-level annotation process of Urdu speech corpus and its quality assessment using PRAAT. The annotation of speech corpus has been done at phoneme, word, syllable and break index levels. Phoneme, word and break index level annotation has been done manually by trained linguists whereas syllable-tier annotation has been done automatically using template matching algorithm. The mean accuracy achieved at phoneme and break index label and boundary identification is 79.07% and 89.67% respectively. The quality assessment of word and syllable tiers is still under investigation.

Research paper thumbnail of Enhancing Large Vocabulary Continuous Speech Recognition System for Urdu-English Conversational Code-Switched Speech

2020 23rd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA), 2020

This paper presents first step towards Large Vocabulary Continuous Speech Recognition (LVCSR) sys... more This paper presents first step towards Large Vocabulary Continuous Speech Recognition (LVCSR) system for Urdu-English code-switched conversational speech. Urdu is the national language and lingua franca of Pakistan, with 100 million speakers worldwide. English, on the other hand, is official language of Pakistan and commonly mixed with Urdu in daily communication. Urdu, being under-resourced language, have no substantial Urdu-English code-switched corpus in hand to develop speech recognition system. In this research, readily available spontaneous Urdu speech corpus (25 hours) is revised to use it for enhancement of read speech Urdu LVCSR to recognize code-switched speech. This data set is split into 20 hours of train and 5 hours of test set. 10 hours of Urdu BroadCast (BC) data are collected and annotated in a semi-supervised way to enhance the system further. For acoustic modeling, state-of-the-art DNN-HMM modeling technique is used without any prior GMM-HMM training and alignments...

Research paper thumbnail of URDU Speech Corpora for Banking Sector in Pakistan

2018 Oriental COCOSDA - International Conference on Speech Database and Assessments, 2018

This research describes an effort to build Urdu speech corpora for the banking sector in Pakistan... more This research describes an effort to build Urdu speech corpora for the banking sector in Pakistan. We have designed speech corpora to develop debit card activation ASR and these corpora are comprised of eight types of corpora mainly debit card number corpus, expiry date corpus, last four digit corpus, months' name, date of birth corpus, account type and Urdu-counting corpus. These corpora contain telephone speech in read style obtained from more than 400 speakers specifically in Punjabi accent in both outdoor and indoor environments, including offices, homes, banks, and universities. The speech is automatically annotated and manually verified at sentence tier and reports 98% inter-annotator accuracy. In this paper, we report the design, recording and annotation process of speech corpora that serve as a data development step for ASR, and will be integrated in debit card activation service in banking sector of Pakistan.

Research paper thumbnail of Corpus of Aspect-based Sentiment for Urdu Political Data

We present a corpus of Urdu political data annotated at aspect and sentiment level. The corpus co... more We present a corpus of Urdu political data annotated at aspect and sentiment level. The corpus contains 8760 tweets regarding four different aspects (Members, Projects, Party and Actions) of three political parties (PTI, PMLN and PPP) of Pakistan. We also present the results of a baseline system developed using the corpus for analyzing its reliability. It can be seen that the classifiers have achieved reasonable scores for aspects categorization and sentiment classification tasks.

Research paper thumbnail of A Sentiment Lexicon for Urdu

Sentiment analysis is a data mining technique, which measures the inclination of people’s opinion... more Sentiment analysis is a data mining technique, which measures the inclination of people’s opinions. Recent studies have shown that the sentiment lexicon can be developed using automatic and manual tagging techniques. The seminal works on Urdu lexicon done so far do not actually denote a broad Lickert scale for data tagging and also do not cover all the open word classes. The current study aims to develop a sentiment lexicon and test its validity using manual and automatic methods. The dictionary-based method is used to design this lexicon using three authentic Urdu dictionaries. The data was tagged on a five point lickert scale i.e. -2 to +2 using the formulated guidelines. The lexicon is composed of four-word classes namely nouns, verbs, adjectives and adverbs. Once the lexicon was developed using manual tagging techniques it was tested both manually and automatically. The manual testing yielded an inter annotator agreement of 75% while the automatic testing included the comparing ...

Research paper thumbnail of Urdu speech corpus for travel domain

Speech corpus is a collection of recorded speech data. It is a fundamental component to develop a... more Speech corpus is a collection of recorded speech data. It is a fundamental component to develop any speech based applications. This paper presents the design and development of an Urdu speech corpus for travel domain. The corpus consists of a total of 250 vocabulary items including city names, days, time and numbers. The corpus has been used to develop a speech recognition system with a laboratory accuracy of 95.6% and a field accuracy of 87.21%. The corpus can be used for domestic flight queries, domestic flight reservation, train inquiry service and reservation and bus information and reservation.

Research paper thumbnail of Acoustic Investigation of /l, j, v/ as Approximants in Urdu

This presented research work aims to investigate the acoustic properties of /l, j, and v/ in Urdu... more This presented research work aims to investigate the acoustic properties of /l, j, and v/ in Urdu as approximants. For the acoustic analysis of /l, j, and v/, data has been recorded from 4 native speakers of Urdu (2 males and 2 females) and total 280 utterances of approximants have been recorded at three positions of word i.e word initial, middle and final. Two experiments have been conducted using PRAAT to investigate the acoustic properties of approximants in Urdu; first experiment is based on the spectrogram analysis of approximants and second experiment analyzes the periodicity level of these approximants. The second experiment is conducted by calculating the median of Harmonicity to Noise Ratio (HNR) values of these sounds. The analysis indicates that approximants in Urdu also behave like fricatives. Moreover, this research also explores the controversial issue about the existence of aspirated approximants i.e. / l, j, v /. Results indicate that /j/ is no more used by Urdu spea...

Research paper thumbnail of Improving Hate Speech Detection of Urdu Tweets Using Sentiment Analysis

Sentiment Analysis is a technique that is being used abundantly nowadays for customer reviews ana... more Sentiment Analysis is a technique that is being used abundantly nowadays for customer reviews analysis, popularity analysis of electoral candidates, hate speech detection and similar applications. Sentiment analysis on tweets encounters challenges such as highly skewed classes, high dimensional feature vectors and highly sparse data. In this study, we have analyzed the improvement achieved by successively addressing these problems in order to determine their severity for sentiment analysis of tweets. Firstly, we prepared a comprehensive data set consisting of Urdu Tweets for sentiment analysis-based hate speech detection. To improve the performance of the sentiment classifier, we employed dynamic stop words filtering, Variable Global Feature Selection Scheme (VGFSS) and Synthetic Minority Optimization Technique (SMOTE) to handle the sparsity, dimensionality and class imbalance problems respectively. We used two machine learning algorithms i.e., Support Vector Machines (SVM) and Multinomial Naïve Bayes’ (MNB) for investigating performance in our experiments. Our results show that addressing class skew along with alleviating the high dimensionality problem brings about the maximum improvement in the overall performance of the sentiment analysis-based hate speech detection.

Research paper thumbnail of Improving Large Vocabulary Urdu Speech Recognition System Using Deep Neural Networks

Interspeech 2019

Development of Large Vocabulary Continuous Speech Recognition (LVCSR) system is a cumbersome task... more Development of Large Vocabulary Continuous Speech Recognition (LVCSR) system is a cumbersome task, especially for low resource languages. Urdu is the national language and lingua franca of Pakistan, with 100 million speakers worldwide. Due to resource scarcity, limited work has been done in the domain of Urdu speech recognition. In this paper, collection of Urdu speech corpus and development of Urdu speech recognition system is presented. Urdu LVCSR is developed using 300 hours of read speech data with a vocabulary size of 199K words. Microphone speech is recorded from 1671 Urdu and Punjabi speakers in both indoor and outdoor environments. Different acoustic modeling techniques such as Gaussian Mixture Models based Hidden Markov Models (GMM-HMM), Time Delay Neural Networks (TDNN), Long-Short Term Memory (LSTM) and Bidirectional Long-Short Term Memory (BLSTM) networks are investigated. Cross entropy and Lattice Free Maximum Mutual Information (LF-MMI) objective functions are employed during acoustic modeling. In addition, Recurrent Neural Network Language Model (RNNLM) is also being used for re-scoring. Developed speech recognition system has been evaluated on 9.5 hours of collected test data and a minimum Word Error Rate (%WER) of 13.50% is achieved.

Research paper thumbnail of A Multi-Genre Urdu Broadcast Speech Recognition System

2021 24th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA), 2021

This paper reports the development of a multi-genre Urdu Broadcast (BC) corpus and a Large Vocabu... more This paper reports the development of a multi-genre Urdu Broadcast (BC) corpus and a Large Vocabulary Continuous Speech Recognition (LVCSR) system. BC speech corpus of 98 hours from 453 speakers is collected and annotated. For acoustic modeling, Time-delay Neural Network (TDNN) is developed with prior Gaussian Mixture Model-Hidden Markov Model (GMM-HMM) training and alignments. For the language model, 3-gram, 4-gram and Recurrent Neural Network (RNN) based models are developed on a text corpus of 188 million words. The developed models are tested on 4.3 hours of unseen BC multi-genre speech dataset and the best Word Error Rate (WER) 18.59% is achieved using RNN based Language Model (LM). Moreover, a detailed word error analysis is carried out to compare the errors made by humans and the Automatic Speech Recognition (ASR) System. The results showed a similar behavior of word misrecognitions by both humans and ASR.

Research paper thumbnail of Improving Hate Speech Detection of Urdu Tweets Using Sentiment Analysis

IEEE Access, 2021

Sentiment Analysis is a technique that is being used abundantly nowadays for customer reviews ana... more Sentiment Analysis is a technique that is being used abundantly nowadays for customer reviews analysis, popularity analysis of electoral candidates, hate speech detection and similar applications. Sentiment analysis on tweets encounters challenges such as highly skewed classes, high dimensional feature vectors and highly sparse data. In this study, we have analyzed the improvement achieved by successively addressing these problems in order to determine their severity for sentiment analysis of tweets. Firstly, we prepared a comprehensive data set consisting of Urdu Tweets for sentiment analysis-based hate speech detection. To improve the performance of the sentiment classifier, we employed dynamic stop words filtering, Variable Global Feature Selection Scheme (VGFSS) and Synthetic Minority Optimization Technique (SMOTE) to handle the sparsity, dimensionality and class imbalance problems respectively. We used two machine learning algorithms i.e., Support Vector Machines (SVM) and Multinomial Naïve Bayes' (MNB) for investigating performance in our experiments. Our results show that addressing class skew along with alleviating the high dimensionality problem brings about the maximum improvement in the overall performance of the sentiment analysis-based hate speech detection. INDEX TERMS Sentiment analysis, hate speech, data sparsity, highly skewed classes, high-dimensional feature vector.

Research paper thumbnail of Multitier Annotation of Urdu Speech Corpus

This paper describes the multi-level annotation process of Urdu speech corpus and its quality ass... more This paper describes the multi-level annotation process of Urdu speech corpus and its quality assessment using PRAAT. The annotation of speech corpus has been done at phoneme, word, syllable and break index levels. Phoneme, word and break index level annotation has been done manually by trained linguists whereas syllable-tier annotation has been done automatically using template matching algorithm. The mean accuracy achieved at phoneme and break index label and boundary identification is 79.07% and 89.67% respectively. The quality assessment of word and syllable tiers is still under investigation.

Research paper thumbnail of District names speech corpus for Pakistani Languages

2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2015

This paper presents a speech corpus that is developed for Urdu automatic speech recognition (ASR)... more This paper presents a speech corpus that is developed for Urdu automatic speech recognition (ASR) system. The corpus comprises of single word utterances fixed vocabulary consisting of district names of Pakistan. The data is recorded over a telephone channel from all over Pakistan to cover six major accents; Punjabi, Urdu, Saraiki, Pashto, Sindhi, and Balochi. The data was collected in challenging acoustic environments; the major issues were silence, background noise and alternate pronunciations, which can affect the performance of the system. In order to address these issues, comprehensive data verification and cleaning guidelines are presented. The proposed process serves as a data preprocessing step for the development of ASR, which is successfully integrated in an Urdu dialog system to provide weather information of Pakistan.

Research paper thumbnail of Multitier Annotation of Urdu Speech Corpus

This paper describes the multi-level annotation process of Urdu speech corpus and its quality ass... more This paper describes the multi-level annotation process of Urdu speech corpus and its quality assessment using PRAAT. The annotation of speech corpus has been done at phoneme, word, syllable and break index levels. Phoneme, word and break index level annotation has been done manually by trained linguists whereas syllable-tier annotation has been done automatically using template matching algorithm. The mean accuracy achieved at phoneme and break index label and boundary identification is 79.07% and 89.67% respectively. The quality assessment of word and syllable tiers is still under investigation.

Research paper thumbnail of Enhancing Large Vocabulary Continuous Speech Recognition System for Urdu-English Conversational Code-Switched Speech

2020 23rd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA), 2020

This paper presents first step towards Large Vocabulary Continuous Speech Recognition (LVCSR) sys... more This paper presents first step towards Large Vocabulary Continuous Speech Recognition (LVCSR) system for Urdu-English code-switched conversational speech. Urdu is the national language and lingua franca of Pakistan, with 100 million speakers worldwide. English, on the other hand, is official language of Pakistan and commonly mixed with Urdu in daily communication. Urdu, being under-resourced language, have no substantial Urdu-English code-switched corpus in hand to develop speech recognition system. In this research, readily available spontaneous Urdu speech corpus (25 hours) is revised to use it for enhancement of read speech Urdu LVCSR to recognize code-switched speech. This data set is split into 20 hours of train and 5 hours of test set. 10 hours of Urdu BroadCast (BC) data are collected and annotated in a semi-supervised way to enhance the system further. For acoustic modeling, state-of-the-art DNN-HMM modeling technique is used without any prior GMM-HMM training and alignments...

Research paper thumbnail of URDU Speech Corpora for Banking Sector in Pakistan

2018 Oriental COCOSDA - International Conference on Speech Database and Assessments, 2018

This research describes an effort to build Urdu speech corpora for the banking sector in Pakistan... more This research describes an effort to build Urdu speech corpora for the banking sector in Pakistan. We have designed speech corpora to develop debit card activation ASR and these corpora are comprised of eight types of corpora mainly debit card number corpus, expiry date corpus, last four digit corpus, months' name, date of birth corpus, account type and Urdu-counting corpus. These corpora contain telephone speech in read style obtained from more than 400 speakers specifically in Punjabi accent in both outdoor and indoor environments, including offices, homes, banks, and universities. The speech is automatically annotated and manually verified at sentence tier and reports 98% inter-annotator accuracy. In this paper, we report the design, recording and annotation process of speech corpora that serve as a data development step for ASR, and will be integrated in debit card activation service in banking sector of Pakistan.

Research paper thumbnail of Corpus of Aspect-based Sentiment for Urdu Political Data

We present a corpus of Urdu political data annotated at aspect and sentiment level. The corpus co... more We present a corpus of Urdu political data annotated at aspect and sentiment level. The corpus contains 8760 tweets regarding four different aspects (Members, Projects, Party and Actions) of three political parties (PTI, PMLN and PPP) of Pakistan. We also present the results of a baseline system developed using the corpus for analyzing its reliability. It can be seen that the classifiers have achieved reasonable scores for aspects categorization and sentiment classification tasks.

Research paper thumbnail of A Sentiment Lexicon for Urdu

Sentiment analysis is a data mining technique, which measures the inclination of people’s opinion... more Sentiment analysis is a data mining technique, which measures the inclination of people’s opinions. Recent studies have shown that the sentiment lexicon can be developed using automatic and manual tagging techniques. The seminal works on Urdu lexicon done so far do not actually denote a broad Lickert scale for data tagging and also do not cover all the open word classes. The current study aims to develop a sentiment lexicon and test its validity using manual and automatic methods. The dictionary-based method is used to design this lexicon using three authentic Urdu dictionaries. The data was tagged on a five point lickert scale i.e. -2 to +2 using the formulated guidelines. The lexicon is composed of four-word classes namely nouns, verbs, adjectives and adverbs. Once the lexicon was developed using manual tagging techniques it was tested both manually and automatically. The manual testing yielded an inter annotator agreement of 75% while the automatic testing included the comparing ...

Research paper thumbnail of Urdu speech corpus for travel domain

Speech corpus is a collection of recorded speech data. It is a fundamental component to develop a... more Speech corpus is a collection of recorded speech data. It is a fundamental component to develop any speech based applications. This paper presents the design and development of an Urdu speech corpus for travel domain. The corpus consists of a total of 250 vocabulary items including city names, days, time and numbers. The corpus has been used to develop a speech recognition system with a laboratory accuracy of 95.6% and a field accuracy of 87.21%. The corpus can be used for domestic flight queries, domestic flight reservation, train inquiry service and reservation and bus information and reservation.

Research paper thumbnail of Acoustic Investigation of /l, j, v/ as Approximants in Urdu

This presented research work aims to investigate the acoustic properties of /l, j, and v/ in Urdu... more This presented research work aims to investigate the acoustic properties of /l, j, and v/ in Urdu as approximants. For the acoustic analysis of /l, j, and v/, data has been recorded from 4 native speakers of Urdu (2 males and 2 females) and total 280 utterances of approximants have been recorded at three positions of word i.e word initial, middle and final. Two experiments have been conducted using PRAAT to investigate the acoustic properties of approximants in Urdu; first experiment is based on the spectrogram analysis of approximants and second experiment analyzes the periodicity level of these approximants. The second experiment is conducted by calculating the median of Harmonicity to Noise Ratio (HNR) values of these sounds. The analysis indicates that approximants in Urdu also behave like fricatives. Moreover, this research also explores the controversial issue about the existence of aspirated approximants i.e. / l, j, v /. Results indicate that /j/ is no more used by Urdu spea...

Research paper thumbnail of Improving Hate Speech Detection of Urdu Tweets Using Sentiment Analysis

Sentiment Analysis is a technique that is being used abundantly nowadays for customer reviews ana... more Sentiment Analysis is a technique that is being used abundantly nowadays for customer reviews analysis, popularity analysis of electoral candidates, hate speech detection and similar applications. Sentiment analysis on tweets encounters challenges such as highly skewed classes, high dimensional feature vectors and highly sparse data. In this study, we have analyzed the improvement achieved by successively addressing these problems in order to determine their severity for sentiment analysis of tweets. Firstly, we prepared a comprehensive data set consisting of Urdu Tweets for sentiment analysis-based hate speech detection. To improve the performance of the sentiment classifier, we employed dynamic stop words filtering, Variable Global Feature Selection Scheme (VGFSS) and Synthetic Minority Optimization Technique (SMOTE) to handle the sparsity, dimensionality and class imbalance problems respectively. We used two machine learning algorithms i.e., Support Vector Machines (SVM) and Multinomial Naïve Bayes’ (MNB) for investigating performance in our experiments. Our results show that addressing class skew along with alleviating the high dimensionality problem brings about the maximum improvement in the overall performance of the sentiment analysis-based hate speech detection.

Research paper thumbnail of Improving Large Vocabulary Urdu Speech Recognition System Using Deep Neural Networks

Interspeech 2019

Development of Large Vocabulary Continuous Speech Recognition (LVCSR) system is a cumbersome task... more Development of Large Vocabulary Continuous Speech Recognition (LVCSR) system is a cumbersome task, especially for low resource languages. Urdu is the national language and lingua franca of Pakistan, with 100 million speakers worldwide. Due to resource scarcity, limited work has been done in the domain of Urdu speech recognition. In this paper, collection of Urdu speech corpus and development of Urdu speech recognition system is presented. Urdu LVCSR is developed using 300 hours of read speech data with a vocabulary size of 199K words. Microphone speech is recorded from 1671 Urdu and Punjabi speakers in both indoor and outdoor environments. Different acoustic modeling techniques such as Gaussian Mixture Models based Hidden Markov Models (GMM-HMM), Time Delay Neural Networks (TDNN), Long-Short Term Memory (LSTM) and Bidirectional Long-Short Term Memory (BLSTM) networks are investigated. Cross entropy and Lattice Free Maximum Mutual Information (LF-MMI) objective functions are employed during acoustic modeling. In addition, Recurrent Neural Network Language Model (RNNLM) is also being used for re-scoring. Developed speech recognition system has been evaluated on 9.5 hours of collected test data and a minimum Word Error Rate (%WER) of 13.50% is achieved.