Development of Speaker-Independent Automatic Speech Recognition System for Kannada Language

Kumar, Praveen; Jayanna, H S

Indian Literature Database on Communication Disorders

Home

Categories &
Resource Types

Author

Title

Year

Subject

Please use this identifier to cite or link to this item: http://localhost:8080//handle/123456789/3006

Title:	Development of Speaker-Independent Automatic Speech Recognition System for Kannada Language
Authors:	Kumar, Praveen Jayanna, H S
Keywords:	DNN;Continuous speech;HMM;Kannada dialect;Kaldi toolkit;monophone;triphone;WER
Issue Date:	2022
Journal Name:	Indian Journal of Science and Technology
Volume No.:	15
Issue No.:	8
Pages:	333-342
Abstract:	Objectives: The primary goal is to address attempts to establish a Continuous Speech Recognition (CSR) framework for recognising continuous speech in Kannada. It is a difficult challenge to deal with a local language such as Kannada, which lacks the resources of a single language database. Methods: Modelling techniques such as monophone, triphone, deep neural network (DNN)-hidden Markov model (HMM) and Gaussian Mixture Model (GMM)- HMM-based models were implemented in Kaldi toolkit and used for continuous Kannada speech recognition (CKSR). To extract feature vectors from speech data, the Mel frequency Cepstral (MFCC) coefficient technique is used. The continuous Kannada speech database consists of 2800 speakers (1680 males and 1120 females) belong to the age group 8 years to 80 years. The training and testing data are in the ratio 80:20. In this paper the hybrid modelling techniques are implemented to recognize the spoken words. Findings: The model efficiency is determined based on the word error rate (WER) and the obtained results are assessed with the well-known datasets such as TIMIT and Aurora-4. This study found that using Kaldi-based features ex- traction recipes for monophone, triphone, DNN-HMM and GMM-HMM acoustic models had a word error rate (WER) of 8.23%, 5.23%, 4.05% and 4.64% respectively. The experimental results suggest that the rate of recognition of Kannada speech data has increased higher than that of state-of-the-art databases. Novelty : We propose a novel automatic speech recognition system for Kannada language. The main reason for developing the automatic speech recognition system for Kannada language is that there are only limited sources of standard continuous Kannada speech are available. We created large vocabulary Kannada database. We implemented monophone, triphone, Subspace Gaussian mixture model (SGMM) and hybrid modelling techniques to develop the automatic speech recognition system for Kannada language.
URI:	http://localhost:8080//handle/123456789/3006
ISSN:	0974-5645
Appears in Resource:	Journal Articles

Files in This Item:

There are no files associated with this item.

Show full item record

Indian Literature Databaseon Communication Disorders

Indian Literature Database
on Communication Disorders