Deep Learning Based Automatic Speech Recognition for Turkish

Using Deep Neural Networks (DNN) as an advanced Artificial Neural Networks (ANN) has become widespread with the development of computer technology. Although DNN has been applied for solving Automatic Speech Recognition (ASR) problem in some languages, DNN-based Turkish Speech Recognition has not bee...

Full description

Saved in:
Bibliographic Details
Main Authors: Hamit Erdem, Burak Tombaloğlu
Format: Article
Language:English
Published: Sakarya University 2020-08-01
Series:Sakarya Üniversitesi Fen Bilimleri Enstitüsü Dergisi
Subjects:
Online Access:https://dergipark.org.tr/tr/download/article-file/1210062
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Using Deep Neural Networks (DNN) as an advanced Artificial Neural Networks (ANN) has become widespread with the development of computer technology. Although DNN has been applied for solving Automatic Speech Recognition (ASR) problem in some languages, DNN-based Turkish Speech Recognition has not been studied extensively. Turkish language is an agglutinative and a phoneme-based language. In this study, a Deep Belief Network (DBN) based Turkish phoneme and speech recognizer is developed. The proposed system recognizes words in the system vocabulary and phoneme components of out of vocabulary (OOV) words. Sub-word (morpheme) based language modelling is implemented into the system. Each phoneme of Turkish language is also modelled as a sub-word in the model. Sub-word (morpheme) based language model is widely used for agglutinative languages to prevent excessive vocabulary size. The performance of the suggested DBN based ASR system is compared with the conventional recognition method, GMM (Gaussian Mixture Method) based Hidden Markov Model (HMM). Regarding to performance metrics, the recognition rate of Turkish language is improved in compare with previous studies.
ISSN:2147-835X