A Spoken Word Boundaries Detection Strategy for Voice Command Recognition

Título: A Spoken Word Boundaries Detection Strategy for Voice Command Recognition

Autores: Peretta, Igor S.; Lima, Gerson F. M.; Tavares, Josimeire A.; Yamanaka, Keiji

Resumo: The use of voice commands as a new way of interaction between man and machine is the subject of several researches in recent years and has already been produced commercial and freeware applications. However, considering the achieved results, there is still a great development potential in this area, particularly in Brazilian Portuguese language. This work proposes: 1. an efficient method of detecting spoken word boundaries from a recorded signal, using Teager Energy Operator and FIR Filter; 2. the use of wavelet transform and wavelet packet filter bank as a main tool for feature extraction to feed a multi-layer artificial neural network to recognize a limited vocabulary of voice commands. The system was developed using a dataset of spoken words from 50 speakers, using normal pronunciation speed and in an environment without any noise control. Tests with the system show a very good classification rate and noise robustness.

Palavras-chave: Voice command recognition; spoken word boundaries detection; teager energy operator; discrete wavelet transform; wavelet packet filter bank; artificial neural network

Páginas: 9

Código DOI: 10.21528/lmln-vol8-no3-art3

Artigo em PDF: vol8-no3-art3.pdf

Arquivo BibTex: vol8-no3-art3.bib