Welcome to EPrints UTCC

Amomkul, P. and Kumhom, P. and Chanmongthai, K. (2005) N-Best Decision for Thai Stressed Speech Recognition with Parallel Hidden Makov Model. UTCC Engineering Research Papers. ISSN 1906-1625


Download (21kB) | Preview


In integrating multi-isolated-word recognizers into a speech recognition for various stressed speeches, the best likelihood scopes as outputs of each recognizer are not guaranteed a correct recognition result. Since training sometimes does not cover all speakers, likelihood score of the correct recognition result is not the best and causes misrecognition. Moreover, the difference among recognizers also leads to mis-understanding. This paper proposes a decision-making method for Thai stressed speech recognition with parallel hidden Markov model. In this method, a voting scheme is applied on the words with the N-best likelihood score. Firstly, if the score margin between the first and the second-best is greater than a threshold, the voting is applied on the words with highest scores from each recognizer. If there is no clear winner, decided by considering the voting score, the next best score are included into the voting scheme. The process goes on until a winner is found or there is a tied score, its which case the average of the likelihood score of each tied word is used to decide the winner. The experiments were conducted with 4-stress speeches including angry, toward, loud, and neutral. It showed that the proposed method helped increase the recognition rate to 96.545% comparing with previous decision making techniques.

Item Type: Article
Subjects: Engineering > Electronics and Telecommunication Engineering
Divisions: School > School of Engineering
Depositing User: Kitikun Pongsak
Date Deposited: 16 May 2014 21:46
Last Modified: 16 May 2014 21:46
URI: http://eprints.utcc.ac.th/id/eprint/918

Actions (login required)

View Item View Item