• Aucun résultat trouvé

Improving Speech Recognition through Automatic Selection of Age Group Specific Acoustic Models

N/A
N/A
Protected

Academic year: 2021

Partager "Improving Speech Recognition through Automatic Selection of Age Group Specific Acoustic Models"

Copied!
13
0
0

Texte intégral

Loading

Figure

Table  1.  The  main  statistics  of  the  data  used  to  train  the  age  group  classifier  for  children  (CNG), young to middle-aged adults (BD-PUBLICO and YMA-a) and the elderly (EASR), and  to optimise acoustic models for children (CNG) and the elde
Table  4. The acoustic feature set used in the age group classifiers: 65 Low Level Descriptors  (LLDs)
Fig.  1. Age group classificatio for the SVM models trained w features
Table 8 illustrates the ASR results for the experimental set-ups described in Section 4.3

Références

Documents relatifs

h ] is above a given threshold for one word or for time segment containing a sequence of words and pauses, then a new set of features is computed in that segment and recognition is

Specific probabilities are introduced that a phoneme hypothesis is correct given the phonemes that would be hypothesized with two different feature sets and their posterior

The improvement is 1% relative to the system using the additional voicing measure which is less than the improvement obtained when combining the spectrum derivative feature solely

To measure the spectral mismatch between adult and children’s speech caused by differences in vocal tract length, we computed the mean warping factor for speakers in the ChildIt

Linguistic features model the information given by the transcription of the spoken words or by word hy- potheses obtained from automatic speech recognition (AS R) of the

On the other hand, the effect of the vocal tract shape on the intrinsic variability of the speech signal between different speakers has been widely studied and many solutions to

The methodology consists in introducing semantic information by using a class-based statistical language model for which classes directly correspond to IF entries.. With

Acoustic Model Merging Using Acoustic Models from Multilingual Speakers for Automatic Speech Recognition.. Tien Ping Tan, Laurent Besacier,