• Aucun résultat trouvé

Index Terms—Speech recognition

New Paradigm in Speech Recognition: Deep Neural Networks

New Paradigm in Speech Recognition: Deep Neural Networks

... Index Termsspeech recognition, deep neural network, acoustic modeling I. I NTRODUCTION More and more information appear on Internet each day. And more and more information is asked by users. ...

8

Belief Hidden Markov Model for speech recognition

Belief Hidden Markov Model for speech recognition

... Index TermsSpeech recognition, HMM, Belief functions, Belief ...automatic speech recognition is a domain of science that attracts the attention of the ...The speech ...

6

Incorporating Named Entity Recognition into the Speech Transcription Process

Incorporating Named Entity Recognition into the Speech Transcription Process

... Index Terms: Named Entity Recognition, Automatic Speech Recognition, language modeling, ASR vocabulary ...Entity Recognition (NER) from speech is mainly per- formed by ...

6

Fast Two-Level-Dynamic-Programming Algorithm For Speech Recognition

Fast Two-Level-Dynamic-Programming Algorithm For Speech Recognition

... fast speech recognition Without doubting the accuracy or effectiveness of Stage 1 (DSP), the focus of this paper is to have the second stage running a fast TLDP (to help find phone boundaries [8]), but ...

6

Closed-loop auditory-based representation for robust speech recognition

Closed-loop auditory-based representation for robust speech recognition

... Two major differences between the closed-loop model and an open-loop procedure can be distinguished: namely, the dynamic range window (DRW) and the gain control. In fact, the[r] ...

96

Speech and Speaker Recognition for Home Automation: Preliminary Results

Speech and Speaker Recognition for Home Automation: Preliminary Results

... detected speech events were misclassified as everyday life sound, and some noise were misclassified as speech (bell ring, music, ...two speech data sets (manual ...the speech synthesizer, 10 ...

11

Multimodal Mathematical Expressions Recognition: Case of Speech and Handwriting

Multimodal Mathematical Expressions Recognition: Case of Speech and Handwriting

... Expressions Recognition 79 2 Global Overview of the Proposed Method We propose in this work a combined system composed of two specialized ones: an online handwritten ME system and a speech ...

11

Using Speech to Search: Comparing Built-in and Ambient Speech Search in Terms of Privacy and User Experience

Using Speech to Search: Comparing Built-in and Ambient Speech Search in Terms of Privacy and User Experience

... Prototype, Participants and Procedure For this experiment a prototypical interactive TV system (programmed in Unity3D) with functions like live TV, electronic program guide (EPG), video on demand (VoD) or weather ...

10

Automatic refinement of hidden Markov models for speech recognition

Automatic refinement of hidden Markov models for speech recognition

... Before pronunciation network generation was performed, a recognizer equipped with TIMIT-based models was applied to the Susan training sentences and the resulting tra[r] ...

62

Analyzing the impact of speaker localization errors on speech separation for automatic speech recognition

Analyzing the impact of speaker localization errors on speech separation for automatic speech recognition

... truth speech activity detection (SAD) labels from Track 3 of the DIHARD-II speaker diarization challenge [23] are used, as these are more reliable than the SAD labels originally provided in ...the speech ...

6

Amharic Speech Recognition for Speech Translation

Amharic Speech Recognition for Speech Translation

... work Speech recognition and translation is a field which has been and being researched for more than a decade for most of the resourced languages like English and most European ...Amharic speech ...

12

Automatic acquisition of language models for speech recognition

Automatic acquisition of language models for speech recognition

... Another very surprising result from Table 5-6 is the substantial difference in word accuracy for the word network and trigram N-best integration. The direct word network [r] ...

141

Semantic Similarity for Detecting Recognition Errors in Automatic Speech Transcripts

Semantic Similarity for Detecting Recognition Errors in Automatic Speech Transcripts

... We did a pre-screening of the various semantic similarity measures in order to choose the one measure of each type (dictionary-based and cor- pus-based) that seemed most promising for our task of detecting semantic ...

10

Deep neural network adaptation for children's and adults' speech recognition

Deep neural network adaptation for children's and adults' speech recognition

... best recognition per- formances when the operating (or testing) condi- tions are consistent with the training ...on speech data from all groups of speak- ers can then be used directly as initialisation to ...

6

Applications of broad class knowledge for noise robust speech recognition

Applications of broad class knowledge for noise robust speech recognition

... acoustically motivated broad classes in designing a robust landmark detection and segmentation algorithm, while Chapter 5 discusses using broad class knowledge in isla[r] ...

164

Advances in deep learning methods for speech recognition and understanding

Advances in deep learning methods for speech recognition and understanding

... 1.3 The Role of Speech Processing in the Develop- ment of Artificial Intelligence The verbal skills is one of the definitive feature of human intelligence. With- out the development of complex communication via ...

108

Automatic Speech Recognition for African Languages with Vowel Length Contrast

Automatic Speech Recognition for African Languages with Vowel Length Contrast

... in speech recognition is concerned, 12 modeled word duration at the acoustic model- ing ...the speech recognition system by rescoring N-best lists with the duration models ...

9

Relationship between the privacy index and the speech privacy class

Relationship between the privacy index and the speech privacy class

... of speech intelligibility (and thereby dis- traction) is usually ...Articulation Index (AI) (or rather its complementary metric, the Privacy Index, PI) determined according to ASTM E1130 ...

9

Speech Communication Automatic speech emotion recognition using an optimal combination of features based on EMD-TKEO

Speech Communication Automatic speech emotion recognition using an optimal combination of features based on EMD-TKEO

... data, it is possible for large inputs to slow down the learning and convergence, in some cases prevents the used classifier from effectively learning for the classification problem. The effect of speaker normalization ...

31

Measurement of speech privacy of closed rooms using ASTM E2638 and setting criteria in terms of speech privacy class

Measurement of speech privacy of closed rooms using ASTM E2638 and setting criteria in terms of speech privacy class

... of speech privacy provided by an enclosed room refers to the extent to which conversations occurring inside are protected from overhearing by people outside, in the adjoining building ...the Speech Privacy ...

5

Show all 2064 documents...

Sujets connexes