• Aucun résultat trouvé

Portability and Language Dependencies

Large Vocabulary Speech Recognition Based on Statistical Methods

5.8 Portability and Language Dependencies

5VCVKUVKECNN[DCUGF URGGEJ TGEQIPKVKQP VGEJPQNQI[ JCU DGGP UWEEGUUHWNN[ GORNQ[GF HQT C XCTKGV[ QH VCUMU CPF NCPIWCIGU 6JG RQTVKPI QH C .8%54 U[UVGO VQ C PGY VCUM QT CPQVJGT NCPIWCIG TGSWKTGU VJG CXCKNCDKNKV[ QH UWHſEKGPV COQWPVU QH VTCPUETKDGF VTCKPKPI FCVC CPF KPXQNXGU UWDUVCPVKCN GHHQTV VQ EQPUVTWEV VJG CEQWUVKE CPF NCPIWCIG OQFGNU CPF VQ FGXGNQR VJG TGEQIPKVKQP NGZKEQP 1HVGP JQYGXGT VJG PGEGUUCT[ TG UQWTEGU CTG PQV CXCKNCDNG CPF IGPGTCVKPI VJGO ECP DG NQPI CPF GZRGPUKXG

4GEGPV GHHQTVU JCXG DGGP FKTGEVGF CV FGXGNQRKPI IGPGTKE TGEQIPKVKQP OQFGNU CPF VJG WUG QH WPCPPQVCVGF FCVC HQT VTCKPKPI RWTRQUGU KP CP CKO VQ TGFWEG VJG TGNKCPEG QP OCPWCNN[ CPPQVCVGF VTCKPKPI EQTRQTC CPF TGFWEKPI FGXGNQROGPV EQUVU =? /GVJQFU VQ KORTQXG IGPGTCNKV[ QH VJG OQFGNU CTG WPFGT KPXGUVKICVKQP DWV VJG RTQDNGO KU HCT HTQO DGKPI UQNXGF

#NVJQWIJ 'PINKUJ JCU DGGP VJG RTGFQOKPCPV NCPIWCIG HQT VJG EQORWVGT YQTNF VJGTG JCU DGGP C NCTIG ITQYVJ KP VJG COQWPV QH KPHQTOCVKQP CXCKNCDNG KP GNGEVTQPKE HQTO KP OCP[ QH VJG YQTNFŏU NCPIWCIGU $WKNFKPI C TGEQIPK\GT HQT CPQVJGT NCPIWCIG KU PQV UQ FKHHGTGPV VJCP DWKNFKPI C TGEQIPK\GT HQT C PGY VCUM RCTVKEWNCTN[ HQT ENQUG NCPIWCIGU .CPIWCIGFGRGPFGPV U[UVGO EQORQPGPVU UWEJ CU VJG RJQPG UGV VJG PGGF HQT RTQ PWPEKCVKQP CNVGTPCVKXGU QT RJQPQNQIKECN TWNGU GXKFGPVN[ OWUV DG EJCPIGF 1VJGT NCPIWCIG FGRGPFGPV HCEVQTU CTG TGNCVGF VQ VJG FGſPKVKQP CPF CEQWUVKE EQPHWUKDKNKV[ QH VJG YQTFU KP VJG NCPIWCIG UWEJ CU JQOQRJQPG OQPQRJQPG CPF EQORQWPF YQTF TCVGU CPF VJG YQTF EQXGTCIG QH C IKXGP UK\G TGEQIPKVKQP XQECDWNCT[ 6CMKPI KPVQ CEEQWPV NCPIWCIG URGEKſEKVKGU ECP GXKFGPVN[ KORTQXG TGEQIPKVKQP RGTHQTOCPEG (QT GZCORNG VQPCN NCPIWCIGU UWEJ CU %JKPGUG OC[ DGPGſV HTQO GZRNKEKV OQFGNKPI QH RKVEJ YJKEJ KP VWTP OC[ TGSWKTG OQFKſECVKQPU VQ VJG HGCVWTG CPCN[UKU WUGF

6JGTG CTG VYQ RTGFQOKPCPV CRRTQCEJGU HQT DQQVUVTCRRKPI CEQWUVKE OQFGNU HQT CPQVJGT NCPIWCIG 6JG ſTUV KU VQ WUG CEQWUVKE OQFGNU HTQO CP GZKUVKPI TGEQIPK\GT CPF C RTQPWPEKCVKQP FKEVKQPCT[ VQ UGIOGPV OCPWCNN[ CPPQVCVGF VTCKPKPI FCVC HQT VJG VCTIGV NCPIWCIG +H TGEQIPK\GTU HQT UGXGTCN NCPIWCIGU CTG CXCKNCDNG VJG UGGF OQFGNU ECP DG UGNGEVGF D[ VCMKPI VJG ENQUGUV OQFGN KP QPG QH VJG CXCKNCDNG NCPIWCIGURGEKſE UGVU #P CNVGTPCVKXG CRRTQCEJ KU VQ WUG C UGV QH INQDCN CEQWUVKE OQFGNU VJCV EQXGT C YKFG PWODGT QH RJQPGOGU = ? 6JKU CRRTQCEJ QHHGTU VJG CFXCPVCIG QH DGKPI CDNG VQ WUG OWNVKNKPIWCN CEQWUVKE OQFGNU VQ RTQXKFG CFFKVKQPCN VTCKPKPI FCVC YJKEJ KU RCTVKEWNCTN[ KPVGTGUVKPI YJGP QPN[ NKOKVGF COQWPVU QH FCVC JQWTU HQT VJG VCTIGV NCPIWCIG CTG CXCKNCDNG

/KPKOK\KPI VJG TGSWKTGF VTCKPKPI FCVC QT FGVGTOKPKPI JQY VQ QRVKOCNN[ CESWKTG UWEJ FCVC TGOCKPU CP QWVUVCPFKPI EJCNNGPIG 5VCPFCTF *// VTCKPKPI TGSWKTGU CP CNKIP OGPV DGVYGGP VJG CWFKQ UKIPCN CPF VJG RJQPG OQFGNU YJKEJ WUWCNN[ TGNKGU QP CP QTVJQITCRJKE VTCPUETKRVKQP QH VJG URGGEJ FCVC CPF C IQQF RJQPGOKE NGZKEQP 6JG QTVJQITCRJKE VTCPUETKRVKQP KU WUWCNN[ EQPUKFGTGF CU ITQWPF VTWVJ VJCV KU VJG YQTF UG SWGPEG VJCV UJQWNF DG J[RQVJGUK\GF D[ VJG URGGEJ TGEQIPK\GT YJGP EQPHTQPVGF YKVJ VJG UCOG URGGEJ UGIOGPV 1PG ECP KOCIKPG VTCKPKPI CEQWUVKE OQFGNU KP C NGUU UWRGT XKUGF OCPPGT KP YJKEJ TGNCVGF NKPIWKUVKE KPHQTOCVKQP CDQWV VJG CWFKQ UCORNG ECP DG

WUGF KP RNCEG QH VJG OCPWCN VTCPUETKRVKQPU TGSWKTGF HQT CNKIPOGPV D[ KPEQTRQTCVKPI VJKU KPHQTOCVKQP KP C NCPIWCIG OQFGN 6JKU NCPIWCIG OQFGN ECP DG WUGF YKVJ CEQWU VKE OQFGNU FGXGNQRGF HQT CPQVJGT VCUM VQ CWVQOCVKECNN[ VTCPUETKDG VJG VCUMURGEKſE VTCKPKPI FCVC #NVJQWIJ KP VJG DGIKPPKPI VJG GTTQT TCVG QP PGY FCVC KU NKMGN[ VQ DG TCVJGT JKIJ VJKU URGGEJ FCVC ECP DG WUGF VQ TGVTCKP VJG OQFGNU QH VJG TGEQIPKVKQP U[UVGO #P KVGTCVKXG RTQEGFWTG ECP UWEEGUUKXGN[ TGſPG VJG OQFGNU CPF VJG VTCPUETKR VKQP = ? 6JKU CRRTQCEJ KU RCTVKEWNCTN[ RTQOKUKPI HQT VJG VTCPUETKRVKQP QH TGCFKN[ CXCKNCDNG CWFKQ UQWTEGU UWEJ CU TCFKQ CPF VGNGXKUKQP PGYU DTQCFECUVU VJCV ECP RTQXKFG CP GUUGPVKCNN[ WPNKOKVGF UWRRN[ QH CEQWUVKE VTCKPKPI FCVC

References

=? JVVREQTGVGZKVEKV

=? & #DDGTNG[ & -KTD[ 5 4GPCNU CPF 6 4QDKPUQPThe THISL Broadcast News Retrieval System, 2TQE '5%# '649 QP #EEGUUKPI +PHQTOCVKQP KP 5RQMGP

#WFKQ %CODTKFIG 7- #RTKN

=? # #PFTGQWO 6 -COO CPF , %QJGPExperiments in Vocal Tract Normaliza-tion,2TQE %#+2 9QTMUJQR (TQPVKGTU KP 5RGGEJ 4GEQIPKVKQP ++

=? 6 #PCUVCUCMQU , /E&QPQWIJ 4 5EJYCTV\ CPF , /CMJQWN A Compact Model for Speaker Adaptation Training,2TQE +%5.2ŏ 2JKNCFGN RJKC 2# 1EVQDGT

=? : #WDGTVOne Pass Cross Word Decoding for Large Vocabularies Based on a Lexical Tree Search Organization,2TQE '5%# 'WTQURGGEJŏ4

$WFCRGUV *WPICT[ 5GRVGODGT

=? 5 #WUVKP 4 5EJYCTV\ CPF 2 2NCEGYC[The Forward-Backward Search Strat-egy for Real-Time Speech Recognition, 2TQE +''' +%#552 6QTQPVQ %CPCFC /C[

=? .4 $CJN ,- $CMGT 25 %QJGP 04 &KZQP ( ,GNKPGM 4. /GTEGT CPF

*( 5KNXGTOCPPreliminary results on the performance of a system for the au-tomatic recognition of continuous speech,2TQE +''' +%#552 2JKNCFGN RJKC 2# #RTKN

=? .4 $CJN 2 $TQYP 2 FG 5QW\C 4. /GTEGT CPF / 2KEJGP[ Acoustic Markov Models used in the Tangora Speech Recognition System,2TQE +'''

+%#5521 0GY ;QTM 0; #RTKN

=? .4 $CJN ( ,GNKPGM CPF 4. /GTEGT A Maximum Likelihood Approach to Continuous Speech Recognition,+''' 6TCPU 2CVVGTP #PCN[UKU /CEJKPG +P

VGNNKIGPEGPAMI-5 /CTEJ

=? .4 $CJN 28 FG 5QW\C 25 )QRCNCMTKUJPCP & 0CJCOQQ CPF / 2KEJGP[

A Fast Match for Continuous Speech Recognition Using Allophonic Models, 2TQE +''' +%#552 %#1 5CP (TCPEKUEQ %# /CTEJ

=? , $CMGT , $CMGT 2 $CODGTI - $KUJQR . )KNNKEM 8 *GNOCP < *WCPI ; +VQ 5 .QYG $ 2GUMKP 4 4QVJ CPF ( 5ECVVQPGLarge Vocabulary Recogni-tion of Wall Street Journal Sentences at Dragon Systems,2TQE &#42#5RGGEJ 0CVWTCN .CPIWCIG 9QTMUJQR *CTTKOCP 0; (GDTWCT[

=? $CWO .' 6 2GVTKG ) 5QWNGU CPF 0 9GKUUA maximization technique oc-curring in the statistical analysis of probabilistic functions of Markov chains

#PP /CVJ 5VCV41

=? ' $QEEJKGTKVector quantization for efficient computation of continuous den-sity likelihoods, 2TQE +''' +%#5522 /KPPGCRQNKU /0 /C[

=? ( $TWIPCTC / %GVVQNQ / (GFGTKEQ CPF & )KWNKCPKA Baseline for the Tran-scription of Italian Broadcast News,2TQE +''' +%#552 +UVCPDWN 6WTMG[

,WPG

=? . %JCUG Word and acoustic confidence annotation for large vocabulary speech recognition 2TQE '5%# 'WTQURGGEJŏ 4JQFGU )TGGEG 5GRVGODGT

=? . %JCUG 4 4QUGPDGTI # *CWRVOCPP / 4CXKUJCPMCT ' 6JC[GT 2 2NCEG YC[ 4 9GKFG CPF % .WImprovements in Language, Lexical and Phonetic Modeling in Sphinx-II,2TQE #42# 5RQMGP .CPIWCIG 5[UVGOU 6GEJPQNQI[

9QTMUJQR #WUVKP 6: ,CPWCT[

=? 5( %JGP CPF , )QQFOCPAn empirical study of smoothing techniques for language modeling,%QORWVGT 5RGGEJ .CPIWCIG13 1EVQDGT

=? 55 %JGP CPF 25 )QRCNCMTKUJPCP Speaker, Environment and Channel Change Detection and Clustering via the Bayesian Information Criterion 2TQE &#42#$TQCFECUV 0GYU 6TCPUETKRVKQP 7PFGTUVCPFKPI 9QTMUJQR .CPFUFQYPG 8# (GDTWCT[

=? ;. %JQY 4 5EJYCTV\ 5 4QWMQU 1 -KODCNN 2 2TKEG ( -WDCNC /1

&WPJCO / -TCUPGT CPF , /CMJQWNThe Role of Word-Dependent Coartic-ulatory Effects in a Phoneme-Based Speech Recognition System 2TQE +'''

+%#5523 6QM[Q ,CRCP #RTKN

=? 2 %NCTMUQP CPF 4 4QUGPHGNFStatistical Language Modelling using CMU-Cambridge Toolkit,2TQE '5%# 'WTQ5RGGEJŏ 4JQFGU )TGGEG 5GRVGODGT

=? 5 &CXKU CPF 2 /GTOGNUVGKPComparison of Parametric Representations of Monosyllabic Word Recognition in Continuously Spoken Sentences, +''' 6TCPU #EQWUVKEU 5RGGEJ 5KIPCN 2TQEGUUKPI28

=? &GORUVGT #2 // .CKTF CPF &$ 4WDKPMaximum Likelihood from In-complete Data via the EM Algorithm ,QWTPCN QH VJG 4Q[CN 5VCVKUVKECN 5QEKGV[

5GTKGU $ OGVJQFQNQIKECN39

=? 0 &GUJOWMJ # )CPCRCVJKTCLW 4, &WPECP CPF , 2KEQPGHuman Speech Recognition Performance on the 1995 CSR Hub-3 Corpus2TQE #42# 5RGGEJ 4GEQIPKVKQP 9QTMUJQR *CTTKOCP 0; (GDTWCT[

=? 8 &KICNCMKU CPF * /WTXGKVGenones: Optimization the Degree of Tying in a Large Vocabulary HMM-based Speech Recognizer,2TQE +''' +%#552 1 #FGNCKFG #WUVTCNKC #RTKN

=? 8 &KICNCMKU & 4VKEJGX CPF .) 0GWOG[GTSpeaker adaptation using con-strained estimation of Gaussian mixtures +''' 6TCPU QP 5RGGEJ #WFKQ 3 5GRVGODGT

=? , &TG[HWU)TCHSonograph and Sound Mechanics,, #EQWUV 5QE #OGTKEC 22

=? * &WFNG[ CPF 5 $CNCUJGM Automatic Recognition of Phonetic Patterns in Speech,, #EQWUV 5QE #OGTKEC30

=? 9, 'DGN CPF , 2KEQPGHuman Speech Recognition Performance on the 1994 CSR Spoke 10 Corpus 2TQE #42# 5RQMGP .CPIWCIG 5[UVGOU 6GEJPQNQI[

9QTMUJQR #WUVKP 6: ,CPWCT[

=? 5 (WTWKComparison of speaker recognition methods using statistical features and dynamic features,+''' 6TCPU QP #EQWUVKEU 5RGGEJ 5KIPCN 2TQEGUUKPI ASSP-29

=? /,( )CNGU CPF 5, ;QWPIAn improved approach to hidden Markov model decomposition of speech and noise, 2TQE +''' +%#552 5CP (TCPEKUEQ %# /CTEJ

=? /,( )CNGU CPF 5, ;QWPIRobust Continuous Speech Recognition using Parallel Model Combination,%QORWVGT 5RGGEJ .CPIWCIG9 1EVQDGT

=? /,( )CNGU Cluster Adaptive Training for Speech Recognition, 2TQE +%

5.2ŏ 5[FPG[ #WUVTCNKC 0QXGODGT

=? /,( )CNGU Semi-Tied Covariance Matrices for Hidden Markov Models, +''' 6TCPU QP 5RGGEJ CPF #WFKQ7 /C[

=? ,. )CWXCKP ) #FFC . .COGN CPF / #FFC&GEMGTTranscribing Broad-cast News: The LIMSI Nov96 Hub4 System,2TQE #42# 5RGGEJ 4GEQIPKVKQP 9QTMUJQR %JCPVKNN[ 8# (GDTWCT[

=? ,. )CWXCKP 5 $GPPCEGH . &GXKNNGTU . .COGN CPF 4 4QUUGVSpoken Lan-guage component of the MASK KioskKP - 8CTIJGUG 5 2ƀGIGT'FU *WOCP

%QOHQTV CPF UGEWTKV[ QH KPHQTOCVKQP U[UVGOU 5RTKPIGT8GTNCI #NUQ KP

2TQE *WOCP %QOHQTV CPF 5GEWTKV[ 9QTMUJQR $TWUUGNU $GNIWKO 1EVQDGT

=? ,. )CWXCKP ,, )CPIQNH CPF . .COGNSpeech Recognition for an Informa-tion Kiosk,2TQE +%5.2ŏ Ō 2JKNCFGNRJKC 2# 1EVQDGT

=? ,. )CWXCKP . .COGN CPF ) #FFCPartitioning and Transcription of Broad-cast News Data,2TQE +%5.2ŏ5 5[FPG[ #WUVTCNKC &GEGODGT

=? ,. )CWXCKP .( .COGN CPF / #FFC&GEMGTDevelopments in Continuous Speech Dictation using the ARPA WSJ Task,2TQE +''' +%#552

&GVTQKV /+ /C[

=? ,. )CWXCKP CPF %* .GGMaximum a Posteriori Estimation for Multivari-ate Gaussian Mixture Observations of Markov Chains,+''' 6TCPU 5RGGEJ

#WFKQ 2TQEGUUKPI2 #RTKN

=? ,. )CWXCKP . .COGN CPF ) #FFCThe LIMSI Broadcast News Transcription

System 5RGGEJ %QOOWPKECVKQP37 /C[

=? . )KNNKEM CPF 4 4QVJ A Rapid Match Algorithm for Continuous Speech Recognition,2TQE &#42#5RGGEJ 0CVWTCN .CPIWCIG 9QTMUJQR

*KFFGP 8CNNG[ 2# ,WPG

=? . )KNNKEM ; +VQ CPF , ;QWPIA Probabilistic Approach to Confidence Mea-sure Estimation and Evaluation 2TQE +''' +%#552 /WPKEJ )GTOCP[ #RTKN

=? ,4 )NCUU 6, *C\GP CPF + . *GVJGTKPIVQP Real-time Telephone-based Speech Recognition in the Jupiter Domain,2TQE +''' +%#5521 2JQGPKZ #< /CTEJ

=? , )QFHTG[ ' *QNNKOCP CPF , /E&CPKGNSWITCHBOARD: Telephone Speech Corpus for Research and Development,2TQE +''' +%#552 5CP (TCPEKUEQ %# /CTEJ

=? +, )QQFThe Population Frequencies of Species and the Estimation of

Popu-lation Parameters $KQOVGTKMC40

=? 25 )QRCNCMTKUJPCP .4 $CJN CPF 4. /GTEGT A tree search strategy for large-vocabulary continuous speech recognition,2TQE +''' +%#552 1 &GVTQKV /+ /C[

=? 4 *CGD7ODCEJ CPF * 0G[ Linear Discriminant Analysis for Improved Large Vocabulary Continuous Speech Recognition, 2TQE +%#5521 /CTEJ

=? 6 *CKP 5' ,QJPUQP # 6WGTM 2% 9QQFNCPF CPF 5, ;QWPI Segment Generation and Clustering in the HTK Broadcast News Transcription System, 2TQE &#42#$TQCFECUV 0GYU 6TCPUETKRVKQP 7PFGTUVCPFKPI 9QTMUJQR .CPFUFQYPG 8# (GDTWCT[

=? #) *CWRVOCPP / 9KVDTQEM CPF / %JTKUVGN News-on-Demand-’An Ap-plication of Informedia Technology’,&KIKVCN .KDTCTKGU /CIC\KPG 5GRVGODGT

=? %6 *GORJKNN ,, )QFHTG[ CPF )4 &QFFKPIVQPThe ATIS Spoken Language Systems Pilot Corpus,2TQE &#42#5RGGEJ 0CVWTCN .CPIWCIG 9QTMUJQR 2KVVUDWTIJ 2# ,WPG

=? * *GTOCPUM[ Perceptual linear predictive (PLP) analysis of speech, ,

#EQWUV 5QE #OGTKEC87

=? // *QEJDGTI 5, 4GPCNU #, 4QDKPUQP CPF & -GTUJCYLarge vocabu-lary continuous speech recognition using a hybrid connectionist-HMM system, 2TQE +%5.2ŏ ;QMQJCOC ,CRCP 5GRVGODGT

=? /, *WPV Signal Representation %JCRVGT QH VJG 5VCVG QH VJG

#TV KP *WOCP .CPIWCIG 6GEJPQNQI[ %QNG GV CN GFU JVVRYYYEUGQIKGFW%5.7*.6UWTXG[EJPQFGJVON

=? / *YCPI CPF : *WCPISubphonetic Modeling with Markov States - Senone, 2TQE +''' +%#5521 5CP (TCPEKUEQ %# /CTEJ

=? /; *YCPI : *WCPI CPF ( #NNGXC Predicting Unseen Triphones with Senones,2TQE +''' +%#552II /KPPGCRQNKU /0 #RTKN

=? ( ,GNKPGMContinuous Speech Recognition by Statistical Methods,2TQE QH VJG

+'''64 #RTKN

=? ( ,GNKPGMStatistical Methods for Speech Recognition,%CODTKFIG /+6 2TGUU

=? ( ,GNKPGM $ /GTKCNFQ 5 4QWMQU CPF / 5VTCWUU A Dynamic Language Model for Speech Recognition, 2TQE &#42# 5RGGEJ 0CVWTCN .CPIWCIG 9QTMUJQR 2CEKſE )TQXG %# (GDTWCT[

=? ( FG,QPI ,. )CWXCKP , FGD *CTVQI CPF - 0GVVGT 1.+8': Speech Based Video Retrieval,2TQE %$/+ŏ 6QWNQWUG (TCPEG 1EVQDGT

=? ,WCPI $* Maximum-Likelihood Estimation for Mixture Multivariate Stochastic Observations of Markov Chains #66 6GEJPKECN ,QWTPCN64

=? 5/ -CV\ Estimation of Probabilities from Sparse Data for the Language Model Component of a Speech Recognizer,+''' 6TCPU #EQWUVKEU 5RGGEJ 5KIPCN 2TQEGUUKPIASSP-35 /CTEJ

=? 6 -GOR CPF # 9CKDGNUnsupervised Training of a Speech Recognizer: Re-cent Experiments,2TQE '5%# 'WTQURGGEJŏ6 $WFCRGUV *WP ICT[ 5GRVGODGT

=? & -GTUJCY #, 4QDKPUQP CPF 5, 4GPCNU The 1995 Abbot hybrid connectionist-HMM large-vocabulary recognition system,2TQE #42# 5RGGEJ 4GEQIPKVKQP 9QTMUJQR *CTTKOCP 0; (GDTWCT[

=? 4 -PGUGT CPF * 0G[Improved Clustering Techniques for Class-Based Statis-tical Language Modelling,2TQE 'WTQURGGEJŏ $GTNKP 5GRVGODGT

=? 4 -PGUGT CPF * 0G[Improved backing-off for n-gram language modeling, 2TQE +''' +%#5521 &GVTQKV /+ /C[

=? ( -WDCNC Design of the 1994 CSR Benchmark Tests, 2TQE #42# 5RQMGP .CPIWCIG 5[UVGOU 6GEJPQNQI[ 9QTMUJQR #WUVKP 6: ,CPWCT[

=? ( -WDCNC 6 #PCUVCUCMQU * ,KP , /CMJQWN . 0IW[GP 4 5EJYCTV\ CPF 0

;WCPToward Automatic Recognition of Broadcast News,2TQE &#42#5RGGEJ 4GEQIPKVKQP 9QTMUJQR *CTTKOCP 0; (GDTWCT[

=? 0 -WOCT CPF #) #PFTGQWHeteroscedastic discriminant analysis and re-duced rank HMMs for improved speech recognition,5RGGEJ %QOOWPKECVKQP 26 &GEGODGT

=? 4 -WJP 2 0IW[GP ,% ,WPSWC . )QNFYCUUGT 0 0KGF\KGNUMK 5 (KPEMG CPF - (KGNF / %QPVQNKPK Eigenvoices for Speaker Adaptation, 2TQE +%

5.2ŏ 5[FPG[ 0QXGODGT

=? .( .COGN CPF ) #FFCOn Designing Pronunciation Lexicons for Large Vo-cabulary, Continuous Speech Recognition,2TQE +%5.2ŏ1 2JKNCFGN RJKC 2# 1EVQDGT

=? .( .COGN CPF 4 &G/QTKSpeech Recognition of European Languages,2TQE +''' #WVQOCVKE 5RGGEJ 4GEQIPKVKQP 9QTMUJQR 5PQYDKTF 7VCJ &G EGODGT

=? .( .COGN CPF ,. )CWXCKPContinuous Speech Recognition at LIMSI,2TQE

#42# 9QTMUJQR QP %QPVKPWQWU 5RGGEJ 4GEQIPKVKQP 5VCPHQTF %#

5GRVGODGT

=? .( .COGN CPF ,. )CWXCKP A Phone-based Approach to Non-Linguistic Speech Feature Identification, %QORWVGT 5RGGEJ .CPIWCIG 9 ,CPWCT[

=? . .COGN ,. )CWXCKP CPF ) #FFCLightly Supervised and Unsupervised Acoustic Model Training %QORWVGT 5RGGEJ .CPIWCIG16 ,CP WCT[

=? .( .COGN 5 4QUUGV 5- $GPPCEGH * $QPPGCW/C[PCTF . &GXKNNGTU CPF ,. )CWXCKP Development of Spoken Language Corpora for Travel Infor-mation 2TQE '5%# 'WTQURGGEJŏ3 /CFTKF 5RCKP 5GRVGODGT

=? -( .GGLarge-vocabulary speaker-independent continuous speech recogni-tion: The SPHINX system,2J& 6JGUKU %CTPGIKG /GNNQP 7PKXGTUKV[

=? . .GG CPF 4% 4QUGSpeaker Normalization Using Efficient Frequency Warp-ing Procedures 2TQE +''' +%#5521 #VNCPVC )# /C[

=? %, .GIIGVVGT CPF 2% 9QQFNCPFMaximum Likelihood Linear Regression for Speaker Adaptation of Continuous Density Hidden Markov Models,%QORWVGT

5RGGEJ .CPIWCIG9 #RTKN

=? .KRQTCEG .4 Maximum Likelihood Estimation for Multivariate Observa-tions of Markov Sources +''' 6TCPUCEVKQPU QP +PHQTOCVKQP 6JGQT[ IT-28

=? 42 .KRROCPPSpeech recognition by machines and humans,5RGGEJ %QO

OWPKECVKQP22 ,WN[

=? & .KW CPF ( -WDCNC Fast Speaker Change Detection for Broadcast News Transcription and Indexing 2TQE '5%# 'WTQ5RGGEJŏ3 $W FCRGUV *WPICT[ 5GRVGODGT

=? /CFEQY Multi-site Data Collection for a Spoken Language Corpus, 2TQE

&#42#5RGGEJ 0CVWTCN .CPIWCIG 9QTMUJQR *CTTKOCP 0; (GDTWCT[

=? . /CPIW ' $TKNN # 5VQNEMG Finding Consensus in Speech Recognition:

Word Error Minimization and Other Applications of Confusion Networks,

%QORWVGT 5RGGEJ CPF .CPIWCIG 1EVQDGT

=? $ /CM CPF ' $QEEJKGTK Subspace distribution clustering for continuous observation density hidden Markov models, 2TQE 'WTQURGGEJŏ 4JQFGU )TGGEG 5GRVGODGT

=? ,, /CTKCPKSpoken Language Processing and Human-Machine Communica-tion in the European Union Programs,KP ) 8CTKNG GF 'WTQURGGEJŏ '7 5RGGEJ 2TQLGEVU &C[ TGRQTV 4JQFGU )TGGEG 5GRVGODGT

=? ,, /CTKCPK CPF .( .COGNAn overview of EU programs related to conver-sational/interactive systems, 2TQE &#42# $TQCFECUV 0GYU 6TCPUETKRVKQP 7PFGTUVCPFKPI 9QTMUJQR .CPFUFQYPG 8# (GDTWCT[

=? 5 /CTVKP , .KGTOCPP CPF * 0G[Algorithms for Bigram and Trigram Clus-tering,2TQE 'WTQURGGEJŏ /CFTKF 5RCKP 5GRVGODGT

=? / /C[DWT[ GFNews on Demand,5RGEKCN 5GEVKQP KP VJG %QOOWPKECVKQPU QH VJG #%/43 (GDTWCT[

=? & /KNNGT 4 5EJYCTV\ 4 9GKUEJGFGN CPF 4 5VQPGNamed Entity Extrac-tion from Broadcast News,2TQE &#42#$TQCFECUV 0GYU 9QTMUJQR

*GTPFQP 8# (GDTWCT[

=? / /QJTK / 4KNG[ & *KPFNG # .LQNKG CPF ( 2GTGKTCFull Expansion of Context-Dependent Networks in Large Vocabulary Speech Recognition,2TQE +''' +%#552 5GCVVNG 9# /C[

=? * /WTXGKV , $WV\DGTIGT 8 &KICNCMKU CPF / 9GKPVTCWDLarge-Vocabulary Dictation using SRI’s Decipher Speech Recognition System: Progressive

Search Techniques, 2TQE +''' +%#552 II /KPPGCRQNKU /0

#RTKN

=? * 0G[The Use of a One-Stage Dynamic Programming Algorithm for Con-nected Word Recognition,+''' 6TCPU #EQWUVKEU 5RGGEJ CPF 5KIPCN 2TQEGUU

KPIASSP-32 #RTKN

=? * 0G[ 4 *CGD7ODCEJ $* 6TCP CPF / 1GTFGTImprovements in Beam Search for 10000-Word Continuous Speech Recognition,2TQE +''' +%#552 I 5CP (TCPEKUEQ %# /CTEJ

=? . 0IW[GP CPF 4 5EJYCTV\ Single-Tree Method for Grammar-Directed Search,2TQE +''' +%#5522 2JQGPKZ #< /CTEJ

=? ,, 1FGNNThe Use of Decision Trees with Context Sensitive Phoneme Mod-elling,/2JKN 6JGUKU %CODTKFIG 7PKXGTUKV[ 'PIKPGGTKPI &GRV

=? ,, 1FGNN 8 8CNVEJGX 2% 9QQFNCPF CPF 5, ;QWPIA One Pass Decoder Design for Large Vocabulary Recognition, 2TQE #42# *WOCP .CPIWCIG 6GEJPQNQI[ 9QTMUJQR 2TKPEGVQP 0, /CTEJ

=? - 1JVUWMK 5 (WTWK 0 5CMWTCK # +YCUCMK CPF <2 <GCPIRecent Advances in Japanese Broadcast News Transcription, 2TQE '5%# 'WTQURGGEJŏ 2 $WFCRGUV *WPICT[ 5GRVGODGT

=? 2# 1NUGP CPF 4# )QRKPCVJModeling Inverse Covariance Matrices by Ba-sis Expansion,2TQE +''' +%#552 1TNCPFQ (.

=? 5 1TVOCPPU * 0G[ CPF # 'KFGP Language-model look-ahead for large vocabulary speech recognition,2TQE +%5.2ŏ 2JKNCFGNRJKC 2#

1EVQDGT

=? 5 1TVOCPPU * 0G[ CPF : #WDGTVA Word Graph Algorithm for Large Vo-cabulary Continuous Speech Recognition,%QORWVGT 5RGGEJ CPF .CPIWCIG 11 ,CPWCT[

=? $6 1UJKMC 89 <WG 48 9GGMU * 0GW CPF , #WTDCEJ The Role of Phonological Rules in Speech Understanding Research,+''' 6TCPU #EQWU VKEU 5RGGEJ 5KIPCN 2TQEGUUKPIASSP-23

=? / 1UVGPFQTH # -CPPCP 1 -KODCNN CPF ,4 4QJNKEGM Continuous Word Recognition Based on the Stochastic Segment Model,2TQE #42# 9QTMUJQR QP %QPVKPWQWU 5RGGEJ 4GEQIPKVKQP 5VCPHQTF %# 5GRVGODGT

=? &5 2CNNGVV ,) (KUEWU 9/ (KUJGT ,5 )CTQHQNQ $# .WPF CPF /#

2T[\DQEMK1993 Benchmark Tests for the ARPA Spoken Language Program, 2TQE #42# *WOCP .CPIWCIG 6GEJPQNQI[ 9QTMUJQR 2TKPEGVQP 0, /CTEJ

=? &5 2CNNGVV ,) (KUEWU 9/ (KUJGT ,5 )CTQHQNQ $# .WPF #( /CTVKP CPF /# 2T\[DQEMK1994 Benchmark Tests for the ARPA Spoken Language

Program,2TQE #42# 5RQMGP .CPIWCIG 5[UVGOU 6GEJPQNQI[ 9QTMUJQR #WUVKP 6: ,CPWCT[

=? &5 2CNNGVV ,) (KUEWU 9/ (KUJGT ,5 )CTQHQNQ #( /CTVKP CPF /#

2T\[DQEMK1995 Hub-3 Multiple Microphone Corpus Benchmark Tests,2TQE

#42# 5RGGEJ 4GEQIPKVKQP 9QTMUJQR *CTTKOCP 0; (GDTWCT[

=? &5 2CNNGVV ,) (KUEWU ,5 )CTQHQNQ #( /CTVKP CPF /# 2T\[DQEMK1998 Broadcast News Benchmark Test Results: English and Non-English Word Er-ror Rate Performance Measures, 2TQE &#42# $TQCFECUV 0GYU 9QTMUJQR *GTPFQP 8# (GDTWCT[

=? &$ 2CWN An efficient A stack decoder algorithm for continuous speech recognition with a stochastic language model,2TQE +''' +%#552 5CP (TCPEKUEQ %# /CTEJ

=? & 2QXG[ CPF 2 9QQFNCPFImproved Discriminative Training Techniques For Large Vocabulary Continuous Speech Recognition 2TQE +''' +%#552 5CNV .CMG %KV[ /C[

=? 2 2TKEG Evaluation of Spoken Language Systems: The ATIS Domain,2TQE

&#42#5RGGEJ CPF 0CVWTCN .CPIWCIG 9QTMUJQR *KFFGP 8CNNG[ 2#

,WPG

=? .4 4CDKPGT CPF $* ,WCPI An Introduction to Hidden Markov Models +''' #EQWUVKEU 5RGGEJ CPF 5KIPCN 2TQEGUUKPI /CIC\KPG ASSP-3 ,CPWCT[

=? /- 4CXKUJCPMCTEfficient Algorithms for Speech Recognition,2J& 6JGUKU

%CTPGIKG /GNNQP 7PKXGTUKV[

=? /& 4KNG[ 9 $[TPG / (KPMG 5 -JWFCPRW # .LQLNG , /E&QPQWIJ

* 0QEM / 5CTCENCT % 9QQVGTU CPF ) <CXCNKCIMQUStochastic pronuncia-tion modelling from hand-labelled phonetic corpora,5RGGEJ %QOOWPKECVKQP 29 0QXGODGT

=? 4 4QUGPHGNF CPF : *WCPIImprovements in Stochastic Language Modeling, 2TQE &#42#9QTMUJQR QP 5RGGEJ 0CVWTCN .CPIWCIG *CTTKOCP 0; (GDTWCT[

=? 4 4QUGPHGNFAdaptive Statistical Language Modeling,2J& 6JGUKU %CTPGIKG /GNNQP 7PKXGTUKV[ CNUQ 6GEJ TGR %/7%5

=? 4 4QUGPHGNFTwo Decades of Statistical Language Modeling: Where Do We Go From Here?,2TQEGGFKPIU QH VJG +''' 5RGEKCN KUUWG QP 5RQMGP .CPIWCIG

2TQEGUUKPI88 #WIWUV

=? 6 5EJWNV\C CPF # 9CKDGN Language-independent and langauge-adaptive acoustic modeling for speech recognition 5RGGEJ %QOOWPKECVKQP 35 #WIWUV

=? / 5EJWUVGTMemory-efficient LVCSR search using a one-pass stack decoder,

%QORWVGT 5RGGEJ .CPIWCIG14 ,CPWCT[

=? 4 5EJYCTV\ 5 #WUVKP ( -WDCNC CPF , /CMJQWNNew uses for N-Best Sen-tence Hypothesis, within the BYBLOS Speech Recognition System,2TQE +''' +%#552I 5CP (TCPEKUEQ %# /CTEJ

=? 4 5EJYCTV\ ; %JQY 5 4QWEQU / -TCUPGT CPF , /CMJQWNImproved Hid-den Markov Modeling of Phonemes for Continuous Speech Recognition,2TQE

+''' +%#5523 5CP &KGIQ %# /CTEJ

=? 5 5GMKPG CPF 4 )TKUJOCP NYU Language Modeling Experiments for the 1995 CSR Evaluation,2TQE #42# 5RGGEJ 4GEQIPKVKQP 9QTMUJQR

*CTTKOCP 0; (GDTWCT[

=? $ 5JCJUJCJCPK A Markov Random Field Approach to Bayesian Speaker Adaptation,2TQE +''' +%#552 &GVTQKV /+ /C[

=? 4 5EJYCTV\ * ,KP ( -WDCNC CPF 5 /CVUQWMCU Modeling Those F-Conditions – Or Not,2TQE &#42#5RGGEJ 4GEQIPKVKQP 9QTMUJQR

%JCPVKNN[ 8# (GDTWCT[

=? - 5G[OQTG CPF 4 4QUGPHGNF Scalable backoff language models 2TQE +%

5.2ŏ1 2JKNCFGNRJKC 2# 1EVQDGT

=? / 5KGINGT 7 ,CKP $ 4CL CPF 4 5VGTPAutomatic Segmentation, Classifica-tion and Clustering of Broadcast News Audio,2TQE &#42#5RGGEJ 4GEQIPK VKQP 9QTMUJQR %JCPVKNN[ 8# (GDTWCT[

=? / 5KW CPF * )KUJ Evaluation of word confidence for speech recognition systems %QORWVGT 5RGGEJ .CPIWCIG13 1EVQDGT

=? # 5VQNEMGEntropy-based Pruning of Backoff Language Models 2TQE &#42#

$TQCFECUV 0GYU 6TCPUETKRVKQP 7PFGTUVCPFKPI 9QTMUJQR .CPFU FQYPG 8# (GDTWCT[

=? 5 6CMCJCUJK CPF 5 5CIC[COCFour-level Tied Structure for Efficient Repre-sentation of Acoustic Modeling,2TQE +''' +%#552 &GVTQKV /+

/C[

=? .( 7GDGN CPF 2% 9QQFNCPF An Investigation into Vocal Tract Length Normalization,2TQE '5%# 'WTQURGGEJŏ $WFCRGUV *WPICT[

5GRVGODGT

=? &# XCP .GGWYGP .) XCP FGP $GTI CPF *,/ 5VGGPGMGPHuman Bench-marks for Speaker Independent Large Vocabulary Recognition Performance, 2TQE '5%# 'WTQURGGEJŏ /CFTKF 5RCKP 5GRVGODGT

=? 6- 8KPVU[WMSpeech discrimination by dynamic programming,-KDPGTPGVKMC 4