• Aucun résultat trouvé

Correspondence Set Pinching

Minimum Bayes-Risk Methods in Automatic Speech Recognition

2.4 Experimental Results

2.4.3 ROVER and e-ROVER for Multilingual ASR

2.4.3.1 Correspondence Set Pinching

RKPEJKPIVJTGUJQNF

TCVKQ

GŦ418'4 418'4

RKPEJKPIVJTGUJQNF

9'4

GŦ418'4 418'4

FIGURE 2.4

Top panel shows the ratio of total number of e-ROVER correspondence sets to that of ROVER correspondence sets, as a function of the pinching threshold.

Bottom panel shows the WER performance of e-ROVER for these thresholds.

TWNG 6JG NKMGNKJQQF UECNG HCEVQT YCU QDVCKPGF D[ EQPFWEKPI CP WPUWRGTXKUGF QR VKOK\CVKQP 5GEVKQP UGRCTCVGN[ HQT GCEJ U[UVGO 418'4 CPF G418'4 YGTG KORNGOGPVGF D[ EQODKPKPI VJGUG VJTGG UGVU QH J[RQVJGUGU 6JG RQUVGTKQT FKU VTKDWVKQP QXGT VJG TGUWNVKPI DGUV NKUV YCU FGTKXGF D[ UKORN[ TGPQTOCNK\KPI VJG NQINKMGNKJQQFU QH VJG UECNGF KPFKXKFWCN J[RQVJGUGU

2.4.3.1 Correspondence Set Pinching

+P G418'4 VJG EQTTGURQPFGPEG UGVU YGTG LQKPGF WUKPI VJG JGWTKUVKE RTQEGFWTG FG UETKDGF KP 5GEVKQP 6JKU RTQEGFWTG LQKPU VJG EQTTGURQPFGPEG UGVU DCUGF QP C ŒRKPEJKPI VJTGUJQNFŒ VJCV EQPUKFGTU VJG NCTIGUV RQUVGTKQT RTQDCDKNKV[ QH CP[ YQTF UVTKPI KP GCEJ EQTTGURQPFGPEG UGV # VJTGUJQNF QH TGUWNVU KP PQ LQKPKPI CV CNN YJKEJ KU GSWKXCNGPV VQ 418'4 YJKNG CP[ VJTGUJQNF CDQXG OGTIGU CNN VJG EQTTG URQPFGPEG UGVU

1WT KORNGOGPVCVKQP QH 418'4 TGUWNVGF KP C 9'4 YJKEJ KU C CD UQNWVG KORTQXGOGPV QXGT VJG DGUV /#2 YQTF GTTQT TCVG QH VJG VJTGG U[UVGOU DGKPI EQODKPGF(KIWTG UJQYU VJCV CFFKVKQPCN ICKPU ECP DG QDVCKPGF WUKPI G418'4 6JG VQR RCPGN UJQYU VJG TCVKQ QH VQVCN PWODGT QH G418'4 EQTTGURQPFGPEG UGVU VQ VQVCN PWODGT QH 418'4 EQTTGURQPFGPEG UGVU CU C HWPEVKQP QH VJG RKPEJKPI VJTGUJ QNF 6JKU TCVKQ KU HQT VJTGUJQNF XCNWG QH CPF FGETGCUGU OQPQVQPKECNN[ CU VJG VJTGUJQNF KPETGCUGU +V KU PQV CV KVU OKPKOWO HQT C VJTGUJQNF QH FWG VQ VJG RTGU GPEG QH EQTTGURQPFGPEG UGVU YJKEJ EQPVCKP QPN[ QPG YQTF VJGUG UGVU JCXG C YQTF YKVJ OCTIKPCN RTQDCDKNKV[ QH CPF TGOCKPGF RKPEJGF HQT C VJTGUJQNF XCNWG QH

6JG DQVVQO RCPGN KP (KIWTG UJQYU VJG GHHGEV QH RKPEJKPI QP 9'4 9G PQVG VJCV CNN VJTGUJQNFU TGUWNV KP DGVVGT VJCP 418'4 YQTF GTTQT TCVG 6JG VJTGUJQNF QH [KGNFU VJG DGUV RGTHQTOCPEG QH CDUQNWVG KORTQXGOGPV QXGT 418'4 CPF JGPEG C VQVCN QH CDUQNWVG QXGT VJG DGUV DCUGNKPG GTTQT TCVG 9G UGG C FGITCFCVKQP KP RGTHQTOCPEG HQT VJTGUJQNFU NCTIGT VJCP 1PG RQUUKDNG GZRNCPCVKQP KU VJG PGGF HQT JGCXKGT RTWPKPI FWG VQ VJG ITGCVN[ GPNCTIGF UGCTEJ URCEG VJCV TGUWNVU HTQO GZRCPFKPI CNN VJG UGIOGPV UGVU #PQVJGT RQUUKDKNKV[ KU VJCV VJG DGUV UVTCVGI[ KU VQ TGVCKP VJG YQTF UGIOGPVU VJCV YGTG TGEQIPK\GF YKVJ CDUQNWVG EGTVCKPV[ D[ VJG ſTUVRCUU U[UVGO

2.5 Summary

9G JCXG FGUETKDGF CWVQOCVKE URGGEJ TGEQIPKVKQP CNIQTKVJOU VJCV CVVGORV VQ OKPKOK\G VJG CXGTCIG OKUTGEQIPKVKQP EQUV WPFGT VCUM URGEKſE NQUU HWPEVKQPU 6JGUG TGEQIPK\

GTU CNVJQWIJ IGPGTCNN[ OQTG EQORWVCVKQPCNN[ EQORNGZ VJCP OQTG YKFGN[ WUGF /#2 CNIQTKVJOU ECP DG GHſEKGPVN[ KORNGOGPVGF WUKPI CP 0DGUV NKUV TGUEQTKPI RTQEGFWTG QT CU CPUGCTEJ QXGT TGEQIPKVKQP NCVVKEGU 9JKNG VJGKU IGPGTCNN[ OQTG CEEW TCVG KVU KORNGOGPVCVKQP TGSWKTGU VJCV WRRGT CPF NQYGT DQWPFU QP VJG EQUV QH RCTVKCN J[RQVJGUGU DG EQORWVGF CU VJG UGCTEJ RTQEGGFU 6JGUG OWUV DG FGTKXGF HQT GCEJ RGTHQTOCPEG ETKVGTKQP QH KPVGTGUV CPF YG JCXG IKXGP GZRTGUUKQPU HQT VJG .GXGPUJVGKP CPF MG[YQTF GTTQT TCVGU +P .8%54 GZRGTKOGPVU YG JCXG UJQYP VJCV /$4 FGEQFKPI RTQEGFWTGU ECP DG WUGF VQ VWPG #54 RGTHQTOCPEG HQT VCUM URGEKſE NQUU HWPEVKQPU 5GIOGPVCN /$4 KU FGUETKDGF CU C URGEKCN ECUG QH /$4 TGEQIPKVKQP VJCV TGUWNVU HTQO VJG UGIOGPVCVKQP QH VJG TGEQIPKVKQP UGCTEJ URCEG 6JG UGIOGPVCVKQP KU FQPG YKVJ VJG CUUWORVKQP VJCV VJG NQUU HWPEVKQP KPFWEGF KU C IQQF CRRTQZKOCVKQP VQ VJG QTKIK PCN FGUKTGF NQUU HWPEVKQP +V KU FKUEWUUGF JQY TGEQIPK\GT XQVKPI ECP DG EQPUKFGTGF KP VJG 5/$4 HTCOGYQTM CPF KP RCTVKEWNCT VJG YKFGN[WUGF 418'4 U[UVGO EQO DKPCVKQP RTQEGFWTG KU FGUETKDGF KP VJKU YC[ 6JCV 418'4 ECP DG FGUETKDGF CU CP /$4 RTQEGFWTG WPFGT C NQUU HWPEVKQP TGNCVGF VQ VJG 9'4 RTQXKFGU C RNCWUKDNG GZ RNCPCVKQP HQT VJG RGTHQTOCPEG KORTQXGOGPVU VJCV KV JCU DGGP HQWPF VQ RTQXKFG 9G VJGP FGUETKDGF G418'4 YJKEJ KU C 418'4 XCTKCPV DCUGF QP C NQUU HWPEVKQP VJCV ECP DG VWPGF VQ DGVVGT CRRTQZKOCVG VJG .GXGPUJVGKP FKUVCPEG 6JG XCNWG QH VJGUG VGEJPKSWG CTG FGOQPUVTCVGF D[ WUKPI 418'4 CPF G418'4 HQT OWNVKNKPIWCN U[UVGO EQODKPCVKQP #U JCU DGGP UJQYP KP VJGUG CPF QVJGT GZRGTKOGPVU TGEQIPK\GT XQVKPI RTQEGFWTGU ECP EQODKPG TGEQIPKVKQP J[RQVJGUGU HTQO FKXGTUG U[UVGOU VQ IGPGTCVG C UKPING J[RQVJGUKU VJCV KU DGVVGT VJCP VJG DGUV J[RQVJGUKU QH CP[ QH VJG KPFKXKFWCN U[U VGOU 6JGUG GZRGTKOGPVU YGTG DCUGF QP VJG UGIOGPVCVKQP QH 0DGUV NKUVU RTQFWEGF D[ GCEJ U[UVGO *QYGXGT UKOKNCT RTQEGFWTGU ECP DG FGTKXGF HQT NCVVKEG TGUEQTKPI CPF VJG FGXGNQROGPV QH /$4 NCVVKEG UGIOGPVCVKQP RTQEGFWTGU KU C VQRKE QH EWTTGPV TGUGCTEJ

2.6 Acknowledgements

9G VJCPM &KOKVTC 8GTI[TK HQT RTQXKFKPI VJG NCVVKEGU VJCV YGTG WUGF KP QWT GZRGTKOGPVU CPF -WOCT 5JCPMCT HQT JGNR YKVJ VJG GZRGTKOGPVU 9G CNUQ VJCPM #PFTGCU 5VQNEMG CPF .KFKC /CPIW HQT WUGHWN FKUEWUUKQPU

References

=? 2 , $KEMGN CPF - # &QMUWO Mathematical Statistics: Basic Ideas and Selected topics *QNFGP&C[ +PE 1CMNCPF %#

=? 9 $[TPG 2 $G[GTNGKP , *WGTVC 5 -JWFCPRWT $ /CTVJK , /QTICP 0 2G VGTGM , 2KEQPG & 8GTI[TK CPF 9 9CPI 6QYCTFU NCPIWCIG KPFGRGPFGPV CEQWUVKE OQFGNKPI +PIEEE Conference on Acoustics, Speech, and Signal Pro-cessing RCIGU Ō +UVCPDWN 6WTMG[

=? 0 %JKPEJQT 2 4QDKPUQP CPF ' $TQYP *WD 0COGF 'PVKV[ 6CUM &GſPK VKQP 8GTUKQP +PHub-5 Conversational Speech Recognition Workshop #XCKNCDNG CV YYYPKUVIQXURGGEJJWD

=? ) 'XGTOCPP CPF 2 9QQFNCPF 2QUVGTKQT 2TQDCDKNKV[ &GEQFKPI %QPſFGPEG 'UVKOCVKQP CPF 5[UVGO %QODKPCVKQP +PIn Proceedings of the NIST and NSA Speech Transcription Workshop %QNNGIG 2CTM /&

=? , (KUEWU # 2QUVRTQEGUUKPI 5[UVGO VQ ;KGNF 4GFWEGF 9QTF 'TTQT 4CVGU 4GE QIPK\GT 1WVRWV 8QVKPI 'TTQT 4GFWEVKQP 418'4 +PIEEE Workshop on Au-tomatic Speech Recognition and Understanding RCIGU Ō

=? 4CFW (NQTKCP CPF &CXKF ;CTQYUM[ &[PCOKE 0QPNQECN .CPIWCIG /QFGNKPI XKC *KGTCTEJKECN 6QRKE$CUGF #FCRVCVKQP +PACL99 RCIGU Ō

=? , , )QFHTG[ ' % *QNNKOCP CPF , /E&CPKGN 5YKVEJDQCTF 6GNGRJQPG 5RGGEJ %QTRWU HQT 4GUGCTEJ CPF &GXGNQROGPV +PIEEE Conference on Acous-tics, Speech, and Signal Processing XQNWOG RCIGU Ō 5CP (TCPEKUEQ

%#

=? 8 )QGN Word List With Content Word Marks #XCKNCDNG CV JVVRYYYENURLJWGFWRGQRNGXIQGN

=? 8 )QGN Minimum Bayes-Risk Automatic Speech Recognition 2J& &KUUGT VCVKQP ,QJPU *QRMKPU 7PKXGTUKV[ $CNVKOQTG /&

=? 8 )QGN CPF 9 $[TPG 6CUM &GRGPFGPV .QUU (WPEVKQPU KP 5RGGEJ 4GEQIPK VKQP 5GCTEJ QXGT 4GEQIPKVKQP .CVVKEGU +PEurospeech-99 RCIGU Ō $WFCRGUV *WPICT[

=? 8 )QGN CPF 9 $[TPG #RRNKECVKQPU QH /KPKOWO $C[GU4KUM &GEQFKPI VQ .8%54 +PIn Proceedings of the NIST and NSA Speech Transcription Work-shop %QNNGIG 2CTM /&

=? 8 )QGN CPF 9 $[TPG /KPKOWO $C[GU4KUM #WVQOCVKE 5RGGEJ 4GEQIPKVKQP Computer Speech and Language Ō

=? 8 )QGN CPF 9 $[TPG 4GEQIPK\GT 1WVRWV 8QVKPI CPF &/% KP /KPKOWO

$C[GU4KUM (TCOGYQTM +PResearch Notes No. 40, Center for Language and Speech Processing

=? 8 )QGN 9 $[TPG CPF 5 -JWFCPRWT .8%54 4GUEQTKPI 9KVJ /QFKſGF .QUU (WPEVKQPU # &GEKUKQP 6JGQTGVKE 2GTURGEVKXG +PIEEE Conference on Acous-tics, Speech, and Signal Processing XQNWOG RCIGU Ō

=? 8 )QGN 5 -WOCT CPF 9 $[TPG 5GIOGPVCN /KPKOWO $C[GU4KUM #54 8QVKPI 5VTCVGIKGU +PInternational Conference on Spoken Language Pro-cessing XQNWOG RCIGU Ō $GKLKPI %JKPC

=? 2 5 )QRCNCMTKUJPCP . 4 $CJN CPF 4 . /GTEGT # 6TGG 5GCTEJ 5VTCVGI[

HQT .CTIG 8QECDWNCT[ %QPVKPWQWU 5RGGEJ 4GEQIPKVKQP +PIEEE Conference on Acoustics, Speech, and Signal Processing XQNWOG RCIGU Ō

=? 2 ' *CTV 0 , 0KNUUQP CPF $ 4CRJCGN # (QTOCN $CUKU HQT VJG *GWTKUVKE

&GVGTOKPCVKQP QH /KPKOWO %QUV 2CVJUIEEE Transactions on Systems Science and Cybernetics 55%Ō

=? 2 ' *CTV 0 , 0KNUUQP CPF $ 4CRJCGN %QTTGEVKQP VQ Ŏ# (QTOCN $CUKU HQT VJG *GWTKUVKE &GVGTOKPCVKQP QH OKPKOWO %QUV 2CVJUŏ SIGART Newsletter Ō

=? ( ,GNKPGM # (CUV 5GSWGPVKCN &GEQFKPI #NIQTKVJO 7UKPI C 5VCEMIBM Journal of Research Development Ō

=? ( ,GNKPGM Statistical Methods for Speech Recognition 6JG /+6 2TGUU %CO DTKFIG /CUUCEJWUGVVU

=? Proceedings of the 1997 Large Vocabulary Continuous Speech Recognition Workshop #XCKNCDNG CV JVVRYYYENURLJWGFWYU

=? $* ,WCPI CPF 5 -CVCIKTK &KUETKOKPCVKXG .GCTPKPI HQT /KPKOWO 'TTQT %NCU UKſECVKQP IEEE Transactions on Signal Processing 52Ō

=? , -CKUGT $ *QTXCV CPF < -CEKE # 0QXGN .QUU (WPEVKQP HQT VJG 1XGTCNN 4KUM

%TKVGTKQP $CUGF &KUETKOKPCVKXG 6TCKPKPI QH *// /QFGNU +PInternational Conference on Spoken Language Processing XQNWOG RCIGU Ō $GK LKPI %JKPC

=? 6 -CYCJCTC %* .GG CPF $* ,WCPI %QODKPKPI -G[ 2JTCUG &GVGEVKQP CPF 5WDYQTF $CUGF 8GTKſECVKQP HQT (NGZKDNG 5RGGEJ 7PFGTUVCPFKPI +PIEEE

Conference on Acoustics, Speech, and Signal Processing XQNWOG RCIGU Ō

=? / 9 -QQ %* .GG CPF $* ,WCPI # 0GY *[DTKF &GEQFKPI #NIQTKVJO HQT 5RGGEJ 4GEQIPKVKQP CPF 7VVGTCPEG 8GTKſECVKQP +P1997 IEEE Workshop on Automatic Speech Recognition and Understanding RCIGU Ō

=? 8 + .GXGPUJVGKP $KPCT[ %QFGU %CRCDNG QH %QTTGEVKPI &GNGVKQPU +PUGTVKQPU CPF 4GXGTUCNUSoviet Phys. Dokl. Ō

=? . /CPIW ' $TKNN CPF # 5VQNEMG (KPFKPI %QPUGPUWU #OQPI 9QTFU .CVVKEG

$CUGF 9QTF 'TTQT /KPKOK\CVKQP +PEurospeech-99 RCIGU Ō $WFCRGUV

*WPICT[

=? # /CTVKP , (KUEWU / 2T\[DQEMK CPF $ (KUJGT *WD 9QTMUJQR +P HQTOCVKQP 4GVTKGXCN +P9th Hub-5 Conversational Speech Recognition Work-shop

=? # /CTVKP , (KUEWU / 2T\[DQEMK CPF $ (KUJGT *WD 9QTMUJQR 9GKIJVGF 9QTF 4GUWNVU +P 9th Hub-5 Conversational Speech Recognition Workshop

=? - 0C $ ,GQP & %JCPI 5 %JCG CPF 5 #PP &KUETKOKPCVKXG 6TCKPKPI QH

*KFFGP /CTMQX /QFGNU 7UKPI 1XGTCNN 4KUM %TKVGTKQP CPF 4GFWEGF )TCFKGPV /GVJQF +PEurospeech-95 RCIGU Ō /CFTKF 5RCKP

=? # 0CFCU # &GEKUKQP 6JGQTGVKE (QTOWNCVKQP QH VJG 6TCKPKPI 2TQDNGO KP 5RGGEJ 4GEQIPKVKQP CPF C %QORCTKUQP QH 6TCKPKPI D[ 7PEQPFKVKQPCN 8GTUWU

%QPFKVKQPCN /CZKOWO .KMGNKJQQF IEEE Transactions on Acoustics, Speech, and Signal Processing #552Ō

=? # 0CFCU 1RVKOCN 5QNWVKQP QH C 6TCKPKPI 2TQDNGO KP 5RGGEJ 4GEQIPK VKQPIEEE Transactions on Acoustics, Speech, and Signal Processing #552 Ō

=? & $ 2CWN #P 'HſEKGPV5VCEM &GEQFGT #NIQTKVJO HQT %QPVKPWQWU 5RGGEJ 4GEQIPKVKQP YKVJ C 5VQEJCUVKE .CPIWCIG /QFGN +P IEEE Conference on Acoustics, Speech, and Signal Processing XQNWOG RCIGU Ō

=? ' 4KUVCF CPF 2 ;KCPKNQU .GCTPKPI 5VTKPI 'FKV &KUVCPEG IEEE Trans. PAMI Ō

=? 4 % 4QUG CPF & $ 2CWN # *KFFGP /CTMQX /QFGN $CUGF -G[YQTF 4GEQIPK VKQP 5[UVGO +PIEEE Conference on Acoustics, Speech, and Signal Processing XQNWOG RCIGU Ō

=? # 5VQNEMG ; -QPKI CPF / 9GKPVTCWD 'ZRNKEKV 9QTF 'TTQT /KPKOK\CVKQP KP 0$GUV .KUV 4GUEQTKPI +PEurospeech-97 XQNWOG RCIGU Ō 4JQFGU )TGGEG

=? 8 8CRPKM Estimation of Dependences Based on Empirical Data 5RTKPIGT 8GTNCI 0GY ;QTM

=? ( 9GUUGN 4 5EJNWVGT CPF * 0G[ 7UKPI 2QUVGTKQT 9QTF 2TQDCDKNKVKGU (QT +ORTQXGF 5RGGEJ 4GEQIPKVKQP +PIEEE Conference on Acoustics, Speech, and Signal Processing XQNWOG RCIGU Ō +UVCPDWN 6WTMG[

=? , ) 9KNRQP . 4 4CDKPGT %* .GG CPF ' 4 )QNFOCP #WVQOCVKE 4GEQI PKVKQP QH -G[YQTFU KP 7PEQPUVTCKPGF 5RGGEJ 7UKPI *KFFGP /CTMQX /QF GNU IEEE Transactions on Acoustics, Speech, and Signal Processing #552 Ō

=? 5 ;QWPI HTK 2.1 'PVTQRKE %CODTKFIG 4GUGCTEJ .CDQTCVQT[ .VF %CO DTKFIG 7-

3

A Decision Theoretic Formulation for Robust