• Aucun résultat trouvé

Graph Interpolation Grammars: a Rule-based Approach to the Incremental Parsing of Natural Languages

N/A
N/A
Protected

Academic year: 2021

Partager "Graph Interpolation Grammars: a Rule-based Approach to the Incremental Parsing of Natural Languages"

Copied!
6
0
0

Texte intégral

(1)

HAL Id: inria-00073299

https://hal.inria.fr/inria-00073299

Submitted on 24 May 2006

HAL is a multi-disciplinary open access

archive for the deposit and dissemination of

sci-entific research documents, whether they are

pub-lished or not. The documents may come from

teaching and research institutions in France or

abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est

destinée au dépôt et à la diffusion de documents

scientifiques de niveau recherche, publiés ou non,

émanant des établissements d’enseignement et de

recherche français ou étrangers, des laboratoires

publics ou privés.

Graph Interpolation Grammars: a Rule-based Approach

to the Incremental Parsing of Natural Languages

John Larchevêque

To cite this version:

John Larchevêque. Graph Interpolation Grammars: a Rule-based Approach to the Incremental

Pars-ing of Natural Languages. [Research Report] RR-3390, INRIA. 1998. �inria-00073299�

(2)

ISSN 0249-6399

a p p o r t

d e r e c h e r c h e

INSTITUT NATIONAL DE RECHERCHE EN INFORMATIQUE ET EN AUTOMATIQUE

Graph Interpolation Grammars:

a rule-based approach to the incremental

parsing of natural languages

John Larchevˆeque

N ˚ 3390

Mars 1998

(3)
(4)

"$#&%'(*)+,(-./ 0 '1 '-*2  3'4 '5,6 ' "7 '5"'(5* 8&9&:;=<*>@?BAB:CED.FCHGJIC KMLJN OQPO(R,SUTWVHXYOQZ\[E]X\^`_HV L _HPPOabPc[d] L ^eVfOEg ^hPc[jidOQkgmlm_EVmV$nOO4k4gf]_HVmVm[j^hkok\[EVm]OQk p ZY_dqWOX!r!XY_Es`s t [Eumuv_HZYXMlfO(Z\O4] L OQZ\] L OwVxdRERdyEz{SU|}[dZok~4yEydSU€v~um[jiEO4k ‚„ƒ…4†‡‰ˆmŠd†d‹,Œ Zo[du L TWVHXYOQZ\u@_dsh[jX\^`_HV Œ Z\[EPcP[EZ\k[dZ\O[,lfOQ]sh[dZo[jX\^`ŽEO!_EZoP[ds`^hkoPw‘M^eX L [dVc_Eu@O4Zo[jX\^`_HVm[js k\O4Pc[EVEX\^e]4k’ KML O4^eZ“iE_E[js^hk”X\_7OQP,•fsh[jX\Oko[jse^eO4VHX–OQ[jX\•vZYOQk”_d,X L O L •mPc[EV—um[EZ\k\O4Z4g*[dVml=Vf_dXo[d˜fse™ ^hVm]4ZYOQPOQVHX\[jse^eXš™d’ KML Oum[EZ\k\^hVfi„umZ\_f]O4koklmO›@VfOQl}˜H™ Œ T Œ k^eVm]4ZYOQPO4VHX\[ds`se™“˜v•f^`shlmk[„k\™fVHX\[d]XY^h],Z\O4uœa Z\O4k\O4VHXo[jXY^e_EV{_j*[k\O4VHXYOQVm]O–[Ek(O4[E] L ko•m]4]O4kok\^`ŽEOseOfOQPO„^ekZ\O4[ElJ’”r Œ T Œ Zo•fseO–k\u@O4]^ž›@O4k[k\OX_d uv[dZokYO]_EVf›viE•mZo[jX\^`_HVmkŸX L [jX$X\Z\^eididOQZŸ^eX\k![duvufs`^h][dXY^e_EV”[dVml–[EV–_HuvOQZ\[dXY^e_EV X\_u@O4ZY_EZoP¡_EV„[Pc[jX\] L ^eVfi ]_EVœ›@iE•mZo[jX\^`_HVJ’ t •fseO4kM[dZ\O(um[dZ\XYse™„]_EVHXYOœXYabk\O4VvkY^eXY^eŽdOE¢œ£•mZYX L O4ZoP_EZ\OdgfX L O™“[dZ\OZ\OŽEO4ZokY^h˜fseOdgfPO4[dVm^eVfi X L [jX$X L O^hZ._HuvOQZ\[dXY^e_EVmk$][EV„˜@O(•mVmlf_HVfOdgœ‘ L ^h] L [ds`se_‰‘*kX L O(um[dZok\^eVfiumZ\_œ]O4kokX\_˜@O(Vf_EVvlfOX\O4ZoP^hVf^hkWa X\^e]d’ KML OQkYO(Xš‘$_£[d]XY_EZokM]_HVœO4Z*O4Vf_H•fi L OmumZ\O4kok\^`ŽEOu@_‰‘.OQZ!XY_ X L O(_EZoPc[jse^ekoP¤_HZ*um[dZok\^eVfi Vm[dX\•mZo[js sh[dVmiE•m[didO4k4’ ¥'¦H§©¨oª,« ‡‰¬…d‹ um[EZ\k\^hVfivgjVm[dX\•mZo[jsœsh[dVfiH•m[jiEO.umZ\_f]OQk\k\^eVmivg‰umk\™œ] L _dse^eVfiH•f^hkYX\^e]4kg‰se^eVmiE•f^hkYX\^e]4kgdiEZo[dPcPc[dZ4g k\™fVHX\[d]XY^h]ZYOQumZYOQkYOQVHX\[jX\^`_HVJgœs`Of^e]4[js Z\O4umZ\O4k\O4VHX\[dXY^e_EV ­¯®° ±\²´³Qµ ° ±2¶·e²b¸b¹º

Unit´e de recherche INRIA Rocquencourt

Domaine de Voluceau, Rocquencourt, BP 105, 78153 LE CHESNAY Cedex (France)

T´el´ephone : 01 39 63 55 11 - International : +33 1 39 63 55 11

(5)

" 6j4 )*-* 6=" W '„  '-* 2   ) " ) -M" ¦ …  ¦ ‹ OQk Œ Zo[dPcPc[j^hZYOQk N [ TWVHX\O4Zouv_Ese[dXY^e_EVmk”lfO Œ Zo[du L

O4k„k\_EVHX„•vV _EZoPc[jse^ekoPO}l$nOQ]sh[dZo[jX\^ž P•mVf^l•vVfO7kfnO4Pc[EVEX\^œ•fO _Hu.nO4Zo[jX\^`_HVmVfO4s`seOd’ses`OQk Žœ^ek\O4VHX

N

[ O4P,•ms`OQZn ]OQZYXo[j^hVfO4k ]4[dZo[d]XfnOQZY^hkYX\^œ•fOQk lmO s[dVv[jse™œk\O4•vZ–kY™fVHX\[jf^œ•fO L •mPc[d^eVJgMO4X”Vf_EX\[EPPOQVEX”kY_HV ][dZo[d]X N OQZYO'^hVm]ZmnOQPOQVHX\[js´’  O}uvZY_f]OQkWa ko•mk.l[EVm[jse™fkYOH•O4s`seO4k.l.nO›&Vf^ekok\O4VHX]_HVmk\X\Zo•f^`X.•mVfO2Z\O4uvZfnOQkYOQVHX\[jX\^`_HVk\™fVHX\[‰f^œ•fOM^eVv]ZfnO4PO4VHXo[jseO4PO4VHXg

N [ POQk\•mZ\Oœ•fO,] L [œ•fO(s`O N O4POO4k\X*sh•J’ [dVvk•mVfO Œ T Œ g@•mVfOZ N O4idseO(]_EPcu@_EZ\XYO•mVmO,ZYOQumZfnO4k\O4VHX\[dXY^e_EV lmO4k]_HVœ›viE•vZ\[dXY^e_EVmkl$nOQ]seO4Vm] L

[EVHXYO4k OX•mVmO!kou$nOQ]^`›@][dXY^e_EV(lfOQk_Eu$nO4Zo[jXY^e_EVvk

N [[dumufse^H•mO4Zk\•vZ]O4k]_EVœ›ma iH•mZo[jXY^e_EVvk’  O4kZ N

OidseO4k2]_HPu@_EZ\XYOQVHX•mV O4snn OQPO4VHX2]_EVHXYOœXo•fOs´¢ lfOcufsh•mk4g Oses`OQkkY_HVEXZfnOŽEO4ZokY^h˜fseO4k4g@OQV ]Ok\O4Vvk!œ•^esJO4k\X$u@_EkokY^h˜fseO(lOQV„l$nO£[j^hZ\OseO4k!O"©OXokgm]O#œ•f^ u@O4ZoPOX!•mVfO([dVv[jse™œk\O2Vm_EV“l.nOX\O4ZoP^eVf^hk\XYOd’

$ OQklfO4•œ£[E]X\O4•mZok]_EVHXoZY^h˜m•fOQVEX N [”]_EVœBnO4Z\O4Z[E•œ Œ Zo[dPcPc[j^hZYOQk N [”TWVEX\O4Zouv_Ese[dXY^e_EVmklfO Œ Z\[Eu L OQk•vV u@_E•mŽd_d^hZ!lOmumZ\O4kokY^e_EV”k\•&%–ko[dVHX*u@_E•mZ!s'[EVm[jse™fkYOlfOQkMse[EVfiE•mO4k!Vm[jXo•mZ\Oses`OQk’

( « †Q… ¨ Š)¦ ‹

k\™fVHX\[‰fOEgsh[dVfiH[jidOVm[dX\•mZ\Os´gQuvkY™f]

L

(6)

R   „) "'-$  "!# $&%('*)+'-,/.01)32546.+25,74&'-8:9;)+'<.25="8('->50 rwiEZo[du L ^hVHXYO4Zou@_dsh[jXY^e_EV'iHZ\[EPcP[EZ!^ek2[ iEZo[dPcPc[dZM_EZoP[ds`^hkoP ‘M^eX L [EV'_Eu@O4Zo[jX\^`_HVm[js k\O4Pc[EVEX\^e]4k’*r Zo•fseO^eV [ iHZ\[Eu L

^eVHX\O4Zouv_Ese[dXY^e_EV iEZo[dPcPc[dZckouvOQ]^`›vO4k”Vf_dX–_HVfse™7k\™œVHXo[d]X\^e]Z\Osh[jX\^`_HVmk ˜m•fX“[jshkY_ [EV O4s`OQPO4VHX\[EZY™–um[EZ\k\^hVfic_Eu@O4Zo[jXY^e_EV ’ t •fs`OQk[dZ\O,seOf^e]4[jse^?4O4l}^eV{X L O k\O4Vmk\O,X L [jXOQ[d] L Zo•fseOlmO4ko]Z\^e˜@O4k2X L O ]_EP,˜m^eVm[dXY_HZY™umZ\_Eu@O4Z\XY^eO4k2_j.[ seOf^h][ds@^eXYOQP”’ p [EZ\k\^hVfi,[,k\O4VHX\O4Vm]O]_EVmk\^ek\X\k.^hV”Pc[jXo] L ^hVfi,OQ[d] L seOfO4PO^eV„X L O^hVmum•fXMk\X\Z\^eVmi,‘M^eX L [ Zo•fseO^eV“X L O(iEZo[dPcPc[dZ![dVml“[duvufs`™œ^hVficX L ^hk*Zo•fseOX\_ X L O]4•mZoZYOQVEX!um[EZ\k\O(ZYOQumZYOQkYOQVHX\[jX\^`_HVJ’ Œ T Œ ašlmZ\^`ŽEO4Vum[EZ\k\^eVmiM‘$[Ek lfO4k\^eiEVfOQl‘M^`X L ^eVv]Z\O4PO4VHX\[ds`^eXš™2[EVmlA@vOœ^h˜f^es`^eXš™^eVP^hVmlJg‰k\_[dkJX\_*O4P•fsh[jXYO k\_EPOO4[dX\•mZ\O4k_d$X L O L •mPc[dVum[dZokY^hVfi'][Eum[d˜f^ese^`Xš™Eg ^eV um[EZYX\^e]4•fsh[dZ^eVv]Z\O4PO4VHX\[dsŸumZ\_œ]O4kokY^hVfi@gJO4ZoZY_HZ X\_dseO4Zo[dVm]Odgf[dVvl L [dVmlfse^hVfic_j]_EPcufseO”‘._HZ\l„_EZolfO4Zok’ TWV [}]_HPcufs`O4XYO'P_œlmOs$_j2lf^hko]_E•vZ\k\O„•mVmlmO4ZokYXo[dVmlf^hVfi@g^eVm]4ZYOQPO4VHX\[ds.um[EZ\k\^eVmi}‘$_E•fshl ˜@O”k L _‰‘*V XY_ ‘$_EZCB(^hVX\[dVvlfO4P ‘M^`X L kY_HPO!_EZoP _jJ]_HPcuv_HkY^eXY^e_EVc˜@OXš‘$OO4V uv[dZ\XY^h[jsmk\O4Pc[dVHXY^h]MZ\O4umZ\O4k\O4VHX\[dXY^e_EVmk4’ TšX ^hk2XY_”˜@O,X L _E•mi L XX L [jXX L

O]_Es`sh[d˜@_EZo[jX\^`_HV˜vO4Xš‘.O4O4V k\™fVEXo[‰[dVml}kYOQP[EVHXY^h]k*^hV}Vv[jX\•vZ\[ds lf^hk\]_E•mZok\O •vVmlfO4Zok\X\[dVvlf^eVmi(^ek£[j^hZYse™]s`_HkYOEgH[EVmlcX L [jX˜m[E]BHX\Zo[d]BH^eVmi^hV X L O2um[EZ\k\O4Z^hk£Z\Oœ•fOQVHXYse™^hVf^eXY^h[jXYOQl–˜H™[ ]se[Ek L _E•mVvl“˜@OXš‘$OOQV”k\O4Pc[EVEX\^e]2OQ[jX\•vZYOQk’ KML ^hkMZYOQuv_HZYX4g L _‰‘$OŽdOQZgH‘M^es`s Vf_EXM[jXYX\O4PcufXMX\_iE^`ŽEO2O4ŽdOQV X L OZ\_E•fi L O4k\X^hlfO4[(_d&‘ L [jXk\O4Pc[dVHX\^e]*Z\O4uvZYOQkYOQVEXo[jX\^`_HVmkk L _E•fshlcs`_œ_DBse^EBdO*[dVmlc‘M^es`s@X L OQZYO_EZ\OMOm]sh•œa k\^eŽdOse™“_f]•mk_HV}k\™œVHXo[d]X\^e]u L OQVf_EPOQVm[mg k\_EPOX\^ePOQk[dXX L Oc]_HkYX_j!kY^hPcufs`^`™œ^eVmi”X L O u L O4Vm_EPO4Vm[ [dX L [dVml ’ "!F GH>#'-8I=KJL.+%M0N)+0POQ=K). KML O(›@ZokYX*Xš‘$_„k\O4]XY^e_EVmkid^eŽdO[c£[d^eZ\s`™']_HPcufs`O4XYO,uvZYOQkYOQVEXo[jX\^`_HV'_jX L O]_HVm]OQufX\k2[dVmlumZ\_f]OQk\k\O4kM^hVœa ŽE_dseŽdOQl^hV–um[EZ\k\^hVfi(‘M^`X L [ Œ Zo[du L TWVEX\O4Zouv_Ese[dXY^e_EV Œ Zo[dPcPc[dZ4’ KML OkY™fVHX\[E]X\^e]*k\X\Zo•m]X\•mZ\O4kidO4VmO4Zo[jXYOQl ‘ L O4Vum[EZ\k\^hVfi*‘M^eX L [ Œ T Œ [dZ\OlfOQk\]4ZY^h˜@O4l(^hVRHOQ]X\^`_HVTSfgB‘ L ^es`O$X L OiEZo[dPcPc[dZ Z\•ms`OQk [dVvlX L O.um[EZ\k\^eVmi uvZY_f]OQk\k![EZYO(lfOQk\]4ZY^h˜vOQl„^hVURHO4]XY^e_EV'Rm’ KML O„VfOfX(O4‘ kYOQ]XY^e_EVvk[dumums`™X L

O–_EZoP[ds`^hkoP XY_{kYO4s`OQ]XYOQl umZY_H˜fseO4Pck’URHOQ]XY^e_EV € ]_EPcum[EZYOQk(X

L O L [dVmlfse^hVfi _d[ k\^ePcufseOOmumZ\O4kokY^e_EV3sh[dVfiH•m[jiEO}•mk\^eVmi7[ $ _EVHXYOœXV@Z\OO Œ Z\[EPPc[EZ”[EVml [ Œ Zo[du L TWVHX\O4Zouv_Ese[dXY^e_EV Œ Zo[dPcPc[dZ4gm[dVvlWRHOQ]X\^`_HVWX[EVm[jse™Y?O4k!•fX\] L $ Z\_EkokWa5RHOQZY^h[js 2OQuvOQVmlfOQVm]^eO4k4’ V^hVm[jsese™dgfX L O]_EVv]sh•mkY^e_EV”^hVmlf^h][jX\O4kMX\_Euf^h]k$_HZ$£•mZ\X L OQZMZYOQkYOQ[dZo] L [dVvl”Z\Osh[jX\O4l”‘._HZBœk4’ Z [ ”  (-$ -  "'-$ "* F\!# GH%()+'-4D0P4 r u L Zo[dk\O^ek!Pc[dlfO•mu”_j [ L OQ[dl„[EVml–^eX\kM]_EPcufseO4PO4VHXok’TšX!‘M^`sesJ˜vO(Z\O4umZ\O4k\O4VHX\O4l–˜H™–[iEZo[du L ‘M^eX L [cZ\_œ_dXM[dVml”_EVmOsh[d˜@OseseO4l“O4lfiEO£ZY_HP X L OZ\_œ_dX!XY_ OQ[d] L Vf_EVvZY_œ_dX!Vf_flfOE’ KML OZ\_œ_dX.Z\O4uvZYOQkYOQVEXokX L O L O4[El–_d X L O(u L Z\[EkYOEgEX L O_dX L O4Z$Vf_flfO4k!Z\O4umZ\O4k\O4VHX^eX\kM]_EPcufseO4PO4VHXok¢f[dVvl X L O(O4lmidOse[E˜vO4sekMZ\O4uvZYOQkYOQVEX!iHZ\[EPPc[dXY^h][js&£•mVv]XY^e_EVvk’

Références

Documents relatifs

To test whether the vesicular pool of Atat1 promotes the acetyl- ation of -tubulin in MTs, we isolated subcellular fractions from newborn mouse cortices and then assessed

Néanmoins, la dualité des acides (Lewis et Bronsted) est un système dispendieux, dont le recyclage est une opération complexe et par conséquent difficilement applicable à

Cette mutation familiale du gène MME est une substitution d’une base guanine par une base adenine sur le chromosome 3q25.2, ce qui induit un remplacement d’un acide aminé cystéine

En ouvrant cette page avec Netscape composer, vous verrez que le cadre prévu pour accueillir le panoramique a une taille déterminée, choisie par les concepteurs des hyperpaysages

Chaque séance durera deux heures, mais dans la seconde, seule la première heure sera consacrée à l'expérimentation décrite ici ; durant la seconde, les élèves travailleront sur

A time-varying respiratory elastance model is developed with a negative elastic component (E demand ), to describe the driving pressure generated during a patient initiated

The aim of this study was to assess, in three experimental fields representative of the various topoclimatological zones of Luxembourg, the impact of timing of fungicide

Attention to a relation ontology [...] refocuses security discourses to better reflect and appreciate three forms of interconnection that are not sufficiently attended to