Lexical encoding of multiword expressions in XMG
Texte intégral
Documents relatifs
At the same time, DueLME and Walenty can be char- acterized as fixed encoding formats in the sense that their encoding language (basically the set of property names and
Last but not least, we showed that LRs are relatively rare in Polish: the idiomaticity rate of VMWEs is equal to 0.978, and only 2.6% of all VMWE types exhibit literal readings in
Extract of the encoding of a sample MWE hablar paja (lit. ’to talk straw’) with 3 different meanings according to the dialect..
For example, ‘decomposable [verbal idioms] tend to be syntactically flexible, in a manner predicted by the nature of the semantic decomposition; non-decomposable
Appendix B shows an extract of another example where the MWE hablar paja [CO,CR,PE] (it. ’to talk straw’) has a different meaning in each of the 3 dialects (represented in Appendix
They used 3 resources: (i) Walenty, (ii) the named entity an- notation layer of the NKJP corpus, and (iii) SEJF, an electronic lexicon of Polish nominal, adjectival and adverbial
Separating the regular from the idiosyncratic: An object-oriented lexical encoding of MWEs using XMG.. Timm Lichte, Yannick Parmentier, Simon Petitjean, Agata Savary,
We have presented a novel LTAG parsing architecture in which parses potentially containing MWEs are given higher priorities so as to be achieved faster than the competing