Lexical encoding of multiword expressions in XMG
Texte intégral
Documents relatifs
They used 3 resources: (i) Walenty, (ii) the named entity an- notation layer of the NKJP corpus, and (iii) SEJF, an electronic lexicon of Polish nominal, adjectival and adverbial
Separating the regular from the idiosyncratic: An object-oriented lexical encoding of MWEs using XMG.. Timm Lichte, Yannick Parmentier, Simon Petitjean, Agata Savary,
We have presented a novel LTAG parsing architecture in which parses potentially containing MWEs are given higher priorities so as to be achieved faster than the competing
For example, ‘decomposable [verbal idioms] tend to be syntactically flexible, in a manner predicted by the nature of the semantic decomposition; non-decomposable
Appendix B shows an extract of another example where the MWE hablar paja [CO,CR,PE] (it. ’to talk straw’) has a different meaning in each of the 3 dialects (represented in Appendix
Last but not least, we showed that LRs are relatively rare in Polish: the idiomaticity rate of VMWEs is equal to 0.978, and only 2.6% of all VMWE types exhibit literal readings in
Extract of the encoding of a sample MWE hablar paja (lit. ’to talk straw’) with 3 different meanings according to the dialect..
At the same time, DueLME and Walenty can be char- acterized as fixed encoding formats in the sense that their encoding language (basically the set of property names and