Anthony Sigogne
6, rue des Berthauds Fixed Tel : 01.60.95.77.17
93110 Rosny Sous Bois Mobile Tel : 06.49.82.69.26
Nationality : French [email protected]
Born september 4, 1986
Training
2009- PhD in computational linguistics, Universit´e Paris-Est Marne-la-Vall´ee (77).
2007-2009 Master Informatique, sp´ecialit´e Recherche en Traitement Automatique des Langues, Universit´e Paris-Est Marne-la-Vall´ee (77).
Mention Bien.
2004-2007 Licence Math´ematiques et Informatique MIAS, sp´ecialit´e Informatique, Univer- sit´e Paris-Est Marne-la-Vall´ee (77).
Mention Assez Bien.
2004 Baccalaur´eat g´en´eral scientifique, sp´ecialit´e Sciences de l’Ing´enieur, Lyc´ee Gus- tave Eiffel de Gagny (93).
Mention Assez Bien
Informatics Skills
Languages of programing
C/C++ (on POSIX system), Java, Python, Ocaml.
Internet technologies
XHTML, CSS, PHP, JavaScript, XML, XSL, XUL, Django.
SGBDs SQL, MySQL, PostgreSQL, Oracle.
Tools of development
GNUMake, Ant, Eclipse, UML, SVN, Open Office, Microsoft Office, Latex
Systems GNU/Linux (Debian, Ubuntu, Arch) UNIX, Microsoft Windows.
Papers/Conferences
2011 5-8 October
Sigogne, Anthony, Constant, Matthieu and Laporte, ´Eric. ”Int´egration des donn´ees d’un lexique syntaxique dans un analyseur syntaxique probabi- liste.” 30th International Conference on Lexis and Grammar (LGC’11). Ni- cosie, Chypre. Note : To appear.
2011 6 October
Sigogne, Anthony, Constant, Matthieu and Laporte, ´Eric. ”French parsing enhanced with a word clustering method based on a syntactic lexicon”.
Second Workshop on Statistical Parsing of Morphologically Rich Languages (SPMRL 2011). Dublin, Ireland. Note : To appear.
2011
12-14 September
Sigogne, Anthony, Constant, Matthieu and Laporte, ´Eric. ”Integration of Data from a Syntactic Lexicon into a Generative and a Discriminative Pro- babilistic Parsers”. International conference on Recent Advances in Natural Language Processing (RANLP’11). Hissarya, Bulgaria.
2011 17-01 June
Constant, Matthieu, Tellier, Isabelle, Duchier, Denys, Dupont, Yoann, Si- gogne, Anthony and Billot, Sylvie. ”Int´egrer des connaissances linguistiques dans un CRF : application `a l’apprentissage d’un segmenteur-´etiqueteur du fran¸cais”. Conf´erence sur le traitement automatique des langues naturelles (TALN’11). Montpellier, France.
2011 19-24 June
Constant, Matthieu and Sigogne, Anthony. ”MWU-aware Part-of-Speech Tagging with a CRF model and lexical resources”. ACL Workshop on Multiword Expressions : from Parsing and Generation to the Real World (MWE’11). Portland, USA.
2010
13–15 October
Sigogne, Anthony. ”HybridTagger : un ´etiqueteur hybride pour le Fran¸cais”.
8`eme MAnifestation des JEunes Chercheurs en Sciences et Technologies de l’Information et de la Communication (MajecSTIC”10). Bordeaux, France.
Note : ”Best paper” of the conference.
2009
11–14 October
Sigogne, Anthony and Constant, Matthieu. ”Real-time unsupervised clas- sification of web documents”. Proceedings of the international conference Computational Linguistics and Applications CLA09. Mragowo, Poland. pp.
281–286
Theses
2009 Sigogne, Anthony. ”De l’´etiquetage morpho-syntaxique au super-chunking : Lev´ee d’ambigu¨ıt´es `a l’aide de m´ethodes hybrides et de ressources lexicales riches”. M´emoire de stage Master2 Recherche. Universit´e Paris-Est Marne- la-Vall´ee.
2008 Sigogne, Anthony. ”Classification incr´ementale et regroupement de docu- ments Web.”. M´emoire de stage Master1 Recherche. Universit´e Paris-Est Marne-la-Vall´ee.
Professional Experience
2009– PhD in computational linguistics
Universit´e Paris-Est Marne-la-Vall´ee (77)
Massive integration of linguisic resources in probabilistic parsing. Combination of a symbolic approach, based on grammars and lexicons manually created, and a statistical approach which trains grammar from arbored corpus and assigns probabilities on rules. The idea is to use specificities of each approach to obtain a parser with large coverage and pruning ambiguities. Linguistic resources used are lexicons of nouns and verbs available at LIGM. Furthermore, we will use lexicons of frozen expressions very present in the texts to encode attachments between syntactic constituents.
2009 April–
September
Research internship of Master2 Informatique Universit´e Paris-Est Marne-la-Vall´ee (77)
Realization of miscellaneous experiences on removing ambiguities into the morpho-syntactic analysis. Use of hybrid method combining both a symbolic approach based on manually created grammars (system ELAG) and statisti- cal approach based on hidden Markov models. We also take into account of compound words in the tagging process. Reuse of these into the partial parsing (at chunks level) in order to evaluate if chunker performances are best than a classic chunking.
2008–2009 November– April
Open-Ending contract (part time) Xeres Issy-les-Moulineaux (92)
Creation of a Thunderbird plugin into JavaScript/XUL in order to filter no relevant input emails thanks to parameterized queries. Managing a database emails.
2009 April–
September
Research internship of Master1 Informatique
Xeres Issy-les-Moulineaux (92) / Universit´e Paris-Est Marne-la-Vall´ee (77) Creation of a web monitoring tool in order to classify automatically web docu- ments, from news, in accordance with the topic. Use of linguistic and statistic knowledges to improve performances of the system for an intensive usage by the firm. Implementation of an algorithm which makes a ”single-pass” classification of a web document. Furthermore, the tool is highly parametrizable depending on granularity of classes.
Languages
English Technical
Spanish Basic knowledge
Leisures
Activities Movies, Concerts Sports Cycling, Bodybuilding