Limitations of the Present Study and Future Research

Dans le document Evaluation of Statistical Machine Translation Engines in the Context of International Organizations (Page 102-107)

Well-Formedness Results

VIII. Final Conclusions

8.4. Limitations of the Present Study and Future Research

The main limitations of this study were clear from the beginning: a small size sample (few participants, short samples for testing) and limited time and funding. On the one hand, it was difficult to find voluntary participants responding to the profile.

While reducing the requirements for participations might have improved the odds of finding willing participants (recruiting students, for example) this would have affected the results negatively: students might have graded the translations in a way teachers grade theirs, not from the point of view of experience as an institutional translator. On the other hand, since participants were volunteers, asking them to grade larger samples would have probably resulted in a high dropout rate. The size of the Association’s corpus is not considered a limitation of the research because it was a characteristic of the case study scenario. However, despite these limitations, the findings of this study let us generate two hypotheses that can be tested in future studies: (1) in terms of translation quality, customizable systems are more appropriate for institutional translation (see sections 7.1.1. Functionality and 8.2.1 Evaluation Results […]); (2) in terms of operability, open source systems are more compatible with institutional settings than commercial systems (see sections 7.2. Operational Evaluation and 8.2.1).

It is worth stressing that, since both candidates were SMT engines, no assumptions can be made of whether these type of engines are more suitable for international organisations than other types of engines, such as RBMT or hybrid systems.

In conclusion, it could be interesting to test; on the one hand, the replicability of the results by carrying out the same study (evaluation of MT systems in institutional settings) with a larger scope (more participants, larger samples, more candidate systems, etc.); and on the other hand, the transferability of the findings by undertaking a similar project in different international organizations, multilingual national institutions, or even multinational companies.

XIX. Bibliography

ALPAC. (1966). Language and Machines. Computers in Translation and Linguistics. Washington, D.


Berger A., Brown P., Della Pietra S., V Della Pietra, Lafferty, J. Printz, H. and Ures L. (1994). The Candide system for machine translation. In Proceedings of the ARPA Conference on Human Language Technology.

Berners-Lee, Tim; Cailliau, Robert (12 November 1990)."World Wide Web: Proposal for a hypertexts Project". Retrieved 15 March 2015 from Cao, D., & Zhao, X. (2008). Translation at the United Nations as Specialized Translation. The Journal

of Specialised Translation, (9), 39–54.

Cancedda, N., Dymetman, M., Foster, G, Goutte, C. (2009). A Statistical Machine Translation Primer in Learning Machine Translation (pp. 2-37). The MIT Press, Massachusetts Institute of Technology, Cambridge, Massachusetts 02142.

Choudhury Rahzeb and McConnel Brian, (reviewers) Van der Meer Jaap and Lockwood Rose, TAUS, Translation Technology Landscape Report, April 2013. Funded by LT-Innovate COMISIÓN EUROPEA. DGT. (2009) Traducir para una comunidad multilingüe. Luxemburgo:

Oficina de Publicaciones Oficiales de las Comunidades Europeas.

COMISIÓN EUROPEA. DGT. (2009) Translation Tools and Workflow. Luxemburgo: Oficina de Publicaciones Oficiales de las Comunidades Europeas.

EAGLES: Evaluation of Natural Language Processing Systems. FINAL REPORT. Version of October

1996. Retrieved from Last accessed May 8, 2015.

Elizalde, C., Pouliquen, B., Mazenc C., and García-Verdugo, J. TAPTA4UN: Collaboration on machine translation between the World Intellectual Property Organization and the United Nations.

In: Languages and Translation. Machine Translation. N. 6, February 2013, pp. 22-23

Estrella, P. S., Popescu-Belis, A., & Underwood, N. (2005). Finding the system that suits you best:

Towards the normalization of MT evaluation. 27th international conference on translating and the computer (ASLIB) (pp. 23-34) Retrieved from

FEMTI - a Framework for the Evaluation of Machine Translation in ISLE [online]. (Last accessed: August 2015) Gerlach, J. (2009). Les Interlangues en TA: l’example de MedSLT. Université de Geneve.

Hutchins, W. J. (2000). Early Years in Machine Translation. Memoirs and biographies of pioneers.

Amsterdam studies in the theory and history of linguistic science. Series 3, Studies in the history of the language sciences, v. 97.

Hutchins, W. and Somers, H. An Introduction to Machine Translation, London: Academic Press, 1991

Hutchins, J. and Somers H.. 1992. An Introduction to Machine Translation. London; San Diego [etc.]:

Academic Press.

Introduction à LexShop – Lexique de Transfert - Tests et Actions. Master Class, Automatic Translation (2011); Master in Translation, Concentration in Translation Technologies.

Faculty of Translation and Interpreting, University of Geneva. Retrieved 10/08/15 (09:52).

ISSCO. MTEval II (FEMTI on-line application) - Quality requirements and metrics for the evaluation of MT: analysis and integration of expertise [online]. (Last accessed: August 2015)

[ISO 01] Iso/Iec, ISO/IEC 0126-1: Software Engineering-Product Quality-Part 1: Quality Mode/

International Organization for Standardization/International Electrotechnical Commission (2001).Retrieved from

Last accessed August 6, 2015 (09:16 )

Johnson, R. (1979). Contemporary Perspectives in Machine Translation by Rod Johnson , Centre for Computational Linguistics ,. In V. Hanon, Suzanne and Hjørnager Pedersen (Ed.), Human translation, machine translation. Papers from 10th annual conference on computational Linguistics,. Odense, Denmark.

Jurafsky, D. and Martin, J. H. (2000). Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition. Upper Saddle River N.J.: Prentice Hall.

Koehn, P. (2010). Statistical Machine Translation. United Kingdom, Cambridge: University Press.

Koehn, P., Shen, W., Federico, M., Bertoldi, N., Callison-Burch, C., Cowan, B., … Moran, C. (2006).

Open Source Toolkit for Statistical Machine Translation, (June), 177–180.

L’Homme, Marie-Claude. Initiation à la traductique. Montréal : Lingatech, 2008. 2e édition. 317 p.

: ill.

McPhee, Robert D., Zaug, Pamela. The Communicative Constitution of Organizations A Framework for Explanation. In: The Electronic Journal of Communication [online], Volume 10, Numbers 1 and 2, 2000. Retrieved from (10/07/2015)

Mauser, A., Mauser, A., Hasan, S., Hasan, S., Ney, H., & Ney, H. (2008). Automatic Evaluation Measures for Statistical Machine Translation System Optimization. Proceedings of the Sixth International Language Resources and Evaluation (LREC’08), 1(1), 3089–3092. Retrieved from

MT@EC: European Commission Machine Translation for Public Administrations in the EU Member States. Updated: 26.09.2014. Retrieved April 1, 2015 (17:45) from

MUÑOZ MARTÍN, F. Javier y VALDIVIESO BLANCO, María (2002) «Traductores y especialistas en la Unión Europea. Hacia el binomio integrador», 410-427 en P. Hernúñez y L. González (coords.): Actas del I Congreso Internacional “El español, lengua de traducción”, Comisión Europea y Agencia EFE, Almagro

Nielsen, M. (2009). Introduction to Statistical Machine Translation. (Online Blog). (URL (Last accessed August 05, 2011).

Papineni, K., Roukos, S., Ward, T., & Zhu, W. (2002). BLEU: a method for automatic evaluation of machine translation. … of the 40Th Annual Meeting on …, (July), 311–318.

Papineni, K., Roukos, S., Ward, T., & Zhu, W.-J. (2001). BLEU: a Method for Automatic Evaluation of Machine Translation. In Proceedings of the 40th Annual Meeting on Association for

Computational Linguistics - ACL ’02 (pp. 311–318).

Popescu-Belis, A., Estrella, P. S., King, M., & Underwood, N. L. (2006). A model for context-based evaluation of language processing systems and its application to machine translation evaluation. Proceedings of the fifth international conference on language resources and evaluation (LREC) (pp. 691-696) Retrieved from

Popescu-Belis, A. (2007). Le rôle des métriques d'évaluation dans le processus de recherche en TAL. TAL (traitement automatique de la langue), vol. 48, n. 1 (pp. 67-91) Retrieved from

Popovi, M., & Ney, H. (2007). Word Error Rates : Decomposition over POS Classes and Applications for Error Analysis. Computational Linguistics, (June), 48–55.

Pouliquen, B., & Mazenc, C. (2011). Coppa, CLIR and TAPTA: three tools to assist in overcoming the patent language barrier at WIPO, 24–30.

Pouliquen, B., Mazenc, C., & Iorio, A. (2011). Tapta: A user-driven translation system for patent documents based on domain-aware Statistical Machine Translation. Proceedings of Th 15th International Conference of the European Association for Machine Translation (EAMT), (May), 5–12.

Pouliquen, B., Mazenc, C., & Iorio, A. (2014). TAPTA : Translation Assistant for Patent Titles and Abstracts A . What is it ?, 1–6.

Pouliquen, B., Elizalde C., Junczys-Dowmunt, M., Mazenc C., & Garcia Verdugo, J. (2013), Large-scale multiple language translation accelerator at the United Nations. [MT Summit 2013], Proceedings of the XIV Machine Translation Summit, Nice September 2-6, pp. 345-352 Quah, Chiew Kin. 2006. Translation and Technology. Palgrave Textbooks in:

Translating and Interpreting. Basingstoke: Palgrave.

Rayner, M., Estella, P., & Bouillon, P. (2012). A Bootstrapped Interlingua-Based SMT Architecture.


Saldanha, G. & O’Brien, S. Research Methodologies in Translation Studies. 1st Ed. New York:

Routledge, 2014. 277 p.

SVOBODA, Tomáš (2013) Moving beyond CAT tools- The MT paradigm Shift from the Translators’

perspective. OPTIMALE, Rennes, June 6. Retrieved (n.d.) from http://www.ressources.univ-

Systems and software engineering -- Systems and software Quality Requirements and Evaluation (SQuaRE) -- Guide to SQuaRE. Retrieved from Last accessed May 3, 2015.

Tucker, A., "Current Strategies in Machine Translation Research and Development," in Nirenburg, S. (ed), Machine Translation: Theoretical and Methodological Issues, Cambridge University Press (1987), Chapter 2. Pp 22-41.

Toshihiro, K. (2008). Usability evaluation based on international standards for software quality evaluation. Nec Technical Journal, 3(2), 27–32. Retrieved from <Go to ISI>://WOS:000256954500004

Thurmair, Gr. Complex Lexical transfer in Metal, in: Proceedings of TMI-90, Austin (1990)

Wagner, E., Bench, S., and Martinez, J. M. (2002) Translating for the European Union Institutions.

Manchester: St. Jerome.

Weaver, W. (1955): Translation. In: Locke and Booth (1955), 15-23.

Wolf, P., Bernardi, U., Federmann, C. and Hunsicker, S. (2011). From Statistical Term Extraction to Hybrid Machine Translation. In Proceedings of the 15th Annual Conference of the European Association for Machine Translation. Leuven, Belgium.


Annex 1-FEMTI Report: Declarative Evaluation

Dans le document Evaluation of Statistical Machine Translation Engines in the Context of International Organizations (Page 102-107)