• Aucun résultat trouvé

COPECO: a Collaborative Post-Editing Corpus in Pedagogical Context

N/A
N/A
Protected

Academic year: 2021

Partager "COPECO: a Collaborative Post-Editing Corpus in Pedagogical Context"

Copied!
1
0
0

Texte intégral

(1)

COPECO: a Collaborative Post-Editing Corpus in Pedagogical Context

Jonathan Mutal1, Pierrette Bouillon1, Perrine Schumacher2, and Johanna Gerlach1 1FTI/TIM, University of Geneva, Switzerland

2CIRTI, Li`ege University, Belgium

{Jonathan.Mutal, Pierrette.Bouillon, Johanna.Gerlach}@unige.ch [email protected]

1 Motivation and objective

Learner corpora are important resources to help translation teachers understand student mistakes and improve their course content1.

COPECO is a joint project between Geneva University and Li`ege University, with three main objectives: 1) to collect post-edits produced by stu-dents and teacher corrections, 2) to build an open-source student post-editing corpus and 3) to help systematise the task of translation error annotation (O’Brien, 2011). It provides translation teachers with an online post-editing platform, designed to help them to annotate student post-editing tasks us-ing a shared or personalised annotation scheme.

2 COPECO platform

The platform2has six main features.

Annotation schemes: importing standardised annotation schemes (for example MQM3) and per-sonalising them to suit the teachers needs i.e. adding, removing or editing error annotation cat-egories; including explanations for students.

Post-editing tasks: defining tasks with list of students, source text, machine translated text, op-tional translation reference and metadata (text ref-erence, MT tool, translation date, etc.); selecting the type of post-editing task (monolingual vs bilin-gual; sentence by sentence vs in block; showing source or target text first); assigning on-line post-editing tasks to students.

Post-editing statistics: recording post-editing

c

2020 The authors. This article is licensed under a Creative Commons 3.0 licence, no derivative works, attribution, CC-BY-ND.

1

https://uclouvain.be/en/research-institutes/ilc/cecl/learner-corpus-bibliography.html

2

Link to the platform https://copeco.unige.ch:9996/

3

http://www.qt21.eu/mqm-definition

activity data such as time, keystrokes and percent-age of MT content edited.

Annotation: annotating errors in the text by ap-pending tags from the selected annotation scheme and optionally adding comments for the stu-dents; comparing post-editing results with refer-ence translations (if available) by highlighting dif-ferences.

Reports: displaying corrected texts with anno-tations, comments and error statistics; addition-ally, in the teacher view, displaying statistics for all post-editing tasks (number of keystrokes, time, annotations and percentage of MT content edited). Collaborative corpus: sharing the anonymised annotated texts, with different access mode de-pending on student authorisation; making the cor-pus available for download in different formats.

3 Demonstration

The demonstration will present the platform and the different functionalities, taking as example a specific learner corpus (Casas, 2020).

References

Casas, S. (2020). Int´egration de la post-´edition dans un cours de traduction et r´evision : ´Etude de cas d’´etudiant·e·s `a la facult´e de traduc-tion et d’interpr´etatraduc-tion. Master’s thesis. ID: unige:131300.

O’Brien, S. (2011). Towards a dynamic quality evaluation model for translation. In Journal of Specialized Translation, 17:1–2.

Références

Documents relatifs

an adm3a. Cursor stops at the left margin of the page. If cnt is given, these commands move that many spaces. Move down one line. Moving off the top of the screen

• If the definition is in the current buffer, it moves point there. Zmacs remembers the other definitions, and tells you about them with a message in the echo

Given a boxed address into a buffer, the buffer may be the subject of any TXDT operation. A buffer is designated as the current one using the functions

The infonnation-processing psychology literature, however, contains two books from which the general approach of this thesis has been derived: Newell and Simon's

To cope this discrepancy, different reinforcement learning methods such as REINFORCE (Ranzato et al. 2016) have been used to directly maximize these sequence-level metrics for

Most recent sentence simplification systems use basic machine translation models to learn lexical and syntactic paraphrases from a man- ually simplified parallel corpus. These meth-

Implementing Machine Translation and Post-Editing to the Translation of Wildlife Documentaries through Voice ‐ over and Off-screen Dubbing.. Carla

An MTPE app has to live with the constraints of mobile devices. 147) states, “serious usability problems are the result of a poor decision about a trade-off