• Aucun résultat trouvé

Literary criticism 2.0 : a digital analysis of the professional and community-driven evaluative talk of literature surrounding the Ingeborg Bachmann Prize

N/A
N/A
Protected

Academic year: 2022

Partager "Literary criticism 2.0 : a digital analysis of the professional and community-driven evaluative talk of literature surrounding the Ingeborg Bachmann Prize"

Copied!
1
0
0

Texte intégral

(1)

Semi-Supervised Learning with BERT: Preliminary Results

CLASSIFIER: Neural Net CLASSIFIER: Nearest Neighbour

Aspect-Category Precision Recall F1-score Support

References 0,50 0,33 0,40 3

Meta 0,85 0,88 0,87 66

Reading 0,67 0,29 0,40 7

Contender 0,86 0,86 0,86 21

Text 0,82 0,84 0,83 32

Jury 0,63 0,79 0,70 24

Onsite Audience 0,88 0,58 0,70 12

Accuracy 0,80 165

Macro avg 0,74 0,65 0,68 165

Weighted avg 0,80 0,80 0,79 165

Polarity Detection Precision Recall F1-score Support

0 0,61 0,85 0,71 13

1 0,71 0,42 0,53 12

Accuracy 0,64 25

Macro avg 0,66 0,63 0,62 25

Weighted avg 0,66 0,64 0,62 25

CLASSIFIER: AdaBoost

Polarity Detection Precision Recall F1-score Support

0 0,93 0,87 0,90 15

1 0,82 0,90 0,86 10

Accuracy 0,88 25

Macro avg 0,87 0,88 0,88 25

Weighted avg 0,88 0,88 0,88 25

Outlook:

Preliminary results:

• ABSA is able to handle the sandwich-type evaluative talk:

“mediocre text by a terrific author”

• BERT + embeddings + polarity triggers + spans Pathways for future research:

• Multimodal sentiment analysis

o parallel corpus of spoken and written discourse o diarisation

• Gender/ethnicity and negotiation of cultural prestige

• Shifts in platforms: newspapers vs. social media as gatekeepers

→ Blurs the border between professional and community-driven literary criticism

DEPARTMENT OF LITERARY STUDIES (IN COLLABORATION WITH LT³ - LANGUAGE AND TRANSLATION TECHNOLOGY TEAM)

Gunther Martens, Lore De Greve and Pranaydeep Singh

LITERARY CRITICISM 2.0: A DIGITAL ANALYSIS OF THE PROFESSIONAL AND

COMMUNITY-DRIVEN EVALUATIVE TALK SURROUNDING THE INGEBORG BACHMANN PRIZE

Annotation method:

Aspect-Based Sentiment Analysis (ABSA)

• Aspects or Feature Expressions (FE): text, jury, contender…

• Named Entities (NE)

• Polarity or Sentiment Expressions (SE) Semi-Supervised Learning:

• Small amount of labeled (annotated) data

• Large amount of unlabeled data

Polarity of the Main Aspect Categories Corpus (2019):

• Tweets and Instagram-posts created during TDDL and using the official

#tddl-hashtag

• Goodreads reviews

• Official description of the jury discussion

Text-Subcategories: Evaluative Criteria

Research goals

1. Predict sentiment with regard to literary texts in three languages (German, English, Dutch)

2. Analysis of implied sentiment: evaluative talk requires domain- specific adaptations

• Named Entities (NE): “Kafka” is good

• Sarcasm and irony

3. Construction of literary value through evaluative diction Advantages and relevance:

• Enables big data analysis

• Possibility of detecting multiple and conflicting sentiments and topics

Interpretation:

The nominated and prize-winning texts are being discussed on both social media platforms, but the focus differs:

• Twitter-discourse is more negative and focuses mostly on the text in general, followed by the content/plot, the language/style and quotes.

• Goodreads-discourse contains predominantly positive evaluations and pays comparatively more attention to the fictional characters and the form of the text.

Contact:

Gunther Martens

Gunther.Martens@UGent.be

Professor of Modern German Literature German Literature

Contact:

Lore De Greve

Lore.DeGreve@UGent.be PhD Candidate

German Literature

Contact:

Pranaydeep Singh

Pranaydeep.Singh@UGent.be PhD Candidate

Language and Translation Technology Team

Références

Documents relatifs

For the Russian language, we find that word-based indexing us- ing our light stemming procedure results in better retrieval effectiveness than does the 4-gram indexing strategy

To perform these experiments, two external resources have been used: an available training corpus containing Nor- wegian reviews from major newspaper sources (NoRec) [23], and a

Next, we used the convolutional N-gram BiL- STM word embedding model by (Nguyen and Le Nguyen, 2018), which has state-of-the-art per- formance on SenTube, to label the data on the

We test three pre-trained statistical models for lemma- tization of Italian based on Universal De- pendencies, and we propose a simple al- ternative to lemmatizing the tweets

The first step involved the VADER lexicon. Since most emoticons and English abbreviations are also common in German social media texts, more than 800 words of this type have been

Previous studies have shown that agreement levels among annotators of sentiment annotation are rather low, since literary texts can be understood and interpreted in a number..

Logistic Regression, RandomForestClassifier, GradientBoostingClassifier as well as h ATF1, None, COSN i and h LOGN, ENPY, PUQN i variants for TF-IDF were chosen according to

The total web page results, based on queries that combine candidate words, seed words and domain keywords, are used in the AcroDef M I3 measure to get the correlation score for each