• Aucun résultat trouvé

On Politics and Argumentation

N/A
N/A
Protected

Academic year: 2021

Partager "On Politics and Argumentation"

Copied!
44
0
0

Texte intégral

(1)

HAL Id: hal-01666416

https://hal.archives-ouvertes.fr/hal-01666416

Submitted on 18 Dec 2017

HAL is a multi-disciplinary open access

archive for the deposit and dissemination of

sci-entific research documents, whether they are

pub-lished or not. The documents may come from

teaching and research institutions in France or

abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est

destinée au dépôt et à la diffusion de documents

scientifiques de niveau recherche, publiés ou non,

émanant des établissements d’enseignement et de

recherche français ou étrangers, des laboratoires

publics ou privés.

On Politics and Argumentation

Maria Boritchev

To cite this version:

Maria Boritchev. On Politics and Argumentation. MALOTEC, Mar 2017, Nancy, France.

�hal-01666416�

(2)

on politics and argumentation

Maria Boritchev March 15th, 2017

(3)

about the field

Argument Mining

• Aim: identify argument structures that can be found within the

discourse

◦ Argumentative schemes

◦ Relationships between pairs of arguments

• Introducing automation ∼ 2000 • Applications:

◦ Stock market

◦ Computerized essay grading ◦ Computer-supported peer review

(4)

arg-tech

(5)
(6)

question

How to automatically mine intertextual

correspondences between U.S. candidates

argumentation during primaries debates and

Reddit users’ comments on those?

(7)

outline

Some theory

Some politics

A spoonful of reality

Politics, people and argumentation

What now ?

(8)
(9)

an introductory example

Example (A simple dialogue) Alice : Q.

Bob : Why Q ? Alice : P.

Modelisation (A simple dialogue)

P (P⇒ Q) Q

(10)

about the method

Inference Anchoring Theory

• Aim: link dialogical and argumentative structures • Combination of logic and argumentation theories • Reference: Budzynska and Reed, 2011

• Contribution to the Argument Web

◦ Mediation project ◦ Ethos mining

(11)

a concrete example

Example (Reddit comments)

Redditor1 : Every American should be a capitalist. Redditor2 : Why?

Redditor1 : Our country was built on capitalism.

(12)

iat diagram (loading...)

(13)

iat diagram (loading...)

(14)

iat diagram (loading...)

(15)

iat diagram (loading...)

(16)

iat diagram (loading...)

(17)

iat diagram (loaded !)

(18)
(19)

about the corpora

U.S. presidential debates

• The American Presidency Project

• Web archive: American presidency related documents • Transcripts of all presidential debates from 1960 to 2016 • 2016 elections:

◦ 9 Democratic debates ◦ 12 Republican debates ◦ 3 General debates

(20)

about the corpora

Reddit

• Social media and news aggregation, web content rating and

discussion English-speaking website

• subreddits: news, science, politics, gaming, movies, ... • In /r/politics, October 13 DNC Primary Debate

-During-debate Discussion Megathread

(21)

assembling the reddit comments corpus

1. For each debate, select the corresponding thread. 2. Sort the comments by time-stamp (oldest on top). 3. Remove all comments having no children.

4. Remove all comments trees beginning with irony or wordplay (rethoric structures are not handled by IAT).

5. Keep comments trees classified by excerpts (time-stamp identification), discard all others.

(22)
(23)

model

(24)

iat in practice

(25)

high dialogical activity

(26)

excerpt

Example

CHAFEE : Anderson, you’re looking at a block of granite when it comes to the issues. Whether it’s...[crosstalk]

COOPER : It seems like pretty soft granite. I mean, you’ve been a Republican, you’ve been an independent.

(27)

hda moments extraction

(28)
(29)
(30)

correspondence mining

(31)

stating the intertextual correspondence task

Input

R ={r1, . . . ,ri, . . . ,rn} set of Reddit comments segments.

D ={d1, . . . ,dj, . . . ,dm} set of presidential debates segments.

Output

C = (ci,j), matrix of the correspondence coefficients between riand

dj.

(32)

sketching the algorithm

Natural Language Processing tools :

Speakers similarity

Frequent wordsets similarity

Semantic similarity ci,j=w1·Sp(ri,dj)+ w2 length(ri)· ∑ u∈ri tf-idf(u, dj,D) + w3 length(ri)· length(dj)· ∑ u∈ri,v∈dj Semsim(u, v) 30 / 42

(33)

sketching the algorithm

Natural Language Processing tools :

Speakers similarity

Frequent wordsets similarity

Semantic similarity ci,j=w1·Sp(ri,dj)+ w2 length(ri)· ∑ u∈ri tf-idf(u, dj,D) + w3 length(ri)· length(dj)· ∑ u∈ri,v∈dj Semsim(u, v) 31 / 42

(34)

speakers similarity

Example

ri= “Hillary Clinton knows Bernie Sanders’ gun control record isn’t

his strong suit”

dj= “CLINTON thinks what Senator SANDERS is saying certainly

makes sense in the terms of the inequality that we have”

S(t) ={CLINTON, SANDERS}

(35)

sketching the algorithm

Natural Language Processing tools :

Speakers similarity

Frequent wordsets similarity

Semantic similarity ci,j =w1·Sp(ri,dj)+ w2 length(ri) ·∑ u∈ri tf-idf(u, dj,D) + w3 length(ri)· length(dj)· ∑ u∈ri,v∈dj Semsim(u, v) 33 / 42

(36)

tf-idf

Definition (Term frequency)

Let w be a word of a (non-empty) segment t in a corpus C. Term frequency of w in t is defined as

Tf(w, t) = |{v ∈ t, v = w}|

|{v ∈ t}| .

Definition (Inverse document frequency)

Inverse document frequency of w in C is defined as

Idf(w, C) =      0 if w /∈ t, log |s, s ∈ C| |s ∈ C, w ∈ s| otherwise. 34 / 42

(37)

tf-idf

Definition (Term frequency–inverse document frequency) Let w be a word of a segment t in a corpus C.

tf-idf of w in t in C is defined as

tf-idf(w, t, C) = Tf(w, t)· Idf(w, C).

(38)

sketching the algorithm

Natural Language Processing tools :

Speakers similarity

Frequent wordsets similarity

Semantic similarity ci,j=w1·Sp(ri,dj)+ w2 length(ri)· ∑ u∈ri tf-idf(u, dj,D) + w3 length(ri)· length(dj)· ∑ u∈ri,v∈dj Semsim(u, v) 36 / 42

(39)

wordnet hierarchy (part.)

(40)

semantic similarity

Definition

Semantic similarity of w1and w2is defined as

Semsim(w1,w2) =1

mini,j{length(path(w1#i, w2#j))}

maxv,k{length(path(entity, v#k))}.

(41)

sketching the algorithm

Natural Language Processing tools :

Speakers similarity

Frequent wordsets similarity

Semantic similarity ci,j=w1·Sp(ri,dj)+ w2 length(ri)· ∑ u∈ri tf-idf(u, dj,D) + w3 length(ri)· length(dj)· ∑ u∈ri,v∈dj Semsim(u, v) 39 / 42

(42)
(43)

conclusion and perspectives

• Stating theintertextual correspondence task

• Developing thecorpora

• Sketching theintertextual correspondence algorithm

• Future work

◦ Other corpora? ◦ Other methods?

(44)

Thank you for your attention

Références

Documents relatifs

To test whether the vesicular pool of Atat1 promotes the acetyl- ation of -tubulin in MTs, we isolated subcellular fractions from newborn mouse cortices and then assessed

Néanmoins, la dualité des acides (Lewis et Bronsted) est un système dispendieux, dont le recyclage est une opération complexe et par conséquent difficilement applicable à

Cette mutation familiale du gène MME est une substitution d’une base guanine par une base adenine sur le chromosome 3q25.2, ce qui induit un remplacement d’un acide aminé cystéine

En ouvrant cette page avec Netscape composer, vous verrez que le cadre prévu pour accueillir le panoramique a une taille déterminée, choisie par les concepteurs des hyperpaysages

Chaque séance durera deux heures, mais dans la seconde, seule la première heure sera consacrée à l'expérimentation décrite ici ; durant la seconde, les élèves travailleront sur

A time-varying respiratory elastance model is developed with a negative elastic component (E demand ), to describe the driving pressure generated during a patient initiated

The aim of this study was to assess, in three experimental fields representative of the various topoclimatological zones of Luxembourg, the impact of timing of fungicide

Attention to a relation ontology [...] refocuses security discourses to better reflect and appreciate three forms of interconnection that are not sufficiently attended to