Reconciling opposite neighborhood frequency effects in lexical decision: Evidence from a novel probabilistic model of visual word recognition

(1)

HAL Id: hal-02004264

https://hal.archives-ouvertes.fr/hal-02004264

Submitted on 1 Feb 2019

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

Reconciling opposite neighborhood frequency effects in lexical decision: Evidence from a novel probabilistic

model of visual word recognition

Thierry Phénix, Sylviane Valdois, Julien Diard

To cite this version:

Thierry Phénix, Sylviane Valdois, Julien Diard. Reconciling opposite neighborhood frequency effects

in lexical decision: Evidence from a novel probabilistic model of visual word recognition. Conference

of the Society of Scientific Studies of Reading (SSSR), Jul 2018, Brighton, United Kingdom. �hal-

02004264�

(2)

A

The general structure of the BRAID model relies on three levels of processing as featured in most previous word recogni:on models.

1. Thele#er sensory submodelimplements low-level visual mechanisms involved in le>er iden:ﬁca:on and le>er posi:on coding. Feature extrac:on is parallel over the input string, an acuity gradient is implemented symmetrically around ﬁxa:on and loca:on is distributed over adjacent le>er posi:ons, implemen:ng lateral interference between le>ers.

2. Thele#er perceptual submodelimplements how informa:on extracted from the sensory input accumulates gradually over :me to create a percept, i.e. an internal representa:on of the input le>er string.

3. Thelexical knowledge submodelimplements knowledge about the 40.481 English words of the Bri:sh Lexicon Project. The probability to recognize a word is modulated by its frequency. This level is enriched by a module, called thelexical membership submodel, that implements a mechanism to decide whether or not the input le>er-string is a known word. This submodel is cri:cal to simulate lexical decision.

One major originality of BRAID is to assume the existence of a fourth level:

4. Thevisual a#en8onal submodelimplements an a>en:onal ﬁltering mechanism between the le>er sensory submodel and the le>er perceptual submodel. Transfer of informa:on between these two submodels is modulated by the amount of a>en:on allocated to each le>er posi:on.

References

Ginestet, E., Phenix, T., Diard, J., & Valdois, S. (submi>ed). Modelling length eﬀect for words in lexical decision: the role of visual a>en:on.

Phénix, T., Valdois, S., & Diard, J. (submi>ed). Bayesian word recogni:on with a>en:on, interference and dynamics.

Phénix, T. (2018). Modélisa)on bayésienne algorithmique de la reconnaissance visuelle de mots et de l’a9en)on visuelle.Universite ́ Grenoble Alpes.

h>ps://tel.archives-ouvertes.fr/tel-01735606

Reconciling opposite neighborhood frequency effects in lexical decision:

Evidence from a novel probabilistic model of visual word recognition

Thierry Phénix, Sylviane Valdois and Julien Diard

Laboratoire de Psychologie et NeuroCognition

Univ. Grenoble Alpes, CNRS, LPNC UMR 5105, F-38000 Grenoble, France Contact: [email protected]

Neighborhood Frequency Effects in Lexical Decision Behavioral Task Simula:ons:

Probabilis:c Ques:ons

BRAID Bayesian Word Recogni:on with A>en:on Interference and Dynamics

Lexical Knowledge Submodel

Acuity Interference Final

Sensory Letter Submodel

C L O N E

Visual A#en8on Submodel

clone close alone

0 200 400 600 800 1000

0.0 0.2 0.4 0.6 0.8 1.0

Iteration

Recognitionprobability

Perceptual Letter Submodel 100 iterations 250 iterations

350 itera8ons 500 itera8ons

Q⁰PT

n=P(Pn^T |s^1:T1:N[L1:T

1:N= 1] [ P1:T

1:N= 1]µ^1:TA 1:T A g^1:T)

• Letter identification with lexical knowledge

QWT

=P(W^T|s^1:T1:N[ L1:T

1:N= 1] [ P1:T

1:N= 1]µ^1:TA 1:T A g^1:T)

• Word iden8ﬁca8on

QDT

=P(D^T|s^1:T1:N[ D1:T

1:N= 1] [ P1:T

1:N= 1]µ^1:TA 1:T A g^1:T)

• Lexical decision

Context

In the lexical decision task, participants have to indicate whether a printed stimulus is a known word (YES response) or not. Among the classical effects related to lexical decision, a poorly understood and highly controversial effect is the neighborhood frequency effect.

Latency on YES responses in lexical decision is modulated by the existence of higher frequency orthographic neighbors, namely the existence of at least one word of higher frequency that differs from the target word by a single letter (e.g., LIKE is a higher frequency neighbor of the target word BIKE).

Contradictory behavioral experiments Perea & Pollatsek (1998) Siakaluk, Sears, & Lupker (2002) Material and procedure:

The two experiments manipulate neighborhood frequency of English words, considering words with one (1HF) or zero (OHF) higher frequency neighbor. Experiment A used small density N and manipulated frequency whereas experiment B manipulated N for low frequency words. Ninety-two lowercase 5-to-6 letter words are used in Experiment A, 60 uppercase 4-to-5 letter words in Experiment B.

Results:

A significant inhibitory neighborhood frequency effect is reported in Experiment A but only for the low frequency words. In contrast, Experiment B reports a significant facilitatory neighborhood frequency effect for low-to- medium frequency words.

Lexical Membership

Submodel Word Is word?

Behavioral experiment:A standard finding is that high frequency words are processed faster than low frequency words and that identification time is proportional to log- frequency.

Simulation:To simulate this effect, we reproduce the simulation design first proposed by Norris (2006). The frequency of occurrence of each word rotates in a round- robin manner over all experimental conditions, so that only frequency affects identification time. The BRAID model reproduces the log-frequency effect.

Frequencies 1 2 5 10 20 50 100 200 500 1000 2000 5000 10 000

0 200 400 600 800 1000

0.0 0.2 0.4 0.6 0.8 1.0

Iteration

Recognitionprobability

1 10 100 1000 10 000

0 50 100 150 200 250 300

Log frequency

Iteration

Behavioral experiment:Johnston (1978, experiment 2) showed that le>ers are recognized signiﬁcantly more oken when they are presented in a word context rather than in isola:on and when they are presented in isola:on rather than in a non-word context.

Simula8on:BRAID reproduces behavioral results (e.g., aker 62 itera:ons). The mean :me course of le>er iden:fica:on shows that the word effect over isolated le>er and non-word is a strong effect and that the advantage of isolated le>ers over a non-word context, significant from itera:on 43, is durable.

Data BRAID

Mot Non-mot Lettre isolée

0.0 0.2 0.4 0.6 0.8 1.0

Contexte

Probabilitéd'identificationdelettre

Context Word Nonword Isolated letter

0 50 100 150 200 250

0.0 0.2 0.4 0.6 0.8 1.0

iteration

Letteridentificationprobability

Behavioral experiment:Andrews (1989, experiment 1 and 3) showed that high density neighborhood slows down le>er iden:fica:on significantly only for low frequency s:muli and has no effect on high frequency s:muli.

Simula8on:BRAID reproduces behavioral results with a threshold of YES response at 0.77. More interes:ngly, it shows that having a high density neighborhood is an advantage at the beginning of the iden:ﬁca:on process but becomes a drawback when the decision is based on a high level of precision.

Data Braid

LF/SD LF/LD HF/SD HF/LD

200 300 400 500 600 700 800

Freq./N

Latency(ms)/iteration

Freq./N LF/SD LF/LD HF/SD HF/LD

0 100 200 300 400

0.6 0.7 0.8 0.9 1.0

Iteration

Identificationprobability

Uniform Standard Reduced

C L O N E

0.0 0.2 0.4 0.6 0.8

Probability

C L O N E

0.0 0.2 0.4 0.6 0.8

Probability

C L O N E

0.0 0.2 0.4 0.6 0.8

Probability

Simula8on

Material and procedure:

The s:muli used in the simula:ons are the same as in the behavioral experiments. We ﬁrst simulate lexical decision in BRAID for 1,000 itera:ons. Words that did not reach 0.97 iden:ﬁca:on probability aker 1,000 itera:ons were removed, resul:ng in an error rate of 1%

and 2.5% for the s:muli used in Simula:on A and B, respec:vely.

Results:

The simulated data successfully capture both inhibitory and facilitatory eﬀects of human data, with the same set of parameters. The dynamic curves reveal an inversion of the neighborhood frequency eﬀect with :me. The existence of a higher frequency neighbor is facilitatory at the beginning of processing but turns inhibitory over :me. In simula:on A, the pa>ern inversion occurs earlier for the low frequency words (itera:on 188; threshold value=0.68) than for the medium frequency words (itera:on 213; threshold value=0.81), and, in simula:on B, earlier for the words that have a small vs. large neighborhood density (itera:on 217 vs. 239;

threshold value = 0.81 vs. 0.85).

Data Braid

LF/0HF LF/1HF MF/0HF MF/1HF SD/0HF SD/1HF LD/0HF LD/1HF 400

450 500 550 600 650 700

Condition

Latency(ms)/Timesteps

A: Perea and Pollatsek (1998)Condition B: Siakaluk et al. (2002)

Condition SD/0HF SD/1HF LD/0HF LD/1HF

0 100 200 300 400

0.6 0.7 0.8 0.9 1.0

Time steps

ProbabilityofYESresponse

Condition LF/0HF LF/1HF MF/0HF MF/1HF

0 100 200 300 400 500

0.6 0.7 0.8 0.9 1.0

Time steps

ProbabilityofYESresponse

A

B