• Aucun résultat trouvé

Average path length

N/A
N/A
Protected

Academic year: 2021

Partager "Average path length"

Copied!
33
0
0

Texte intégral

(1)

A unified computer model for internal and external constraints

in language evolution

5th International Conference on the Evolution of Language Leipzig, 1-4-2004

Christophe Coupé

Laboratoire Dynamique du Langage, CNRS - Université Lyon 2, France

(2)

Overview

Main questions:

What is language change?

How do internal and social constraints interact in language change?

How can we investigate the rate of language change or the linguistic diversity in a community of speakers?

(3)

What the model does, and what it doesn’t

• The model investigates the evolution of

language with general mechanisms (back to basics)…

• It pays equal attention to internal and social factors, and allows various degrees of

complexity for both …

• But is not a model of emergence (although the

framework may capture several existing models

of emergence)

(4)

Outline

• Underlying theoretical framework

• Description of the model

• First experiments

• Perspectives and work in progress

(5)

Outline

• Underlying theoretical framework

(6)

Language as a complex dynamical system

• The underlying paradigm:

• « A complex dynamical system… consists of a set of interacting elements where the behavior of the total is an indirect, non-

hierarchical consequence of the behavior of the different parts » [Steels, 1997]

• Applied to various levels:

• Words & lexicon [Steels, 1996],

• Phonemes & phonological system [Lindblom, 1984]

• Idiolects & language of a community

• Language is made of structures:

• A structure is a collection of interactions between elements building a set as a coherent whole and giving it its specific aspect

• Phonological system (significant oppositions), grammar, lexicon

(7)

Space of possible states

• The abstract space defined by all possible states of language

• a muti-dimensional space, with continuous (eg. formantic values) or discrete dimensions (eg. word-order)

• Divided in domains of « structural stability »:

• domains defined by a linguistic structure

• all states in a domain correspond to the same structure

• eg. allophonic variations for a phoneme

(8)

Language change

• Variation: evolution of the system inside a domain of structural stability

• Language change

• A structural change

• From one domain to another

• Language evolves in the space of possible states according to various constraints

• Internal versus external forces (eg. [Labov, 2001])

(9)

Internal constraints

• Cognitive constraints

• Minimize the cognitive load during encoding &

decoding (eg. working memory, access to data)

• Constraints in perception & production

• Ease of the production vs. ease of the perception (eg. hypo-/hyper-correction [Ohala, 1993])

• Efficient transmission of information

(10)

External constraints (1)

• Sociolinguistics:

• Language as a tool to assert one’s identity [Labov, 1972]

• Individuals interact in complex social networks

• role of social ties in actuation and implementation of changes

• Contacts between languages

• Contacts between dialects or

registers from [Milroy, 1992]

Æ Impact language change & evolution in the communities

(11)

External constraints (2)

• From social ties to linguistic evolution

• « from Bloomfield's principle of density, which argues that each communicative act is accompanied by a slight degree of

convergence of linguistic systems of speaker and interlocutor. » [Labov, 2002]

• « Anspach in The Why of Fashion (1967) argues that the

initiating spark is the need of people to be like others and yet to be distinct from others » [Labov, 2001]

Æ Positive links: convergence of linguistic systems

Æ Negative links: (relative) divergence of linguistic systems

(12)

From a theoretical framework to a computational model

Task:

Derive a computational model from the preceding abstract descriptions

Why?

1. Possibility to investigate the dynamics of language change 2. Possibility to measure various features of language change

given the social structure or the internal constraints 3. Few models integrate both internal & external constraints

How?

(13)

Outline

• Underlying theoretical framework

• Description of the model

(14)

Parameters of the model

• N agents; each agent = one linguistic system

• (x1, x2, …, xm): linguistic variables of the system

• A social network to tie the agents

• Links between -1.0 & 1.0 (0.0: no interaction)

• A function describing the natural constraints on each agent (for all variables)

• Assumption: same internal constraints for all

agents

(15)

A model based on

probabilistic distributions

• Constraints modelled by Gaussian probabilistic distributions:

A non-deterministic behavior…

which follows constraints on average

A distribution for each linguistic variable

• Description of the Gaussian:

• Mean = most likely variation

• Variance

• V = 0 Æ deterministic behavior

• V = +∞ Æ fully random behavior

(16)

Modelling internal constraints

A fitness landscape for two linguistic variables ([0,1]x[0,1])

• Notion of fitness landscape:

• from domains of structural stability to basins of attractions

• variation = evolution inside a basin of attraction

• change = jump from one attractor to another

Most likely variation

slope of the landscape for each variable

• local topography

ÅÆ local view of the landscape ÅÆ unplanned change

No need to know the whole landscape to compute local evolutions

(17)

Modelling external constraints

• For each agent:

• Compute the sum of attractions and repulsions with other agents

Æ local social norm

Æ most likely variation ∝ (local social norm – current position)

Current position

Local social norm for the agent

Most likely variation for the variable X Y

X

(18)

Integrating constraints

−200 −15 −10 −5 0 5 10 15 20

0.05 0.1 0.15 0.2 0.25

2nde distribution

Distribution finale 1ère distribution

−200 −15 −10 −5 0 5 10 15 20

0.02 0.04 0.06 0.08 0.1 0.12 0.14 0.16 0.18 0.2

1ère distribution

2nde distribution Final distribution

Mixture of Gaussians

• Constraints can either go in the same or opposite direction(s)

• social constraints can run counter to internal constraints

• link with linguistic diversity

(19)

Variables to be measured

• Rate of change:

• Number of transitions between attractors per time period (flat landscape: length of the trajectory)

• Evolution of the idiolects / evolution of the barycentre of the idiolects

• Diversity for N agents:

• Dispersion in various basins of attraction (flat

landscape: spatial entropy or, more easily, average distance to the barycentre of the agents)

(20)

Outline

• Underlying theoretical framework

• Description of the model

• First experiments

(21)

Testing a simple fitness landscape

V = 0 Æ stable state

V = +∞ Æ random walk

0 < V < +∞ Æ periods of stability in basins of attraction + changes

Æ Punctuated equilibriums

One agent

(22)

Simple social networks (1)

• Influence of the strength of the link

Average distance to the barycentre as a function of the strength of the links

0 0,1 0,2 0,3 0,4 0,5 0,6 0,7

-40 -30 -20 -10 0 10 20 30 40

Strength of the links Average distance to the barycentre

Social variance: 0.001 Social variance: 0.00001

Average path length as a function of the strength of links

0 0,002 0,004 0,006 0,008 0,01 0,012

-40 -20 0 20 40

Strength of links

Average path length Barycentre, social variance:

0.001

Barycentre, social variance:

0.00001

Agents, social variance: 0.001

Agents, social variance:

0.00001

(23)

Simple social networks (2)

Weak or strong variance of the Gaussian distributions

Average distance to the barycentre as a function of social variance

0 0,1 0,2 0,3 0,4 0,5 0,6 0,7 0,8

-8 -6 -4 -2 0 2

log(social variance)

Average distance to the barycentre

Links: 0.0 Links: 1.0 Links: 10.0 Links: -1.0 Links: -10.0

Average path length as a function of social variance

0 0,005 0,01 0,015 0,02 0,025 0,03 0,035 0,04

-8 -7 -6 -5 -4 -3 -2 -1 0 1 2

log(social variance)

Average path length Agents, links: 0.0

Agents, links: 1.0 Agents, links: 10.0 Agents, links: -1.0 Agents, links: -10.0 Barycentre, links: -1.0 Barycentre, links: 1.0 Barycentre, links: 10.0 Barycentre, links: -10.0 Barycentre, links: -1.0

(24)

Simple social networks (3)

• Influence of the number of agents

Average distance to the barycentre as a function of the number of agents

0 0,1 0,2 0,3 0,4 0,5 0,6

0 10 20 30 40 50 60 70

Number of agents

Average distance to the barycentre Links: -10,0

Links: -1,0 Links: 0,0 Links: 1,0 Links: 10,0

Similarity with Nettle’s results (Nettle, 1999)

Average path length as a function of the number of agents

0 0,002 0,004 0,006 0,008 0,01 0,012

0 10 20 30 40 50 60 70

Number of agents

Average path length Barycentre, links: -10,0

Barycentre, links: -1,0 Barycentre, links: 0,0 Barycentre, links: 1,0 Barycentre, links: 10,0 Agents, links: -10,0 Agents, links: -1,0 Agents, links: 0,0 Agents, links: 1,0 Agents, links: 10,0

(25)

Integrating external &

internal constraints

Same results as with separated constraints:

• punctuated equilibriums

• influence of the social structure

Time = 5,000 Time = 7,000 Time = 8,000

(26)

Outline

• Underlying theoretical framework

• Description of the model

• First experiments

• Perspectives and work in progress

(27)

Toward realistic networks

• Increasing number of studies on specific networks

smallworld networks

scale-free networks

Æ Impact on rate of change & diffusion of innovation?

Æ Solving the threshold problem?

Æ Evolution of languages during prehistory

• Take new parameters into account:

clustering coefficient

Average path length

See Ke & Gong’s talk « Language change and social networks »

negative links and diversity

(28)

A ‘real’ exemple of internal

constraints: phonological systems

• PSs as complex dynamical systems:

• Interactions between vowels [Lindblom, 1984] [de Boer, 2001]

• Extracting knowledge from a database

• The UPSID database : 451 PSs as a

representative sample of world’s langagues (Maddieson & Precoda 1989)

• Statistical approach and GA to extract the relevant fitness landscape from the data

(29)

Conclusions

A model integrating internal and external constraints in a single framework

• quite technical (takes time to describe)

• but captures a large number of situations (including previous models as naming games etc.)

Key-points:

• Investigate the impact of various social structures

• Deal with highly-dimensional fitness landscapes

• Possible additional features to the model: other descriptions of the linguistic systems

Perspectives:

• Integrate more data from real cases in the model (ex. UPSID)

• Investigate the prehistory of languages by studying the impact of our predecessors’ social structures

(30)

Thank you for your attention

(31)

Acknowledgements

• Members of the « MEL » team (CNRS OHLL program)

• Members of the Language Engineering

Laboratory, City Univ. of HK

(32)

Limitations

• No interactions between social and natural constraints

• only competition for the most likely variation

• Specific influence of social ties on linguistic structures

• Not a central phenomenon

• Independent computations for the linguistic variables Nya-rri ngithun thabu waangkur riwur.

1exc-du.HAR my elder.brother go:FUT east:FUT

« My elder brother and I will go east »

Lardil (Queensland, Australie) (Evans, 2003)

(33)

Prehistoric linguistic diversity and rate of evolution

With time, increasing contacts between

groups, slow increase of population density

• Human density and social networks

(Jacquesson, 2000)

• An oversimplified scenario for the

evolution of languages

Références

Documents relatifs

Studying the impact of internal and external forces minimization in a motion-based external forces and moments prediction method: application to

To this aim, we give asymptotics for the number σ (n) of collisions which occur in the n-coalescent until the end of the chosen external branch, and for the block counting

Abstract: This study aimed to compare the training load of a professional under-19 soccer team (U-19) to that of an elite adult team (EAT), from the same club, during the

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des

Pour tous les patients de notre étude, le délai de deux heures ± 15 minutes pour la précision des dosages san- guins réalisés deux heures après la prise, de même que le maintien

The colours correspond to different approaches for calculating the potential : a classical polyhedral approach considering a uniform den- sity and a Mascon 8 gravity model assuming

Our aim is to figure out special features of the German language that preschool teachers can use as an opportunity to foster children’s linguistic as well

To determine the ultimate strength of a CMC, one needs to determine the average pull−out length as a function of the load level in the fiber. This Note is devoted to establishing