• Aucun résultat trouvé

Learning Anticipatory Motor Control

N/A
N/A
Protected

Academic year: 2022

Partager "Learning Anticipatory Motor Control"

Copied!
21
0
0

Texte intégral

(1)

Learning Anticipatory Motor Control

D. Bailly P. Andry P. Gaussier A Rengerve

ETIS UMR CNRS 8051

ENSEA - Univertité Cergy-Pontoise [email protected]

GDR Robotique et Neuroscience septembre 2012

(2)

ANR joint project INTERACT:

Understanding the link between low-level physical compliance and higher level behaviors : HR Cooperation

Development of a NN Model for motor Control : Cerebellum

Pluri-disciplinary : Understanding the process of decision making from motor theory :

“Decision making is based on the anticipation of the consequence of motor actions”

- disclaimer : this work is not mature (at all) -

Context

LISV-BIA

UVSQ Hydraulic Devices

ETIS

UCP ENSEA

Neural Network Models

URECA

LiLLE III Psychology

(3)

Context

(4)

Issues

Decision making is based on the anticipation of the consequence of motor actions”

= circuits, action processing, may be different according to the intention, the goal, the expected reward (i.e : the context)

Motor theory of social cognition : MNs, [Rizzolatti, Gallese, Decety, Wolpert...]

hyp: if processings are different, then motor trajectories may be different.

Can we read the “social motivation” from motor trajectories ?

(5)

Issues

[Jacob & Jeannerod 2005] :

NO

motor trajectory is the trace of motor intention.

we can’t read the agent’s social motivation (DR Jeckyl and Mr Hyde imaginary experiment).

Decision making is based on the anticipation of the consequence of motor actions”

= circuits, action processing, may be different according to the intention, the goal, the expected reward (i.e : the context)

Motor theory of social cognition : MNs, [Rizzolatti, Gallese, Decety, Wolpert...]

if processing are different, then motor trajectories may be different.

Can we read the “social motivation” from motor trajectories ?

(6)

Issues

BUT... [Becchio et al. 2008, Becchio et al. 2010, Ferri 2011] :

The “reach-to-grasp” has no direct social outcome, but is also affected (similar effect as the “place” action)

trajectory lenght trajectory height wrist max amplitude time to peak velocity

(7)

Our NN model for motor control : cortical areas, hippocampus, cerebellum, striatum

On going experiment (simulation) : can our model account for Becchio’s results ?

Overview (on going work)

(8)

NN model for motor control

References :

seminal and functional : VITE model [Bullock] : “via points”

developmental : [Gaussier, Andry, Quoy, Giovannangelli, Oudeyer] (imitation, navigation, intrinsic motivations)

Neuro-anatomy : [Grossberg, Shadmer&Krakauer, Doya, Guenter, Arleo]

Robotics : HRI [Billard, Fukuyori]

(9)

NN model for motor control

Sensori-motor categories

associative learning between different modalities (prop-vision)

building visuo-motor “attractors” to reach parts of the workspace [Fukuyori SAB08]

motor babling : learning is self-supervised - ART like NN - [Grossberg] - vigilance parameter

emulates a control of the muscle : stretch and force = position control by activation signal

(10)

NN model for motor control

Sensori-motor categories

Reaching a visual target

not accurate, but functional

basis for immediate imitation [Rengerve, Andry]

(11)

NN model for motor control

Transitions

Learning a sequence of attractors

hippocampus [Grossberg - Banquet - Gaussier]

Timing and step by step prediction of the future attractor

building elementary trajectories

reaching a visual target

not accurate

(12)

granular cells :

distal links : propriocetion (O) - US -

proximal links (mossy fibers) :

output : proprioception at t+1

NN model for motor control

Cerebellum

many projections [Doya : a simulator and internal models]

rebuilds signals from a modality to another

run faster (20 times than cortico-hippocampus loop)

smooth trajectories

predicts at t+1 the proprioceptive ( O) information

reaching a visual target

not accurate

(13)

Cerebellum

many projections [Doya : simulator + internal models]

rebuilds signals from a modality to another

run faster (20 times than cortico-hippocampus loop)

smooth trajectories

predicts at t+1 the proprioceptive ( O) information

reaching a visual target

not accurate

granular cells :

distal links : propriocetion (O)

proximal links (mossy fibers) :

output : proprioception at t+1

NN model for motor control

(14)

Our NN model for motor control : cortical areas, hippocampus, cerebellum, striatum

On going experiment (simulation) : toward becchio experiments -> global modulation

Overview (on going work)

(15)

How do we go higher (or lower ?)

the change in the kinematics is global

linked to an initial recognition of the task’s context

changing speed or neural time : do not affect the trajectory

Hyp :

the confidence in the task modulates the vigilance level = recognition level of the attractors

Becchio’s exp.

(16)

Becchio’s exp.

How do we go higher (or lower ?)

the change in the kinematics is global

linked to an initial recognition of the task’s context

changing speed or neural time : do not affect the trajectory

Hyp :

the confidence in the task modulates the vigilance level = recognition level of the attractors

(17)

How do we go higher (or lower ?)

the change in the kinematics is global

linked to an initial recognition of the task’s context

changing speed or neural time : do not affect the trajectory

Hypothesis :

the confidence in the task modulates the vigilance level = recognition level of the attractors

low vigilance induces early recognition of the visuo-motor state

early recognition induces a lower trajectory

Becchio’s exp.

(18)

Becchio’s exp.

Striatum

evaluates the situation, estimate the reward, and change the vigilance accordingly

preliminary results (simulation)

reaching a visual target

not accurate

(19)

Becchio’s exp.

With a developmental plausible explanation :

during the first learning trials : the system follows and learns accurately the example

vigilance is high, the “basin of recognition” of the visuo-motor state is small, the trajectory is accurate

during the life of the agent :

if trials with a lower vigilance earns reward (big objects, no obstacles, etc), then trajectory with a lower path should be learned : less energy needed to obtain the same reward

consistent with the experiment :

individual condition :

only object, easy, standard moves

= low vigilance, low trajectory

social condition

adds a social constrain

= higher vigilance, high trajectory

(20)

Future works

Striatum : could initiate exploration strategy

evaluates the situation and change the vigilance accordingly

estimate the reward according to the visual distance (visuo-motor states in the visual space)

reaching a visual target

not accurate





 

(21)

Conclusion

NN model for motor control

cortex - hippocampus - cerebellum - striatum

Interesting solution for Becchio’s experiments, developmentally plausible (hyp)

Next : robotic validation (electric - hydraulic)

hippocampus-cerebellum link is not very plausible

far from being optimal (cerebellum ?)

striatum-PFC

reaching a visual target

not accurate

Références

Documents relatifs

All TMS sites were chosen because processing of vestibular information or sensory processing for online motor control during reaching has been reported on either the site itself or

Governance of public expertise in SSE refers to decision-making, actions, processes, regulation, traditions, organizations and institutions that characterize the way

We defined a standard processing chain for motor imagery based on the parameters used for the Graz paradigm. We want to identify four commands corresponding to four motor

Action units describe muscles or muscle groups in the face, are set or unset Uncertainty Modelling in Knowledge Engineering and Decision Making Downloaded from

Future-oriented envisioning of the deliberate outcomes of teaching and action-based decisions about the actions to be undertaken are two different aspects of decision making..

 Autonomy in language learning depends on the development and exercise of a capacity for detachment, critical reflection, decision making and independent action;

According to the results reported by neuroimaging and experimental studies on motor learning, several cortical and subcortical structures, including the basal ganglia, cerebellum,

The link between motor memory, action selection, and programming can be illustrated as follows: during motor learning, assessed by a procedural learning task, a new motor routine