• Aucun résultat trouvé

A Model of Binocular Gaze Estimation

N/A
N/A
Protected

Academic year: 2021

Partager "A Model of Binocular Gaze Estimation"

Copied!
2
0
0

Texte intégral

(1)

HAL Id: inria-00590233

https://hal.inria.fr/inria-00590233

Submitted on 3 May 2011

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

A Model of Binocular Gaze Estimation

Miles Hansard, Radu Horaud

To cite this version:

Miles Hansard, Radu Horaud. A Model of Binocular Gaze Estimation. COSYNE 2007 - 4th Com- putational and Systems Neuroscience Meeting, Feb 2007, Salt Lake City, United States. 2007. �inria- 00590233�

(2)

A Model of Binocular Gaze Estimation

Miles Hansard and Radu Horaud

INRIA Rhˆone-Alpes, 655 avenue de l’Europe, Montbonnot 38334, France.

Binocular image-pairs contain information about the three-dimensional structure of the visible scene, which can be recovered by the identification of corresponding points. However, the resulting disparity field also depends on the orientation of the eyes. If it is assumed that the exact eye-positions cannot be obtained from oculomotor feedback, then the gaze parameters must also be recovered from the images, in order to properly interpret the retinal disparity field.

Existing models of biological stereopsis have addressed this issue independently of the binocular- correspondence problem. It has been correctly assumed thatif the correspondence problem can be solved, then the disparity field can be decomposed into gaze and structure components, as described above. In this work we take a different approach; we emphasize that although the complete point-wise disparity field is sufficient for gaze estimation, it is not in factnecessary. We show that the gaze parameters can be recovered directly from the images, independently of the point-wise correspondences.

The relationship between binocular vergence and the resulting epipolar geometry is derived. Our algorithm is then based on the simultaneous representation of all epipolar geometries that are feasible with respect to a fixating oculomotor system. This is done in an essentially two-dimensional space, parameterized by azimuth and viewing-distance. We define a cost function that measures the compatibility of each geometry with respect to the observed images. The true gaze parameters are estimated by a simple voting-scheme, which runs in parallel over the parameter space. We describe an implementation of the algorithm, and show results obtained from real images.

Our algorithm requires binocular units with large receptive-fields, such as those found in area MT[1]. The model is also consistent with the finding that depth-judgments can be biased by microstimulation in MT[2];

if the artificial signal generates an ‘incorrect’ set of gaze parameters, then we would expect the subsequent interpretation of the disparity field to be biased. Our model could be tested using binocular stimuli based on thepatternsof disparity that we describe. We note that these patterns are geometrically analogous to parametric motion fields. It has already been shown that such flow-fields are effective stimuli for motion- sensitive cells in area MST[3]; we predict an analogous binocular ‘gaze-tuning’ in the extrastriate cortex.

Acknowledgments

This work is part of thePerception on Purposeproject, supported by EU grant 027268.

References

[1]Functional Properties of Neurons in Middle Temporal Visual Area of the Macaque Monkey. II. Binocular Interactions and Sensitivity to Binocular Disparity. J. H. R. Maunsell & D. C. Van Essen. J. Neurophys.

49(5), 1148–1167, 1983.

[2]Cortical Area MT and The Perception of Stereoscopic Depth. G. C. DeAngelis, B. G. Cumming & W. T.

Newsome.Nature394, 677–680, 1998.

[3] Integration of Direction Signals of Image Motion in the Superior Temporal Sulcus of the Macaque Monkey. H. Saito, M. Yukie, K. Tanaka, K. Hikosaka, Y. Fukada & E. Iwai. J. Neurosci. 6(1), 145–157, 1986.

56

Thursday evening, Poster I-49 Cosyne 2007

Références

Documents relatifs

Based on the bond-number equality concept an equation is derived for anion complexes of normal valence compounds with triangularly and/or tetrahedrally coordinated central atoms

(use the same hint as in the previous

University of Zurich (UZH) Fall 2012 MAT532 – Representation theory..

1 In the different policies that I consider, the changes in mortality risk faced by different age classes remain moderate, so that v i can be interpreted as the Value of

Dr Fortin thanks the patients and co-investigators of the Saguenay group associated with the Research Chair on Chronic Diseases in Primary Care.. Competing interests None

2 Until a refrigerator-stable vaccine becomes available, however, varicella vac- cine will not be incorporated into the recommend- ed immunization schedule in Canada, as most

Figure 2: Illustrating the interplay between top-down and bottom-up modeling As illustrated in Figure 3 process modelling with focus on a best practice top-down model, as well

It is of much interest to notice that this approach, although directed to a different immediate goal, by having faced problems recognised as very similar to those encountered in