• Aucun résultat trouvé

P. Monasse, M. Aubry, and R. Marlet November 29th, 2016

N/A
N/A
Protected

Academic year: 2022

Partager "P. Monasse, M. Aubry, and R. Marlet November 29th, 2016"

Copied!
5
0
0

Texte intégral

(1)

Vision 3D artificielle - Final exam (duration: 2h30)

P. Monasse, M. Aubry, and R. Marlet November 29th, 2016

You can choose to answer in French or English, at your convenience.

1 Fast PnP

The PnP (Perspective fromnPoints) problem aims at finding the position and orien- tation of a camera knowing the camera calibration matrixK, a family ofn3D points Xi(expressed in a world coordinate system) and their 2D projection in the cameraxi. The method proposed by Lepetit, Moreno-Noguer and Fua in 2008, called EPnP, is remarkable in the sense that it is simple to implement and that its complexity depends linearly onn.

1. Show that given four 3D pointsCwj (j=1, . . . ,4), we can write in general

∀i, Xi=

4

X

j=1

αi jCwj with

4

X

j=1

αi j=1.

What does the term “in general” mean precisely in the sentence above?

2. Suppose that the points Cwj are expressed in the camera coordinate frame as vectorsCj = RCwj +T, with RandT the (unknown) rotation and translation between the camera and world coordinate frames. Show that

∀i,∃λi,0, K−1xiiK−1









 ui vi 1









=

4

X

j=1

αi jCj.

3. Prove that the vectorx=

CT1 CT2 CT3 CT4T

satisfies a linear system of equa- tionsM x=0, the 2n×12 matrixMdepending on theui,vii jandK.

4. Deduce that xis in the kernel ofM and that we can write x = PN

k=1βkVk,Vk being the rightmostKcolumns ofV in the SVD ofM =UΣVT. What doesN mean?

(2)

5. Ifn ≥6, we have normallyN =1, but because of noise, it is better to test for different values ofN, 1 ≤ N ≤ 4, the one that minimizes the sum of square distancesP

id2(xi,KP

jαi jCj). Justify that this distance should ideally be null.

6. Show that (using Matlab notation) fori,j∈ {1, . . . ,4}

k

N

X

k=1

βk

Vk(3i−2 : 3i)−Vk(3j−2 : 3j)

k=kCiw−Cwjk.

7. IfN=1, show that we can write β1

P4

i,j=1kV1(3i−2 : 3i)−V1(3j−2 : 3j)k kCiw−Cwjk P4

i,j=1kV1(3i−2 : 3i)−V1(3j−2 : 3j)k2 .

8. IfN=2, show that we haveL









 β21 β1β2

β22









=ρwithLa 6×3 matrix andρa 6-vector depending on theCwi. Propose a solution to recover (β1, β2).

9. IfN=3, show that we have some similar sytem but withL6×6.

10. IfN =4, show that the vectorβcomposed of the 10 unknownsβi jiβj, with 1 ≤i ≤ j ≤ 4, could be determined up to an unknown vector in the kernel of a 6×10 matrixL. Propose additional equations in theβi j that would permit to recover this unknown vector.

11. RecoveringRandT from the data ofCwj andCj is a standard procedure. As- suming the pairs (Xi,xi) have some outliers, write in pseudo-code an algorithm of type RANSAC estimatingRandT.

2 Multiple view constraints with lines

We assumencameras in space, of calibration matrixKi, oriented with rotationRiand translationTiwith respect to some world coordinate frame.

1. Show that in general, the observation of a projection of a 3D line in a camera does notimpose any constraint on its projection in another view.

2. What is the exception?

3. By a simple geometric consideration, show that the observation of the projections of a 3D line intwoviews determines the projection in a third view.

4. Letlibe the 3-D vectors representing the projections of a same 3D line in differ- ent viewsi=1, . . . ,n. Show that then×4 matrix

=





lT1K1R1 lT1K1T1

.. ..





(3)

must have rank at most 2. Hint: consider the projections of two points on the 3D line.

5. Show that the 4×5 matrixDbelow has rank 4.

D= l1 [l1]× 0

0 0 1

!

6. We assumeR1=IdandT1=0. Show that the matrix

















lT1l1 0 0

lT2K2R2l1 lT2K2R2[l1]× lT2K2T2

... ... ...

lTnKnRnl1 lTnKnRn[l1]× lTnKnTn

















has rank at most 2.

7. Deduce that the following (n−1)×4 matrixMhas rank at most 1.

M =











lT2K2R2[l1]× lT2K2T2

... ...

lTnKnRn[l1]× lTnKnTn











 .

8. Show that for anyi,j∈ {2, . . . ,n}

lTi KiRi[l1]×RTjKTjlj=0.

(lTjKjTjlTiKiRi−lTiKiTilTjKjRj)[l1]×=0.

9. Show that any algebraic constraints betweenm>3 projectionsliare combina- tions of trilinear constraints involving 3 projections.

3 Feature detection and matching

1. Does the Harris Corner detector detect the same points in an image and its rotated version? Discuss and explain why.

2. Imagine you have a feature descriptor sensitive to rotations. Describe a simple strategy to build a rotation invariant descriptor.

3. One possible issue when trying to match images are repetitive elements, that can be found at multiple locations in the image and generate ambiguous matches.

Describe a simple strategy to remove these ambiguous matches.

(4)

4 Graph cuts for disparity map estimation

We consider the estimation of disparity maps using graph cuts, based on an energy with the following form:

E(f)=X

p∈P

Dp(fp)+ X

(p,q)∈N

Vp,q(fp,fq)

where

• f :P → Lis a labeling assigning a disparityd ∈ L={dmin, . . . ,dmax}to each pixelp∈ P,

• Dp : L → ’is a data term expressing the penalty for pixel p to be assigned disparity fp, and

• Vp,q:L × L →’is a regularization term expressing the penalty for neighboring pixelsp,qaccording toN to have potentially different disparities fp,fq. We want to study the rotation sensitivity of the regularization term.

Question 4.1. We consider the case whereVp,qp,qmin(K,|fp−fq|). CanE(f) be minimized efficiently and exactly? If so, with what method(s)? If not, what method(s) can you however suggest?

We now consider the scene in Figure 1 with two fronto-parallel planar objects: an axis-aligned square (i.e., whose sides are horizontal or vertical in the picture) and an identical square rotated 45(diamond).

Figure 1: Axis-aligned square (left) and identifcal square rotated 45(right).

We consider two types of pixel neighborhoods, pictured in Figure 2: N4where a pixel has 4 neighbors andN8where a pixel has 8 neighbors.

Figure 2: 4-pixel neighborhood (left) and 8-pixel neighborhood (right).

(5)

Question 4.2. We assume the difference of disparities is very large (larger thanK) at the boundary between the background and the objects (square and diamond). Ignoring the situation near the vertices of the objects, i.e., looking only at their sides (edges) as shown in Figure 3, can a value be found for (λp,q)(p,q)∈N4with a 4-pixel neighborhood N4so that the penalty of the square boundary is identical to the penalty of the diamond boundary?

Figure 3: axis-aligned boundary (left) and diagonal boundary (right).

Question 4.3. Same question as Question 4.2 with neighborhoodN=N8.

Question 4.4. Are the answers to questions 4.2 and 4.3 different when the square is rotated with angleθsuch that tanθ=12?

Question 4.5. Conclude. What is the impact in practice?

Références

Documents relatifs

Ce guide, résolument pratique s’adresse bien sûr aux parents, aux familles et à leur entourage, souvent très démunis eux aussi, aux professionnels, mais égale- ment au grand

L'ensemble de ces sections est muni d'une structure d'espace vectoriel sur C.. On dgsigne

Ce document ne revêt aucun caractère contractuel et ne constitue en aucun cas ni une sollicitation d’achat ou de vente, ni une recommandation d’achat ou de vente de valeurs

n In the linear mode the input common-mode voltage range includes ground and the output voltage can also swing to ground, even though operated from only a single power supply

Les institutions fondatrices de l’Espace Partagé de Santé Publique affirment leur volonté d’assumer cette responsabi- lité en structurant leur collaboration et

[r]

Le Capitaine Haddock jurait "Mille milliards de mille sabords !".. Son écriture décimale est 1 suivi

La diagonale AC traverse des carrés et délimite à l'intérieur de certains d'entre eux des petits triangles rectangles (voir un exemple supra) dont la somme des périmètres est un