1 Disparity map with cost volume filtering

(1)

Vision 3D artificielle - Final exam (duration: 2h30)

P. Monasse and R. Marlet November 5th, 2013

You can choose to answer in French or English, at your convenience.

1 Disparity map with cost volume filtering

The recent method presented in this exercise comparesadaptiveimage patches.

Given two imagesu_l andu_r from a stereo pair, we define the cost volume as a 3-D image:

C(i, j, d) =|ul(i+d, j)−ur(i, j)|

wheredspans the set of possible disparities. We apply a filter to the cost volume C, yielding a volume of same dimensions C⁰, then we select the disparity by a simple “Winner take all” (WTA) criterion:

disp(i, j) = arg min

d C⁰(i, j, d)

1. Why is it necessary in practice to not useC directly in WTA (C=C⁰)?

2. Interpret SAD on a windowω_i,j = [i−r, i+r]×[j−r, j+r] as a filter on the cost volume. In the following, we will build a more adaptive filter.

All windowsω_i,j being supposed of fixed size (2·r+ 1)², we note simply

|ω|(without index) this value.

3. Let (i₀, j₀) be a pixel ofu_l. We define the energy Ei₀,j₀,d(a, b) = X

(i,j)∈ωi0,j0

(a·ul(i, j) +b−C(i, j, d))²+|ω|a²

where >0 is a “small” real number whose only use is to avoid the risk of dividing by 0 below.

We look for realsaandbsuch as to approximateC(i, j, d) at a givend:

(a_i₀_,j₀_,d, b_i₀_,j₀_,d) = arg min

a,b E_i₀_,j₀_,d(a, b).

(2)

Figure 1: Top: images ul and ur. Bottom: disparity maps computed by SAD on square windows (left) and by cost volume filtering (right).

Compute the gradient of the above energy and prove that ai₀,j₀,d=

P

ωi0,j0ul(i, j)C(i, j, d)/|w| −µi₀,j₀·C¯i₀,j₀,d

σ_i²

0,j0+ bi₀,j₀,d= ¯Ci₀,j₀,d−ai₀,j₀,dµi₀,j₀

where µi₀,j₀ and σ²_i₀_,j₀ are the average and variance of ul onωi₀,j₀, and C¯i0,j0,d the average ofC onωi0,j0× {d}.

4. Each pixel (i, j) belonging to several windowsω_i⁰_,j⁰, this would yield several values ofaandbfor (i, j). We take the average:

¯

a_i,j,d= 1

|ω|

X a_i⁰_,j⁰_,d

(3)

Show that we can write C⁰(i, j, d) =X

i⁰,j⁰

Wi,j,i⁰,j⁰C(i⁰, j⁰, d) with

W_i,j,i⁰_,j⁰ = 1

|ω|²

X

(k,l)∈ωi,j∩ω_i0,j0

1 +(ul(i, j)−µk,l)(ul(i⁰, j⁰)−µk,l) σ_k,l² +

! .

5. Supposing u_l takes only two values on ω_i,j ∩ω_i⁰_,j⁰, comment the term under the sum.

6. See in Figure 1 the disparity maps computed from a simple search by minimization of SAD on square windows or after cost volume filtering.

Comment the better quality of the filtering method, in particular con- cerning a common drawback of local methods.

2 Multiple view constraints

The goal of this exercise is to exhibit the constraints arising from more than two views of a scene. Suppose we have a pointX in 3-D space viewed inn >2 images:

λixi=Ki Ri Ti

X, i= 1, . . . , n.

1. Recall the meaning of the different terms in such an equation.

2. Explain why we do not lose generality if we assumeR1=Id3 andT1= 0.

3. Write the above system of equations in the form a systemAY = 0 with Y = X λ1 · · · λn^T

.

4. Link the existence and uniqueness of the system to the rank of matrixA.

5. Show that the rank ofAis the rank of B plus 3, where:

B =







K2T2 K2R2x1 x2 0 0 · · · 0

K₃T₃ K₃R₃x₁ 0 x₃ 0 · · · 0

... ... ... . .. . .. . .. ...

K_n−1T_n−1 K_n−1R_n−1x₁ 0 · · · 0 x_n−1 0

K_nT_n K_nR_nx₁ 0 · · · 0 0 x_n







(4)

6. Show that the following matrixD is of maximal rank 3(n−1):

D=







x^T₂ 0 0 · · · 0

0 x^T₃ 0 · · · 0

... . .. . .. . .. ...

0 · · · 0 x^T_n−1 0

0 · · · 0 0 x^T_n

[x₂]_× 0 0 · · · 0

0 [x₃]_× 0 · · · 0

... . .. . .. . .. ...

0 · · · 0 [x_n−1]_× 0

0 · · · 0 0 [xn]_×





 .

7. Deduce thatDB andB have same rank.

8. Write the contents of matrixDB.

9. Using the fact thatxi6= 0 for all i, prove that the rank of B is the rank ofM plus (n−1), withM the 3(n−1)×2 matrix:

M =







[x2]×K2R2x1 [x2]×K2T2

... ...

[xn]×KnRnx1 [xn]×KnTn





.

10. Show that the rank of M is zero if and only if all optical centers and X are aligned.

11. Show that forM to be rank 1, it is necessary that (i) for allithe vectors [x_i]_×K_iT_i and [x_i]_×K_iR_ix_i be proportional, (ii) which amounts to the usual two-image epipolar constraint (“bilinear constraint”).

12. We assume known that ifa_i6= 0 andb_i 6= 0 are all vectors ofR³, the rank deficiency of the matrix





 a₁ b₁

... ... an vn







is equivalent to the condition: a_ib^T_j −b_ja^T_j = 0 for alli, j. Write “trilinear constraints” linking pointsx1,xi andxj.

13. Show that any possible “quadrilinear” or “multilinear” constraint is a combination of trilinear and bilinear constraints.

(5)

3.1 Multiple labels

The course presents a method based on graph cuts to compute an exact energy minimization for a label assignment problem in a case where there are multiple labels (more than two) and where the labels are linearly ordered (Boykov et al.

1998). Given a 4000×3000-pixel image and assuming we have no information on the possible range of the disparity,

1. What should be the size of the graph?

2. What would be the size of the graph if we were to use iteratedα-expansions?

3. Conclude.

3.2 Impact of pairwise potentials

1. Imagine you are facing stairs. Is it appropriate to use a Potts model as pairwise potentials? Why?

2. Imagine you are facing a pyramid. Is it appropriate to use a cap max value fonction as pairwise potentials? Why?