• Aucun résultat trouvé

Revealing the true achievable rates of scalar Costa scheme

N/A
N/A
Protected

Academic year: 2022

Partager "Revealing the true achievable rates of scalar Costa scheme"

Copied!
5
0
0

Texte intégral

(1)

Proceedings Chapter

Reference

Revealing the true achievable rates of scalar Costa scheme

PEREZ-FREIRE, Luis, PEREZ-GONZALEZ, Fernando, VOLOSHYNOVSKYY, Svyatoslav

PEREZ-FREIRE, Luis, PEREZ-GONZALEZ, Fernando, VOLOSHYNOVSKYY, Svyatoslav.

Revealing the true achievable rates of scalar Costa scheme. In: IEEE International Workshop on Multimedia Signal Processing (MMSP) . 2004.

Available at:

http://archive-ouverte.unige.ch/unige:47874

Disclaimer: layout of this document may differ from the published version.

1 / 1

(2)

Revealing the True Achievable Rates of Scalar Costa Scheme

Luis P´erez-Freire, Fernando P´erez-Gonz´alez Signal Theory and Communications Dept.

University of Vigo - 36200 Vigo, Spain E-mail:{lpfreire, fperez}@gts.tsc.uvigo.es

Sviatoslav Voloshynovskiy Department of Computer Science University of Geneva - Switzerland

E-mail: svolos@cui.unige.ch

Abstract— By abandoning the assumption of an infinite docu- ment to watermark ratio, we recompute the achievable rates for Eggers’s Scalar Costa Scheme (SCS, also known as Scalar Distor- tion Compensated Dither Modulation) and show, as opposed to the results reported by Eggers, that the achievable rates of SCS are always larger than those of spread spectrum (SS). Moreover, we show that for small Watermark to Noise Ratios, SCS becomes equivalent to a two-centroid problem, thus revealing interesting relations with SS and with Malvar’s Improved Spread Spectrum (ISS). We also show an interesting behavior for the optimal distortion compensation parameter. All these results aim at filling an existing gap in watermarking theory and have important consequences for the design of efficient decoders for data hiding problems.

I. INTRODUCTION

Scalar Costa Scheme (SCS) [1] is a popular method for information embedding that belongs to the family of quantization-based methods. Eggers calculated in [1] the achievable rate of SCS by resorting to the assumption of uniformity of the host signal inside each quantization bin, concluding that the achievable rate of SCS is smaller than that of spread spectrum (SS) methods for high noise levels, besides being independent on the host statistics and the document to watermark ratio, which is defined as λ = σ2x/Dw or DWR = 10 log10λ, with σ2x being the host variance and Dw the embedding distortion. The uniform assumption is equivalent to considering that DWR = ∞. We will show that the performance of SCS is actually never worse than that of SS in terms of achievable rate, and, in fact, it can benefit from low DWR’s. In general, for data hiding and watermarking applications, the variance of the host signal is considered to be much larger than that of the watermark, giving rise to the assumption of high DWR. However, in practical image processing applications, the host image can be modeled as a weighted mixture of zero-mean Gaussian pdf’s that capture local image statistics [2], with most of them presenting small variances. Therefore, it is very important to consider the performance of data-hiding techniques for relatively low DWR’s.

This work was partially funded by Xunta de Galicia under projects PGIDT02 PXIC32205PN and PGIDT04 PXIC32202PM; CYCIT project AMULET, reference TIC2001-3697-C03-01; FIS project G03/185, and Euro- pean Comission through the IST Programme under Contract IST-2002-507932 ECRYPT.

ECRYPT disclaimer: the information in this document reflects only the author’s views, is provided as is and no guarantee or warranty is given that the information is fit for any particular purpose. The user thereof uses the information at its sole risk and liability.

For our analysis we will consider the same scenario as Eggers in [1]: an equiprobable watermark message m, be- longing to the M-ary alphabet M = {0,1, . . .|M| −1}, is embedded into an independent and identically distributed (i.i.d.) host signal xyielding a watermarked signal y, which undergoes an additive channel, modeled by additive white Gaussian noise (AWGN), resulting in the received signal z.

In SCS, the watermarked signal is obtained by adding to the host signal a fraction of the quantization error:

y=x+α(Qi(x)−x), (1) where Qi(x) is the quantized value of x using a uniform scalar quantizer with step ∆, depending on the transmitted symbol mi, and α is the distortion compensation parame- ter. The embedding process is parameterized by α and the DWR defined above. Another parameter introduced for the performance analysis is the watermark to noise ratio, which is defined asξ=Dw/Dc or WNR= 10 log10ξ, beingDc the distortion introduced by the channel, which in our case is equal to the noise variance,σ2n. Zero-mean signals are considered in all cases and all embedding rates are expressed in bits.

II. COMPUTING THE TRUE ACHIEVABLE RATES

The achievable rates for SCS are calculated by maximizing over parameterαthe mutual information between the received signalZ and the transmitted message M:

R(λ, ξ) = max

α I(Z;M). (2)

Note that we make the achievable rate dependent on the DWR and the WNR, not only on the WNR, as Eggers made in [1].

The mutual informationI(Z;M)is given by I(Z;M) =h(Z)−X

i∈M

P r{M =mi}h(Z|M =mi), (3) whereh(Z)stands for the differential entropy of the random variable Z with a density fZ(z). Thus, to calculate the mutual informations we need to know the pdf of Z and of Z conditioned on the transmitted message. The following paragraphs are aimed at showing how these exact pdf’s can be obtained.

Scalar quantization with distortion compensation can be thought of as a random variable transformation Y = g(X), whose pdf can be easily computed by means of the funda- mental theorem for random variable transformations [3]. Such a transformation depends on the considered centroid. Let cki

(3)

be thek-th centroid in the quantizer associated to the message mi, the transformation is given by

yki=gki(x) = (x−cki)(1−α) +cki, (4) for

cki−∆

2 ≤x≤cki+∆

2, (5)

wherecki is the nearest centroid to x(in terms of Euclidean distance). The only root of (4) is

xki=gki1(yki) =yki−cki

1−α +cki,

so by the fundamental theorem we have the contribution of the centroidcki to the pdf of the watermarked signal

fYki(yki) = fX(xki)

|gki(xki)| = fX

y

ki−cki

1−α +cki

1−α . (6) The pdf of the watermarked signal conditioned on the trans- mitted messageM =mi is then given by

fYi(yi) =

X

k=−∞

fYki(yki), (7) so the pdf of the watermarked signal is

fY(y) = X

i∈M

P r{M =mi}fYi(yi) = 1 M

X

i∈M

X

k=−∞

fYki(yki).

(8) When the support of X is infinite, (7) and (8) must be approximated by truncating the host pdf. Finally, the addition of Gaussian noise can be accounted for by simple numerical convolution with an appropriate Gaussian pdf. As in [1], no closed form exists for the resulting pdf’s, so we must resort to numerical computation. The embedding distortion is given by

Dw= 1 M

X

i∈M

Z

(x−y)2fX(x)dx, (9) which can be easily calculated (again in a numerical manner) for an arbitrary host pdf. The quantization step is fixed without loss of generality at ∆ = 1, so the variance σx2 of the host signal is adjusted to fit a certain DWR for a given parameter α.

Having obtained the required pdf’s, computation of the mutual informations is straightforward.

A. Theoretical achievable rates for small WNR’s

From (1), it is easy to see that, by reducingα, the quanti- zation step∆ can be made larger while keeping constant the embedding distortionDw. Moreover, from the analysis made in [1], it is known that the optimum distortion compensation parameterαdecreases according to the value of WNR. With this two considerations in mind and the fact that we are dealing with finite DWR’s, one can conjecture that the optimum quantization step for small WNR’s is such that the whole pdf of the host signal can be confined inside one quantization bin, or equivalently, the ratio ∆/σx can be made very large. We could reduce then SCS to a problem with only two meaningful centroids. It is interesting to note the relation between this

two-centroid scheme and SS: when the host pdf is contained inside one quantization bin, the embedding process always moves the host signal towards the positive axis when the transmitted bit is 0, and towards the opposite direction when the transmitted bit is 1. Such an embedding process resembles SS-based watermarking, but a subtle difference between both schemes must be noted: whereas in the latter embedding is performed by the addition of a watermark with fixed amplitude to the host signal (we neglect here any issue concerning perceptual masking), in the former the watermark depends on the considered host sample. Because of its great similarity to SS, we will refer to this scheme in the sequel as DC-SS (Distortion Compensated - Spread Spectrum).

For the following analysis we consider a Gaussian host and binary signaling (M = {0,1}) with equiprobable symbols, and that the centroids corresponding to the symbols mi are located at−x0,x0, respectively (antipodal constellation), with x0 = ∆/4. Assuming we transmit the message M = 1, and particularizing (4) for this case, the following expression for the received signal is obtained

z=x+w+n= (1−α)x+αx0+n, (10) from which it follows that

fZ|M(z|M = 1)∼ N(αx0, σx2(1−α)2n2), (11) i.e. the received signal also follows a Gaussian distribution.

Moreover, sincey=x+w, it is easy to realize that

w=α(x0−x), Dw2(x20x2). (12) Recalling that λ= 10DW R10 andξ= 10W N R10 , we have then

λ= σx2

α2(x20x2), ξ=α2(x20x2)

σn2 , (13) so we can write σx2 andσn2 as functions ofλ,ξ andx0

σ2x= λα2x20

1−λα2 , σn2 = α2

x20+ α

2x20λ 1−α2λ

ξ . (14)

It can be analytically shown that the mutual information of DC-SS is a monotonically increasing function of the following signal to noise ratio

SN RDC−SS = x20α2

σ2x(1−α)22n , (15) which is nothing but the ratio between the mean squared value of the received signal conditioned on the transmitted message M = 1and its variance.

The optimum parameterαcan now be calculated by insert- ing (14) in (15) and maximizing over that parameter, obtaining

αDC−SS(λ, ξ) =1 +ξ+λξ−[(1 +ξ+λξ)2−4λξ2]12

2λξ ,

(16) which only depends on the WNR and the DWR. For small WNR’s, the following approximation for the achievable rate of DC-SS is valid

RDC−SS(λ, ξ, α)≃1

2log(1 +SN RDC−SS). (17)

(4)

Inserting (16) in (17) yields the following achievable rate RDC−SS(λ, ξ)≃

1 2log

1 + 1

2

hξ−λξ−1 + [(1 +ξ+λξ)2−4λξ2]12i .(18) In the next section, the validity of (16) and (18) to predict the performance of SCS for small WNR’s will be verified.

III. RESULTS AND DISCUSSION

From now on, all the results will stand for Gaussian hosts and binary signaling. In Fig. 1-a, the true achievable rates in SCS for negative WNR’s are represented. Two different DWR’s are considered: 10 and 20 dB. For comparison pur- poses, the capacity predicted by Costa,

CCosta = 1 2log2

1 + Dw

σ2n

, (19)

and the capacity for SS with Gaussian host, CSS = 1

2log2

1 + Dw

σ2x2n

, (20)

are also plotted.

It can be seen that the achievable rate depends, indeed, on the host statistics, and below a certain value of WNR (which is dependent on the DWR) the gain with respect to the uniform assumption is considerable, but more important is the fact that the true achievable rates of SCS are never below those of SS, contrarily to what was reported by Eggers. The optimum value forαis shown in Fig. 1-b, revealing another surprising result: the optimum αis discontinuous, and also depends on the DWR: below a certain WNR, it diverges from the value obtained by Eggers and gets closer to the one derived by Costa.

The reason for such a discontinuity is the existence of two local maxima in the curves of the mutual information: when the location of the global maximum changes sharply, so does the optimum α (if we would have resorted to the uniform approximation, there would exist only one maximum in those curves, as it occurs in [1]).

These results evidence the non-validity of the uniform assumption for small WNR’s: as long as the value of α is decreased, so does the ratioσx/∆in order to keep DWR con- stant. When the ratioσx/∆ is sufficiently small, the uniform assumption no longer holds, and even the absolute location of the centroids becomes relevant in the calculation ofI(Z;M).

In fact, to achieve the maximum embedding rate, they must be symmetrically located around the host mean. The decreasing in the ratio σx/∆ implies that the number of centroids with a non-negligible assignment probability is also decreasing, until the limiting case where only one centroid for each symbol is used. Costa had shown in [4] a similar behavior for the optimum number of codewords in his capacity-achieving scheme: for small WNR’s, αCosta tends to 0 and the number of required codewords per symbol tends to 1, thus confirming the conjecture made in Section II-A.

Now, we will verify the theoretical achievable rates and the optimumαthat were derived in that Section II-A for the DC- SS scheme. Equation (18) gives an excellent estimate for the

−20 −18 −16 −14 −12 −10 −8 −6 −4 −2 0

10−3 10−2 10−1 100

WNR (dB)

R (bits/element)

Costa DWR = 10 dB DWR = 20 dB DWR = ∞

SS

a

−200 −18 −16 −14 −12 −10 −8 −6 −4 −2 0

0.1 0.2 0.3 0.4 0.5

WNR (dB)

α

SCS, DWR = Costa SCS, DWR = 10 dB SCS, DWR = 15 dB SCS, DWR = 20 dB

b

Fig. 1. Binary SCS with Gaussian host: achievable rates (a) and optimum distortion compensation parameter (b)

achievable rate of SCS when DC-SS assumptions hold, as it can be readily seen in Fig. 2-a. Moreover, it is not difficult to prove that DC-SS always performs better than SS for DWR’s greater than 0 dB. We noted above the difference between the optimum parameter α derived by Eggers and the one we obtained for DWR <∞. The analytical expression (16) derived for DC-SS closely matches the optimum α in SCS when DC-SS assumptions hold, as can be seen in Fig. 2-b.

Furthermore, it can be easily shown that DWRlim→−∞

αDC−SS = lim

λ→0αDC−SSCosta, (21) whereαCosta= 1 +ξ11

stands for the optimum parame- terαderived by Costa in [4]. Fig. 1-b shows that the parameter αin SCS is approximately fitted by that derived by Costa for low WNR’s; the lower the DWR, the wider the range where such an approximation is valid. The result (21) makes sense because for DWR → −∞the variance of the host signal is negligible compared to ∆, and thus the DC-SS assumptions always hold.

IV. CONNECTIONS BETWEENSCSANDISS

The reduction of SCS to a two-centroid problem (DC- SS) resembles a recently proposed scheme by Malvar and Florˆencio in [5], the so-called ISS (Improved Spread Spec- trum), which is a generalized spread spectrum method that

(5)

−20 −19 −18 −17 −16 −15 −14 −13 −12 −11 −10 −9 10−2

WNR (dB)

R (bits/element)

SCS DC−SS (theoretical) SS

a

−200 −18 −16 −14 −12 −10 −8 −6

0.05 0.1 0.15 0.2 0.25

WNR (dB)

α

αSCS

* , DWR = 10 dB

αSCS

* , DWR = 15 dB

αDC−SS

* (theoretical), DWR = 10 dB

αDC−SS

* (theoretical), DWR = 15 dB

b

Fig. 2. Comparison between the achievable rates obtained numerically for SCS and theoretical ones for DC-SS and SS (a), and comparison between optimum parameterαin SCS and DC-SS (b). A Gaussian host was considered in both plots.

varies the amplitude of the watermark depending on the considered host sample, providing significant gains over tra- ditional SS. Although several versions of ISS are described in [5], we only consider here the linear one in order to clearly show the connections between DC-SS and ISS, revealing that the latter can be interpreted as a scheme with two virtual centroids, similarly to the former, being both approaches equivalent in terms of performance. For the analysis, the con- sidered scenario will be the same as that of SCS, introduced in Section I.

In traditional spread spectrum, the embedding function particularized for one sample is simply y = x+bσu, with b = ±1 depending on the to-be-transmitted bit, and σu the watermark amplitude. In the linear approximation of ISS, the embedding function can be written as

y=x+γbσu−νx= (1−ν)x+γbσu, (22) being γ andν two parameters in the range [0,1] that control the watermark amplitude and host rejection, respectively (note that spread spectrum is a particular case of (22) forγ= 1and ν= 0). It can be inferred from (22) that

Dw2σu22σ2x, (23)

fZ|B(z|b= 1)∼ N(γσu,(1−ν)2σx22n). (24) By comparing (11) and (24), it can be noted thatσu plays in (24) the role of the centroid x0 in (11). The main difference between ISS and DC-SS is the fact that ISS uses two parame- ters for embedding, namelyγandν, whereas DC-SS uses only one parameter. However, parameterγ in ISS is actually fixed to make the distortion (23) equal to that of spread spectrum, yielding γ=p

u2−ν2σ2x)/σu2, so, similarly to DC-SS, the achievable rate for ISS can be estimated by maximizing over ν the following expression

RISS(λ, ξ)≃ 1

2log(1 +SN RISS), (25) where

SN RISS = σu2−ν2σ2x (1−ν)2σ2x2n

. (26)

By some straightforward algebraic manipulations, it is easy to show that (26) is equal to (15), thus ν = αDC−SS and RISS = RDC−SS, and the results derived for DC-SS also apply for ISS. Since the achievable rate of ISS is equal to that of DC-SS, it is evident that the former is outperformed by SCS when the WNR increases. The drawback of ISS is that the number of centroids is not increased according to the WNR, as we pointed out in Section III.

V. CONCLUSIONS AND FURTHER WORK

We have analyzed the achievable rates of SCS by rejecting the uniform assumption, concluding that such an assumption leads to a significant underestimation of the true achievable rates for small watermark to noise ratios. As a matter of fact, the exact analysis has revealed an important result: the performance of SCS is dependent on the host statistics, and it is never worse than that of SS in terms of the achievable rate under AWGN attacks, hence there is no reason for using SS even when the watermarks must survive high noise levels.

By reducing SCS to a problem with only two meaningful centroids, we have obtained some novel theoretical expressions that characterize the performance of SCS for small watermark to noise ratios and allow to derive some interesting relations between SCS, SS and ISS.

The analysis carried out here can be made easily extensive to other host distributions besides the Gaussian, and it can be extended to the calculation of the probability of error in SCS-based schemes, in order to show the true performance of decoders operating at low-WNR regimes.

REFERENCES

[1] J. J. Eggers, R. B¨auml, R. Tzschoppe, and B. Girod, “Scalar Costa Scheme for information embedding,” IEEE Transactions on Signal Pro- cessing, vol. 51, no. 4, pp. 1003–1019, April 2003.

[2] A. Hjorungnes, J. Lervik, and T. Ramstad, “Entropy coding of composite sources modeled by infinite gaussian mixture distributions,” in IEEE Digital Signal Processing Workshop, 20-24 January 1996, pp. 235–238.

[3] A. Papoulis, Probability, random variables and stochastic processes, 3rd ed. McGraw-Hill International Editions, 1991.

[4] M. H. M. Costa, “Writing on dirty paper,” IEEE Transactions on Information Theory, vol. 29, no. 3, pp. 439–441, May 1983.

[5] H. S. Malvar and D. A. F. Florˆencio, “Improved Spread Spectrum: a new modulation technique for robust watermarking,” IEEE Transactions on Signal Processing, vol. 51, no. 4, pp. 898–905, April 2003.

Références

Documents relatifs

Let us now consider Stokes imaging systems, where 4 different analysis polarization states are used. Let us define the matrix W whose lines are the Stokes vectors of the

Numerical assessments on real UA channels with spreading factors (product between the delay and Doppler spread) around 10 − 2 show that reliable OFDM transmissions at 2 to 4

This itération bound is obtained by showing that in each itération the so-called primal-dual potential function decreases with (at least) some constant. In the dérivation of this

interference channels for which the interference signal in the i-th channel is a linear.. combination of the signals transmitted in channels j

[r]

In this paper, the achievable rates of reliable robust data-hiding in channels with geometrical transformations are investigated from an informationtheoretic point of view

Average rate loss analysis of on-average-optimal Costa coding versus worst case encoding strategy for uniformly varying noise variance and a Laplacian host when the host statistics

The world model is a representation of the world built from input data available to the robot (sensors, a priori information, communications,.. Since the planning relates to