• Aucun résultat trouvé

Strategic Successive Refinement Coding for Bayesian Persuasion with Two Decoders

N/A
N/A
Protected

Academic year: 2021

Partager "Strategic Successive Refinement Coding for Bayesian Persuasion with Two Decoders"

Copied!
7
0
0

Texte intégral

(1)

HAL Id: hal-03227554

https://hal.archives-ouvertes.fr/hal-03227554

Submitted on 17 May 2021

HAL

is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire

HAL, est

destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

Strategic Successive Refinement Coding for Bayesian Persuasion with Two Decoders

Rony Rouphael, Maël Le Treust

To cite this version:

Rony Rouphael, Maël Le Treust. Strategic Successive Refinement Coding for Bayesian Persuasion

with Two Decoders. Information Theory Workshop 2021, Oct 2021, Kanazawa, Japan. �hal-03227554�

(2)

Strategic Successive Refinement Coding for Bayesian Persuasion with Two Decoders

Rony Bou Rouphael and Maël Le Treust

ETIS UMR 8051, CY Cergy-Paris Université, ENSEA, CNRS, 6, avenue du Ponceau, 95014 Cergy-Pontoise CEDEX, FRANCE

Email: {rony.bou-rouphael ; mael.le-treust}@ensea.fr

Abstract—We study the multi-user Bayesian persuasion game between one encoder and two decoders, where the first decoder is better informed than the second decoder. We consider two perfect links, one to the first decoder only, and the other to both decoders. We consider that the encoder and both decoders are endowed with distinct and arbitrary distortion functions.

We investigate the strategic source coding problem in which the encoder commits to an encoding while the decoders select the sequences of symbols that minimize their respective distortion functions. We characterize the optimal encoder distortion value by considering successive refinement coding with respect to a specific probability distribution which involves two auxiliary random variables, and captures the incentives constraints of both decoders.

A full version of this paper is accessible at: https://arxiv.

org/pdf/2105.06201.pdf

I. INTRODUCTION

The optimization of distinct and arbitrary distortion func- tions resulting from the communication between several au- tonomous devices with non-aligned objectives is under study.

This problem was originally formulated in the game the- ory literature and referred to as the sender-receiver game, where the amount of information transmitted is generally unrestricted. In the seminal paper [1], Crawford and Sobel investigate the Nash equilibrium solution of the cheap talk game, by considering the encoder and the decoder have distinct objectives and choose their coding strategies simultaneously.

In [2], Kamenica and formulate the Stackelberg version of the strategic communication game, in which the encoder is the Stackelberg leader and the decoder is the Stackelberg follower.

This setting, referred to as the Bayesian persuasion game, is the one under study in this paper by considering two decoders.

This problem is an attractive multi-disciplinary subject of study. The Nash equilibrium solution is investigated for multi- dimensional sources and quadratic distortion functions in [3], [4], whereas the Stackelberg solution is studied in [5]. The computational aspects of the persuasion game are considered in [6]. The strategic communication problem with a noisy channel is investigated in [7], [8], [9], [10], and four different

Maël Le Treust gratefully acknowledges financial support from INS2I CNRS, DIM-RFSI, SRV ENSEA, UFR-ST UCP, The Paris Seine Initiative and IEA Cergy-Pontoise. This research has been conducted as part of the project Labex MME-DII (ANR11-LBX-0023-01).

E

D1

D2 V2n V1n Un

M1

M2

Fig. 1: Successive Refinement Source Coding Setup.

scenarios of strategic communication are studied in [11]. The case where the decoder privately observes a signal correlated to the state, also referred to as the Wyner-Ziv setting [12], is studied in [13], [14] and [15]. Vora and Kulkarni investigate the achievable rates for the strategic communication problem in [16], [17] where the decoder is the Stackelberg leader.

In this paper, we study a multi-receiver Bayesian persuasion game in which the observation of the first decoder contains the observation of the second decoder, as in Fig. 1. More specif- ically, we assume that the encoder E selects and announces beforehand the compression scheme to be implemented. Upon receipt of the indexes, the decoders D1 andD2 update their Bayesian beliefs over the source sequence and select the output sequence that minimizes their respective distortion functions.

We characterize the optimal encoder distortion value obtained via the successive refinement coding with respect to the distribution that involves two auxiliary random variables, and that satisfies both decoders incentives constraints.

A. Preliminaries

1) Notations: Let E denote the encoder and Di denote the decoder i ∈ {1,2}. Notations Un and Vin denote the sequences of random variables of source information un = (u1, ..., un) ∈ Un, and decoder Di actions vni ∈ Vin respectively for i ∈ {1,2}. Calligraphic fonts U and Vi

denote the alphabets and lowercase letters u and vi denote the realizations. For a discrete random variableX,we denote by ∆(X) the probability simplex, i.e. the set of probability distributions over X, and by PX(x) the probability mass function P{X = x}. Notation X −−Y −−Z stands for the Markov chain PZ|XY = PZ|Y. The information source U follows the independent and identically distributed (i.i.d) probability distribution PU ∈∆(U).

(3)

II. SYSTEMMODEL

In this section, we aim at formulating the coding problem.

Definition 1. Let R1, R2 ∈R2+ = [0,+∞[2, and n∈ N? = N\{0}. The encoding σ and decoding τi strategies of the encoder E and decoders Di for i ∈ {1,2} respectively, are given by

σ:Un−→∆({1,2, ..2bnR1c} × {1,2, ..2bnR2c}), (1) τ1:{1,2, ..2bnR1c} × {1,2, ..2bnR2c} −→∆(V1n), (2) τ2:{1,2, ..2bnR2c} −→∆(V2n). (3) We denote byS(n, R1, R2)the set of coding triplets (σ, τ1, τ2).

The coding strategies (σ, τ1, τ2) ∈ S(n, R1, R2) are stochastic and induce a joint probability distributionPσ,τ12

∆(Un× {1,2, ..2bnR1c} × {1,2, ..2bnR2c} ×V1n×V2n)defined by

Pσ,τ12 = n

Y

t=1

PUt

PMσ

1M2|UnPVτ1n

1|M1M2PVτ2n

2|M2. (4) Definition 2. The single-letter distortion functions of the encoder de and both decoders di for i ∈ {1,2} are defined by

de:U × V1× V2−→R, (5) d1:U × V1−→R, (6) d2:U × V2−→R. (7) Definition 3. The long-run distortion functions are defined by

dne(σ, τ1, τ2) =Eσ,τ12

"

1 n

n

X

t=1

de(Ut, V1,t, V2,t)

#

= X

un,v1n,v2n

PUσ,τn1V1n2V2n(un, vn1, v2n

"

1 n

n

X

t=1

de(ut, v1,t, v2,t)

# ,

dn1(σ, τ1) =Eσ,τ1

"

1 n

n

X

t=1

d1(Ut, V1,t)

# ,

dn2(σ, τ2) = X

un,vn2

PUσ,τn2V2n(un, v2n

"

1 n

n

X

t=1

d2(ut, v2,t)

# . In the above equations, PUσ,τn1V1n2V2n, PUσ,τnV11n and PUσ,τnV22n

denote the marginal distributions of Pσ,τ12 over the n- sequences (Un, V1n, V2n), (Un, V1n), and (Un, V2n) respec- tively. We consider the strategic communication game in which the encoder select σ that minimizes dne(σ, τ1, τ2) and the decoders chooseτi,i∈ {1,2} in order to minimize the long- run distortion functions din

(σ, τi),i∈ {1,2}.

Definition 4. For any encoding strategyσ,the set of decoder i’s best reply strategies fori∈ {1,2} is defined as follows

BRi(σ) ={τi, din(σ, τi)≤din(σ,τ˜i),∀τ˜i}. (8) If several pairs of best-reply strategy (τ1, τ2)∈BR1(σ)× BR2(σ)are available, we assume that the decoders choose the worst pair from the encoder perspective. For (R1, R2)∈R2+

and n ∈ N?, the encoder has to solve the following coding problem.

Den(R1, R2) = inf

σ max

τ1BRd1(σ), τ2BRd2(σ)

dne(σ, τ1, τ2). (9) We now discuss now the operational significanceDne(R1, R2).

The encoderE chooses, announces the encoding σ.

The sequenceUn is drawn i.i.d with distributionPU.

The messages (M1, M2) are encoded according to PMσ

1M2|Un.

By knowing σ, the decoder D1 observes (M1, M2) and draws a sequence pair V1n according to the strategy τ1 ∈BRd1(σ). Similarly, the decoder D2 observes M2

and draws a sequence V2n according to the strategy τ2∈BRd2(σ).

The distortion values are given by dne(σ, τ1, τ2), dn1(σ, τ1),dn2(σ, τ2).

III. CHARACTERIZATION

Definition 5. We consider two auxiliary random variables W1∈ W1andW2∈ W2with|Wi|=|Vi|, fori∈ {1,2}. For (R1, R2)∈R2+, we define

Q0(R1, R2) =

QW1W2|U, R2≥I(U;W2), R1+R2≥I(U;W1, W2)

. (10) For every distributionQW1W2|U ∈∆(W1× W2)|U |, we define

Q1(QW1W2|U) = arg min

QV1|W1W2

EQW1W2|U QV1|W1W2

hd1(U, V1)i , (11) Q2(QW2|U) = arg min

QV2|W2

EQW2|U QV2|W2

hd2(U, V2)i

. (12)

Note thatQ1(QW1W2|U)∈∆(V1)|W1×W2|andQ2(QW2|U)∈

∆(V2)|W2|. The encoder’s optimal distortion is defined by D?e(R1, R2)

= inf

QW1W2|U

∈Q0 (R1,R2 )

max

QV1|W1W2∈Q1 (QW1W2|U) QV2|W2∈Q2 (QW2|U)

E h

de(U, V1, V2)i , (13) where the expectation in (13) is evaluated with respect to PUQW1W2|UQV1|W1W2QV2|W2.

Remark 1. The random variablesU, W1, W2, V1, V2 satisfy (U, V2)−−(W1, W2)−−V1, (U, W1, V1)−−W2−−V2. Definition 6. Given QW1W2|U, we denote by QU|W1W2

∆(U)|W1×W2| and QU|W2 ∈ ∆(U)|W2| the posterior beliefs of decodersD1 and D2 that are defined by

Q(u|w1, w2) = P(u)Q(w1, w2|u) P

u0P(u0)Q(w1, w2|u0), ∀u, w1, w2, (14) Q(u|w2) = P(u)P

w1Q(w1, w2|u) P

u0P(u0)Q(w2|u0) , ∀u, w2. (15)

(4)

Moreover, for (w1, w2) ∈ W1× W2, we introduce the no- tations QwU1w2 = QU|W1W2(.|w1, w2) ∈ ∆(U) and QwU2 = QU|W2(.|w2)∈∆(U).

Theorem 1. Let(R1, R2)∈R2+, we have

∀ε >0,∃ˆn∈N,∀n≥n, Dˆ en(R1, R2)≤D?e(R1, R2) +ε,

∀n∈N, Den(R1, R2)≥D?e(R1, R2).

Together with Fekete’s Lemma for the sub-additive se- quence nDne(R1, R2)

n∈N?, this result describes two features of the asymptotic behavior of the encoder’s long run distortion functionDne(R1, R2):

1) It converges to the optimal distortion D?e(R1, R2), 2) It does not go below D?e(R1, R2).

In other words,

n→∞lim Dne(R1, R2) = inf

n∈N?

Dne(R1, R2) =De?(R1, R2). (16)

IV. CONVERSEPROOF

Let (R1, R2)∈R2+ andn∈N?. We consider(σ, τ1, τ2)∈ S(R1, R2) and a random variable T uniformly distributed over {1,2, ..., n} and independent of(Un, M1, M2, V1n, V2n).

We introduce the auxiliary random variables W1 = (M1, T), W2 = (M2, T), (U, V1, V2) = (UT, V1,T, V2,T), distributed according toPU Wστ1τ2

1W2V1V2defined for all(u, w1, w2, v1, v2) = (ut, m1, m2, t, v1,t, v2,t)by

PU Wστ1τ2

1W2V1V2(u, w1, w2, v1, v2)

=PUστ1τ2

TM1M2T V1TV2T(ut, m1, m2, t, v1,t, v2,t)

=1 n

X

ut−1 unt+1

X

vt−1 1 ,vn1,t+1 vt−1

2 ,vn2,t+1

n Y

t=1

PU(ut)

PMσ

1M2|Un(m1, m2|un)

× PVτ1n

1|M1M2(vn1|m1, m2)PVτ2n

2|M2(vn2|m2).

Lemma 1. The distribution PU Wστ1τ2

1W2V1V2 has marginal on

∆(U)given byPU and satisfies the Markov chain properties (U, V2)−−(W1, W2)−−V1, (U, W1, V1)−−W2−−V2. Proof. [Lemma 1] The i.i.d. property of the source ensures that the marginal distribution is PU. By the definition of the decoding functionsτ1 andτ2 we have

(UT, V2,T)−−(M1, M2, T)−−V1,T, (UT, M1, V1,T)−−(M2, T)−−V2,T. ThereforePU Wστ1τ2

1W2V1V2=PUPWσ

1W2|UPVτ1

1|W1W2PVτ2

2|W2. Lemma 2. For allσ, the distributionPWσ

1W2|U ∈Q0.

Proof. [Lemma 2] We consider an encoding strategyσ, then nR2≥H(M2)≥I(M2;Un) (17)

=

n

X

t=1

I(Ut;M2|Ut−1) (18)

=nI(UT;M2|UT−1, T) (19)

=nI(UT;M2, UT−1, T) (20)

≥nI(UT;M2, T) (21)

=nI(U;W2). (22)

In fact, (19) follows from the introduction of the uniform random variable T ∈ {1, . . . , n}, (20) comes from the i.i.d.

property of the source and (22) follows from the identification of the auxiliary random variables(U, W2). Similarly,

n(R1+R2)≥H(M1, M2)≥I(Un;M1, M2) (23)

=

n

X

t=1

I(Ut;M1, M2|Ut−1) (24)

=nI(UT;M1, M2|UT−1, T) (25)

≥nI(UT;M1, M2, T) (26)

=nI(U;W1, W2). (27)

Lemma 3. For all(σ, τ1, τ2)and i∈ {1,2}, we have dne(σ, τ1, τ2) =E

de(U, V1, V2)

, (28)

dni(σ, τi) =E

di(U, Vi)

, (29)

evaluated with respect to PUPWσ

1W2|UPVτ1

1|W1W2PVτ2

2|W2. Moreover for allσ, we have

Q1(PWσ

1W2|U) =n

QV1|W1W2,

∃τ1∈BR1(σ), QV1|W1W2=PVτ1

1|W1W2

o , (30) Q2(PWσ2|U) =n

QV2|W2,

∃τ2∈BR2(σ), QV2|W2=PVτ2

2|W2

o

. (31)

Proof. [Lemma 3] By Definition 3 we have

dne(σ, τ1, τ2) = X

un ,m1,m2, vn1,vn2

n Y

t=1

PU(ut)

× PMσ

1M2|Un(m1, m2|un)PVτ1n

1|M1M2(vn1|m1, m2)

× PVτ2n

2|M2(vn2|m2

"

1 n

n

X

t=1

de(ut, v1,t, v2,t)

#

(32)

=

n

X

t=1

X

ut,m1,m2, v1,t ,v2,t

Pσ,τ12(ut, m1, m2, t, v1,t, v2,t)

×de(ut, v1,t, v2,t) =E

de(U, V1, V2)

. (33)

GivenQV1|W1W2 ∈Q1(PWσ

1W2|U), we considerτ1 such that PVτ1n

1|M1M2(v1n|m1, m2) =

n

Y

t=1

QV1|W1W2(v1,t|m1, m2, t).

(5)

Therefore

dn1(σ, τ1) =EPWσ

1W2|U QV1|W1W2

d1(U, V1)

(34)

= min

PV1|W1W2

EPσW1W2|U PV1|W1W2

d1(U, V1)

(35)

≤min

˜ τ1

EPWσ1W2|U

P˜τ1 V1|W1W2

d1(U, V1)

= min

˜ τ1

dn1(σ,τ˜1), (36)

hence τ1 ∈ BR1(σ). The other inclusion is direct and the same arguments imply (31).

For any strategy σ, we have maxτ12

dne(σ, τ1, τ2)

=maxτ12 E PσW1W2|U

Pτ1 V1|W1W2

Pτ2 V2|W2

h

de(U, V1, V2)i

(37)

= max

QV1|W1W2∈Q1 (Pσ W1W2|U) QV2|W2∈Q2 (Pσ

W2|U)

E PWσ1W2|U QV1|W1W2 QV2|W2

h

de(U, V1, V2)i (38)

≥ inf

QW1W2|U

∈Q0 (R1,R2 )

max

QV1|W1W2∈Q1 (QW1W2|U) QV2|W2∈Q2 (QW2|U)

E h

de(U, V1, V2)i

(39)

=De?(R1, R2). (40)

Equations (37) and (38) comes from Lemma 3, whereas (39) comes from Lemma 2. This concludes the converse proof of Theorem 1.

V. ACHIEVABILITY PROOF

A. Alternative Formulation

Definition 7. We denote by V1?(q1) and V2?(q2), the sets of optimal outputs of decoders D1 andD2 for any distributions q1∈∆(U)and q2∈∆(U).

V1?(q1) = arg min

v1∈V1

X

u

q1(u)d1(u, v1), (41) V2?(q2) = arg min

v2∈V2

X

u

q2(u)d2(u, v2). (42) Definition 8. Given a strategy QW1W2|U we denote by A(Q˜ W1W2|U, w1, w2)⊂ V1×V2the set of action pairs(v1, v2) that are optimal for the decoders and worst for the encoder.

This set is defined by A(Q˜ W1W2|U, w1, w2) =

arg max

(v1,v2 )∈V ?1(Qw1w2 U

V ?2(Qw2 U )

n X

u

QwU1,w2(u)de(u, v1, v2)o . (43)

Definition 9. Given(R1, R2)∈R2+, we define Q˜0(R1, R2) =n

QW1W2|U s.t. R2> I(U;W2), R1+R2> I(U;W1, W2),

wmax1,w2|A(Q˜ W1W2|U, w1, w2)|= 1o . (44)

Definition 10. Consider the following program:

e(R1, R2) = inf

QW1W2|UQ˜0(R1,R2)

max

QV1|W1W2∈Q1 (QW1W2|U) QV2|W2∈Q2 (QW2|U)

E h

de(U, V1, V2)i . (45) where the expectation in (45) is evaluated with respect to PUQW1W2|UQV1|W1W2QV2|W2.

Lemma 4. For(R1, R2)∈R2+, we have

D?e(R1, R2) = ˜De(R1, R2) (46) The proof of lemma 4 relies on showing that ˜

Q0(R1, R2) is dense inQ0(R1, R2). It is provided in the full version of the paper.

B. Special Cases

E

D1

D2

Un

M1

M2

V1n

V2n PV1|W1W2

PV2|W2 W1n, W2n

W2n

d2(U, V2) d1(U, V1)

Fig. 2: Achievability of Successive Refinement Source Coding Setup.

1) R1 = R2 = 0: The auxiliary random variables (W1, W2)are independent of U. Moreover, message sets are singletons, and the only possible encoding strategyσ0is given by σ0 : Un −→ {1} × {1}. The codebook consists of two sequencesW2n(1)andW1n(1,1)only. Therefore, the following result holds:

Lemma 5. D?e(0,0) =Den(0,0) ∀n∈N?.

2) R1 > 0 & R2 = 0: Random variables W2 and U are independent for R1 > 0 and R2 = 0, i.e.

QW1W2|U =QW2QW1|W2U. This means that decoderD2will repeatedly chose the actionv2,0∈V?(PU)that corresponds to its prior beliefPUand maximizes the encoder’s distortion. The persuasion game is thus reduced to the point-to-point problem with one decoderD1 as in [11].

3) R1 = 0 & R2 > 0 : The auxiliary random variable W1 is independent of U. Hence, the encoder transmits the same index to both decoders. Therefore, both decoders will have the same posterior belief QwU2 ∈∆(U),∀w2∈ W2. The output sequencesV1nandV2n are drawn i.i.d. according to the optimal QV1|W2 ∈ Q1(QW2|U) and QV2|W2 ∈ Q2(QW2|U) respectively. This case is also reduced to the point-to-point problem solved in [10], by considering a unique decoder that selects the optimal pair of outputs(v1, v2).

(6)

Definition 11. A family of pairs(λw2,QwU2)w2∈W2 ∈([0,1]×

∆(U))|W2| is a splitting ofPU if X

w2∈W2

λw2 = 1, (47)

X

w2∈W2

λw2QwU2 =PU. (48) In that case, the optimal distortion can be reformulated in terms of a convexification of its expected distortion as follows:

De?(0, R2) = inf

w2,QwU2)w2∈W2

X

w2∈W2

λw2Ψe(QwU2). (49)

whereΨe(q) = max

(v1,v2 ) V ?1(q)×V ?2(q)

Eq

de(U, v1, v2)

.

4) (R1, R2)∈]0,+∞[2: Fix a conditional probability dis- tribution QW1,W2|U. There existsη >0 such that

R2=I(U;W2) +η, (50) R1=I(U;W1|W2) +η. (51) Codebook generation: Randomly and independently gen- erate 2bnR2c sequences w2n(m2) for m2 ∈ [1 : 2bnR2c], according to the i.i.d distribution PW2n = Πnt=1PW2(w2t).

For each (m1, m2) ∈ [1 : 2bnR1c] × [1 : 2bnR2c] gen- erate a sequence w1n(m1, m2) randomly and conditionally independently according to the i.i.d conditional distribution PW1n|M1W2n= Πnt=1PW1|M1W2(w1t|m1, w2t(m2)).

Coding algorithm: Encoder E observes un and looks in the codebook for a pair (m1, m2) such that (un, w1n(m1, m2), wn2(m2)) ∈ Tδn(PUPW1W2|U), i.e. the sequences are jointly typical with tolerance parameter δ >0.

If such a jointly typical tuple doesn’t exist, the source encoder sets (m1, m2) to (1,1). Then, it sends m2 to decoder D2, and(m1, m2)to decoder D1.

Here comes the main difference with the successive refinement coding, which is due to the strategic nature of the problem. Instead of declaring w1n(m1, m2) and w2n(m2) and selecting V1n and V2n i.i.d. with respect to QV1|W1W2 ∈Q1(QW1W2|U) and QV2|W2 ∈Q2(QW2|U), the decoders 1 and 2 compute their Bayesian posterior beliefs PUσn|M1M2 and PUσn|M2 and select the output sequences V1n andV2n that minimize their long-run distortion functions.

Error Event: Given a tolerance δ > 0, the error event is given byE ={(Un, W2n(m2), W1n(m2, m1)∈ T/ δn}. We have by the union of events bound P(E)≤ P(E1) +P(E2(M2)∩ E1c), where

E1={(Un, W2n(m2))∈ T/ δn ∀m2}, (52) E2(m2) ={(Un, W2n(m2), W1n(m2, m1))∈ T/ δn ∀m1}. (53) By [18, Lemma 3.3, pp. 62], P(E1)tends to zero as n→ ∞ if

R2> I(U;W2) +η. (54)

P(E1c∩ E2(M2))goes to zero by [18, Lemma 3.3, pp. 62] if R1+R2> I(U;W1, W2) +η. (55) Since the expected error probability evaluated with respect to the random codebook is small, we have that for allε2>0, for all η > 0, there exists δ >¯ 0, for all δ ≤δ, there exists¯

¯

n∈Nsuch that for alln≥n¯ we have:

E P(E1)

≤ε2, (56)

E

P(E2(m2))

≤ε2. (57) 5) Control of Beliefs: We introduce the indicator of error eventsEδ1∈ {0,1}for decoder D1 defined as follows

Eδ1=

(1, if (un, w1n, w2n)∈ T/ δn(PUQW1W2|U).

0, otherwise. (58)

Assuming the distribution PU|W1W2 is fully supported, the beliefs of decoderD1 are controlled as follows

E h1

n

n

X

t=1

D(Ptm1,m2||PU|W1W2(·|W1t, W2t))

Eδ1= 0i (59)

= X

m1,m2, wn1,wn2

Pσ,τ12(m1, m2, w1n, wn2

E1δ = 0)

×1 n

n

X

t=1

X

u

Ptm1m2(u) log2 Ptm1m2(u)

PU|W1W2(u|w1t, w2t) (60)

= X

m1,m2, wn1,wn2

Pσ,τ12(m1, m2, w1n, wn2

E1δ = 0)

×1 n

n

X

t=1

X

u

Ptm1m2(u) log2 1

PU|W1W2(u|w1t, w2t)

− X

m1,m2, wn1,wn2

Pσ,τ12(m1, m2, wn1, wn2

Eδ1= 0)×

1 n

n

X

t=1

X

u

Ptm1m2(u) log2 1

Ptm1m2(u) (61)

≤1

nI(Un;M1, M2

Eδ1= 0)−I(U;W1, W2) +δ +1

n+ log2|U | · Pσ,τ12(E1δ = 1) (62)

≤η+δ+ 1

n+ log2|U | · Pσ,τ12(Eδ1= 1). (63) Similarly, we can control the beliefs of decoder D2. This completes the proof of achievability.

REFERENCES

[1] V. Crawford and J. Sobel, “Strategic information transmission,”Econo- metrica, vol. 50, no. 6, pp. 1431–51, 1982.

[2] E. Kamenica and M. Gentzkow, “Bayesian persuasion,”American Eco- nomic Review, vol. 101, pp. 2590 – 2615, 2011.

[3] S. Saritas, S. Yuksel, and S. Gezici, “Quadratic multi-dimensional signaling games and affine equilibria,”IEEE Transactions on Automatic Control, vol. 62, no. 2, p. 605–619, Feb 2017.

[4] S. Sarıta¸s, P. Furrer, S. Gezici, T. Linder, and S. Yüksel, “On the number of bins in equilibria for signaling games,” in2019 IEEE International Symposium on Information Theory (ISIT), 2019, pp. 972–976.

(7)

[5] S. Sarıta¸s, S. Yüksel, and S. Gezici, “Dynamic signaling games with quadratic criteria under nash and stackelberg equilibria,”Automatica, vol. 115, no. C, May 2020.

[6] S. Dughmi, D. Kempe, and R. Qiang, “Persuasion with limited commu- nication,” inProceedings of the 2016 ACM Conference on Economics and Computation, ser. EC ’16. New York, NY, USA: Association for Computing Machinery, 2016, p. 663–680.

[7] E. Akyol, C. Langbort, and T. Ba¸sar, “Strategic compression and transmission of information,” in IEEE Information Theory Workshop - Fall (ITW), Oct 2015, pp. 219–223.

[8] E. Akyol, C. Langbort, and T. Ba¸sar, “Information-theoretic approach to strategic communication as a hierarchical game,”Proceedings of the IEEE, vol. 105, no. 2, pp. 205–218, 2017.

[9] M. Le Treust and T. Tomala, “Information design for strategic coordi- nation of autonomous devices with non-aligned utilities,”IEEE Proc. of the 54th Allerton conference, Monticello, Illinois, pp. 233–242, 2016.

[10] ——, “Persuasion with limited communication capacity,” Journal of Economic Theory, vol. 184, p. 104940, 2019.

[11] ——, “Point-to-point strategic communication,”IEEE Information The- ory Workshop, 2020.

[12] A. D. Wyner and J. Ziv, “The rate-distortion function for source coding with side information at the decoder,”IEEE Transactions on Information Theory, vol. 22, no. 1, pp. 1–11, 1976.

[13] E. Akyol, C. Langbort, and T. Ba¸sar, “On the role of side information in strategic communication,” in IEEE International Symposium on Information Theory (ISIT), July 2016, pp. 1626–1630.

[14] R. Bou Rouphael and M. Le Treust, “Impact of private observation in bayesian persuasion,”International Conference on NETwork Games COntrol and OPtimization NetGCoop, Mar. 2020.

[15] M. Le Treust and T. Tomala, “Strategic communication with decoder side information,”Information Symposium on Information Theory (ISIT), 2021.

[16] A. S. Vora and A. A. Kulkarni, “Achievable rates for strategic communi- cation,” in2020 IEEE International Symposium on Information Theory (ISIT), 2020, pp. 1379–1384.

[17] ——, “Information extraction from a strategic sender: The zero error case,” 2020. [Online]. Available: https://arxiv.org/abs/2006.10641 [18] A. El Gamal and Y.-H. Kim,Network information theory. Cambridge

university press, 2011.

Références

Documents relatifs

We formulate a point-to-point source-channel coding problem with state information, in which the encoder and the decoder choose their respective encoding and decoding strategies

Gap based on the need for treatment for all HIV-positive pregnant women, serodiscordant couples, HIV-positive female sex workers, HIV-positive MSM and HIV-positive IDU’s (regardless

However, our results show that the profits of airlines can be higher if they invest in more efficient aircraft: eventhough the activity would increase, the emissions are lower

- a simple or minimalist version of strategic management accounting: management control is a tool to strategy formulation (Camillus & Grant, 1980; Simmonds, 1981; Lorange et

Th e Commission would like to thank its partners, particularly those in civil society and the private sector, whose tireless eff orts have made this tremendous task lighter. Th

If airline i is more efficient than the benchmark, which means that its fuel consumption is lower than the average consumption, airline i receives allowances which will be sold on

The proposed algorithm consists of the Context-Adaptive Binary Arithmetic Coding (CABAC) algorithm and the enumerative coding algorithm with a hierarchical approach.. The pro-

Other mathematical optimization based approaches that have been applied to simultaneously clustering gene expression and drug activity profiles are the Soft Topographic