• Aucun résultat trouvé

Background (I): matrix completion

N/A
N/A
Protected

Academic year: 2022

Partager "Background (I): matrix completion"

Copied!
19
0
0

Texte intégral

(1)

Decentralized Low-Rank Matrix Completion

Qing Ling1, Yangyang Xu2, Wotao Yin2, Zaiwen Wen3

1. Department of Automation, University of Science and Technology of China 2. Department of Computational and Applied Mathematics, Rice University

3. Department of Mathematics, Shanghai Jiaotong University

(2)

Outline

‰ Background: matrix completion

‰ centralized Æ decentralized

‰ Problem formulation

‰ nonconvex matrix factorization model + decentralized computing

‰ Algorithm design

‰ Gauss-Seidel + decentralized implementation (with the ADM)

‰ Simulation and conclusion

Keywords: matrix completion matrix factorization ADM/ADMM decentralized computing

(3)

Background (I): matrix completion

„ Matrix completion problem

z Knowing some entries of a matrix, to recover the others

z Important prior: the matrix is low-rank

„ Related applications

z Collaborative filtering

(4)

Background (II): decentralized matrix completion

„ Distributed data in distributed agents & no fusion center

z Privacy, cost of data collection, etc

z Decentralized computing with limited information exchange

(5)

Problem formulation (I): two models

„ A connected network with L distributed agents

z Agent i observes some entries of a data matrix

z The whole data matrix is with rank r<<min(N, M)

z Observation over a subset

„ Nonconvex matrix factorization model

Convex nuclear norm minimization model

(6)

Problem formulation (II): decentralized computing

„ Decentralized matrix completion with the nonconvex model

z Public matrix X: common to all agents

z Private matrix Yi: held by agent i

(7)

Problem formulation (III): nonconvex vs. convex

„ Nonconvex vs. convex in decentralized computing

z Nonconvex: efficient computation of X and Yi (and Zi)

z Convex: decentralized SVD as a subroutine

(8)

Algorithm design (I): Gauss-Seidel method

„ Centralized Gauss-Seidel method: LMaFit

projection

[WYZ10] Z. Wen, W. Yin, and Y. Zhang. Solving a low-rank factorization model for matrix completion by a non-linear successive over-relaxation algorithm. Mathematical Programming Computation, To Appear

(9)

Algorithm design (I): Gauss-Seidel method

„ Centralized Gauss-Seidel method: LMaFit

„ A simple but nontrivial revision: replace X(t+1) with

(10)

Algorithm design (II): decentralized implementation

„ If agent i knows X(i)=X, the updates of Yi and Zi are easy

Y-update

Z-update

„ How to update X(i)? Choose c=1/L:

X-update is average consensus

(11)

Algorithm design (III): average consensus

Zi(t)YiT(t) Zj(t)YjT(t)

How to let all agents have the average?

(12)

Algorithm design (IV): average consensus

„ Exactly solving the average consensus problem

z Randomized gossip, alternating direction method (ADM)

z Iterative algorithm: dividing each iteration into S slots

„ The ADM is a powerful tool for decentralized optimization

[B1999] D. Bertsekas. Numerical Optimization, Second Edition. Athena Scientific, 1999

[SRG2008] I. Schizas, A. Ribeiro, and G. Giannakis. Consensus in ad hoc WSNs with noisy links - Part I: Distributed estimation of deterministic signals. IEEE Transactions on Signal Processing, 2008

[LTYY2012] Q. Ling, M. Tao, W. Yin, and X. Yuan. A multi-block alternating direction method with parallel splitting for decentralized consensus optimization. Journal of Wireless Communications and Networking, Submitted

(13)

Algorithm design (IV): average consensus

„ Exactly solving the average consensus problem

z Randomized gossip, alternating direction method (ADM)

z Iterative algorithm: dividing each iteration into S slots

„ Exact average consensus iterates with the ADM

multiplier positive constant neighbors

(14)

Algorithm design (V): inexact average consensus

„ Exact average consensus

z Optimal when the network is connected

z The decentralized algorithm = the centralized algorithm

z Extra communication & coordination costs

„ Inexact average consensus

z Simply let S=1 in the ADM; no more iterates

z Different from the centralized algorithm

(15)

Algorithm design (V): optimization framework

„ Updating own private from own public

„ Updating own public from own private & neighboring public

(16)

Simulation (I): simulation settings

„ Network settings

z L=25 agents are randomly distributed in a 100×100 area

z Two agents are neighbors if their distance < 30

„ Data settings

z N=300 rows, M=500 columns, 20 columns per agent

z 100×p percent randomly chosen entries can be observed

z True rank K=4

„ Performance evaluation: relative error

(17)

Simulation (II): convergence

Estimated rank r=4=K Estimated rank r=6>K Knowing 80% data, p=0.8

(18)

Conclusion

„ Concluding remarks

z Discuss matrix completion in a decentralized manner

z Use a nonconvex matrix factorization model

z Main feature: private <-> public

z Gauss-Seidel for private, average consensus for public

„ Open questions

z When can we protect data privacy?

z Convergence? Nonconvex model + inexact average consensus

(19)

Thank you!

Références

Documents relatifs

Then, when the non-smooth part is also partly smooth relative to a smooth submanifold, we establish finite identification of the latter and provide sharp local linear

In this paper, the Scene Background Initialization (SBI) data set was chosen for the background initialization task. The data set contains seven image sequences and corresponding

Plus préci- sément, nous nous pencherons sur les molécules organiques semiconductrices de type n, potentiellement utilisables dans des transistors à eet de champ (OFET, organic

(a) HEC–TM cells were seeded at the same density, treated or not with 4OHT, transduced to express an RFP-coupled mitochondrial protein and a day later labelled by the calcium

New methods for simu- lation and studying of permafrost degradation in well pads areas which related with permafrost heating by various technical system are developed, taking

On the other hand, another splitting method called the parallel proximal algorithm (PPXA) was proposed in [21] to minimize a sum of convex functions in a possibly infinite

The essential idea underlying the proposed approach is to decompose the original nonconvex stochastic problem into a sequence of (simpler) deterministic subproblems. In this case,

Despite the widespread deployment of WiMAX (IEEE 802-16d) networks, there is no radio monitoring tool on the market to support network operators in the optimization task; hence