NP-hardness of $\ell$0 minimization problems: revision and extension to the non-negative setting

(1)

HAL Id: hal-02112180

https://hal.archives-ouvertes.fr/hal-02112180

Submitted on 26 Apr 2019

HAL is a multi-disciplinary open access

archive for the deposit and dissemination of

sci-entific research documents, whether they are

pub-lished or not. The documents may come from

teaching and research institutions in France or

abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est

destinée au dépôt et à la diffusion de documents

scientifiques de niveau recherche, publiés ou non,

émanant des établissements d’enseignement et de

recherche français ou étrangers, des laboratoires

publics ou privés.

extension to the non-negative setting

Thi Thanh Nguyen, Charles Soussen, Jérôme Idier, El-Hadi Djermoune

To cite this version:

Thi Thanh Nguyen, Charles Soussen, Jérôme Idier, El-Hadi Djermoune. NP-hardness of ℓ0

minimiza-tion problems: revision and extension to the non-negative setting. 13th Internaminimiza-tional Conference on

Sampling Theory and Applications, SampTA 2019, Jul 2019, Bordeaux, France. �hal-02112180�

(2)

NP-hardness of

`

₀

minimization problems:

revision and extension to the non-negative setting

Thanh T. N

GUYEN1

, Charles S

OUSSEN2

, J´erˆome I

DIER3

, El-Hadi D

JERMOUNE1

1

_{CRAN, Universit´e de Lorraine, CNRS, F-54506 Vandœuvre-l`es-Nancy, France}

2

_{L2S, CentraleSupélec-CNRS-Université Paris-Sud, Université Paris-Saclay,}

F-91192 Gif-sur-Yvette, France

3

_{LS2N, UMR 6004, F-44321 Nantes, France}

{thi-thanh.nguyen,el-hadi.djermoune}@univ-lorraine.fr;

[email protected]; [email protected]

Abstract—Sparse approximation arises in many appli-cations and often leads to a constrained or penalized `0

minimization problem, which was proved to be NP-hard. This paper proposes a revision of existing analyses of NP-hardness of the penalized `0 problem and it introduces a

new proof adapted from Natarajan’s construction (1995). Moreover, we prove that `0minimization problems with

non-negativity constraints are also NP-hard.

I. INTRODUCTION

Sparse approximation appears in a wide range of appli-cations, especially in signal processing, image processing and compressed sensing [1]. Given a signal data y ∈ Rm and a dictionary A of size m×n, the aim is to find a signal x ∈ Rnthat gives the best approximation y ≈ Ax and has the fewest non-zero coefficients (i.e., sparsest solution). This task leads to solving one of the following constrained or penalized `0 minimization problems:

min ky−Axk2≤ kxk0 (`0C) min kxk0≤K ky − Axk2 2 (`0C0) min x ky − Axk 2 2+ λkxk0 (`0P )

in which , K and λ are positive quantities related to the noise standard deviation, the sparsity level and regulariza-tion strength, respectively. Letters C and P respectively indicate that the problem is constrained or penalized. Depending on application, the appropriate statement will be addressed. It is noteworthy that n and K often depend on m when one considers the size of problem. (`0C) and

(`0C0) are well known to be hard [2, 3]. The

NP-hardness of (`0P ) was claimed to be a particular case of

more general complexity analyses in [4, 5]. However, we point out that these complexity analyses do not rigorously apply to (`0P ) as claimed. In this paper, we justify the

complexity analyses in [4, 5] do not apply to problem (`0P ), and we provide a new proof for the NP-hardness

of (`0P ) adapted from Natarajan’s construction [2].

In several applications such as geoscience and remote sensing [6, 7], audio [8], chemometrics [9] and computed

This work was supported by the project ANR BECOSE (ANR-15-CE23-0021).

This work was carried out in part while T. T. Nguyen was visiting L2S during the academic years 2017-2019.

tomography [10], the signal or image of interest is non-negative. In such contexts, one often addresses a mini-mization problem with both sparsity and non-negativity constraints [10–12]. Adding non-negativity constraints to `0 minimization problems yields the following problems:

min x≥0,ky−Axk2≤ kxk0 (`0C+) min x≥0,kxk0≤K ky − Axk2 2 (`0C0+) min x≥0 ky − Axk 2 2+ λkxk0 (`0P +)

Several papers address non-negative `0minimization

prob-lems in the literature (see, e.g., [13–16]). However, to the best of our knowledge, the complexity of these problems has not been addressed yet, the question of their NP-hardness being still open. Here we show that these prob-lems are NP-hard and the proof can be derived from the NP-hardness of `0 minimization problems.

The rest of paper is organized as follows. In Section II, we discuss the issues related to NP-hardness of (`0P ) in

existing analyses and we present our proof. In Section III, we discuss about the NP-hardness of non-negative `0

minimization problems. We draw some conclusions in Section IV.

II. HARDNESS OF`0MINIMIZATION PROBLEMS

A. Background on constrained `0 minimization problems

Let us recall that an NP-complete problem is a prob-lem in NP to which any other probprob-lem in NP can be reduced in polynomial time. Thus NP-complete problems are identified as the hardest problems in NP. An NP-complete problem is strongly NP-NP-complete if it remains NP-complete when all of its numerical parameters are bounded by a polynomial in the length of the input. NP-hard problems are at least as hard as NP-complete problems. However, NP-hard problems do not need to be in NP and do not need to be decision problems. Formally, a problem is NP-hard (respectively, strongly hard) if a complete (respectively, strongly NP-complete) problem can be reduced in polynomial time to it. The reader is referred to [17, 18] for more information on this topic.

(3)

In the literature, problem (`0C), called SAS in [2],

is well known to be hard [2, Theorem 1]. The NP-hardness of (`0C) is a valuable extension of an earlier

result: the problem of minimum weight solution to linear equations (equivalent to (`0C) with = 0) is NP-hard [17,

p. 246]. Davis et al. proved that (`0C0), called M -optimal

approximation in [3], is NP-hard for any K < m [3, Theorem 2.1]. Both analyses of Natarajan and Davis were made by a polynomial time reduction from the “exact cover by 3-sets” problem1 _{which is known to be}

NP-complete [17, p. 221].

B. Existing analyses on penalized`0 minimization

In [4, 5], the NP-hardness of (`0P ) is deduced as

a particular case of more general complexity analyses. However, it turns out that the latter do not apply to (`0P ), as explained hereafter. Chen et al. [4] address the

unconstrained `q-`p minimization problem, defined by:

min x ky − Axk q q+ λkxk p p (`q-`p)

where λ > 0, q ≥ 1 and 0 ≤ p < 1. The authors showed that problem (`q-`p) is NP-hard with any λ > 0, q ≥ 1

and 0 ≤ p < 1 [4, Theorem 3]. Obviously, (`0P ) is the

case where q = 2 and p = 0. The proof was done by i) introducing an invertible transformation which scales any instance of problem (`q-`p) to the problem (`q-`p)

with λ = 1/2, and ii) establishing a polynomial time reduction from the partition problem which is known to be NP-complete [17] to the problem (`q-`p) with λ = 1/2.

In other words, they showed that problem (`q-`p) with

λ = 1/2 is NP-hard and, because there exists an invertible transformation from any problem (`q-`p) to the one with

λ = 1/2, every problem (`q-`p) is NP-hard. Similarly, they

showed that (`q-`p) is strongly NP-hard [4, Theorem 5] by

a reduction from the 3-partition problem which is known to be strongly NP-hard [17]. The invertible transform used in [4] is defined by:

˜

x = (2λ)1/px, A = (2λ)˜ −1/pA. (1) Unfortunately, (1) is not well-defined when p = 0. There-fore, [4, Theorems 3 and 5] do not apply to (`0P ) when

λ 6= 1/2.

Using a different approach, Huo and Chen’s paper [5] addresses the penalized least-squares problem defined by:

min x ky − Axk 2 2+ λ n X i=1 φ(|xi|), (PLS)

where φ is a penalty function mapping non-negative values to non-negative values. The authors showed that (PLS) is NP-hard if the penalty function φ satisfies the following four conditions [5, Theorem 3.1]:

C1. φ(0) = 0 and ∀ 0 ≤ τ1< τ2, φ(τ1) ≤ φ(τ2).

C2. There exists τ0> 0 and a constant d > 0 such that

φ(τ ) ≥ φ(τ0) − d(τ0− τ )2

1_{The latter problem, denoted by X3C in [2, 17], is stated as follows:}

Given a set S and a collection C of 3-element subsets of S (called triplets), is there a subcollection of disjoint triplets that exactly covers S?

for every 0 ≤ τ < τ0.

C3. For the aforementioned τ0, if τ1, τ2< τ0 then

φ(τ1) + φ(τ2) ≥ φ(τ1+ τ2).

C4. For every 0 ≤ τ < τ0,

φ(τ ) + φ(τ0− τ ) > φ(τ0). (2)

The proof of [5, Theorem 3.1] is by a reduction from the NP-complete problem X3C to the decision version of (PLS); this leads to the NP-completeness of the decision version of (PLS) and so the NP-hardness of (PLS) [5, Appendix 1]. The authors claimed that the `0 penalty

function satisfies conditions C1-C4 for τ0 = d = 1.

Therefore, the (PLS) problem with the `0penalty function

is NP-hard [5, Corollary 3.2]. Unfortunately, it turns out that the `0penalty does not fulfill condition C4 as claimed.

Indeed, for τ = 0 the strict inequality (2) becomes φ(0) > 0. Besides, in the proof [5, Appendix 1], the inputs of the decision problem are not guaranteed to have rational values. This might also violate the polynomiality of the reduction. Therefore, [5, Theorem 3.1] does not apply to (`0P ).

In [5], the authors also mention an alternate proof of NP-hardness of (`0P ) from Huo and Ni’s earlier paper

[19] as a special case of their results. In this proof [19, Appendix A.1], the relation between (`0P ) and (`0C) is

es-tablished using the principle of Lagrange multiplier. More precisely, the authors introduce an instance of (`0C) in

which is defined from the minimizer of (`0P ) and argue

that solving (`0P ) is equivalent to solving the mentioned

instance of (`0C), which is known to be NP-hard [2].

There are a number of issues in the NP-hardness proof in [19]. For instance, the proposed transformation between (`0P ) and (`0C) is not a polynomial time reduction.

Besides, it is well known that (`0P ) and (`0C) are not

equivalent [20].

C. New analysis on penalized `0 minimization problems

To prove that a problem T is NP-hard, one must estab-lish a polynomial time reduction (briefly called reduction hereafter) from some known NP-hard or NP-complete problem to T [18]. Roughly speaking, the reduction from a problem T1 to another problem T2 implies that T1 is not harder than T2. Therefore, if there exists a reduction from T1 to T2 and if T1 is NP-hard, T2 must be NP-hard too. The NP-hardness proofs in [2] and [3] use this principle. As an adaptation of Natarajan’s construction, we prove the NP-hardness of (`0P ) using the same principle as follows.

Theorem II.1. Problem (`0P ) is NP-hard for 0 < λ < 3.

The proof is by a reduction from the known NP-complete problem X3C to (`0P ). The proof contains three

steps: (1) Construct an instance of (`0P ) from a given

instance of X3C; (2) Construct a solution of (`0P ) from

a solution of X3C; (3) Construct a solution of X3C from a solution of (`0P ).

1) Construction of an instance of (`0P ) from a given

instance of X3C: Given an instance of X3C: S = {s1, s2, ..., sm} is a set of m elements. C is a collection

(4)

can assume that m is a multiple of 3 since otherwise there is trivially no exact cover so no solution of X3C.

We now construct an instance of (`0P ). Let y =

[1, 1, ..., 1]T ∈ Rm_{. Let A = (a}

ij)1≤i≤m,1≤j≤n where

aij = 1 if si ∈ cj and aij = 0 otherwise. Let λ ∈ Q,

0 < λ < 3. Let

F (x) := ky − Axk22+ λkxk0. (3)

2) Construction of a solution of (`0P ) from a

solu-tion of X3C: Assume that there is a subcollection of disjoint triplets ˆC which exactly covers S. Let x∗ = [x∗1, x∗2, ..., x∗n]T where x∗j = 1 if cj ∈ ˆC and x∗j = 0

otherwise. We will prove that x∗ is a solution of (`0P ).

Since ˆC exactly covers S, | ˆC| = m/3 and y = Ax∗. Thus, kx∗k0= m/3 and

F (x∗) = 0 + λm

3 = λ

m 3. Suppose that there exists ¯x such that

F (¯x) < F (x∗) = λm

3. (4)

Let us show that this leads to a contradiction.

Since F (¯x) ≥ λk¯xk0, from (4) we have k¯xk0< m/3.

Therefore, we can rewrite k¯xk0 = m/3 − q for some

q ∈ N, 1 ≤ q < m/3. Note that A¯x has m entries. Since the number of non-zero entries of A¯x identifies with the number of elements si recovered by the subcollection

corresponding to ¯x, this number cannot exceed 3k¯xk0=

m − 3q. As a result, the number of zero entries of A¯x must be between 3q and m. Since y is the all-one vector, y − A¯x has at least 3q entries valued 1, which implies

ky − A¯xk22≥ 3q. (5) Hence, F (¯x) ≥ 3q + λm 3 − q = λm 3 + (3 − λ)q > λ m 3, (6) which contradicts (4). Therefore, x∗is a solution of (`0P ).

3) Construction of a solution of X3C from a solution of (`0P ): Assume that x∗is a solution of (`0P ). We will

consider four cases as follows. a) Casekx∗_k

0> m/3: We deduce that X3C has no

solution. Indeed, assume that ˆC is an exact cover for S. Define x = [x1, x2, ..., xn]T where xj = 1 if cj∈ ˆC and

xi= 0 otherwise. Then we have

F (x) = λm 3 < λkx

∗_k

0≤ F (x∗)

which contradicts the fact that x∗ is a solution of (`0P ).

b) Case kx∗_k

0 < m/3: We deduce that X3C has

no solution. Indeed, assume that ˆC is an exact cover for S. Let x = [x1, x2, ..., xn]T where xj = 1 if cj ∈ ˆC

and xi = 0 otherwise. Then we have F (x) = λ

m 3. Since kx∗_k

0< m/3, we can write kx∗k0= m/3 − q for some

q ∈ N and 1 ≤ q < m/3. Similar to (6), we have F (x∗) > λm

3. Since F (x) = λ m

3, we obtain F (x

∗_{) > F (x) which}

contradicts the fact that x∗ is a solution of (`0P ).

c) Case where kx∗_k

0 = m/3 and y 6= Ax∗: We

deduce that X3C has no solution. Indeed, assume that ˆC is an exact cover for S. Define x = [x1, x2, ..., xn]T where

xj= 1 if cj∈ ˆC and xi= 0 otherwise. Then we have

F (x) = λm

3 < ky − Ax

∗_k2

2+ λkx∗k0= F (x∗)

which contradicts the fact that x∗ is a solution of (`0P ).

d) Case where kx∗_k

0= m/3 and y = Ax∗: Let ˆC

be the collection of triplets cj such that the jth entry of

x∗ is non-zero. Obviously, ˆC is an exact cover for S so a solution of X3C.

Thus Theorem II.1 is proved.

It is notable that the proof above is also valid when F (x) := ky − Axkp

p+ λkxk0 for any p ≥ 1. Indeed,

one only needs to check whether (5) still holds when the `2 norm is replaced by the `p norm with p ≥ 1. This

is the case since y − A¯x has at least 3q entries equal to 1. Therefore, we have the following generalization of Theorem II.1.

Theorem II.2. Problem minxky −Axkpp+ λkxk0is

NP-hard for p ≥ 1 and 0 < λ < 3.

III. HARDNESS OF NON-NEGATIVE`0MINIMIZATION PROBLEMS

The NP-hardness of non-negative `0minimization

prob-lems is a consequence of NP-hard proofs of (`0C) [2],

(`0C0) [3] and (`0P ) (Theorem II.1). Indeed, all these

proofs consist in a reduction from X3C and the solu-tion that established equivalence is binary. Therefore, the additional non-negativity constraints do not change the validity of these proofs. In other words, one can repeat the same proofs as in [2, 3] and that of Theorem II.1 for the corresponding non-negative `0minimization problems

(`0C+), (`0C0+) and (`0P +). Another way to prove the

NP-hardness of non-negative `0minimization problems is

by a reduction from the corresponding `0 minimization

problems which are known to be NP-hard. In this reduc-tion, the instance of non-negative problems is defined by

˜

y = y, A = [A, −A],˜ x =˜ x

+

x−

where x+ _{= max{x, 0}, x}− _{= max{−x, 0}. Naturally,}

by this construction, one gets ˜x ≥ 0, k˜xk0 = kxk0 and

˜

A˜x = Ax. The proofs (skipped for brevity) contain three steps similar to that of Theorem II.1.

Therefore, we can state the following theorem without proof.

Theorem III.1. (`0C+), (`0C0+) are NP-hard. The same

for (`0P +) with 0 < λ < 3.

In the same spirit and using the same argument as at the end of Section II-C one can directly extend Theorem II.2 to the non-negative setting.

Theorem III.2. Problem minx≥0 ky − Axkpp+ λkxk0 is

(5)

IV. CONCLUSION

NP-hardness of penalized `0 minimization problems

cannot be deduced from previous complexity analyses, as stated in [4, 5]. Here, we introduced a new proof of NP-hardness of penalized `0minimization problems when

the regularization parameter λ is smaller than 3, by an adaptation of Natarajan’s construction [2], while the case λ ≥ 3 is still open. Besides, we showed that the `0

minimization problems with non-negative constraints are also NP-hard.

This work can be extended in several directions. For instance, researchers interested in what makes NP-hard problems even harder might be interested in the strong NP-hardness of the aforementioned optimization problems. As it is widely believed that X3C is strongly NP-complete, one might easily deduce the strong NP-hardness of (`0C),

(`0C0) and other problems which are reduced from X3C.

However, to the best of our knowledge, X3C is only proved to be complete [17, pp. 53, 221] and the strong NP-completeness has not been rigorously shown yet. There-fore, we believe that the question of strong NP-hardness of (non-negative) `0 minimization problems is not trivial

and needs more work in future.

Besides, as (non-negative) `0minimization problems are

NP-hard, it would be interesting to know if the associated decision problems are in NP (so being NP-complete). Let us consider the decision problem associated with (`0C):

given y ∈ Qm, A ∈ Qm×n, a positive rational number and a positive integer K, does there exist x ∈ Rnsuch that ky − Axk2 ≤ and kxk0 ≤ K? This decision problem

should be in NP since if one can guess a rational solution x, it can be verified in polynomial time if ky − Axk2≤

and kxk0≤ K. Similarly, we conjecture that the decision

version of other optimization problems mentioned in the paper are in NP as well.

Another perspective is the approximability of aforemen-tioned NP-hard problems. The hardness of approximating (`0C) was discussed in [21, 22]. It was shown that

approxi-mating (`0C) to within a factor of (1−α) ln(n), 0 < α < 1

is NP-hard [22]. Examining whether similar results can be obtained on other NP-hard problems presented in the paper would require more involved theoretical analysis, which is left for future work.

V. ACKNOWLEDGMENT

The authors would like to thank M. Mongeau (ENAC, Toulouse) for helpful discussions and two anonymous reviewers for constructive and valuable comments.

REFERENCES

[1] M. Elad, Sparse and redundant representations: From theory to applications in signal and image processing, Springer, New York, Aug. 2010.

[2] B. K. Natarajan, “Sparse approximate solutions to linear systems”, SIAM J. Comput., vol. 24, no. 2, pp. 227–234, Apr. 1995. [3] G. Davis, S. Mallat, and M. Avellaneda, “Adaptive greedy

approx-imations”, Constructive Approximation, vol. 13, no. 1, pp. 57–98, Mar. 1997.

[4] X. Chen, D. Ge, Z. Wang, and Y. Ye, “Complexity of unconstrained l2-lp minimization”, Mathematical Programming, , no. 143, pp. 371–383, 2014.

[5] X. Huo and J. Chen, “Complexity of penalized likelihood estima-tion”, J. Statist. Comput. Simul., , no. 80:7, pp. 747–759, 2010.

[6] M.-D. Iordache, J. M. Bioucas-Dias, and A. Plaza, “Sparse unmixing of hyperspectral data”, IEEE Trans. Geosci. Remote Sensing, vol. 49, no. 6, pp. 2014–2039, June 2011.

[7] E. Esser, Y. Lou, and J. Xin, “A method for finding structured sparse solutions to nonnegative least squares problems with applications”, SIAM J. Imaging Sci., vol. 6, no. 4, pp. 2010–2046, Oct. 2013. [8] T. Virtanen, J. F. Gemmeke, and B. Raj, “Active-set Newton

algorithm for overcomplete non-negative representations of audio”, IEEE Trans. Audio, Speech, Language Process., vol. 21, no. 11, pp. 2277–2289, Nov. 2013.

[9] R. Bro and S. De Jond, “A fast non-negativity-constrained least squares algorithm”, J. Chemometrics, vol. 11, no. 5, pp. 393–401, 1997.

[10] S. Petra and C. Schn¨orr, “Average case recovery analysis of tomographic compressive sensing”, Linear Alg. Appl., vol. 441, pp. 168–198, 2014.

[11] P. O. Hoyer, “Non-negative matrix factorization with sparseness constraints”, J. Mach. Learning Res., vol. 5, pp. 1457–1469, 2004. [12] J. Rapin, J. Bobin, A. Larue, and J.-L. Starck, “Sparse and non-negative BSS for noisy data”, IEEE Trans. Signal Process., vol. 61, no. 22, pp. 5620–5632, Nov. 2013.

[13] A. M. Bruckstein, M. Elad, and M. Zibulevsky, “On the uniqueness of nonnegative sparse solutions to underdetermined systems of equation”, IEEE Trans. Inf. Theory, vol. 54, no. 11, pp. 4813– 4820, Nov. 2008.

[14] R. Peharz and F. Pernkopf, “Sparse nonnegative matrix factorization with l0 constraints”, Neurocomputing, vol. 80, pp. 38–46, 2012. [15] M. Yaghoobi, D. Wu, and M. E. Davies, “Fast non-negative

orthogonal matching pursuit”, IEEE Signal Process. Lett., vol. 22, no. 9, pp. 1229–1233, Sept. 2015.

[16] Z. Wang, R. Zhu, K. Fukui, and J. Xue, “Cone-based joint sparse modelling for hyperspectral image classification”, Signal Processing, vol. 144, pp. 417–429, 2018.

[17] M. R. Garey and D. S. Johnson, Computers and Intractability: A Guide to the Theory of NP-Completeness, W. H. Freeman, New York, 1979.

[18] J. v. Leeuwen, Handbook of theoretical computer science (vol. A): algorithms and complexity, Elsevier Science Publishers, 1990. [19] X. Huo and X. Ni, “When do stepwise algorithms meet subset

selection criteria?”, Annals Statist., vol. 35, no. 2, pp. 870–887, 2007.

[20] M. Nikolova, “Description of the minimizers of least squares regularized with `0 norm. Uniqueness of the global minimizer”,

SIAM J. Imaging Sci., vol. 6, no. 2, pp. 904–937, May 2013. [21] E. Amaldi and V. Kann, “On the approximability of minimizing

nonzero variables or unsatisfied relations in linear systems”, The-oretical Computer Science, vol. 209, no. 1, pp. 237 – 260, 1998. [22] A. M. Tillmann, “On the computational intractability of exact and

approximate dictionary learning”, IEEE Signal Processing Letters, vol. 22, no. 1, pp. 45–49, Jan 2015.