Approximations Algorithms (for Database Researchers)

(1)

Approximations Algorithms

(for Database Researchers)

(2)

Optimization and computation problems

Letf be a functionf :X →R(often, but not always, a function that countssomething). We focus in this talk on:

Optimization problems.

Given a set of objectsS,findsome subsetX ⊆Ssuch thatf(X)isminimal(ormaximal) among allX satisfying some conditions.

Given a database D,findsome set of tuples X such that f(X)isminimalamong all X satisfying some conditions.

Computation problems.

Given a set of objectsS,computethe value off(S). Given a database D,computethe value of f(D).

(3)

Optimization and computation problems

(4)

Optimization and computation problems

(5)

Optimization and computation problems

Given a set of objectsS,computethe value off(S).

Given a database D,computethe value of f(D).

(6)

Optimization and computation problems

(7)

Examples of optimization problems

Examples

Maximum Matching. Given a set of tasks and a set of workers with preferences of workers on tasks,findan assignment for all tasks thatmaximizessatisfaction.

Set Cover. Given a set of people, each with fluency in various languages, finda group of people ofminimumsize who can speak all the languages.

Vertex Cover. In a city,finda set ofminimumsize of road intersections where to put street cameras such that all roads are covered.

Inconsistent Data Repair. Given a database inconsistent w.r.t. fixed integrity constraints,findtheminimumamount of tuples to add or remove to make it consistent.

Influence Maximization. Given a social network with influence probabilities on edges,findthe set of nodes to target tomaximizethe impact of a marketing campaign.

(8)

Examples of optimization problems

Examples

(9)

Examples of optimization problems

Examples

(10)

Examples of optimization problems

Examples

(11)

Examples of optimization problems

Examples

Influence Maximization. Given a social network with influence probabilities

(12)

Examples of Computation Problems

Examples

Coloring Counting. Computethe number of ways to color a graph with 3 colors.

SQL Match Counting. Computethe number of distinct matches to a fixed SQL query:

SELECT COUNT(DISTINCT *)

FROM R NATURAL JOIN S NATURAL JOIN T Navigational XPath Counting. Computethe number of matches of a

fixed simple XPath expression (no functions, no equality): count(//a[b/c]/d[e/f])

Probabilistic Query Evaluation. Computethe probability of a fixed SQL query over a database whose tuples are annotated with probabilities.

(13)

Examples of Computation Problems

Examples

FROM R NATURAL JOIN S NATURAL JOIN T

Navigational XPath Counting. Computethe number of matches of a fixed simple XPath expression (no functions, no equality):

count(//a[b/c]/d[e/f])

(14)

Examples of Computation Problems

Examples

fixed simple XPath expression (no functions, no equality):

(15)

Examples of Computation Problems

Examples

fixed simple XPath expression (no functions, no equality):

Probabilistic Query Evaluation. Computethe probability of a fixed SQL query over a database whose tuples are annotated with

(16)

Intractability

Most of these problems areintractable: unless P=NP, there is no polynomial-time algorithm to solve them, only algorithms

exponential in the size of the data!

Two classes of intractability discussed here: NP-hardnessfor optimization problems,#P-hardnessfor computation problems (latter implies former). See further.

Polynomial-time NP-hard #P-hard

Max. Matching Set Cover Coloring Counting

Nav. XPath Counting Vertex Cover SQL Match Counting Incons. Data Repair Prob. Query Evaluation Influence Maximization

(17)

Intractability

Nav. XPath Counting Vertex Cover SQL Match Counting Incons. Data Repair Prob. Query Evaluation Influence Maximization

(18)

Intractability

Nav. XPath Counting Vertex Cover SQL Match Counting Incons. Data Repair Prob. Query Evaluation

(19)

Is this the end of it?

Many real-world tasks require solving hard problems

Be persistent, do not stop because you have encountered an intractable problem!

Different strategies:

Find tractable subcases

Find heuristic algorithms that are good enough in practice, though without any guarantee

Finddeterministic algorithmsthat provide aguaranteed approximation

Findrandomized algorithmsthat provide aguaranteed approximation with high probability

(20)

Is this the end of it?

(21)

Is this the end of it?

(22)

Is this the end of it?

(23)

Is this the end of it?

(24)

Is this the end of it?

(25)

Is this the end of it?

(26)

Outline

Introduction

Intractable Classes

Deterministic Approximations Randomized Approximations Conclusion

(27)

NP-hardness

Adecision(i.e., yes/no) problem is inNPif there exists a

nondeterministic polynomial-timealgorithm (i.e., the algorithm is allowed to make a guess) that solves it

A problemX isNP-hardif it is at least as hard as any problem in NP: being able to solveX means you can solve any problem in NP with deterministic polynomial-time overhead

NP-complete: decision problemboth NP and NP-hard

To prove NP-hardness ofX, you show apolynomial-time reduction from an arbitrary problemY known to be NP-hard toX: you take an instance ofY and show that if you know how to solveX, then you can solveY with deterministic polynomial-time overhead

Technical note:most definitions of NP-hardness fordecision problemsare a bit stricter because they are based onKarp many-one reductions. Important if you want to distinguish, e.g., NP-hardness vs coNP-hardness. For optimization/computation problems, it is irrelevant, so I use simplerTuring reductions.

(28)

NP-hardness

(29)

NP-hardness

(30)

NP-hardness

(31)

NP-hardness

Technical note:most definitions of NP-hardness fordecision problemsare a bit stricter because they are based onKarp many-one reductions. Important if you want to

(32)

#P-hardness

Acountingproblem is in#Pif it can be solved by counting the number of ways anondeterministic polynomial-timealgorithm (i.e., the algorithm is allowed to make a guess, and you count the various ways to guess) can return “yes”

A problemX is#P-hardif it is at least as hard as any problem in

#P: being able to solveX means you can solve any problem in #P with deterministic polynomial-time overhead

To prove #P-hardness ofX, you show apolynomial-time reduction from an arbitrary problemY known to be #P-hard toX: you take an instance ofY and show that if you know how to solveX, then you can solveY with deterministic polynomial-time overhead

Technical note:most definitions of #P-hardness forcounting problemsare a bit stricter because they are based onKarp many-one reductions. I use simplerTuring

reductions, not much practical difference.

(33)

#P-hardness

(34)

#P-hardness

(35)

#P-hardness

Technical note:most definitions of #P-hardness forcounting problemsare a bit stricter

(36)

#P-hardness implies NP-hardness

Proof.

Take a #P-hard problemX. Let us show it is NP-hard as well.

Take an arbitraryNP-completeproblemY. There is a

non-deterministic polynomial-time algorithmAthat solvesY. Consider the problemZ that counts the number of waysAcan return “yes”. This is a #P problem.

ThusZ reduces toX.

ButY reduces toZ: useZ to count, and return “yes” iff the count is>0.

ThereforeY reduces toX, and thusX is NP-hard.

(37)

#P-hardness implies NP-hardness

Proof.

non-deterministic polynomial-time algorithmAthat solvesY.

Consider the problemZ that counts the number of waysAcan return “yes”. This is a #P problem.

ThusZ reduces toX.

(38)

#P-hardness implies NP-hardness

Proof.

ThusZ reduces toX.

(39)

#P-hardness implies NP-hardness

Proof.

ThusZ reduces toX.

(40)

#P-hardness implies NP-hardness

Proof.

ThusZ reduces toX.

(41)

#P-hardness implies NP-hardness

Proof.

ThusZ reduces toX.

(42)

Outline

Introduction

Intractable Classes

Deterministic Approximations Approximation Algorithms FPTAS

Randomized Approximations

(43)

Outline

Introduction

Intractable Classes

Randomized Approximations Conclusion

(44)

Additive (absolute) approximation

Letϕ:R→R⁺. Definition

AnoptimizationalgorithmAprovides anadditiveϕ-approximation for a problemP with optimal solutionX^∗ if the solutionX returned byA satisfies the condition ofPand is such that

|f(X)−f(X^∗)|6ϕ(f(X^∗))

Definition

AcomputationalgorithmAprovides anadditiveϕ-approximation for a problemPwith actual solutionv^∗if the valuev returned byAis such that

|v−v^∗|6ϕ(v^∗)

(45)

Additive (absolute) approximation

AnoptimizationalgorithmAprovides anadditiveϕ-approximation for a problemP with optimal solutionX^∗ if the solutionX returned byA satisfies the condition ofPand is such that

|f(X)−f(X^∗)|6ϕ(f(X^∗))

Definition

AcomputationalgorithmAprovides anadditiveϕ-approximation for a problemP with actual solutionv^∗if the valuev returned byAis such that

∗ ∗

(46)

Multiplicative (relative) approximation

AnoptimizationalgorithmAprovides amultiplicativeϕ-approximation for a problemP with optimal solutionX^∗ if the solutionX returned byA satisfies the condition ofPand is such that

|f(X)|6ϕ(f(X^∗))|f(X)^∗| ifP is a minimization problem

|f(X)|>ϕ(f(X^∗))|f(X)^∗| ifP is a maximization problem

Definition (attention, inconsistent notation!)

AcomputationalgorithmAprovides amultiplicativeϕ-approximation for a problemPwith actual solutionv^∗if the valuev returned byAis such that

(1−ϕ(v^∗))|v^∗|6|v|6(1+ϕ(v^∗))|v^∗|

(47)

Multiplicative (relative) approximation

AnoptimizationalgorithmAprovides amultiplicativeϕ-approximation for a problemP with optimal solutionX^∗ if the solutionX returned byA satisfies the condition ofPand is such that

|f(X)|6ϕ(f(X^∗))|f(X)^∗| ifP is a minimization problem

|f(X)|>ϕ(f(X^∗))|f(X)^∗| ifP is a maximization problem

Definition (attention, inconsistent notation!)

AcomputationalgorithmAprovides amultiplicativeϕ-approximation for a problemP with actual solutionv^∗if the valuev returned byAis such that

(48)

APX

We want the approximation algorithms to bepolynomial-time

Ideally,ϕisconstant

For a constantϕ,multiplicative approximation is betterthan additive approximation

Additive approximation is thus rarely used, “approximation algorithm” usually means multiplicative

APX:class ofoptimizationproblems that have a polynomial-time multiplicative approximation algorithm with constantϕ

(49)

APX

We want the approximation algorithms to bepolynomial-time Ideally,ϕisconstant

(50)

APX

(51)

APX

(52)

APX

(53)

Vertex Cover is in APX

a c

b

e

d

Optimal: {a,c}, size2

Approximated: {a,b,c,e}, size4

Approximation algorithm:

Choose an arbitrary edge not covered Add both end points to the cover Repeat until all edges are covered

Multiplicative 2-approximation! (twice as many nodes in the

approximated cover as edges chosen, each of this edge need to be covered)

(54)

Vertex Cover is in APX

a c

b

e

d

(55)

Vertex Cover is in APX

a c

b

e

d

(56)

Vertex Cover is in APX

a c

b

e

d

(57)

Vertex Cover is in APX

a c

b

e

d

(58)

Vertex Cover is in APX

a c

b

e

d

(59)

Vertex Cover is in APX

a c

b

e

d

approximated cover as edges chosen, each of this edge need to be

(60)

Other examples

Set Cover has a(lnn+O(1))-approximationalgorithm [Chv79]

but isnot in APX[LY94]

Inconsistent Data Repair is inAPX(but the constant depends on the dependencies) [KL09]

Influence Maximization is inAPX; it has a

(1−1/e−ε)-approximationalgorithm for anyε(slightly better than 63%) [KKT03]

(61)

Inapproximability results

It is also possible to show that some problem isnot ϕ-approximable (we assume P6=NP in this slide)

Vertex Cover isnot 1.3606-approximable[DS05] (!) Set Cover isnot(ln(n)−o(lnn))-approximable[DS14]

This kind of results is usuallymuch more difficultto obtain than approximation algorithms

(62)

Inapproximability results

It is also possible to show that some problem isnot ϕ-approximable (we assume P6=NP in this slide) Vertex Cover isnot 1.3606-approximable[DS05] (!)

Set Cover isnot(ln(n)−o(lnn))-approximable[DS14]

(63)

Inapproximability results

It is also possible to show that some problem isnot ϕ-approximable (we assume P6=NP in this slide) Vertex Cover isnot 1.3606-approximable[DS05] (!) Set Cover isnot(ln(n)−o(lnn))-approximable[DS14]

(64)

Inapproximability results

It is also possible to show that some problem isnot ϕ-approximable (we assume P6=NP in this slide) Vertex Cover isnot 1.3606-approximable[DS05] (!) Set Cover isnot(ln(n)−o(lnn))-approximable[DS14]

(65)

How to find an approximation algorithm?

From scratch, by exploiting the structure of the problem (as we did with Vertex Cover)

By exploitingapproximation-preserving reductionsbetween a problem and an approximable problem (in both directions); various notions of approximation-preserving, arbitrary reductions don’t work

(66)

How to find an approximation algorithm?

From scratch, by exploiting the structure of the problem (as we did with Vertex Cover)

By exploitingapproximation-preserving reductionsbetween a problem and an approximable problem (in both directions); various notions of approximation-preserving, arbitrary reductions don’t work

(67)

Outline

Introduction

Intractable Classes

Randomized Approximations Conclusion

(68)

From APX to FPTAS

APX:polynomial-timec-approximation forsomefixed constantc

Useful, but would be better if we could have ac arbitrarily close to 1

PTAS (Polynomial-Time Approximation Scheme): there exists a polynomial-time(1+ε)-approximation foranyε >0(for a minimization problem);(1−ε)-approximation for a maximization problem;ε-approximation for a computation problem

Great, but these approximations may becomemore and more difficult to findasεnears 0

FPTAS (Fully Polynomial-Time Approximation Scheme): PTAS whose overall complexity dependspolynomially in 1/ε

(69)

From APX to FPTAS

APX:polynomial-timec-approximation forsomefixed constantc Useful, but would be better if we could have ac arbitrarily close to 1

(70)

From APX to FPTAS

(71)

From APX to FPTAS

(72)

From APX to FPTAS

FPTAS (Fully Polynomial-Time Approximation Scheme): PTAS

(73)

Problems with a FPTAS?

Fairlyrare!

Neither Vertex Cover, nor Inconsistent Data Repair [KL09], nor Influence Maximization [KKT03] have an FPTAS (unless P=NP)

There are still some problems for which there are FPTAS:

Example

Knapsack. Given a collection of items, each with a weight and a volume,finda subset of items ofmaximumvolume whose total weight does not exceed some fixed limit

Knapsack is anNP-hardproblem, but there exists anFPTAS

(74)

Problems with a FPTAS?

Fairlyrare!

Example

(75)

Problems with a FPTAS?

Fairlyrare!

Example

(76)

Problems with a FPTAS?

Fairlyrare!

Example

Knapsack. Given a collection of items, each with a weight and a volume,finda subset of items ofmaximumvolume whose total weight does not exceed some fixed limit Knapsack is anNP-hardproblem, but there exists anFPTAS

(77)

Problems with a FPTAS?

Fairlyrare!

Example

(78)

Problems with a FPTAS?

Fairlyrare!

Example

(79)

Outline

Introduction

Intractable Classes

Deterministic Approximations Randomized Approximations

Generalities

Monte-Carlo Sampling FPRAS

Conclusion

(80)

Outline

Introduction

Intractable Classes

Generalities

(81)

Randomized approximations

To simplify, only talk about computation problems. ϕ:R→R⁺,δ >0.

Definition

AcomputationalgorithmAprovides arandomized additive

(ϕ, δ)-approximation for a problemP with actual solutionv^∗if the value v returned byAis such that

|v−v^∗|6ϕ(v^∗) with probability>1−δ

Definition

AcomputationalgorithmAprovides arandomized multiplicative

(1−ϕ(v^∗))|v^∗|6|v|6(1+ϕ(v^∗))|v^∗| with probability>1−δ

(82)

Randomized approximations

To simplify, only talk about computation problems. ϕ:R→R⁺,δ >0.

Definition

AcomputationalgorithmAprovides arandomized additive

|v−v^∗|6ϕ(v^∗) with probability>1−δ

Definition

AcomputationalgorithmAprovides arandomized multiplicative

(ϕ, δ)-approximation for a problemP with actual solutionv^∗if the value

(83)

Hoeffding’s Inequality

LetX₁, . . . ,X_nbenindependent random variables, each within the interval[a,b], andX¯ = ¹_nP

iX_i the empirical mean.

We have [Hoe63]: Pr

X¯ −E[ ¯X] >ε

62e

−2nε2 (b−a)2

In other words, we know that Pr

X¯ −E[ ¯X] >ε

6δas long as: n> (b−a)²

2ε² ln1 δ

Often too conservative!

(84)

Hoeffding’s Inequality

We have [Hoe63]:

Pr

X¯ −E[ ¯X] >ε

62e

−2nε2 (b−a)2

X¯ −E[ ¯X] >ε

6δas long as: n> (b−a)²

2ε² ln1 δ

(85)

Hoeffding’s Inequality

We have [Hoe63]:

Pr

X¯ −E[ ¯X] >ε

62e

−2nε2 (b−a)2

X¯ −E[ ¯X] >ε

6δ as long as:

n> (b−a)² 2ε² ln1

δ

(86)

Hoeffding’s Inequality

We have [Hoe63]:

Pr

X¯ −E[ ¯X] >ε

62e

−2nε2 (b−a)2

X¯ −E[ ¯X] >ε

6δ as long as:

n> (b−a)² 2ε² ln1

δ

(87)

Hoeffding’s Inequality

We have [Hoe63]:

Pr

X¯ −E[ ¯X] >ε

62e

−2nε2 (b−a)2

X¯ −E[ ¯X] >ε

6δ as long as:

n> (b−a)² 2ε² ln1

δ

(88)

Outline

Introduction

Intractable Classes

Generalities

(89)

Application to Polling

Pollofnpersons in a country ofminhabitants

Every personi is asked if they prefer politicianAor politicianB. We noteX_i =0 if they preferA, 1 otherwise

We are interested inpredicting the result of an electionbetweenA andB;E[ ¯X]is the expected proportion of votes forB

We want a margin of error ofε=2%, and a probabilistic guarantee of 1−δ =95%

So we just need by Hoeffding’s inequality: n> 1

2ε²ln1

δ >3745 This is completelyindependent ofm!

(90)

Application to Polling

Every personi is asked if they prefer politicianAor politicianB.

We noteX_i =0 if they preferA, 1 otherwise

2ε²ln1

(91)

Application to Polling

2ε²ln1

(92)

Application to Polling

2ε²ln1

(93)

Application to Polling

So we just need by Hoeffding’s inequality:

n> 1 2ε²ln1

δ >3745

This is completelyindependent ofm!

(94)

Application to Polling

So we just need by Hoeffding’s inequality:

n> 1 2ε²ln1

δ >3745

(95)

Monte-Carlo Sampling

Assumptions:

We cansample in polynomial-timefrom a population Given a sample, we canevaluate a certain quantityin polynomial-time

Then we can compute the expected mean of that quantity with a polynomial-time randomized additive(ε, δ)-approximation algorithm for arbitraryε >0,δ >0

Direct application of Hoeffding’s inequality, can be used to obtain the required number of samples

Examples

Polling

Computation ofπ

Probabilistic Query Evaluation

(96)

Monte-Carlo Sampling

Assumptions:

We cansample in polynomial-timefrom a population

Given a sample, we canevaluate a certain quantityin polynomial-time

Examples

Polling

Computation ofπ

(97)

Monte-Carlo Sampling

Assumptions:

Examples

Polling

Computation ofπ

(98)

Monte-Carlo Sampling

Assumptions:

Examples

Polling

Computation ofπ

(99)

Monte-Carlo Sampling

Assumptions:

Examples

Polling

Computation ofπ

(100)

Monte-Carlo Sampling

Assumptions:

Examples

Computation ofπ

(101)

Monte-Carlo Sampling

Assumptions:

Examples

Polling

Computation ofπ

(102)

Monte-Carlo Sampling

Assumptions:

Examples

(103)

Outline

Introduction

Intractable Classes

Generalities

Conclusion

(104)

FPRAS

PRAS (Polynomial-time Randomized Approximation Scheme):

there exists a polynomial-time randomized(ε,1/3)-approximation for anyε >0

1/3 is irrelevant here; from that, we can obtain an

(ε, δ)-approximation for the sameεand arbitraryδby simply repeating the algorithm

FPRAS (Fully Polynomial-time Randomized Approximation Scheme): PRAS whose overall complexity dependspolynomially in 1/ε

(105)

FPRAS

(106)

FPRAS

(107)

FPRAS

(108)

FPRAS for disjunctions [KLM89, KKS09]

E₁, . . . ,E_m sequence of events in a probability space

Assumptions: For eachE_i, we can efficiently (in polynomial-time):

Compute Pr(Ei)

Test whetherEi is true in a given random sample Sample from the subspace conditioned onEi

Thenthere exists a FPRAS to compute Pr Wm i=1E_i

Seehttp://webcourse.cs.technion.ac.il/236605/Spring2015/ho/ WCFiles/L9%20-%20QA%20in%20PDBs.pdffordetailed explanations

(109)

FPRAS for disjunctions [KLM89, KKS09]

E₁, . . . ,E_m sequence of events in a probability space

Assumptions: For eachE_i, we can efficiently (in polynomial-time):

Compute Pr(Ei)

Test whetherEi is true in a given random sample Sample from the subspace conditioned onEi

Thenthere exists a FPRAS to compute Pr Wm i=1E_i

Seehttp://webcourse.cs.technion.ac.il/236605/Spring2015/ho/ WCFiles/L9%20-%20QA%20in%20PDBs.pdffordetailed explanations