Disponible à / Available at permalink :

(1)

- - -

Dépôt Institutionnel de l’Université libre de Bruxelles / Université libre de Bruxelles Institutional Repository

Thèse de doctorat/ PhD Thesis Citation APA:

Maquet, N. (2011). New algorithms and data structures for the emptiness problem of alternating automata (Unpublished doctoral dissertation). Université libre de Bruxelles, Faculté des Sciences – Informatique, Bruxelles.

Disponible à / Available at permalink : https://dipot.ulb.ac.be/dspace/bitstream/2013/209961/4/7a7a86e7-a4b3-4306-9bc4-d0ba20ed5966.txt

(English version below)

Cette thèse de doctorat a été numérisée par l’Université libre de Bruxelles. L’auteur qui s’opposerait à sa mise en ligne dans DI-fusion est invité à prendre contact avec l’Université (di-fusion@ulb.ac.be).

Dans le cas où une version électronique native de la thèse existe, l’Université ne peut garantir que la présente version numérisée soit identique à la version électronique native, ni qu’elle soit la version officielle définitive de la thèse.

DI-fusion, le Dépôt Institutionnel de l’Université libre de Bruxelles, recueille la production scientifique de l’Université, mise à disposition en libre accès autant que possible. Les œuvres accessibles dans DI-fusion sont protégées par la législation belge relative aux droits d'auteur et aux droits voisins. Toute personne peut, sans avoir à demander l’autorisation de l’auteur ou de l’ayant-droit, à des fins d’usage privé ou à des fins d’illustration de l’enseignement ou de recherche scientifique, dans la mesure justifiée par le but non lucratif poursuivi, lire, télécharger ou reproduire sur papier ou sur tout autre support, les articles ou des fragments d’autres œuvres, disponibles dans DI-fusion, pour autant que :

Le nom des auteurs, le titre et la référence bibliographique complète soient cités;

L’identifiant unique attribué aux métadonnées dans DI-fusion (permalink) soit indiqué;

Le contenu ne soit pas modifié.

L’œuvre ne peut être stockée dans une autre base de données dans le but d’y donner accès ; l’identifiant unique (permalink) indiqué ci-dessus doit toujours être utilisé pour donner accès à l’œuvre. Toute autre utilisation non mentionnée ci-dessus nécessite l’autorisation de l’auteur de l’œuvre ou de l’ayant droit.

--- English Version ---

This Ph.D. thesis has been digitized by Université libre de Bruxelles. The author who would disagree on its online availability in DI-fusion is invited to contact the University (di-fusion@ulb.ac.be).

If a native electronic version of the thesis exists, the University can guarantee neither that the present digitized version is identical to the native electronic version, nor that it is the definitive official version of the thesis.

DI-fusion is the Institutional Repository of Université libre de Bruxelles; it collects the research output of the University, available on open access as much as possible. The works included in DI-fusion are protected by the Belgian legislation relating to authors’ rights and neighbouring rights.

Any user may, without prior permission from the authors or copyright owners, for private usage or for educational or scientific research purposes, to the extent justified by the non-profit activity, read, download or reproduce on paper or on any other media, the articles or fragments of other works, available in DI-fusion, provided:

The authors, title and full bibliographic details are credited in any copy;

The unique identifier (permalink) for the original metadata page in DI-fusion is indicated;

The content is not changed in any way.

It is not permitted to store the work in another database in order to provide access to it; the unique identifier (permalink) indicated above must always be used to provide access to the work. Any other use not mentioned above requires the authors’ or copyright owners’ permission.

(2)

D 03746

Université Libre de Bruxelles

New Algorithms and Data Structures for the Emptiness Problem of Alternating Automata

Nicolas Maquet

Thèse présentée en vue de l'obtention du grade de Docteur en Sciences

Directeur: Prof. Jean-François Raskin

Université Libre de Bruxelles

003-4-E92S0

beispo

(3)

New Algorithms and Data Structures for the Emptiness Problem of Alternating Automata

Nicolas Maquet

Thèse réalisée sous la direction du Professeur Jean-François Raskin et présentée en vue de l’obtention du grade de Doctein en Sciences. La défense publique de cette thèse a eu lieu le 3 mars 2011 à l’Université Libre de Bruxelles, devant le jury composé de:

• Parosh Aziz Abdulla (Uppsala University, Suède)

• Bernard Boigelot (Université de Liège, Belgique)

• Véronique Bruyère (UMons. Belgique)

• Jean Cardinal (Université Libre de Bruxelles, Belgique)

• Gilles Geeracrts (Université Libre de Bruxelles, Belgique)

• Jean-François Raskin (Université Libre de Bruxelles, Belgique)

(4)

I

(5)

V

Résumé

Ce travail traite de l’étude de nouveaux algorithmes et structures de données dont l’usage est destiné à la vérification de programmes. Les ordinateurs sont de plus en plus présents dans notre vie quotidienne et, de plus en plus souvent, ils se voient confiés des tâches de nature critique pour la sécurité. Ces systèmes sont caractérisés par le fait qu’une panne ou un bug (erreur en jargon informatique) peut avoir des ef

fets potentiellement désastreux, que ce soit en pertes humaines, dégâts environnemen

taux, ou économiques. Pour ces systèmes critiques, les concepteurs de systèmes indus

triels prônent de plus en plus l’usage de techniques permettant d’obtenir une assurance formelle de correction.

Une des techniques de vérification de programmes les plus utilisées est le model checking, avec laquelle les programmes sont typiquement abstraits par une machine a états finis. Après cette phase d’abstraction, des propriétés (typiquement sous la forme d’une formule de logique temporelle) peuvent êtres vérifiées sur l’abstraction à espace d’états fini, à l’aide d’outils de vérification automatisés. Les automates alter

nants jouent un rôle important dans ce contexte, principalement parce que plusieurs logiques temporelles peuvent êtres traduites efficacement vers ces automates. Cette car

actéristique des automates alternants permet de réduire le model checking des logiques temporelles à des questions sur les automates, ce qui est appelé l’approche par auto

mates du model checking.

Dans ce travail, nous étudions trois nouvelles approches pour l’analyse (le test du vide) des automates alternants sur mots finis et infinis. Premièrement, nous ap

pliquons l’approche par antichaînes (utilisée précédemment avec succès pour l’analyse d’automates) pour obtenir de nouveaux algorithmes pour les problèmes de satisfais- abilité et du model checking de la logique temporelle ünéaire, via les automates alter

nants. Ces algorithmes combinent l’approche par antichaînes avec l’usage des ROBDD, dans le but de gérer efficacement la combinatoire induite par la taille exponentielle des alphabets d’automates générés à partir de LTL. Deuxièmement, nous développons de nouveaux algorithmes d’abstraction et raffinement pour les automates alternants, combinant l’usage des antichaînes et de Vinterprétation abstmite, dans le but de pou

voir traiter efficacement des automates de grande taille. Enfin, nous définissons une nouvelle structure de données, appelée LVBDD\ qui permet un encodage efficace des fonctions de transition des automates alternants sur alphabets symboliques. Tous ces travaux ont fait l’objet d’implémentations et ont été validés expérimentalement.

^Lattice-Valued Binary Decision Diagrams

(6)

I I

►*.-^ = ^A.■ ■ ^f^JT^-aii.wai^■ r .^...»vLg-i-uf

'i'i:' I

Ml'

'I

^{II II}

I'

mil I 'i II

l'i ' I l II III I II I III I I I I I'

Ml I I II

(7)

Abstract

This Work studios new algorithms and data structures that are useful in the context of

program vérification.

As computers hâve become more and more ubiquitous in our modem societies, an increasingly large number of computer-based Systems are considered

safety-critical.

Such Systems are characterized by the fact that a failure or a

bug

(computer error in the computing jargon) could potentially cause large damage, whether in loss of life, environmental damage, or économie damage. For safety-critical Sys

tems, the industrial software engineering community increasingly calls for using techniques which provide some

formai assurance

that a certain piece of software is correct.

One of the most successful program vérification techniques is model checking, in which programs are typically abstracted by a finite-state ma

chine. After this abstraction step, properties (typically in the form of some temporal logic formula) can be checked against the finite-state abstraction, with the help of automated tools. Altemating automata play an important rôle in this context, since many temporal logics on words and trees can be efhciently translated into those automata. This property allows for the réduction of model checking to automata-theoretic questions and is called the automata-theoretic approach to model checking.

In this work, we provide three novel approaches for the analysis (emptiness checking) of altemating automata over finite and infinité words. First, we build on the successful framework of antichains to devise new algorithms for LTL satisfiability and model checking, using altemating automata. These algorithms combine antichains with reduced ordered binary decision diagrams in order to handle the exponentially large alphabets of the automata generated by the LTL translation. Second, we develop new abstraction and refinement algorithms for altemating automata, which combine the use of antichains with abstract interprétation, in order to handle ever larger in

stances of altemating automata. Finally, we define a new symbolic data structure, coined lattice-valued binary decision diagrams that is particularly well-suited for the encoding of transition fonctions of altemating automata over symbolic alphabets. Ail of these works are supported with experimen

tal évaluations that confirm the practicaJ usefulness of our approaches.

(8)

(9)

IX

Acknowledgements

First and foremost, I would like to show my gratitude to Jean-François Raskin, my thesis advisor. His technical excellence, his rigor, and his sheer enthusiasm are perhaps the qualities that Jean-François is best known for among his peers. As one of his doctoral students, I would like to testify also of his human qualities that hâve made these four years at his side not only instructive and fruitful, but also enjoyable and fulfilling. Most of ail, I thank him for his his patience, his availability, and his daily cheerfulness.

Next, I want to heartily thank my coauthors. These are, in no particular order, Laurent Doyen, Martin De Wulf, Gilles Geeraerts, Pierre Ganty, Gabriel Kalyon and Tristan Le Gall. The research material that went into tliis Ph.D. thesis is very much the fruit of a collaborative work, and Fm very thankful for their work and support.

I would like to thank ail my colleagues form the Computer Science De

partment and the Vérification Group, for making the depaxtment a lively and welcoming place. Spécial thanks to my faithful office neighbor, Frédéric Servais, for his biting humor and support, and to Emmanuel Fihot for being the most Belgian French I hâve ever met. A big thank you also to the three secretaries of the Department: Maryka Peetroons, Véronique Bastin and Pascaline Browaeys.

I sincerely thank ail the members of the thesis’ jury comittee: Parosh Aziz Abdulla, Bernard Boigelot, Véronique Bruyère, Jean Cardinal, Gilles Geeraerts and Jean-François Raskin, for their careful reading of this work, and for suggesting improvements.

Finally, 1 would like to express my deepest gratitude to ail of those who hâve ofîered me their love and friendship over the years. In the end, I am what they give me, more than anything else, for which 1 will be for- ever grateful. In particular, and most importantly, I would like to thank Amandine, for her continued love and support.

(10)

r

(11)

A mes frangin(e)^

^Comme d’autres, plus sages, l’ont dit avant moi: ça leur fera une belle jambe ...

Illustration: Caroll, Lewis: “Alice’s Adventures in Wonderland” (1865)

(12)

/

(13)

xiii

(14)

XIV

FDL

...154

6.6.1 The Birkhofî Antichain Lattice ... 154

6.6.2 RPC on the Birkhoff Antichain Lattice... 155

6.6.3 Proofs of RPC’s Algebraic Properties... 156

6.7 LVBDD Manipulation Algorithms...161

6.7.1 Memory Management and Memoization ... 162

6.7.2 Relative Pseudo-complement of LVBDD in

SNF . .

. 162

6.7.3 Meet of an LVBDD in

SNF

With a Constant...165

6.7.4 Meet of two LVBDD in

SNF

...169

6.7.5 Join of two LVBDD in

SNF

...175

6.7.6 Meet or Join of two LVBDD in UNF... 182

6.7.7 Existential and Universal Quantification of LVBDD . 184 6.8 Complexity of LVBDD Algorithms...185

6.9 Empirical Evaluation... 195

(16)

XVI

Chapter 1 Introduction

Today, computers are ubiquitous, and are a necessary tool of modem life.

It is quite amazing to realize how quickly and radically computers hâve changed the way we live. The computers of today do an incredibly vast range of tasks for us: they help us communicate with each other, they help us pay our bills, they help us brake more efficiently on wet roads.

But computers not only make what was possible easier, more importantly they also allow to perform tasks which, not too long ago, were considered impossible. Without computers, modem medicine would not be possible.

Without computers, modem physics would not be possible. The list goes on and on.

1.1 Context

In the last 40 years or so, our society has become increasingly dépendent on computers and their programs. Also, computers are more and more entrusted with safety-critical tasks such as guiding airplanes and rockets, controlling medical and industrial equipment, etc. When the cost of a potential software failure (whether human, environmental, or économie) becomes prohibitive, the question of software correetness becomes highly important.

There hâve been many well-publicized stories of catastrophic software fail- ures leading to the loss of lives and / or enormous économie damage. Tra- ditional software engineering rely on testing to remove bugs (programming

1

(18)

2 1.1. Context

mistakes, design flaws) from programs. However, testing can only detect the presence of errors in a computer program, but it cannot show that the program is correct in any formai sense. More and more, certain applications call for some formai assurance that a piece of software meets a spécification written in a formai language.

Since the early works of Robert Floyd and Tony Hoare in the late 1960’s, the field of program vérification has gathered much research effort. The ultimate goal of this discipline is to devise proofs for the correctness of pro

grams. However, it has become quickly apparent that finding such proofs is extremely hard, usually much more so than writing the program itself!

Moreover, program proofs are often very long and tedious, which makes them very difficult to compute by hand. For those reasons, a lot of research has been put into the development of automatic methods for software vér

ification. The idea is to use computer programs to find and / or validate proofs of correctness of other programs.

The field of computer-aided vérification is faced with a fundamental undecidability problem. In general, checking whether a computer program satisfies its spécification is imdecidable. In fact, the Rice theorem tells us that ail non trivial properties of computer programs are undecidable.

Computer-aided vérification techniques sidestep this imdecidability baxrier by exploiting sound abstractions which over-approximate the behavior of the original program. The idea is to substitute the concrète program with an abstract, more simple version, in such a way that any property that holds on the abstraction also holds on the concrète program. Note that this technique is sound but not complété in general, so there is no contradiction with the general imdecidability of program vérification.

A

particularly fruitful and popular abstraction of computer programs is the

finite state machine,

or

finite automaton.

Finite-state abstractions are useful for modeling programs and properties where the intricacies of the System lies more in the control-state combinatorial rather than in the ma

nipulation of complex data. Many Systems fall into this category, such as network protocols, distributed control Systems, embedded or reactive Sys

tems, and so on. The field of

model checking

encompasses the study of

(19)

Chapter 1. Introduction 3 efficient algorithms and data structures for the analysis of finite-state Sys

tems. In this context, a program is represented by a finite-state abstraction

A4

that describes a

language

of

possible behaviors,

denoted

L(A4).

On the other hand, a

property ip

is written down in some formai logic (temporal or otherwise) that describes a language of

correct behaviors,

denoted

L{(p).

For a System

A4

and a property

ip

the

model checking problem

asks whether

A4 \= if,

that is whether

L{M)

Ç

L{ip).

If ail the behaviors of a sound approximation of a program

V

are included in those defined by a property

(p,

then we say that

V

is

provably-correct

with respect to

ip.

The primary challenge faced by the model checking community is the

State space explosion problem.

Finite-state abstractions of real-world pro- grams typically hâve a very large number of States, and much research effort has been put into dealing with this challenge.

In this work, we develop new algorithms for the analysis of altemat- ing automata over finite and infinite-words. The concept of alternation for automata was first studied by Chandra and Stockmeyer in 1976. In their séminal work, the authors define the altemating Turing machine, as a generalization of non-deterministic Turing machines, whose set of states is partitioned between existential and universal. In an existential state, such a machine makes a non-deterministic guess to select the next state. In a universal state, however, the machine launches as many copies of itself as there are successor states, and accepts only if each of these “computation branches” are accepting. The motivation for studying altemating Turing machines originally came from research in computational complexity. Chan

dra and Stockmeyer show, among other results, that the class of problems that can be solved in polynomial space is the same than the class of prob

lems that can be solved in altemating polynomial time (i.e.,

PS

pace

= APT

ime

).

Similarly to altemating Turing machines, altemating automata gener- alize non-deterministic automata. Altemating automata hâve been defined on finite and infinite-words and on finite and infinité trees. The use of alternating automata is a key component of the automata-theoretic approach to model checking. In this approach, the property ip that need to be verified is

(20)

4 1.2. Contributions

translated into an automaton, which reduces the model checking problem to automata-theoretic questions. One of the key strengths of alternating automata is that several temporal logics can be translated to alternating automata in linear-time. Efficient decision procedures for those automata are thus very désirable, particularly in the field of model checking.

1.2 Contributions

The purpose of this thesis is to find new efficient algorithms for the analysis of alternating automata. Our aim is to develop algorithms which analyze alternating automata directly, without resorting to an explicit and costly translation to non-deterministic automata. We siunmarize here the main contributions of this work.

In recent research [DDHR06, DR07, DRIO], Raskin et al. hâve devel- oped new algorithms for the analysis of finite automata which are based on antichains. The idea of antichains is to exploit simulation preorders that exist by construction on the state spaces of exponential subset construc

tions. Many such simulation preorders can be found in automata-theoretic problems and can be used to facilitate the évaluation of fixed-point ex

pressions based on the pre and post operators. In this work, we apply the framework of antichains to the satisfiability and model checking problems of the linear temporal logic (LTL, for short) over symbolic Kripke structures^.

To perform the analysis of LTL, we use the linear-time translation to alter

nating automata to reduce these problems to language-emptiness questions on automata. However, the translation from LTL to alternating automata croates automata with alphabets of exponential size. To efficiently handle the combinatorial entailed by the exponential alphabets, we combine re- duced ordered binary decision diagrams [Bry86] (ROBDD, for short) with antichains in a new set of algorithms. Note that the use of antichains and ROBDD for LTL satisfiability is only useful in a heuristic sense. The prob

lem being PSPACE-COMPLETE, our algorithms cannot strictly improve on the existing theoretically optimal solutions. Therefore, in order to validate

^Symbolic Kripke structures are, in brief, symbolically-encoded finite-state machines.

(21)

Chapter 1. Introduction 5

our approach, we demonstrate its efïiciency with empirical évaluations on a number of meaningful benchmarks.

Our second contribution is the development and implémentation of an abstraction / refinement framework for alternating automata. The moti

vation for this Work was to investigate how^ the use of abstract interpréta

tion [CC77], together with the framework of antichains could yield more efficient algorithms for alternating automata. Our framework uses the set of partitions of the set of states of an alternating automaton as basis for the abstraction. This work has a number of interesting characteristics.

First, our refinement is not based on counter-examples as is usually the case [CGJ+03] but rather builds on recent research which uses fixed-point guided refinement instead [CGR07, RRT08]. Second, contrary to other refinement approaches, our abstract domain does not need to be strictly refined at each successive abstraction / refinement step. Rather, our algo- rithm always keeps the abstract domain as coarse as possible with respect to its current knowledge of the System. Finally, we hâve implemented oirr approach and compared its efficiency on a number of benchmark exam

ples. Our experimental évaluations reveal that, while the overhead of the abstraction refinement is not negligible, it can greatly outperform concrète antichain algorithms when the number of locations in the alternating au

tomaton grows very large.

Our final contribution is the définition and study of an new symbolic data structure, coined lattice-valued binary decision diagrams (LVBDD, for short) for the efficient représentation of functions of the form 2*’ i-t £ where P is a finite set of Boolean propositions and £ is a lattice. LVBDD are similar to previously-defined data structures in the literatme such as multi-terminal binary decision diagrams [FMY97], or algebraic decision diagrams [BFG"^97]

which can be used to encode functions of the form 2*" h-> D where D is an arbitrary domain. LVBDD differ from these existing structures by exploiting the fact that the co-domain of the represented functions is a structured set, to achieve more compact représentations. We argue in this work, and experimentally demonstrate, that LVBDD are particularly well-suited for the encoding of the transition functions of alternating automata over symbolic

(22)

6 1.3. Plan of the Thesis

alphabets. We define two distinct semantics for LVBDD and prove that they are incomparable in general. Also, we define two normal forms for LVBDD, and we define algorithms for computing the greatest lower bound {meet) and least upper boimd (join) of their représentations. Our algorithms are accompanied with complété and generic proofs of soundness which are ap

plicable to LVBDD over any (finite and distributive) lattice. We also provide a study of the worst-case complexity of LVBDD manipulation algorithms.

Finally, we hâve implemented our LVBDD data structure in the form of a C-|—|- template library and made it available to the research community.

Using this implémentation, we hâve experimentally compared the efficiency of LVBDD and ROBDD in the context of solving LTL satisfiability with alternating automata and antichains. Our experiments reveal that, in that context, LVBDD can outperform ROBDD by several orders of magnitude.

1.3 Plan of the Thesis

The remainder of this thesis is organized into the following chapters;

• In Chapter 2: Preliminaries we recall some basic notions about numbers, sets, languages and automata, and so on. This chapter fixes the formai notations used throughout the thesis.

• In Chapter 3: Antichain Approaches to Automata Emptiness we summarize (a part of) the framework of antichains for the anal

ysis of automata. Since this framework is used in ail the remaining chapters, it has been summarized in this chapter in order to avoid répétitions. Note that we hâve taken the liberty to slightly adapt the theory for the needs of this dissertation. •

• In Chapter 4: LTL Satisfiability and Model Checking Revis- ited we apply the framework of antichains to the particular case of LTL satisfiability and model checking. We study algorithms which com

bine ROBDD with antichains in the analysis of alternating automata, and discuss an empirical évaluation of these algorithms.

(23)

Chapter 1. Introduction 7

• In Chapter 5: Fixed Point-Guided Abstraction Refinement

we présent abstraction / refinement algorithms designed to handle alternating automata with very large numbers of States. The abstrac

tion is based on the set of partitions of the set of States, and uses a refinement strategy that is not based on counter-examples. An ex

perimental évaluation of the technique is discussed, comparing our approach to “classical” non-abstract antichain algorithms.

• In Chapter 6: Lattice-Valued Binary Decision Diagreims we study a new sjnnbolic data structure for the efficient encoding of ftmc- tions of the form 2• ** £, where P is a finite set of Boolean proposi

tions and £ is a lattice. We define algorithms for manipulating LVBDD and prove their soundness and complexity properties. Also, we discuss an empirical comparison of LVBDD and ROBDD in the context of the analysis of alternating automata over symbolic alphabets.

• In Chapter 7: Conclusions we draw some conclusions and provide some thoughts on potential future work.

1.4 Chronology &; Crédits

Here are some chronological and crédit comments on the contents of each chapters.

• The contents of Chapter 3 is not my original work, and is due to Doyen and Raskin [DR07, DR09, DRIO].

• The work of Chapter 4 has been published in the proceedings of TACAS 2008 [DDMROSb] in collaboration with Doyen, De Wulf, and Raskin. A “tool-paper” describing the corresponding implémentation has been published in the proceedings of ATVA 2008 [DDMR08a] in collaboration with the same authors.

• The contents of Chapter 5 is the resuit of a collaboration with Ganty and Raskin, which has been published in the proceedings of CIAA 2009 [GMR09]. An extended version of this work was published in a

(24)

8 1.4. Chronology & Crédits

spécial issue of Theoretical Computer Science [GMRIO] in collabora

tion witti the same authors. •

• The Work presented in Ctiapter 6 bas been done in collaboration with Geeraerts, Kalyon, Le Gall, and Raskin, and bas been publisbed in the proceedings of ATVA 2010 [GKL+10].

(25)

Chapter 2 Preliminaries

In this chapter, we review the basic notions used throughout the rest of thc thesis.

2.1 Sets, Punctions, and Relations

We dénoté the set of natural numbers {0,1,2,...} by N and the set of positive natural numbers {1,2, 3,...} by Nq. We dénoté the set of integer numbers NU {—1, —2,...} by Z. The empty set is denoted 0. For any set S we dénoté by 2^ the powerset of S, i.e., the set of subsets of S. We dénoté by |5| G N U {+oo} the cardinality of a set S. For any subset X Ç S oî a set S, we dénoté by X S \ X the complément of X in S, when S is clear from the context. In what follows, we sometimes make use of Church’s lambda notation to anonymousiy define fonctions. For instance, \x ■ x + 1 is such an anonymous “lambda-style” définition. For a fonction f : A B, we call A the domain of /, and B the codomain of /, which are denoted dom(/) and codom(/), respectively. The image of a fonction f : A B is the set img(/) {/(a) | a e A}.

2.1.1 Binary Relations

A

binary relation

over a set

5

is a set of pairs

R Ç. S x S. A

binary relation

R

is

refleocive

if and only if for each

s Ç. S we

hâve that

{s, s)

€

R;

it is

9

(26)

10 2.1. Sets, Functions, and Relations

symmetric if and only if for each pair

(si,S2)

G R we hâve that {s2,si)

G

R', it is antisymmetric if and only if for each pair

(si,S2) G

R such that (s2, Si)

G

R it is also the case that Si = S2; it is total if and only for each pair

Si G

5,

S2 G

5 we either hâve that

(si,S2) G

R or

(s2,si) G

R] finally, it is transitive if and only if for each triple Si,S2,S3 such that (si,S2)

G

R and

(s2,S3) G

R, it is also the case that

(si,S3) G

R.

Binary relations are often denoted using non-alphabetic symbols and the infix notation. Let ~ Ç 5 x 5; the expression Si ~ S2 is équivalent tO

(Si,S2) G

and

Si

9^

S2

is équivalent to

(si,S2)

^ To make the notations more intuitive, we always use symmetric symbols for symmetric relations and vice-versa. Moreover, the infix notation allows to use symbol relations backwards, e.g., the expression si :< S2 is équivalent to S2 b Si for any binary relation

Let R Ç, S X S he a binary relation; we dénoté by R° the set of pairs {(s, s)

I

s

G

5

},

and for every

i G N

q we dénoté by R^ the relation:

{(^1)^3)

G S X S

|

3s2

G

i

S':

(si,S2)

G ^ A (s2,S3) G iî}

The transitive closure of R, denoted i2+, is the relation:

{

(si; •S2)

G S X S I 3 i

G

No

: (si, S2) G iî*

}

The reflexive and transitive closure of R, denoted R*, is the relation R°\JR'^.

A preorder is a reflexive and transitive binary relation. A preorder that is also symmetric is an équivalence. A preorder that is also antisymmetric is a partial order.

2.1,2 Partially Ordered Sets

A partially ordered set (or poset, for short) is a pair {S, :<) where :< Ç. S x S is a partial order. A set X Ç 5 is called a chain if and only if for every pair of éléments Xi G X,X2 G X we hâve that either Xi :< X2 or X2 :< x\.

Dually, a set X Ç 5 is called an antiehain if and only if for every pair of éléments Xi

G

X, X2

G

X we hâve that whenever X\ ■< X2 or X2 x\ then x\ — X2. We dénoté by Chains[S', -<] and Antichains[5, :<] the set of chains and set of antichains of the partially ordered set {S, ■<), respectively. The

(27)

Chapter 2. Preliminaries 11

height (respectively width) of a partially ordered set is the cardinality of its largest chain (respectively largest antichain).

Let {S, ■<) be a partially ordered set. The upward-closure of a set X Ç 5 is

={

s

G

5 |3a;GX:sbx}. Conversely, the downward-closure of a set X Ç 5 is 4--^

={

s

G

5

|

3x

GA’:

s

:<

x

}.

AsetXÇS'is upward- closed (respectively downward-closed) if and only if X = (respectively X = 4-X). We respectively dénoté by UCS[5, :<] and DCS[5, :<] the set of upward- and downward-closed subsets of the partially ordered set

For any set X Ç S, the set of minimal éléments of X is the antichain = {x Ç: X |Vx'gX:x':^x=J>x' = x}; similarly, the set of maximal élé

ments of X is the antichain [X] = {x E X \ V x' E X : x' ^ x => x' = x}.

Note that [XJ and [X] are always antichains since, by définition, they cannot contain two distinct and comparable éléments.

A useful property of upward and downward-closed sets is that upward and downward-closedness is preserved by union and intersection. This is formalized by the following lemma.

Lemma 2.1.1 (U and fl Préservé Closedness for Partial Orders). Let {S, -<) be a partially ordered set. For every pair of upward-closed sets f/i, f/2 Q S, the sets U1UU2 and f/i fl f/2 are both upward-closed. Likewise, for every pair of downward-closed sets Di, D2 Q S, the sets D\ U D2 and Di n D2 are both downward-closed.

The following theorem formalizes the notion that minimal and maximal antichains are canonical représentations of upward- and downward-closed sets, respectively.

Theorem 2.1.2 (Antichain Représentation of Closed Sets). For any par

tially ordered set {S, :<) and X Ç. S, we hâve that X is upward-closed if and only if X = |[XJ. Similarly, X is downward-closed if and only if X = l\X],

Let {S, :<) be a partially ordered set and X C S. An element s G 5 is an upper bound of X if and only if V x G X : s ^ x; dually, s is a lower bound of X if and only ifVxGX:s:<x. The set of upper bounds (respectively lower bounds) of X is denoted by UB[S', :^](X) (respectively LB[5, :<](X)). The least upper bound of X, denoted LUB[5, ::^](X), is the element s such that

(28)

12 2.1. Sets, Functions, and Relations

LUB[S, :<](X)J = {s}, and is undefined when | LUB[5, :<](X)J | 7^ 1. Dually, the greatest lower bound of X, denoted GLB[5, :^](X), is the element s such that [LB[5, :<](X)] = {s}, and is undefined when | [LB[5, :<](X)] | 7^ 1.

In the context of partially ordered sets, an isomorphism is a one-to-one correspondance (i.e., a bijection) between two posets that is order preserv- ing. Formally, the bij active function a : 5i i-)- S2 is an isomorphism between the posets {Si, :;<i) and {S2,1^2) if and only if Vs G Si, s' G 5i: (s :<i s') <=>

((t(s) :<2 o-(s')). We say that two partially ordered sets are isomorphic if and only if there exists an isomorphism between them.

2.1.3 Lattices

A join-semilattice

(respectively

meet-semilattice)

is a posât set

{S, ■<) such

that, for every pair of éléments Si G 5,^2 G

S, LUB[5,

:^]({si, S2}) (respec

tively

GLB[5,

:<]({si, S2})) is defined.

A lattice

is a posât that is both a join-semilattice and a meet-semilattice.

The least upper bound of two lattice éléments si, «2 is called the join of these éléments, and is denoted Si

U «2;

the greatest lower bound of si,

S2

is called the meet of these éléments, and is denoted Sj n S2-

A lattice {S, :<) is said to be complété if and only if both LUB[5, r^KA”) and GLB[5, :<](A) are defined for every set X Ç S. Notice that finite lattices are always complété. A lattice {S, :<) is bounded if and only if we hâve that both LUB[5, :<](5') and GLB[5, r^](5) are defined, in which case they are denoted T and ±, respectively. Again, finite lattices are trivially always bounded.

A

lattice

{S, -<)

is

distributive

if and only if the following two equalities hold for ail triples si,

82,83

G

S.

(si U S2) n S3 = (si n S3) U (s2 n S3) (si n S2) U S3 = (si U S3) n (s2 U S3)

An element s of a lattice {S, :<) is join-irreducible if and only if s 7^ ± and Vsj

G

S, 82

G

S: (s'^

U

S2 = s) (s'i = s

V

S2 = s). Dually, s is meet-irreducible if and only if s

7^

T and V

sj

£ S, 82 E S : (s(

n S2 =

s)

(s'i = s V «2 = s). In short, s ^ {i-,T} is join-irreducible (respectively

(29)

Chapter 2. Preliminaries 13

meet-irreducible) whenever it is not the join (respectively not the meet) of two distinct lattice éléments. The set of join-irreducible (respectively meet- irreducible) éléments of a lattice {S,:<) is denoted

JIR[S',

:<] (respectively

MIR[5,

:<]).

We call a lattice of sets any collection of sets that forms a lattice using subset inclusion as partial order. Simply put, any collection of sets that is closed under imion and intersection is a lattice of sets.

We can now recall Birkhoff’s Représentation Theorem for finite distribu

tive lattices. This important resuit in lattice theory States that every finite distributive lattice is isomorphic to a lattice of sets.

Theorem 2.1.3 (Birkhoff’s Représentation Theorem [Bir67]). Any finite distributive lattice {S, :^) is isomorphic to the lattice of downward-closed sets of its join-irreducible éléments, or formally, the lattice

(DCS[JIR[5,

-<

], :^]) Q)- The corresponding isomorphism is a{s) = {x E JIR[5, :<] | a: s}

with a~^{X) = LUB[5, :<](X). Conversely, any finite distributive lattice {S, :<) is isomorphic to the lattice of upward-closed sets of its meet-irreducible éléments, or formally, the lattice (UCS[MIR[5, Ç). In this case the corresponding isomorphism is the function a(s) = {x E MIR[5,:<] | x ^ s}

with a-i(X) = GLB[5, ^](X).

An illustration of Birkhofî’s theorem is depicted at Figure 2.1.

The set of upward-closed subsets of a finite join-semilattice is a finite and distributive lattice. Likewise, the set of downward-closed subsets of a finite meet-semilattice is a finite and distributive lattice.

Theorem 2.1.4 (Lattice ofUpward/Downward-Closed Sets). For any join- semilattice {Su, we hâve that

(UCS[S'u,

Ç) is a finite and distributive lattice. For any meet-semilattice

(5n,

i^) we hâve that

(DCS[S'u,

:<], Ç) is a finite and distributive lattice.

The lattice of upward-closed subsets of a finite join-semilattice is iso

morphic to a lattice of minimal antichains. Symmetrically, the lattice of downward-closed subsets of a finite meet-semilattice is isomorphic to a lat

tice of maximal antichains.

(30)

14 2.1. Sets, Functions, and Relations

Figure 2.1: Illustration of Birkhoff’s représentation theorem on the lattice of divisors of 60. The set of divisors of 60 is partially ordered by the divides relation. The join and meet are the least common multiple and greatest common divisor operations, respectively.

(31)

Chapter 2. Preliminaxies 15

Theorem 2.1.5 (Lattice of Minimal Antichains). Let {S,:<) be a finite join-semilattice, and for any X,Y E Antichains[S', let X ifî y x E X 3y E Y : X y. The pair (Antichains[5, is a finite and distributive lattice, such that for any X,Y E Antichains[5, :<];

Auy = [Aurj

Any =

[{xUy \ X E X,y eY}\

± = 0 T =

Moreover, the function [-J : UCS[5, :<] Antichains[5, :^] is a lattice iso- morphism between (UCS[5, Ç) and (Antichains[5,

Theorem 2.1.6 (Lattice of Maximal Antichains). Let (S,:<) be a finite meet-semilattice, and for any X,Y E Antichains[5, :^] let X ^Y iff \/ x E X 3y eY X -<y. The pair (Antichains[5, :^],E) is a finite and distributive lattice, such that for any X,Y E Antichains[5, :<];

Auy = [A U y]

Any =

\{xny \ X E X,y E

y}]

± = 0 T = [5]

Moreover, the function [•] ; DCS[5, :<] Antichains[5, :^] is a lattice iso- morphism between (DCS[S, Ç) and (Antichains[S,

2.1.4 Functions over Lattices and Fixed Points

We now recall some définitions regarding functions over lattices.

Let {S, :<) be a lattice; a function / : 5 i->- 5 is monotonie if and only if Vs G 5,s' e 5: (s s') => (/(s) :< /(«'))• Moreover, the fimction / is continuons if and only if either S is a finite set, or for every non-empty set X Ç S, we hâve that /(LUB[5, ^](A)) = LUB[5, ^]({/(a:) | x G A}).

Note that every continuons function is necessarily monotonie. In this thesis, we shall only manipulate functions over finite lattices, so the continuity requirement will always be trivially satisfied by monotone functions.

(32)

16 2.1. Sets, Functions, and Relations

A fixed point of a fonction f: S S over the lattice {S, :^) is an element s G S such that f{s) = s. We dénoté by FP[5, :i^](/) the set of fixed points of / over the lattice {S, :<).

We can now recall the well-known Knaster-Taxski theorem on continuons functions over complété lattices.

Theorem 2.1.7 (Knaster-Tarski Theorem). For every continuons function f : S S over a complété lattice {S, ■<), the set of fixed point FP[5, :<](/) forms a complété lattice with the partial order .

Since complété lattices are necessarily non-empty^, it follows from the Knaster-Tarski theorem that every continuons function over a complété lat

tice admits both a least fixed point and a greatest fixed point. We dénoté these fixed points by LFP[5, i^](/) = GLB[S', ^](FP[5, :^](/)) and GFP[S', :<

](/) = LUB[5, :^](FP[5, ^](/)), respectively.

Let / : 5 1-^ 5 be a continuons function over the complété lattice (5, d)- We define /° to be the identity function, i.e., Vs G 5: /°(s) = s, and for any f G Nq we define p recursively as /®(s) = f{P~^{s)).

Algorithm 2.1 depicts the practical itérative computation of fixed points.

In both algorithms, the least or greatest fixed point is computed by repeat- edly applying the function (beginning either with T or

T).

2.1.5 Galois Connections

In this thesis, we use Galois connections [CC77] to formalize the approxi

mation of alternating automata. A Galois connection is a pair of functions oc : C 1-^ A, y : A C connecting two domains A and C, which axe both complété lattices. The complété lattice (C, Çc) is called the concrète do

main, while {A, Ç_4) is called the abstract domain. Therefore, the function Y ; C is called the concretization function, and a : C i-)- ^ is called the

abstraction function.

Définition 2.1.8 (Galois Connection [CC77]). Let (.4, and (C, Ec) two complété lattices, and two functions oc : C A and y : A C. The

^At the very least, LUB(0) and GLB(0) must evaluate to a lattice element.

(33)

Chapter 2. Preliminaries 17

Algorithm 2.1; Computation of least and greatest fixed points, inputs : a complété lattice {S, :<) and a continuons function

f-.S^S output: LFP[5, :<](/) 1 begin computeLFP(5, :<,/)

s i— J_ 5 do

s' s ;

s •«- /(s) ;

while s ^ s' \ return s ; 8 end

g

10

11

12 13 14 15 16

inputs : a complété lattice (5, ■<) and a continuons function output: GFP[5, :^](/)

begin computeGFP(S, f) s ^— T J

do

s' <— s ] s

/(s) ;

while s ^ s' ; return s ; end

4-tuple

(a,

{A,

(C, E

c

))T)

forms a

Galois connection,

which is denoted {C, Ec) ^ ^ > (-4, E^), if and only if (both statements are équivalent):y

y c £ C Wa e. A: oi{c) a c Çc "/(«)

y CE C: cÇ.cy° «.(c) and Va € A: a o y (a) a

The lattice orderings in both domains represent the relative précision of domain values. We say that an abstract value a E A represents a concrète

(34)

18 2.1. Sets, Functions, and Relations

value c £ C whenever a(c) a, or equivalently whenever c Qc t(o)- Furthermore, we say that a concrète value c is exactly represented by a Galois connection whenever y ° a(c) = c, that is whenever c is represented without loss of précision.

Lemma 2.1.9 (Properties of Galois Connections). For every Galois con-

y

nection {C, ^ > {.4, the following properties hold:

• a and y are monotone functions ;

• for every ci, Cq G C we hâve that a(ci U C2) = a(ci) U a(c2) ;

• for every ai, 02 E A we hâve that y{a\ PI 02) = y(ci) H y(c2) ;

• aoyoa=a and y o a o y = y ;

• a = Ac : GLB({a | c Ce y(a)}) ;

• y = Aa : LUB({c ] a(c) a}) .

We now provide some intuitions on some of the properties above.

The properties c Ec y o a(c) and a o y(a) a show the soundness of the abstraction: a concrète value c should be at least as précisé than the concretization of the abstraction of c, and similarly, the abstraction of the concretization of an abstract value a should be at least as précisé as a.

The last two properties of the above lemma show that the abstraction fonction and the concretization fonction are both univocally determined by the other. What is more, these properties states that Galois connections are always such that a is maximally précisé, since it maps each concrète value c to the unique most précisé value a that represents c (that is, c Ec

y

A Galois connection {C, Ec) ^ ^ > {A, E>i) for which y o a is the identity fonction is called a Galois insertion.

The following theorem shows how it is possible to approximate the ab

straction of a concrète domain fixed point, by carrying out the fixed point computation in the abstraet domain. In general, this abstract fixed point computation is less précisé than the abstraction of the concrète fixed point.

In practice, it is often not possible to do better however, when the concrète fixed point carmot be computed directly for undecidability or intractability

reasons.

(35)

Chapter 2. Preliminaries 19

Theorem 2.1.10 (Abstraction of Fixed Points [CC77]). For any Galois connection (C, (.4., , and for any monotone function f : C ^ C we hâve that:

a(LFP[C, Ec](/)) LFP[.4,

Ç^](a o / o y)

a(GFP[C,Çc](/)) GFP[A

E^](ao/oy)

2.2 Boolean Logic

In this section, we quickly recall a number of définitions and notations rcgarding Boolean logic that are needed in the sequel.

2.2.1 Boolean Functions

We dénoté the set of Boolean truth values as B {true, false}. In general, a set of logical truth values can contain more than two éléments (e.g., to accommodate a third, uncertain possibility), and can even be infinité (e.g., the real numbers between 0 and 1); Boolean logic however, deals only with the binary world of true and false.

Let P be a finite set of éléments called propositions. A valuation over P is a set U Ç P which identifies a truth assignment of each proposition in P; by convention, the propositions in v are seen as assigned to true and the propositions in P \ u are seen as assigned to false. For any valuation U

6

2’’' and proposition p

G

P, we use the notations v|p=true v

U

{p} or

^|p=faise ’= v \ {p} interchangeably.

A Boolean function is a function / of type / : 2*” —> B for some finite set of propositions P. We refer to the cardinality |P| as the arity of a Boolean function. For any Boolean function / : 2®’ —>• B, with p

G

P and t

G

B, we define f\p=t to be the function /': 2^ B such that f'{v) = f{v\p=t) for every valuation v G 2^. We dénoté the set of Boolean functions over P by BF(P).

The efficient représentation and manipulation of Boolean functions is a central problem in both computer science and engineering. The most ex-

(36)

20 2.2. Boolean Logic

plicit, and the simplest représentation of a Boolean function is the truth table, which contains one row for each of the possible 2l*’l inputs. The remainder of this section reviews two more interesting représentations of Boolean functions, namely propositional Boolean formulas and reduced or- dered binary decision diagrams.

2.2.2 Propositional Boolean Formulas

Let us first formally define the syntax of propositional Boolean formulas.

Définition 2.2.1 (Syntax of Propositional Boolean Formulas). Let

P

be a finite set of Boolean propositions, and let p E F. The set of syntacti- cally correct propositional Boolean formulas is defined recursively with the following grammar:

false | true | p \ -k/? | </3i V | y’i A </?2

We dénoté the set of propositional Boolean formulas over

P

by

BL(P).

The above définition introduces basic logical connectives. The symbol -■

is the unary négation,

V

is the disjunction connective and

A

is the conjunc- tion connective. Additionally, we define the following syntactic shorthands:

{ip => ip) (-lyj V ip) is the logical implication coimective (it reads “p im

plies xp"), and {p xp) {p => xp) A {xp ^ p) is the logical équivalence coimective (it reads “<^ if and only if xp” )

.

As stated previously, propositional Boolean formulas are syntactic rep

résentations of Boolean functions. The represented Boolean function is obtained by induction on the structure of the formula, as formalized in the following définition.

Définition 2.2.2 (Semantics of Propositional Boolean Formulas). Let

P

be a finite set of Boolean propositions, and p be a propositional Boolean formula over F. The semantics of the formula p, denoted |[(^|, is the Boolean function / : 2** B such that, for every valuation v E 2^ :

• |true](u) = true;

• [false] (w) = false;

(37)

Chapter 2. Preliminaries 21

• IpI (^) = iffp £ V (assuming p & Ÿ);

• = true iff = false;

• \ip\ V = true iff |¥?i](f) = true or = true or both;

• \ipi

A

(Pq\{v) = true iff both [y’iKu) = true and |</:’2]1(^) — true.

We see from the above définition that propositional Boolean formulas associate to each valuation of their propositions the value true or false. When 1*^1 ('T’) = true we say that the valuation v satisfies the formula <p, which we dénoté more compactly hy v ip. Conversely, when |<^l(f) = false then v does not satisfy ip which is simply written v ^ p). A formula p is said to be satisfiable iff it is satisfied by at least by one valuation, or equivalently iff [(p] 7^ Au • false. Dually, a formula p is valid iff it is satisfied by every valuation, or equivalently iff |(^J = Au • true. Deciding whether a given propositional Boolean formula is satisfiable or valid are fondamental NP- COMPLETE and CO-NP-Complete problems, respectively [Coo71]. We dénoté by Sat((/?) {u G 2’’’ \ v\= p^ the set of valuations which satisfy a formula p.

2.2.3 Positive Propositional Boolean Formulas

In the remainder of this thesis. we will often manipulate positive proposi

tional Boolean formulas, i.e., formulas which do not contain négation symbols. We dénoté by BL'^(P) the set of positive propositional Boolean for

mulas over P.

The interesting property of positive propositional Boolean formulas is that the set of valuations that makes the formula true is always upward- closed. This is formalized by the following Lemma.

Lemma 2.2.3 (BL+ Formulas Hâve Upward-Closed Valuation Sets). Let P be a finite set of Boolean propositions and let p G BL'*'(P). The set of valuations Sat((/?) is upward-closed for Ç.

Proof We proceed by induction over the structure of the formula p. For the base case, we can hâve p = true and Sat(</?) = 2^, ox p — false and Sat(</j) = 0, which are both upward-closed for Ç. The final base case is y? = p, with p G P, for which Sat(<p) = {uÇP |pGu}is also upward-closed. For the

(38)

22 2.2. Boolean Logic

inductive case, we either hâve that (p = if\\/ and Sat((^) = Sat((/?i) U Sat((/?2), ox tp = ip\ h <p>2 and Sat((^) = Sat((/?i) fl Sat((/?2)- By the induction hypothesis, we know that both Sat(</Ji) and Sat((/?2) are upward-closed, and since miion and intersection preserve upward-closedness by Lemma 2.1.1, we hâve that Sat(i^) is upward-closed for Ç in both cases. □

2.2.4 Reduced Ordered Binary Decision Diagrams

Formulas offer a simple and compact représentation of Boolean functions but lack the crucial property of canonicity, which is of great practical impor

tance, as it allows for example to easily test for équivalence (more on that later). Truth tables are canonical but are always exponential in |P|, no mat- ter the represented frmction, so they cannot be used for practical pmposes.

Reduced ordered binary decision diagrams [Bry86] (ROBDD, for short) pro

vide a canonical, graph-based représentation of Boolean functions that has proven to be extremely efficient in a mnnber of applications, particularly in the field of symbolic model checking [BCM+90].

Définition 2.2.4 (Ordered Binary Decision Diagrams). Let¥ = {pi,... ,pk}

be a totally ordered set of Boolean propositions. An ordered binary de

cision diagram (OBDD, for short) over P is (i) either a terminal OBDD (index(n), val(n)) where index(n) = A: -I- 1 and val(n) G {true, false}; or (U) a non-terminal OBDD (index(n), lo(n), hi(n)), where 1 < index(n) <

k and lo(n) and hi(n) are (terminal or non-terminal) OBDD such that index(hi(n)) > index(n) and index(lo(n)) > index(n). We dénoté the set of syntactically correct OBDD overF by OBDD(P).

Additionally, for every n € OBDD({pi,... ,Pk}) with index(n) = i, we de- note by prop(n) the proposition p,.

It is easy to see that the S3mtactic requirements enforced on OBDD by the définition above are such that ail OBDD are essentially rooted directed acyclic graphs, with non-leaf nodes having an out-degree of 2 and leaf nodes being labeled by a truth value.

In the sequel, we refer to OBDD also as “OBDD node”, or simply

“node”. This abuse of language is justified by the fact that OBDDs are

(39)

Chapter 2. Preliminaries 23

uniquely identified by their root node. For any non-terminal node n, we call hi(n) and lo(n) the high-child and low-child of n, respectively. Ad- ditionally, the set nodes(n) of an OBDD n is defined recursively as follows. If n is terminal, then nodes(n) = {n}. Otherwise nodes(n) = {n} U nodes(lo(n)) U nodes(hi(n)). The number of (unique) nodes of an OBDD is denoted by |n|.

The semantics of ordered binary decision diagrams is naturally defined in terms of the if-then-else function, which we dénoté ite. For any p G IP, /, P : 2“’ B we hâve that |ite(p, /, p)] = Xv - if p e v then f{v) else g{v).

Définition 2.2.5 (Semantics of OBDD). Let P = {pi,... ,pfc} be a totally ordered set of Boolean propositions and let d G OBDD(P). The semantics of the ordered binary decision diagram d, denoted [d], is a Boolean function f : 2^ B defined recursively over the structure of d:

• if d is a terminal node then |d] = Jval(d)]|;

• otherwise, let i = index(d), [d]] = ite(pj, |lo(d)]|, |[hi(d)]).

We can see from the above définitions that OBDD are non-canonical.

Reduced OBDD (or ROBDD for short) provide additional syntactic require- ments to enforce canonicity. The définition of ROBDD relies on the notion of isomorphism between OBDD, which we now define formally.

Définition 2.2.6 (Isomorphism Between OBDD). Let d\,d2 G OBDD(P) be two OBDD over P. An isomorphism between d\ and d2 is a bijection a : nodes(di) i-> nodes(d2) such that for every ni G nodes(di) we hâve that:

• index(rii) = index(cr(ni));

• CT(lo(ni)) = lo(o-(ni));

• f^(hi(ni)) = hi((j(ni));

• val(ni) = val(a(ni)) if ni is terminal.

Two OBDDs are isomorphic iff there exists an isomorphism between them.

We can now formally define reduced OBDD.

Définition 2.2.7 (Reduced OBDD). LetB = {pi,... ,pfc} andd G OBDD(P).

We say that the OBDD d is reduced iff it satisfies the following constraints:

(40)

24 2.2. Boolean Logic

• d does not contain two distinct isomorphic OBDDs;

• for every n € nodes(d) we hâve that lo(n) ^ hi(n).

l^e dénoté the set of syntactically correct ROBDD overŸ by ROBDD(P).

Bryant [Bry86] showed that ROBDDs are a canonical data structure for Boolean fonctions, assuming a total ordering of the Boolean propositions.

This is formalized by the following theorem.

Theorem 2.2.8 (Canonicity of ROBDD [Bry86]). Let P = {pi,... ,Pk} be a totally ordered set of Boolean propositions, and let f : 2^ be a Boolean function over P. There exists a ROBDD d G ROBDD(P) such that |[d] = /

and for every d'G ROBDD(P) if [d'| = / then d and d' are isomorphic.

For every Boolean formula tp G BL(P), we dénoté by BuildROBDD((p) the ROBDD d G ROBDD(P) such that {dj = |<p].

The practical importance of canonicity is very large. Efficient ROBDD implémentations very often enforce structural equality between isomorphic ROBDD, so that two ROBDD are équivalent if and only if they hâve the same address. One can therefore check for satisfiability or validity of an ROBDD simply by inspecting its root node. More importantly, memory- level canonicity means that ROBDD can be implemented as hashable values;

this allows to use caching techniques and dynamic programming algorithms.

As expected, we dénoté the disjunction and conjunction of two ROBDD di, d2 respectively by d\ V ^2 and d\ A d2, and we dénoté the négation of an ROBDD d by -id (the formai définitions of these operations are obvions and omitted here). Likewise, for an ROBDD d

G

ROBDD(P) and p

G

P, we use the syntax d|p=true and d|p=faise in the expected way. An additional ROBDD operation that we use throughout this work is existential quantification, defined below.

Définition 2.2.9 (ROBDD existential quantification). Let P = {pi,... ,pk}

be a totally ordered set of Boolean propositions, and let d G ROBDD(P) be an ROBDD over P. The existential quantification of d with respect to a set 5 Ç P, denoted 3S ■ d, is defined by 3 S ■ d'^= ^pes (^lp=faise V d|p=true)-

(41)

Chapter 2. Preliminaries 25

2.3 Finite Automata

In this section, we review the basic formalisms needed to represent formai languages and encode computation Systems.

2.3.1 Alphabets, Words, and Languages

Let E be a finite non-empty set that we call alphabet and the éléments of which we call letters. A finite word is a finite sequence w = ao,..., <7„_i of letters from E. The word of length zéro is denoted e. We dénoté by E* the set of ail finite words over E. A finite-word language is a finite or infinité set of finite words, i.e., a set L Ç E*. We also call E* the finite-word universal language as it contains ail words of finite length, and we call the empty set of words 0 the empty language.

These notions are extended to infinité words in the following natural way. An infinité word is an infinité sequence w = ao, ai,... oi letters from E. We dénoté by E" the set of ail infinité words over E. An infinite-word language is a (finite or infinité) set of infinité words, i.e., a set L Ç E". We call E" the infinite-word universal language.

The length of a word w is denoted |u;|; the length of a infinité word is simply +00. The position of a letter in a finite or infinité word is a natural nmnber, the first letter being at position zéro. We dénoté the letter of position i in a finite or infinité word w by u;[f]. For any word w such that

|îs| > i, we dénoté by rt;[f...] the suffix of the word w, staxting at and including position i. Note that a suffix has always a strictly positive length.

In this thesis we do not consider languages that contain both finite and infinité words.

2.3.2 Finite State Machines and Automata

On of the most fundamental tools to represent formai languages is the finite State machine (FSM for short). An FSM is essentially a graph whose vertices are called states, and whose edges are called transitions, and which identifies a set of initial states along with a set of final states (non-initial and non-final States are simply called states). Additionally, the transitions of an FSM are

(42)

26 2.3. Finite Automata

labeled by symbols taken from a finite alphabet.

Définition 2.3.1 (Finite State Machine). A finite state machine is a tuple {Q, E, /, —F) where:

• Q = {qi,..., q„} is a finite set of states;

• S = {(Tl,... ,ak} is a finite alphabet;

• I Ç Q is a finite set of initial states;

• -^Ç:Qy.T,xQis a labeled transition relation;

• F Ç. Q is a finite set of final states.

The transition relation —>Ç QxExQof FSM is inherently non deterministic, in the sense that there can be several edges labeled with the same alphabet symbol which go from a single state to several dis

tinct States. Contrarily, deterministic FSM are such that Vg G Q,'icr G

I

^{{9^}

I

^{(9) 9O}

^ } I ^

1- Moreover, deterministic FSM must hâve exactly one initial state, i.e., |/| = 1.

An FSM is complété iff Vç G G S: | {(?' | (9, cr, g') G ^} | > 1.

In other words, a complété FSM contains no deadlocks, i.e., each state has always at least one outgoing transition for every alphabet symbol. Without loss of generality we shall assume in the remainder of this thesis that ail FSM that we manipulate are always complété, unless otherwise stated.

Finite state machines admit either finite or infinité executions called runs. A rim of an FSM is a finite or infinité path inside the machine which follows the transition relation. Runs which originate from the initial state are called initial. Moreover, every nm of a finite state machine is classified as either accepting or rejecting (non-accepting). In the case of finite runs, the acceptance criterion is simple: a finite run is accepting iff it ends in a final state. For infinité nms, we consider the socalled Büchi acceptance condition. A run is accepting for Büchi iff it visits final states infinitely often.

Many other acceptance conditions hâve been studied in automata theory, such as Streett, Rabin, Parity, generalized Büchi, coBüchi and others. In this thesis, we will be mainly concerned with the Büchi acceptance condition for infinité words. We can now formally define the notion of runs of an FSM.

Définition 2.3.2 (Run of an FSM). Let M = {Q,T,, I,^,F) be an FSM and let n; G S* U S“ be a word. A nm of the finite state machine M over

Disponible à / Available at permalink :

D 03746

New Algorithms and Data Structures for the Emptiness Problem of Alternating Automata

Nicolas Maquet

beispo

New Algorithms and Data Structures for the Emptiness Problem of Alternating Automata

Nicolas Maquet

Résumé

'i'i:' I

'I

I'

Abstract

This Work studios new algorithms and data structures that are useful in the context of

As computers hâve become more and more ubiquitous in our modem societies, an increasingly large number of computer-based Systems are considered

Such Systems are characterized by the fact that a failure or a

(computer error in the computing jargon) could potentially cause large damage, whether in loss of life, environmental damage, or économie damage. For safety-critical Sys­

tems, the industrial software engineering community increasingly calls for using techniques which provide some

that a certain piece of software is correct.

Acknowledgements

r

Contents

xiii

Contents

Contents

FDL

SNF . .

SNF

SNF

SNF

Contents

Chapter 1 Introduction

1.1 Context

1

2 1.1. Context

particularly fruitful and popular abstraction of computer programs is the

or

Finite-state abstractions are useful for modeling programs and properties where the intricacies of the System lies more in the control-state combinatorial rather than in the ma­

nipulation of complex data. Many Systems fall into this category, such as network protocols, distributed control Systems, embedded or reactive Sys­

tems, and so on. The field of

encompasses the study of

Chapter 1. Introduction 3 efficient algorithms and data structures for the analysis of finite-state Sys­

tems. In this context, a program is represented by a finite-state abstraction

that describes a

of

denoted

On the other hand, a

is written down in some formai logic (temporal or otherwise) that describes a language of

denoted

For a System

and a property

the

asks whether

that is whether

Ç

If ail the behaviors of a sound approximation of a program

are included in those defined by a property

then we say that

is

with respect to

The primary challenge faced by the model checking community is the

Finite-state abstractions of real-world pro- grams typically hâve a very large number of States, and much research effort has been put into dealing with this challenge.

PS

= APT

).

4 1.2. Contributions

1.2 Contributions

Chapter 1. Introduction 5

6 1.3. Plan of the Thesis

1.3 Plan of the Thesis

Chapter 1. Introduction 7

we présent abstraction / refinement algorithms designed to handle alternating automata with very large numbers of States. The abstrac­

tion is based on the set of partitions of the set of States, and uses a refinement strategy that is not based on counter-examples. An ex­

perimental évaluation of the technique is discussed, comparing our approach to “classical” non-abstract antichain algorithms.

1.4 Chronology &; Crédits

8 1.4. Chronology & Crédits

Chapter 2 Preliminaries

2.1 Sets, Punctions, and Relations

2.1.1 Binary Relations

A

over a set

(computer error in the computing jargon) could potentially cause large damage, whether in loss of life, environmental damage, or économie damage. For safety-critical Sys

Finite-state abstractions are useful for modeling programs and properties where the intricacies of the System lies more in the control-state combinatorial rather than in the ma

nipulation of complex data. Many Systems fall into this category, such as network protocols, distributed control Systems, embedded or reactive Sys

Chapter 1. Introduction 3 efficient algorithms and data structures for the analysis of finite-state Sys

we présent abstraction / refinement algorithms designed to handle alternating automata with very large numbers of States. The abstrac

tion is based on the set of partitions of the set of States, and uses a refinement strategy that is not based on counter-examples. An ex

:^]({si, S2}) (respec