A brief Introduction to Automated Theorem Proving - PowerPoint PPT Presentation

1 / 33
About This Presentation
Title:

A brief Introduction to Automated Theorem Proving

Description:

ATP: Vampire, Otter, Spass, E-SETHEO, Darwin, Epilog, SNARK, Gandalf ... No 'external' functions symbols have to be considered! ... – PowerPoint PPT presentation

Number of Views:238
Avg rating:3.0/5.0
Slides: 34
Provided by: stiinn
Category:

less

Transcript and Presenter's Notes

Title: A brief Introduction to Automated Theorem Proving


1
A brief Introduction to Automated Theorem Proving
  • Theoretical Foundations, History and the
    Resolution Calculus for classical First-order
    Logic
  • Uwe Keller

2
Content
  • Intoduction
  • Motivation History
  • Theorem Proving, ATP and Calculi
  • Foundations
  • FOL, Normalforms Preprocessing, Metaresults
  • Resolution
  • Basic calculus, Unification
  • Refinements, Redundancy
  • Decision procedures
  • Chain Resolution
  • A Variant of Resolution for the Semantic Web
  • Demo

3
Part IIntroduction
  • Motivation History
  • Theorem Proving, ATP and Calculi

4
Logic and Theorem Proving
Real-world description in natural
language. Mathematical Problems Program
Specification
Formalization
Syntax (formal language). First-order Logic,
Dynamic Logic,
Semantics (truth function)
Calculus (derivation / proof)
Correctness
Valid Formulae
Provable Formulae
Completeness
5
How did it start
  • Results from first-half of the 20th century in
    mathematical logic showed
  • we can do logical reasoning with a limited set of
    simple (computable) rules in restricted formal
    languages like First-order Logic (FOL)
  • That means computers can do reasoning!
  • Implementation of ATP
  • First Computers where needed - )
  • AI as a prominent field Reasoning as a basic
    skill!
  • Mid 1950s first attempts to implement an ATP
  • Today
  • (A)TP is no longer only a part of main stream AI
  • Central shared problem How to represent and
    search extremely large search spaces!

6
A rough timeline in ATP
  • before 1950 Proof-theoretic Work by Skolem,
    Herbrand, Gentzen and Schütte
  • 1954 First machine-generated Proof (Davis)
  • 1955ff Semantic Tableaus (Beth, Hinitkka)
  • 1957 First machine-generated Proof in Logic
    Calculus (Newell Simon)
  • 1957 Lazy substitution by free (dummy) Vars
    (Kanger, Prawitz)
  • 1958 First prover for Predicate Logic (Prawitz)
  • 1959 More provers (Gilmore, Wang)
  • 1960 Davis-Putnam Procedure (Davis, Putnam,
    Longman)
  • 1963 Unification (J.A. Robinson)
  • 1963ff Resolution (J.A. Robinson) Inverse
    Method (Maslov)
  • 1963ff Modern Tableau Method (Smullyan, Lis)
    without Unification
  • 1968 Modelelimination (Loveland), with
    Unification
  • 1970ff PROLOG (Colmerauer, Kowalski),
    Refinements of Resolution
  • 1971 Connection Method (Bibel), Matings
    (Andrews) with Unification
  • 1985 ATP in non-classical logics, Renaissance of
    Tableaux Methods
  • 1987 Tableaus with Unification
  • 1993ff Renewed interest in Instance-based
    Methods DPLL, Modelevolution

7
Theorem Proving
  • Given
  • a formal language (or logic) L
  • a calculus C for this language ( set of rules)
  • a conjecture S and a set of assumptions or axioms
    A in the language L
  • Determine
  • Can we construct a proof for S (from A) in
    calculus C?
  • Logic Syntax Semantics Calculus
  • TP Proof-search in C (Huge search problem)
  • Correctness and completeness of Calculi essential
    properties
  • Calculus Non-deterministic Algorithm
  • Central problem in ATP How to implement a
    non-deterministic algorithm efficiently on a
    deterministic machine - )

8
Theorem Proving (II)
  • Research areas
  • Interactive / tactic TP vs. Automated TP
  • Classical Logic vs. Non-classical logics
  • Calculi for
  • ATP - General principle Refutation
  • Resolution, Tableau, Inverse Method,
    Instance-based Methods
  • ITP General principle Proof situation /
    context
  • Sequent Calculi
  • others General principle Generation from
    Axioms
  • Hilbert-style Calculi
  • Central difference
  • What are the elements in a proof what is a
    proof?

9
Main TP Applications
  • Main Applications
  • Software Hardware Verification
  • Theorem proving in Mathematics
  • Query answering in rich knowledge bases
    (Ontologies)
  • Verification of cryptographic protocols
  • Retrieval of Software Components
  • Reasoning in non-classical Logics
  • Program synthesis
  • many systems implemented
  • ATP Vampire, Otter, Spass, E-SETHEO, Darwin,
    Epilog, SNARK, Gandalf
  • ITP Isabelle/HOL, Coq, Theorema, KeY-Prover

10
Why is FOL of special interest in the ATP
community ?
  • There are less more expressive logics than FOL
  • Classical Propositional Logic, Modal
    Propositional Logic, Description Logics, Temporal
    Propositional Logic
  • Higher-order Predicate Logics, Dynamic Predicate
    Logics, Type Theory
  • Research in ATP mainly focused on FOL
  • FOL is very expressive, many real-world problems
    can be formalized in FOL
  • FOL turned out to be the most expressive logic
    that one can adequately approach with ATP
    techniques

11
Example
  • Theorem in (elementary) Calculus
  • Nullstellensatz Every function which is
    continous over a closed interval Ia,b must
    take the value 0 somewhere in I if f(a) lt 0 and
    f(b) gt 0
  • Proof idea Consider the Supremum l of set M
    x f(x) lt 0, altxltb and show that f(l) 0

12
Example (II)
  • Formalization
  • Compact (only LEQ)
  • Redundancy-free
  • Specific definitions
  • Continous functions
  • Main idea of proofis already encoded
  • Use Supremum
  • Can be done by anATP system
  • but without properFormalization ?!?
  • ATP better than humanprover? Robbins Problem in
    Algebra
  • Intelligent Proving vs.Combinatorical proving

13
Part IIFoundations
  • FOL, Normalforms Preprocessing, Metaresults

14
Classical First-order Logic (FOL)
  • Syntax
  • Signature
  • Function Symbols, Predicate Symbols, Arity,
    logical Connectives, Quantors
  • Terms (over ), Atomic Formulae (over ),
    Formluae (over )
  • Definition relative to the signature of the
    predicate logic
  • Semantics
  • First-order structure / interpretation S (U,I)
  • Universe U Signature-Interpretation I
  • Constants I(c) element of U
  • Functionsymbols I(f) total functions on U
  • Relationsymbols I(R) relation on U
  • Logical connectives and quantors in the usual way
  • Definition relative to the signature of the
    predicate logic

15
Classical FOL (II)
  • Model of a statement
  • An interpretation S (U,I) is called a model of
    a statement s iff valS(s) t
  • What does it mean to infer a statement from given
    premisses?
  • Informally Whenever our premisses P hold it is
    the case that the statement holds as well
  • Formally Logical Entailment
  • For every interpretation S which is a model of P
    it holds that S is a model of S as well
  • Special case Validity Set of premisses is
    empty
  • Logical entailment in a logic L is the (semantic)
    relation that a calculus C aims at formalizing
    syntactically (by means of a derivability
    relation)!
  • Logical entailment considers semantics
    (Interpretations) relative to a set of premisses
    or axioms!

16
Normal Forms
  • What is a normal form?
  • Why are they interesting?
  • Relation to ATP?
  • Conversion of input to a specifc NF my be
    required by a calculus (e.g. Resolution) )
    Preprocessing step
  • ATP in a sense can be seen as a conversion in a
    NF itself, borderline is fuzzy in a sense
  • Normalforms in FOL
  • Negation Normal Form
  • Standard Form
  • Prenex Normal Form
  • Clause Normal Form (in a sense a logic free
    form)
  • There are logics where certain NF do not exist,
    like CNF in a Dynamic First-order Logic
  • Certain calculi then can not be applied in these
    logics!

17
Negation Normal Form
  • A formula is in Negation NF (NNF) iff. it
    contains no implication and no bi-implication
    symbols and all negation symbols occur only as
    part of a literal (directly in front of atomic
    formulae)
  • How to achieve this NF ?
  • Replace implication and bi-implication by their
    definition (in terms of Æ and Ç)
  • Move negation symbols inside to atomic formulae
  • De Morgan laws
  • Dualize quantifiers when moving negation symbols
    over a quantor
  • Eliminate multiple negations
  • All these syntactical transformations generate
    semantically equivalent formulae
  • Example

18
Standard Form
  • A formula A is in Standard Form if no variable x
    in A occurs both bound and free and no bound
    variable is used as a quantor variable for
    multiple subformulae
  • How to generate this NF?
  • Bounded renaming of quantor variables and the
    respective occurrences
  • Transformed formulae is semantically equivalent
    to original one
  • Example
  • (8 x P(x) Æ Q(z)) ! (9 x R(x) Ç 9 z (P(z) Æ
    Q(z)))

19
Prenex Normal Form
  • A formula A is in Prenex NF iff. it is of the
    form A Q1x1 Qnxn B where Qk is a universal
    or existential quantor and B contains no
    quantors. B is called the Matrix of A
  • How to construct this NF?
  • Transform A in NNF and Standard Form
  • Move iteratively outermost quantor to the outside
    until it reaches another quantor. Quantors may
    not cross quantors of different sort (in-scope
    relation between quantor occurrences may not be
    changed)
  • This transformation generates a formulae which is
    logically equivalent to the original one.
  • Example

20
Clause Normal Form
  • A formula A is in Clause NF iff. it is in PNF,
    closed, the prefix only contains universal
    quantors and the Matrix is on conjunctive normal
    form.
  • In other words A 8 x1 8 xn ( (L1,1 Ç Ç
    L1,m1) Æ Æ (Lk,1 Ç Ç Lk,mk)) where Li,j is a
    literal (negated or positive atomic formula)
  • How to construct this NF?
  • Transform A in NNF and Standard Form
  • Transform result in PNF
  • Remove existential quantors by Skolemization
    (Function terms)
  • Apply Distributivity laws to convert Matrix of
    the result in conjuntive normal form (conjunction
    of discjunction of literals)
  • This transformation results in a formula which is
    not logically equivalent, but it is
    satisfiability-preserving (which is enough for
    the ATP methods later)
  • Example

21
Clause Normal Form (II)
  • A formula A is in Clause NF can be written as A
    8 x1 8 xn ( (L1,1 Ç Ç L1,m1) Æ Æ (Lk,1 Ç
    Ç Lk,mk)) where Li,j is a literal (negated or
    positive atomic formula)
  • Since every formula can be transformed into CNF,
    the CNF can be seen as logic free
    representation of a formulae
  • All quantors are universal, no free variables are
    allowed -gt drop quantors
  • Matrix is in CNF Conjunction of Disjunction of
    Literals -gt Model as a Set of Sets of Literals
  • Example
  • The sketched transformation to CNF is not optimal
  • Exponential blowup possible (already for NNF)
  • Syntactical structure of the original formula
    gets lost
  • Skolemsymbols have unnecessarily many parameters
  • Unnecessarily many new skolem systems are
    introduced
  • One can improve all these aspects of a
    transformation to CNF!
  • Skolemization before PNF transformation,
    Definitorial CNF for Matrix, Reuse of Skolem
    functions

22
Metaresults
  • Metaresult Property of a Logic L
  • Here some metaresults for FOL which form the
    theoretical foundation of ATP (carry over to many
    other logics as well)
  • Deduction Theorem
  • If M s ² s then M ² s ! s
  • Logical entailment can be reduced to validity
  • Proof by contradiction
  • If M is a set of closed formulae thenM ² s iff.
    M s is unsatisfiable (i.e. has no model)
  • Logical entailment can be reduced to
    unsatisfiability checking
  • Refutation can be used as a universal principle
    for inference in FOL

23
Metaresults (II)
  • Complexity of logical entailment, validity and
    satisfiability
  • Propositional Logic
  • Logical entailment (²-relation) is decidable,
    Satisfiability too
  • Set of valid formulae is co-NP-complete
  • Set of satisfiable formulae is NP-complete
  • First-order Predicate Logic
  • Logical entailment / validity / satisfiability is
    undecidable
  • Set of valid formulae is semi-decidable
    (recursively enumerable)
  • Set of satisfiable formulae is not recursively
    enumerable

24
Metaresults (III)
  • Term Interpretations and Herbrand Theorem
  • S (U,I) is term-interpretation if U Term0?
  • Let Term0? be non-empty. An interpretation S
    (U,I) is called Herbrand-Interpretation if
  • S is term-interpretation and
  • I(f)(t1,,tn) f(t1,,tn) for all n-ary function
    symbols f 2 ? and ground terms t1,,tn
  • Herbrand-Modell of s is Herbrand-Intp. I with I ²
    s
  • Herbrand-Interpretations are special because they
    have a simple universe (syntactical) and Terms
    are basically uninterpreted. Quantifiers then
    have ground terms as their range!
  • Computers can deal with such special
    (syntactical) interpretations, but not with
    interpretations in general!

25
Metaresults (IV)
  • Term Interpretations and Herbrand Theorem
  • Let M be a set of closed formulae s in
    Prenex-Normalform that contain no existential
    quantors (for instance s in CNF)
  • Let T be a set of terms (over signature ?)
  • T(M) set of T-instances of M, i.e. replace
    every occurence of a (universal) variable in any
    formulae in M with any term in T
  • Herbrand Theorem
  • Let Term0? be non-empty and M a set of formulae
    in Prenex-NF without existential quantors.
  • Then the following statements are equivalent
  • M has a model
  • M has a Herbrand-model
  • Term0?(M) has a model
  • The last set is a set of formulae in
    propositional logic

26
Metaresults (V)
  • Compactness of FOL
  • A (possibly infinite) set M of formulae has a
    model iff every finite subset M ½ M has a model
    (i.e. is satisfiable)
  • Combining Compactness with Herbrands Theorem
  • Let Term0? be non-empty and M a set of formulae
    in Prenex-NF without existential quantors.
  • Then M is unsatisfiable iff. T(M) is
    unsatisfiable for a finite set of ground terms T
    ½ Term0?
  • Note that T is a finite set of ground terms over
    the signature ? of the formula set M
  • No external functions symbols have to be
    considered!
  • Allows for using guided substitutions
    (Unification!)

27
Metaresults (VI)
  • That means logical entailment / validity can be
    checked
  • by reduction to unsatisfiabiliy of a set of
    formulae M
  • which can done by finding suitable finite
    (counter)-examples for the quantfied variables
    such that a contradiction arises
  • One can only use the Signature ? of the given set
    M to find the counterexamples
  • Basically this is what all ATP procedures do
    Find a finite set of counterexamples (objects)
    such that a the instance of the orginial set is
    determined as being
  • The theorem immediately gives an algorithm for
    ATP!
  • Problem How to construct / find T in the theorem
    in a clever way?

28
Part IIIThe Resolution Calculus
  • Pre-resolution phase
  • Gilmores Methods, Davis-Putnam Procedure
  • Unification
  • Basic Resolution Calculus
  • Refinements, Redundancy
  • Decision procedures

29
Pre-Resolution period Gilmores Method
30
Pre-Resolution periodDavis-Putnam Procedure
31
A Revolution in ATP Robinsons Resolution
Principle
32
Part IVChain Resolution
  • A Variant of Resolution for the Semantic Web

33
Part IVDemo
  • assisted by a Resolution-based ATP System
    VAMPIRE
Write a Comment
User Comments (0)
About PowerShow.com