Using FOL to Describe the Properties of Systems

View

Semantics

An interpretation of a first-order language assigns a denotation to each non-logical symbol (predicate symbol, function symbol, or constant symbol) in that language. It also determines a domain of discourse that specifies the range of the quantifiers. The result is that each term is assigned an object that it represents, each predicate is assigned a property of objects, and each sentence is assigned a truth value. In this way, an interpretation provides semantic meaning to the terms, predicates, and formulas of the language. The study of the interpretations of formal languages is called formal semantics. What follows is a description of the standard or Tarskian semantics for first-order logic. (It is also possible to define game semantics for first-order logic, but aside from requiring the axiom of choice, game semantics agree with Tarskian semantics for first-order logic, so game semantics will not be elaborated herein).


First-order structures

The most common way of specifying an interpretation (especially in mathematics) is to specify a structure (also called a model; see below). The structure consists of a domain of discourse D and an interpretation function I mapping non-logical symbols to predicates, functions, and constants.

The domain of discourse D is a nonempty set of "objects" of some kind. Intuitively, given an interpretation, a first-order formula becomes a statement about these objects; for example, \exists xP(x) states the existence of some object in D for which the predicate P is true (or, more precisely, for which the predicate assigned to the predicate symbol P by the interpretation is true). For example, one can take D to be the set of integers.

Non-logical symbols are interpreted as follows:

  • The interpretation of an n-ary function symbol is a function from Dn to D. For example, if the domain of discourse is the set of integers, a function symbol f of arity 2 can be interpreted as the function that gives the sum of its arguments. In other words, the symbol f is associated with the function I(f) which, in this interpretation, is addition.
  • The interpretation of a constant symbol (a function symbol of arity 0) is a function from D0 (a set whose only member is the empty tuple) to D, which can be simply identified with an object in D. For example, an interpretation may assign the value I(c)=10 to the constant symbol c.
  • The interpretation of an n-ary predicate symbol is a set of n-tuples of elements of D, giving the arguments for which the predicate is true. For example, an interpretation I(P) of a binary predicate symbol P may be the set of pairs of integers such that the first one is less than the second. According to this interpretation, the predicate P would be true if its first argument is less than its second argument. Equivalently, predicate symbols may be assigned Boolean-valued functions from Dn to \{\mathrm {true,false} \}.


Evaluation of truth values

A formula evaluates to true or false given an interpretation and a variable assignment μ that associates an element of the domain of discourse with each variable. The reason that a variable assignment is required is to give meanings to formulas with free variables, such as y=x. The truth value of this formula changes depending on the values that x and y denote.

First, the variable assignment μ can be extended to all terms of the language, with the result that each term maps to a single element of the domain of discourse. The following rules are used to make this assignment:

  • Variables. Each variable x evaluates to μ(x)
  • Functions. Given terms t_{1},\ldots ,t_{n} that have been evaluated to elements d_{1},\ldots ,d_{n} of the domain of discourse, and a n-ary function symbol f, the term f(t_{1},\ldots ,t_{n}) evaluates to(I(f))(d_{1},\ldots ,d_{n}).

Next, each formula is assigned a truth value. The inductive definition used to make this assignment is called the T-schema.

  • Atomic formulas (1). A formula P(t_{1},\ldots ,t_{n}) is associated the value true or false depending on whether \langle v_{1},\ldots ,v_{n}\rangle \in I(P), where v_{1},\ldots ,v_{n} are the evaluation of the terms t_{1},\ldots ,t_{n} and I(P) is the interpretation of P, which by assumption is a subset of D^{n}.
  • Atomic formulas (2). A formula t_{1}=t_{2} is assigned true if t_{1} and t_{2} evaluate to the same object of the domain of discourse (see the section on equality below).
  • Logical connectives. A formula in the form \neg \varphi , \varphi \rightarrow \psi , etc. is evaluated according to the truth table for the connective in question, as in propositional logic.
  • Existential quantifiers. A formula \exists x\varphi (x) is true according to M and \mu if there exists an evaluation \mu ' of the variables that differs from \mu at most regarding the evaluation of x and such that φ is true according to the interpretation M and the variable assignment \mu '. This formal definition captures the idea that \exists x\varphi (x) is true if and only if there is a way to choose a value for x such that φ(x) is satisfied.
  • Universal quantifiers. A formula \forall x\varphi (x) is true according to M and \mu if φ(x) is true for every pair composed by the interpretation M and some variable assignment \mu ' that differs from \mu at most on the value of x. This captures the idea that \forall x\varphi (x) is true if every possible choice of a value for x causes φ(x) to be true.

If a formula does not contain free variables, and so is a sentence, then the initial variable assignment does not affect its truth value. In other words, a sentence is true according to M and \mu if and only if it is true according to M and every other variable assignment \mu '.

There is a second common approach to defining truth values that does not rely on variable assignment functions. Instead, given an interpretation M, one first adds to the signature a collection of constant symbols, one for each element of the domain of discourse in M; say that for each d in the domain the constant symbol cd is fixed. The interpretation is extended so that each new constant symbol is assigned to its corresponding element of the domain. One now defines truth for quantified formulas syntactically, as follows:

  • Existential quantifiers (alternate). A formula \exists x\varphi (x) is true according to M if there is some d in the domain of discourse such that \varphi (c_{d}) holds. Here \varphi (c_{d}) is the result of substituting cd for every free occurrence of x in φ.
  • Universal quantifiers (alternate). A formula \forall x\varphi (x) is true according to M if, for every d in the domain of discourse, \varphi (c_{d}) is true according to M.

This alternate approach gives exactly the same truth values to all sentences as the approach via variable assignments.


Validity, satisfiability, and logical consequence

If a sentence φ evaluates to true under a given interpretation M, one says that M satisfies φ; this is denoted M\vDash \varphi . A sentence is satisfiable if there is some interpretation under which it is true. This is a bit different from the symbol \vDash from model theory, where M\vDash \phi denotes satisfiability in a model, i.e. "there is a suitable assignment of values in M's domain to variable symbols of \phi ".

Satisfiability of formulas with free variables is more complicated, because an interpretation on its own does not determine the truth value of such a formula. The most common convention is that a formula φ with free variables x_{1}, ..., x_{n} is said to be satisfied by an interpretation if the formula φ remains true regardless which individuals from the domain of discourse are assigned to its free variables x_{1}, ..., x_{n}. This has the same effect as saying that a formula φ is satisfied if and only if its universal closure \forall x_{1}\dots \forall x_{n}\phi (x_{1},\dots ,x_{n}) is satisfied.

A formula is logically valid (or simply valid) if it is true in every interpretation. These formulas play a role similar to tautologies in propositional logic.

A formula φ is a logical consequence of a formula ψ if every interpretation that makes ψ true also makes φ true. In this case one says that φ is logically implied by ψ.


Algebraizations

An alternate approach to the semantics of first-order logic proceeds via abstract algebra. This approach generalizes the Lindenbaum–Tarski algebras of propositional logic. There are three ways of eliminating quantified variables from first-order logic that do not involve replacing quantifiers with other variable binding term operators:

  • Cylindric algebra, by Alfred Tarski and colleagues;
  • Polyadic algebra, by Paul Halmos;
  • Predicate functor logic, mainly due to Willard Quine.

These algebras are all lattices that properly extend the two-element Boolean algebra.

Tarski and Givant (1987) showed that the fragment of first-order logic that has no atomic sentence lying in the scope of more than three quantifiers has the same expressive power as relation algebra.  This fragment is of great interest because it suffices for Peano arithmetic and most axiomatic set theory, including the canonical ZFC. They also prove that first-order logic with a primitive ordered pair is equivalent to a relation algebra with two ordered pair projection functions.


First-order theories, models, and elementary classes

A first-order theory of a particular signature is a set of axioms, which are sentences consisting of symbols from that signature. The set of axioms is often finite or recursively enumerable, in which case the theory is called effective. Some authors require theories to also include all logical consequences of the axioms. The axioms are considered to hold within the theory and from them other sentences that hold within the theory can be derived.

A first-order structure that satisfies all sentences in a given theory is said to be a model of the theory. An elementary class is the set of all structures satisfying a particular theory. These classes are a main subject of study in model theory.

Many theories have an intended interpretation, a certain model that is kept in mind when studying the theory. For example, the intended interpretation of Peano arithmetic consists of the usual natural numbers with their usual operations. However, the Löwenheim–Skolem theorem shows that most first-order theories will also have other, nonstandard models.

A theory is consistent if it is not possible to prove a contradiction from the axioms of the theory. A theory is complete if, for every formula in its signature, either that formula or its negation is a logical consequence of the axioms of the theory. Gödel's incompleteness theorem shows that effective first-order theories that include a sufficient portion of the theory of the natural numbers can never be both consistent and complete.


Empty domains

The definition above requires that the domain of discourse of any interpretation must be nonempty. There are settings, such as inclusive logic, where empty domains are permitted. Moreover, if a class of algebraic structures includes an empty structure (for example, there is an empty poset), that class can only be an elementary class in first-order logic if empty domains are permitted or the empty structure is removed from the class.

There are several difficulties with empty domains, however:

  • common rules of inference are valid only when the domain of discourse is required to be nonempty. One example is the rule stating that \varphi \lor \exists x\psi implies \exists x(\varphi \lor \psi ) when x is not a free variable in \varphi. This rule, which is used to put formulas into prenex normal form, is sound in nonempty domains, but unsound if the empty domain is permitted.
  • The definition of truth in an interpretation that uses a variable assignment function cannot work with empty domains, because there are no variable assignment functions whose range is empty. (Similarly, one cannot assign interpretations to constant symbols). This truth definition requires that one must select a variable assignment function (μ above) before truth values for even atomic formulas can be defined. Then the truth value of a sentence is defined to be its truth value under any variable assignment, and it is proved that this truth value does not depend on which assignment is chosen. This technique does not work if there are no assignment functions at all; it must be changed to accommodate empty domains.

Thus, when the empty domain is permitted, it must often be treated as a special case. Most authors, however, simply exclude the empty domain by definition.