Two Views on Building Conceptual Hierarchies Rainer Osswald - - PowerPoint PPT Presentation
Two Views on Building Conceptual Hierarchies Rainer Osswald - - PowerPoint PPT Presentation
Two Views on Building Conceptual Hierarchies Rainer Osswald Intelligente Informations- and Kommunikationssysteme Fachbereich Informatik FernUniversit at in Hagen http://pi7.fernuni-hagen.de/osswald Building Conceptual Lattices Version
Building Conceptual Lattices – Version I
Formal Concept Analysis (FCA) [Ganter & Wille 1999] ◮ Formal context K = U,Σ,K U set of objects, Σ set of attributes, K ⊆ U ×Σ satisfaction relation V ⊲ = {p ∈ Σ| x K p for all x ∈ V} intent of V ⊆ U X⊳ = {x ∈ U | x K p for all p ∈ X} extent of X ⊆ Σ ◮ Formal concept: V,X with V ⊲ = X, X⊳ = V ◮ Concept lattice: ordered set of formal concepts V,X ≤ W,Y iff V ⊆ W, X ⊇ Y ◮ Example
human biped featherless John x x x Mary x x x Fido x Alex x
{J,M,F,A},∅ {J,M,A},{b} {J,M,F},{f} {J,M},{h, f,b}
Complete implicational theory of context: human → biped ∧ featherless, biped ∧ featherless → human
◮ Basic Theorem of FCA Every concept lattice is a complete lattice and every complete lattice is representable as the concept lattice of a formal context.
– 1 – Rainer Osswald
Building Conceptual Lattices – Version IIa
ONTOLOG [Oldager 2000, Fischer-Nilsson 2001] ◮ Logico-algebraic language for specifying taxonomies/ontologies p ∈ Σ (set of basic concepts), null, univ, φ +ψ (conceptual sum), φ ×ψ (conceptual product), φ ≤ ψ (subsumption), φ = ψ (equality) ◮ Distributive lattice of concepts via equational specification Example human = biped×featherless ◮ Attributive descriptions (not taken into account here) r : φ (Peirce Product) ◮ Relation to FCA?
univ featherless + biped featherless biped human null
“In some sense this approach can be compared to Formal Concept Analysis, since both are mathematical methods for generating concept lattices. However, our approach is fundamentally different, because ONTOLOG is a language for specifying taxonomies while the Formal Concept Analysis is a mathematical theory for analysis of conceptual data (the relations between objects and their attributes).” [Oldager 2000]
– 2 – Rainer Osswald
Building Conceptual Lattices – Version IIb
Oles (2000): An Application of Lattice Theory to Knowledge Representation ◮ Knowledge base (KB): set of terminological axioms newconcept I (I primitive identifier), subconcept C1C2 (Ci built by ∧, ∨ from primitives, ⊤, ⊥) ◮ Example
newconcept human ; newconcept biped ; newconcept featherless ; subconcept human (biped∧featherless) ; subconcept (biped∧featherless) human ;
◮ Lattice of semantic concepts:
⊤ b∨ f b f h ⊥
distributive lattice presented by generators and relations given by KB “Birkhoff implementation” by ordered set of ∨-irreducibles ◮ Relation to FCA?
“Right from the start, let us say what this paper is not about. It is not about the formal concept analysis of Wille, which deals with the derivation, from tables of attributes for individuals, of hierarchies of what are also called concepts. Wille’s hierarchies are lattices, but they are not necessarily distributive. There are no clear connections between Wille’s work and the contents
- f this paper.” [Oles 2000]
– 3 – Rainer Osswald
Overview
◮ A Simple Propositional Framework ◮ Theories and Information Domains ◮ Formal Concept Analysis Reanalyzed ◮ The Lindenbaum Algebra of a Theory ◮ The Predicational Viewpoint ◮ Perspectives
– 4 – Rainer Osswald
A Simple Propositional Framework
Working hypothesis: concepts are subsets of a set Σ of attributes ◮ Regard p ∈ Σ as atomic proposition (propositional variable) ◮ Boolean formulas B[Σ] over Σ: inductively built by ∧, ∨, ¬, ⊤, ⊥ φ → ψ for ¬φ ∨ψ, φ ↔ ψ for φ → ψ ∧ ψ → φ Positive/affirmative formula: no ¬ Implication: P → Q, P,Q ⊆ Σ finite (∅ = ⊤, ∅ = ⊥) Conditional (biconditional) form: φ → ψ (φ ↔ ψ), φ,ψ positive Conditional normal form: P → Q ◮ Theory over Σ: set of Boolean formulas over Σ ◮ Interpretation: function m : Σ → {0,1}, canonical extension to B[Σ] equivalent: subset X ⊆ Σ, via characteristic function χX X satisfies φ iff χX(φ) = 1, notation: X φ (X p iff p ∈ X, X ¬φ iff X φ, . . . ) ◮ X ⊆ Σ model of theory Γ iff X φ for all φ ∈ Γ
– 5 – Rainer Osswald
Theories and Information Domains (I)
◮ Information domain C(Γ) of theory Γ: models of Γ ordered by set inclusion C(Γ) = {X ⊆ Σ| X φ for all φ ∈ Γ} ◮ Examples Σ = {h,b, f} Γ = {h ↔ b∧ f}
∅ {b} { f} {h, f,b}
Σ = {a,b,c,d} Γ = {⊤ → a∨b, c∧d → ⊥, a∧b ↔ c∨d}
{a,b,c} {a,b,d} {a} {b}
Σ = {a1,a2,...} Γ =
n≥1{an → a1 ∨an+1, an+1 → an}
Σ X2 X1 X0 Y Xn = {am | m ≤ n}, Y = {an | n > 1}
– 6 – Rainer Osswald
Theories and Information Domains (II)
Classification of information domains
Class of formulas of Γ Closure properties of C(Γ) C(Γ) as ordered set
P → Q
local membership
P → q, P → ⊥
(Horn) nonempty intersection, directed union Scott domain
P → q
(implication) intersection, directed union complete algebraic lattice p → q (simple implication) intersection, union complete coprime-algebraic lattice
P → ⊥
(contradiction) subsets, finitely bounded union bounded-complete atomic dcpo with completely coprime atoms
– 7 – Rainer Osswald
Theories and Information Domains (III)
◮ Canonical C -theory ThC (U ) associated with subset system U ⊆℘(Σ) ThC (U ) = {φ ∈ C | X φ for all X ∈ U } C ∈ {Boolean formulas, implications, ...} ◮ ThC ,C is Galois connection between℘(℘(Σ)) and℘(C ) induced by Consequence: C ◦ThC is closure operator on ℘(Σ) ◮ Γ is complete C -theory of U iff Γ ⊆ ThC (U ) and Γ ⊢ ThC (U ) ◮ Example
U
{a,b,c,d} {a,b,e} {a,c,d} {a,c} {a} {b}
C(Thimpl(U ))
{a,b,c,d,e} {a,b,c,d} {a,b,e} {a,c,d} {a,c} {a,b} {a} {b} ∅
Complete implicational theory of U : d → c, c → a, e → a∧b, b∧c → d
– 8 – Rainer Osswald
Formal Concept Analysis Reanalyzed
◮ Formal context K = U,Σ,K V ⊲ = {p ∈ Σ| x K p for all x ∈ V} = {x⊲ | x ∈ V} (with x⊲ for {x}⊲) ◮ Complete implicational theory of K: complete implicational theory of object intents UK = {x⊲ | x ∈ U} ◮ Theorem For finite attribute sets, there is an order-reversing one-to-one corre- spondence between the concept lattice of a formal context and the informa- tion domain of any complete implicational theory of that context. ◮ Example
K a b c d e x1 x x x x2 x x3 x x4 x x x x x5 x x x6 x x x x x7 x x x UK x2 {a} x3 {b} x5 {a,c} x7 {a,c,d} x4,x6 {a,b,c,d} x1 {a,b,e}
– 9 – Rainer Osswald
The Lindenbaum Algebra of a Theory
◮ Lindenbaum algebra L(Γ) of theory Γ L(Γ) = A[Σ]/≃Γ (with φ ≃Γ ψ iff Γ ⊢ φ ↔ ψ) L(Γ) is distributive lattice with zero and unit ◮ Theorem (Variant of Birkhoff’s Representation Theorem) Suppose Σ is finite. (i) There is an order-reversing one-to-one correspondence between C(Γ) and the ordered set of ∨-irreducible elements of L(Γ). (ii) There is a one-to-one correspondence between L(Γ) and the antichains in C(Γ), where an antichain S corresponds to the equivalence class of
{X | X ∈ S }.
◮ Example
C(Γ)
{a,b,c,d} {a,b,e} {a,c,d} {a,c} {a} {b}
L(Γ)
[a∨b] [b∨c] [a] [b∨d] [c∨e] [d ∨e] [c] [b] [a∧b] [d] [b∧d] [e] [⊥]
Γ = {d → c, c → a, e → a∧b, b∧c → d c∧e → ⊥, a∧b → c∨e, ⊤ → a∨b}
– 10 – Rainer Osswald
The Predicational Viewpoint
◮ Natural formulation within monadic predicate logic attributes are monadic predicates statements are universally quantified (e.g. ∀x(human x → biped x)) ◮ Formalization Σ set of (atomic) monadic predicates Boolean predicates by ∧, ∨, ¬ via abstraction (i.e. (φ ∧ψ)x is φx∧ψx, etc) theory = set of universal statements ∀φ (for ∀x(φx)) ◮ Interpretation of Σ: universe U, function M : Σ →℘(U) Model of theory Γ: interpretation of Σ such that Γ is true ◮ Information domain of theory Γ: C(Γ) = {X ⊆ Σ| X φ for all ∀φ ∈ Γ} Canonical model of Γ with universe C(Γ): p ∈ Σ → {X ∈ C(Γ)| p ∈ X} ◮ Theorem The canonical model of Γ is universal in the sense that it embeds every other model of Γ satisfying identity of indiscernibles.
– 11 – Rainer Osswald
Summary
◮ Take Home Message I The lattices of FCA are the information domains of the complete implicational theories of the formal contexts (with finite attribute set). Their “latticehood” is due to the restriction to implicational theories. ◮ Take Home Message II The information domain of a theory (of universally quantified Boolean predicates) is the (ordered) universe of a universal model of that theory. In terms of propositional logic, the information domain is the set of all models of the theory. ◮ Take Home Message III The Lindenbaum algebra (of affirmative terms) of a theory is a distributive lattice with zero and unit. The Lindenbaum algebra and the information domain of a theory are dual to each other in the sense of Birkhoff’s Representation Theorem.
– 12 – Rainer Osswald
Perspectives
◮ Infinite set of atomic propositions/predicates Lindenbaum algebra not uniquely determined by information domain ◮ Attribute-value theories Attributive descriptions π : φ (for λx∃y(πxy∧φy)) ◮ Default statements Default extensions as elements of information domains ◮ Categorical issues Category of theories, functors C and L
– 13 – Rainer Osswald