Two Views on Building Conceptual Hierarchies Rainer Osswald - - PowerPoint PPT Presentation

two views on building conceptual hierarchies
SMART_READER_LITE
LIVE PREVIEW

Two Views on Building Conceptual Hierarchies Rainer Osswald - - PowerPoint PPT Presentation

Two Views on Building Conceptual Hierarchies Rainer Osswald Intelligente Informations- and Kommunikationssysteme Fachbereich Informatik FernUniversit at in Hagen http://pi7.fernuni-hagen.de/osswald Building Conceptual Lattices Version


slide-1
SLIDE 1

Two Views on Building Conceptual Hierarchies

Rainer Osswald Intelligente Informations- and Kommunikationssysteme Fachbereich Informatik FernUniversit¨ at in Hagen http://pi7.fernuni-hagen.de/osswald

slide-2
SLIDE 2

Building Conceptual Lattices – Version I

Formal Concept Analysis (FCA) [Ganter & Wille 1999] ◮ Formal context K = U,Σ,K U set of objects, Σ set of attributes, K ⊆ U ×Σ satisfaction relation V ⊲ = {p ∈ Σ| x K p for all x ∈ V} intent of V ⊆ U X⊳ = {x ∈ U | x K p for all p ∈ X} extent of X ⊆ Σ ◮ Formal concept: V,X with V ⊲ = X, X⊳ = V ◮ Concept lattice: ordered set of formal concepts V,X ≤ W,Y iff V ⊆ W, X ⊇ Y ◮ Example

human biped featherless John x x x Mary x x x Fido x Alex x

{J,M,F,A},∅ {J,M,A},{b} {J,M,F},{f} {J,M},{h, f,b}

Complete implicational theory of context: human → biped ∧ featherless, biped ∧ featherless → human

◮ Basic Theorem of FCA Every concept lattice is a complete lattice and every complete lattice is representable as the concept lattice of a formal context.

– 1 – Rainer Osswald

slide-3
SLIDE 3

Building Conceptual Lattices – Version IIa

ONTOLOG [Oldager 2000, Fischer-Nilsson 2001] ◮ Logico-algebraic language for specifying taxonomies/ontologies p ∈ Σ (set of basic concepts), null, univ, φ +ψ (conceptual sum), φ ×ψ (conceptual product), φ ≤ ψ (subsumption), φ = ψ (equality) ◮ Distributive lattice of concepts via equational specification Example human = biped×featherless ◮ Attributive descriptions (not taken into account here) r : φ (Peirce Product) ◮ Relation to FCA?

univ featherless + biped featherless biped human null

“In some sense this approach can be compared to Formal Concept Analysis, since both are mathematical methods for generating concept lattices. However, our approach is fundamentally different, because ONTOLOG is a language for specifying taxonomies while the Formal Concept Analysis is a mathematical theory for analysis of conceptual data (the relations between objects and their attributes).” [Oldager 2000]

– 2 – Rainer Osswald

slide-4
SLIDE 4

Building Conceptual Lattices – Version IIb

Oles (2000): An Application of Lattice Theory to Knowledge Representation ◮ Knowledge base (KB): set of terminological axioms newconcept I (I primitive identifier), subconcept C1C2 (Ci built by ∧, ∨ from primitives, ⊤, ⊥) ◮ Example

newconcept human ; newconcept biped ; newconcept featherless ; subconcept human (biped∧featherless) ; subconcept (biped∧featherless) human ;

◮ Lattice of semantic concepts:

⊤ b∨ f b f h ⊥

distributive lattice presented by generators and relations given by KB “Birkhoff implementation” by ordered set of ∨-irreducibles ◮ Relation to FCA?

“Right from the start, let us say what this paper is not about. It is not about the formal concept analysis of Wille, which deals with the derivation, from tables of attributes for individuals, of hierarchies of what are also called concepts. Wille’s hierarchies are lattices, but they are not necessarily distributive. There are no clear connections between Wille’s work and the contents

  • f this paper.” [Oles 2000]

– 3 – Rainer Osswald

slide-5
SLIDE 5

Overview

◮ A Simple Propositional Framework ◮ Theories and Information Domains ◮ Formal Concept Analysis Reanalyzed ◮ The Lindenbaum Algebra of a Theory ◮ The Predicational Viewpoint ◮ Perspectives

– 4 – Rainer Osswald

slide-6
SLIDE 6

A Simple Propositional Framework

Working hypothesis: concepts are subsets of a set Σ of attributes ◮ Regard p ∈ Σ as atomic proposition (propositional variable) ◮ Boolean formulas B[Σ] over Σ: inductively built by ∧, ∨, ¬, ⊤, ⊥ φ → ψ for ¬φ ∨ψ, φ ↔ ψ for φ → ψ ∧ ψ → φ Positive/affirmative formula: no ¬ Implication: P → Q, P,Q ⊆ Σ finite (∅ = ⊤, ∅ = ⊥) Conditional (biconditional) form: φ → ψ (φ ↔ ψ), φ,ψ positive Conditional normal form: P → Q ◮ Theory over Σ: set of Boolean formulas over Σ ◮ Interpretation: function m : Σ → {0,1}, canonical extension to B[Σ] equivalent: subset X ⊆ Σ, via characteristic function χX X satisfies φ iff χX(φ) = 1, notation: X φ (X p iff p ∈ X, X ¬φ iff X φ, . . . ) ◮ X ⊆ Σ model of theory Γ iff X φ for all φ ∈ Γ

– 5 – Rainer Osswald

slide-7
SLIDE 7

Theories and Information Domains (I)

◮ Information domain C(Γ) of theory Γ: models of Γ ordered by set inclusion C(Γ) = {X ⊆ Σ| X φ for all φ ∈ Γ} ◮ Examples Σ = {h,b, f} Γ = {h ↔ b∧ f}

∅ {b} { f} {h, f,b}

Σ = {a,b,c,d} Γ = {⊤ → a∨b, c∧d → ⊥, a∧b ↔ c∨d}

{a,b,c} {a,b,d} {a} {b}

Σ = {a1,a2,...} Γ =

n≥1{an → a1 ∨an+1, an+1 → an}

Σ X2 X1 X0 Y Xn = {am | m ≤ n}, Y = {an | n > 1}

– 6 – Rainer Osswald

slide-8
SLIDE 8

Theories and Information Domains (II)

Classification of information domains

Class of formulas of Γ Closure properties of C(Γ) C(Γ) as ordered set

P → Q

local membership

P → q, P → ⊥

(Horn) nonempty intersection, directed union Scott domain

P → q

(implication) intersection, directed union complete algebraic lattice p → q (simple implication) intersection, union complete coprime-algebraic lattice

P → ⊥

(contradiction) subsets, finitely bounded union bounded-complete atomic dcpo with completely coprime atoms

– 7 – Rainer Osswald

slide-9
SLIDE 9

Theories and Information Domains (III)

◮ Canonical C -theory ThC (U ) associated with subset system U ⊆℘(Σ) ThC (U ) = {φ ∈ C | X φ for all X ∈ U } C ∈ {Boolean formulas, implications, ...} ◮ ThC ,C is Galois connection between℘(℘(Σ)) and℘(C ) induced by Consequence: C ◦ThC is closure operator on ℘(Σ) ◮ Γ is complete C -theory of U iff Γ ⊆ ThC (U ) and Γ ⊢ ThC (U ) ◮ Example

U

{a,b,c,d} {a,b,e} {a,c,d} {a,c} {a} {b}

C(Thimpl(U ))

{a,b,c,d,e} {a,b,c,d} {a,b,e} {a,c,d} {a,c} {a,b} {a} {b} ∅

Complete implicational theory of U : d → c, c → a, e → a∧b, b∧c → d

– 8 – Rainer Osswald

slide-10
SLIDE 10

Formal Concept Analysis Reanalyzed

◮ Formal context K = U,Σ,K V ⊲ = {p ∈ Σ| x K p for all x ∈ V} = {x⊲ | x ∈ V} (with x⊲ for {x}⊲) ◮ Complete implicational theory of K: complete implicational theory of object intents UK = {x⊲ | x ∈ U} ◮ Theorem For finite attribute sets, there is an order-reversing one-to-one corre- spondence between the concept lattice of a formal context and the informa- tion domain of any complete implicational theory of that context. ◮ Example

K a b c d e x1 x x x x2 x x3 x x4 x x x x x5 x x x6 x x x x x7 x x x UK x2 {a} x3 {b} x5 {a,c} x7 {a,c,d} x4,x6 {a,b,c,d} x1 {a,b,e}

– 9 – Rainer Osswald

slide-11
SLIDE 11

The Lindenbaum Algebra of a Theory

◮ Lindenbaum algebra L(Γ) of theory Γ L(Γ) = A[Σ]/≃Γ (with φ ≃Γ ψ iff Γ ⊢ φ ↔ ψ) L(Γ) is distributive lattice with zero and unit ◮ Theorem (Variant of Birkhoff’s Representation Theorem) Suppose Σ is finite. (i) There is an order-reversing one-to-one correspondence between C(Γ) and the ordered set of ∨-irreducible elements of L(Γ). (ii) There is a one-to-one correspondence between L(Γ) and the antichains in C(Γ), where an antichain S corresponds to the equivalence class of

{X | X ∈ S }.

◮ Example

C(Γ)

{a,b,c,d} {a,b,e} {a,c,d} {a,c} {a} {b}

L(Γ)

[a∨b] [b∨c] [a] [b∨d] [c∨e] [d ∨e] [c] [b] [a∧b] [d] [b∧d] [e] [⊥]

Γ = {d → c, c → a, e → a∧b, b∧c → d c∧e → ⊥, a∧b → c∨e, ⊤ → a∨b}

– 10 – Rainer Osswald

slide-12
SLIDE 12

The Predicational Viewpoint

◮ Natural formulation within monadic predicate logic attributes are monadic predicates statements are universally quantified (e.g. ∀x(human x → biped x)) ◮ Formalization Σ set of (atomic) monadic predicates Boolean predicates by ∧, ∨, ¬ via abstraction (i.e. (φ ∧ψ)x is φx∧ψx, etc) theory = set of universal statements ∀φ (for ∀x(φx)) ◮ Interpretation of Σ: universe U, function M : Σ →℘(U) Model of theory Γ: interpretation of Σ such that Γ is true ◮ Information domain of theory Γ: C(Γ) = {X ⊆ Σ| X φ for all ∀φ ∈ Γ} Canonical model of Γ with universe C(Γ): p ∈ Σ → {X ∈ C(Γ)| p ∈ X} ◮ Theorem The canonical model of Γ is universal in the sense that it embeds every other model of Γ satisfying identity of indiscernibles.

– 11 – Rainer Osswald

slide-13
SLIDE 13

Summary

◮ Take Home Message I The lattices of FCA are the information domains of the complete implicational theories of the formal contexts (with finite attribute set). Their “latticehood” is due to the restriction to implicational theories. ◮ Take Home Message II The information domain of a theory (of universally quantified Boolean predicates) is the (ordered) universe of a universal model of that theory. In terms of propositional logic, the information domain is the set of all models of the theory. ◮ Take Home Message III The Lindenbaum algebra (of affirmative terms) of a theory is a distributive lattice with zero and unit. The Lindenbaum algebra and the information domain of a theory are dual to each other in the sense of Birkhoff’s Representation Theorem.

– 12 – Rainer Osswald

slide-14
SLIDE 14

Perspectives

◮ Infinite set of atomic propositions/predicates Lindenbaum algebra not uniquely determined by information domain ◮ Attribute-value theories Attributive descriptions π : φ (for λx∃y(πxy∧φy)) ◮ Default statements Default extensions as elements of information domains ◮ Categorical issues Category of theories, functors C and L

– 13 – Rainer Osswald