Inductive Learning and Ockhams Razor Konstantin Genin Kevin T. - - PowerPoint PPT Presentation

inductive learning and ockham s razor
SMART_READER_LITE
LIVE PREVIEW

Inductive Learning and Ockhams Razor Konstantin Genin Kevin T. - - PowerPoint PPT Presentation

Inductive Learning and Ockhams Razor Konstantin Genin Kevin T. Kelly Carnegie Mellon University kgenin@andrew.cmu.edu Konstantin Genin (CMU) Inductive Learning and Ockhams Razor October 23, 2014 1 / 64 Justifying Inductive Methods


slide-1
SLIDE 1

Inductive Learning and Ockham’s Razor

Konstantin Genin Kevin T. Kelly

Carnegie Mellon University kgenin@andrew.cmu.edu

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 1 / 64

slide-2
SLIDE 2

Justifying Inductive Methods

Figure : Rudolf Carnap, 1891-1970

Our system of inductive logic ... is intended as a rational reconstruction ... of inductive thinking as customarily applied in everyday life and science. ... An entirely different question is the problem of the validity of our or any other proposed system of inductive logic ... This is the genuinely philosophical problem of induction (Carnap, 1945).

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 2 / 64

slide-3
SLIDE 3

Justifying Inductive Methods

A justification of an inductive procedure

1 must refer to its success in some sense; 2 must not require that the truth of its predictions be guaranteed in

the short-run.

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 3 / 64

slide-4
SLIDE 4

Justifying Inductive Methods

Reichenbach is right ... that any procedure, which does not [converge in the limit] is inferior to his rule of induction. However, his rule ... is far from being the only one possessing that characteristic. The same holds for an infinite number of

  • ther rules of induction. ... Therefore we need a more

general and stronger method for examining and comparing any two given rules of induction ... (Carnap, 1945)

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 4 / 64

slide-5
SLIDE 5

Justifying Inductive Methods

Is there something in between?

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 5 / 64

slide-6
SLIDE 6

Ockham’s Razor

Figure : William of Ockham, 1287-1347

All things being equal, one ought to prefer simpler theories.

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 6 / 64

slide-7
SLIDE 7

Simplicity and Ockham’s Razor

Two fundamental questions:

1 How is simplicity to be defined? Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 7 / 64

slide-8
SLIDE 8

Simplicity and Ockham’s Razor

Two fundamental questions:

1 How is simplicity to be defined? 2 Given simplicity, what is Ockham’s Razor? Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 7 / 64

slide-9
SLIDE 9

Simplicity and Ockham’s Razor

Two fundamental questions:

1 How is simplicity to be defined? 2 Given simplicity, what is Ockham’s Razor? 3 How does Ockham’s Razor help you find the truth? Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 7 / 64

slide-10
SLIDE 10

Simplicity: Epistemic or Methodological?

“Justifying an epistemic principle requires answering an epistemic question: why are parsimonious theories more likely to be true? Justifying a methodological principle requires answering a pragmatic question: why does it make practical sense for theorists to adopt parsimonious theories?” (Baker, SEP).

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 8 / 64

slide-11
SLIDE 11

Justifying Inductive Methods

Ockham’s razor cannot provide a short-run guarantee. A justification

  • f Ockham’s Razor is tied up with what could be in between these two

extremes.

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 9 / 64

slide-12
SLIDE 12

Section 2 Topology as Epistemology

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 10 / 64

slide-13
SLIDE 13

Topological as Epistemology

Related Approaches:

1 Vickers (1996) 2 Kelly (1996) 3 Luo and Schulte (2006) 4 Yamamoto and de Brecht (2010) 5 Baltag, Gierasimczuk, and Smets (2014) Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 11 / 64

slide-14
SLIDE 14

Propositions and Possible Worlds

Let W be a set of possible worlds. A proposition is a set P ⊆ W. The contradictory proposition is ∅ and the necessary proposition is W. P ∧ Q = P ∩ Q, P ∨ Q = P ∪ Q, ¬P = W \ P and P entails Q iff P ⊆ Q.

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 12 / 64

slide-15
SLIDE 15

Observable Propositions

Let O ⊆ P(W) be the set of observable propositions. Then the set of all propositions observable in world w is: O(w) = {O ∈ O : w ∈ O}. O is a topological basis iff the following are both satisfied:

  • O1. O = W;
  • O2. If A,B ∈ O(w) then there is C ∈ O(w) such that C ⊆ A ∩ B.

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 13 / 64

slide-16
SLIDE 16

Verifiable Propositions

Say that a proposition P is verifiable iff for every world w ∈ P there is some observation O ∈ O(w) such that O entails P. The following four thesis about verifiability follow from this definition:

  • V1. The contradictory proposition ∅ is verifiable.
  • V2. The trivial proposition W is verifiable.
  • V3. The verifiable propositions are closed under finite conjunction.
  • V4. The verifiable propositions are closed under arbitrary disjunction.

The possible worlds and verifiable propositions (W,V) form a topology.

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 14 / 64

slide-17
SLIDE 17

Verifiable Propositions

You can verify finitely many sunrises, But not that it will rise every morning.

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 15 / 64

slide-18
SLIDE 18

Falsifiable Propositions

Say that a proposition P is falsifiable iff ¬P is verifiable. The following four thesis about falsifiability follow from this definition:

  • F1. The contradictory proposition ∅ is falsifiable.
  • F2. The trivial proposition W is falsifiable.
  • F3. The falsifiable propositions are closed under finite disjunction.
  • F4. The verifiable propositions are closed under arbitrary

conjunction.

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 16 / 64

slide-19
SLIDE 19

A Translation Key

To translate between topology and epistemology:

1 basic open set ≡ observable proposition. 2 open set ≡ verifiable proposition. 3 closed set ≡ falsifiable proposition. 4 clopen set ≡ decidable proposition. 5 locally closed set ≡ conditionally refutable proposition. Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 17 / 64

slide-20
SLIDE 20

The Topology of the Problem of Induction

The bread, which I formerly ate, nourished me ... but does it follow, that other bread must also nourish me at another time, and that like sensible qualities must always be attended with like secret powers? The consequence seems nowise necessary (Enquiry Concerning Human Understanding).

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 18 / 64

slide-21
SLIDE 21

Sierpinski Space

Suppose we have two worlds.

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 19 / 64

slide-22
SLIDE 22

Sierpinski Space

Suppose we have two worlds.

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 20 / 64

slide-23
SLIDE 23

Sierpinski Space

If bread always nourishes, we can never rule out that one day it will stop nourishing.

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 21 / 64

slide-24
SLIDE 24

Sierpinski Space

If someday bread will cease to nourish, this will be verified.

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 22 / 64

slide-25
SLIDE 25

Sierpinski Space

This simple structure defines the Sierpinski space, a simple topological space.

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 23 / 64

slide-26
SLIDE 26

Sierpinski Space

Note that all information compatible with the bottom world is compatible with the top world, but the converse is not true.

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 24 / 64

slide-27
SLIDE 27

The specialization order

Let w v iff O(w) ⊆ O(v) i.e. all information consistent with w is consistent with v.

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 25 / 64

slide-28
SLIDE 28

The specialization order

Let w ≺ v if w v but v w.

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 26 / 64

slide-29
SLIDE 29

The specialization order

This defines the specialization order over points in the space.

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 27 / 64

slide-30
SLIDE 30

The specialization order

A topology is T0 if for all w,v, if w v then O(w) O(v) i.e. if two worlds are distinct, then there is some observational difference between them. The T0 axiom rules out “metaphysical” distinctions between worlds. For T0 spaces, the specialization order is a partial order.

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 28 / 64

slide-31
SLIDE 31

The specialization order

Figure : A “metaphysical” distinction.

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 29 / 64

slide-32
SLIDE 32

The specialization order

A topology is Td (Aull and Thron, 1962) iff for all w, if {v : v ≺ w} is closed.

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 30 / 64

slide-33
SLIDE 33

Section 3 Empirical Simplicity

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 31 / 64

slide-34
SLIDE 34

Topological Closure

The closure of a proposition A is the set of all worlds where A is never refuted: A = {w : Every O ∈ O(w) is consistent with A}. Furthermore, {w} = {v : v w}.

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 32 / 64

slide-35
SLIDE 35

Popper and Simplicity

The epistemological questions which arise in connection with the concept of simplicity can all be answered if we equate this concept with degree of falsifiability (Popper, 1959).

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 33 / 64

slide-36
SLIDE 36

Popper and Simplicity

A proposition P is more falsifiable than Q if and only if every

  • bservation that falsifies Q falsifies P.

Equivalently, every observation consistent with P is consistent with Q.

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 34 / 64

slide-37
SLIDE 37

Popper and Simplicity

A proposition P is more falsifiable than Q if and only if every

  • bservation that falsifies Q falsifies P.

Equivalently, every observation consistent with P is consistent with Q. So if P is true, Q will never be refuted. Therefore P ⊆ Q.

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 35 / 64

slide-38
SLIDE 38

Empirical Simplicity

Definition (The Simplicity Order)

P is simpler than Q, written P Q,

1 iff P ⊆ Q, 2 iff P entails Q will never be refuted, 3 iff P has a problem of induction with Q, 4 iff P is more falsifiable than Q. Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 36 / 64

slide-39
SLIDE 39

Section 4 Learning

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 37 / 64

slide-40
SLIDE 40

Empirical Problem Contexts

An empirical problem context is a triple P = (W,O,Q). W is the set of possible worlds. O is a countable set of observables. Q is a question that partitions W into countably many answers. Let Q(w) be the answer true at w.

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 38 / 64

slide-41
SLIDE 41

Learning

An empirical method is a function λ : O → P(W). Say that λ is solves P in the limit iff for all w ∈ W, there is E ∈ O(w) such that for all F ∈ O(w), w ∈ λ(E ∩ F) ⊆ Q(w). Say that P is solvable in the limit iff there exists λ that solves it in the limit.

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 39 / 64

slide-42
SLIDE 42

Learning

Proposition (Yamamoto and de Brecht (2010))

If |W| ≤ ω then P = (W,O,Q) is solvable in the limit iff (W,O) is Td.

Proposition (Baltag, Gierasimczuk, and Smets (2014))

P = (W,O,Q) is solvable in the limit iff each Q ∈ Q is a countable union of locally closed sets.

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 40 / 64

slide-43
SLIDE 43

A Special Case

For simplicity we restrict our attention to the case where |W| ≤ ω and Q = Q⊥.

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 41 / 64

slide-44
SLIDE 44

Section 5 Efficient Convergence

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 42 / 64

slide-45
SLIDE 45

Refining Convergence

Pursuit of truth ought to be as direct as possible.

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 43 / 64

slide-46
SLIDE 46

Refining Convergence

Needless cycles and reversals in opinion ought to be avoided.

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 44 / 64

slide-47
SLIDE 47

Cycles and Reversals

Definition (Reversals)

(λ(E),λ(F)) is a reversal iff F ⊂ E and λ(F) ⊆ λ(E)c.

Definition (Cycles)

(λ(E),λ(F),λ(G)) is a cycle iff (λ(E),λ(F)) and (λ(F),λ(G)) are reversals and λ(G) ⊆ λ(E).

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 45 / 64

slide-48
SLIDE 48

Monotonicity Principles

A learner λ is

1 rationally monotone if λ(E ∩ F) ⊆ λ(E) ∩ F when F meets λ(E). Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 46 / 64

slide-49
SLIDE 49

Monotonicity Principles

A learner λ is

1 rationally monotone if λ(E ∩ F) ⊆ λ(E) ∩ F when F meets λ(E). 2 cautiously monotone if λ(E ∩ F) ⊆ λ(E) ∩ F when λ(E) ⊆ F. Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 46 / 64

slide-50
SLIDE 50

Monotonicity Principles

A learner λ is

1 rationally monotone if λ(E ∩ F) ⊆ λ(E) ∩ F when F meets λ(E). 2 cautiously monotone if λ(E ∩ F) ⊆ λ(E) ∩ F when λ(E) ⊆ F. 3 reversal monotone if λ(E ∩F) meets λ(E)∩F when F meets λ(E). Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 46 / 64

slide-51
SLIDE 51

Monotonicity Principles

A learner λ is

1 rationally monotone if λ(E ∩ F) ⊆ λ(E) ∩ F when F meets λ(E). 2 cautiously monotone if λ(E ∩ F) ⊆ λ(E) ∩ F when λ(E) ⊆ F. 3 reversal monotone if λ(E ∩F) meets λ(E)∩F when F meets λ(E). 4 weakly monotone if λ(E ∩ F) meets λ(E) ∩ F when λ(E) ⊆ F. Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 46 / 64

slide-52
SLIDE 52

Monotonicity Principles

A learner λ is

1 rationally monotone if λ(E ∩ F) ⊆ λ(E) ∩ F when F meets λ(E). 2 cautiously monotone if λ(E ∩ F) ⊆ λ(E) ∩ F when λ(E) ⊆ F. 3 reversal monotone if λ(E ∩F) meets λ(E)∩F when F meets λ(E). 4 weakly monotone if λ(E ∩ F) meets λ(E) ∩ F when λ(E) ⊆ F. Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 47 / 64

slide-53
SLIDE 53

Monotonicity Principles

Proposition

A learner λ is reversal monotone iff there are no E,F,G ∈ O such that (λ(E),λ(F),λ(G)) is a cycle.

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 48 / 64

slide-54
SLIDE 54

Section 6 Ockham’s Razor

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 49 / 64

slide-55
SLIDE 55

Ockham’s Razor

Definition (The Vertical Razor)

Say that a learner λ is vertical Ockham iff

1 iff for all w ∈ E, {w} λ(E) ⇒ w ∈ λ(E); 2 iff λ(E) is closed in E; 3 iff λ(E) is downward-closed in . Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 50 / 64

slide-56
SLIDE 56

Vertical Ockham and Monotonicity

Proposition

A learner λ is reversal monotone only if λ is vertical Ockham.

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 51 / 64

slide-57
SLIDE 57

Vertical Ockham and Monotonicity

Proposition

A learner λ is reversal monotone only if λ is vertical Ockham.

Sketch.

Suppose you violate the vertical razor.

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 52 / 64

slide-58
SLIDE 58

Vertical Ockham and Monotonicity

Proposition

A learner λ is reversal monotone only if λ is vertical Ockham.

Sketch.

You reverse on further information, though your first conjecture is not refuted.

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 53 / 64

slide-59
SLIDE 59

Vertical Ockham and Monotonicity

Proposition

A learner λ is reversal monotone only if λ is vertical Ockham.

Sketch.

On even further information, you are forced into a cycle.

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 54 / 64

slide-60
SLIDE 60

Vertical Ockham and Monotonicity

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 55 / 64

slide-61
SLIDE 61

Ockham’s Razor

Definition (The Weak Razor)

Say that a learner λ is weak Ockham iff for all E,        

  • v∈λ(E)

{v}         ∩ E ⊆ λ(E).

Proposition

A learner λ is weakly monotone only if λ is weak Ockham.

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 56 / 64

slide-62
SLIDE 62

Ockham and Monotonicity

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 57 / 64

slide-63
SLIDE 63

Ockham’s Razor

Definition (The Horizontal Razor)

Say that a learner λ is horizontal Ockham iff for all E, λ(E) is co-initial in .

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 58 / 64

slide-64
SLIDE 64

Reversal Optimality

Definition

Say that λ is reversal-optimal iff for every reversal sequence (λ(E),λ(F)) and learner λ′ there is a reversal sequence (λ′(G),λ′(H)) such that λ′(G) ⊆ λ(E) and λ′(F) ⊆ λ(H).

Proposition

A learner λ is reversal-optimal iff λ is horizontal Ockham.

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 59 / 64

slide-65
SLIDE 65

Retractions

Definition (Retractions)

(λ(E),λ(F)) is a retraction iff F ⊂ E and λ(F) λ(E).

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 60 / 64

slide-66
SLIDE 66

Retraction Optimality

Definition

Say that λ is retraction-optimal iff for every retraction sequence (λ(E),λ(F)) and learner λ′ there is a retraction sequence (λ′(G),λ′(H)) such that λ′(G) ⊆ λ(E) and λ′(F) ⊆ λ(H).

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 61 / 64

slide-67
SLIDE 67

Ockham and Monotonicity

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 62 / 64

slide-68
SLIDE 68

Justification of Ockham’s Razor

Thank you!

Supported by a grant from the John Templeton Foundation. Manuscript: http://www.andrew.cmu.edu/user/kk3n/simplicity/bulletin-12.pdf

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 63 / 64

slide-69
SLIDE 69

Works Cited

CE Aull and WJ Thron. Separation axioms between t0 and t1. Indag. Math, 24:26–37, 1962. Alexandru Baltag, Nina Gierasimczuk, and Sonja Smets. Epistemic topology (to appear). ILLC Technical Report, 2014. Rudolf Carnap. On inductive logic. Philosophy of Science, 12(2):72, 1945. Kevin Kelly. The Logic of Reliable Inquiry. 1996. Wei Luo and Oliver Schulte. Mind change efficient learning. Information and Computation, 204(6):989–1011, 2006. Karl R. Popper. The Logic of Scientific Discovery. London: Hutchinson, 1959. Steven Vickers. Topology Via Logic. Cambridge University Press, 1996. Akihiro Yamamoto and Matthew de Brecht. Topological properties of concept spaces (full version). Information and Computation, 208(4):327–340, 2010.

Konstantin Genin (CMU) Inductive Learning and Ockham’s Razor October 23, 2014 64 / 64