Feature Extraction with Description Logics Functional Subsumption - - PowerPoint PPT Presentation

feature extraction with description logics functional
SMART_READER_LITE
LIVE PREVIEW

Feature Extraction with Description Logics Functional Subsumption - - PowerPoint PPT Presentation

Feature Extraction with Description Logics Functional Subsumption Rodrigo de Salvo Braz Dan Roth University of Illinois at Urbana-Champaign A conflict ? Most machine learning algorithms use feature vectors as inputs. ? Most data is best


slide-1
SLIDE 1

Feature Extraction with Description Logics Functional Subsumption

Rodrigo de Salvo Braz Dan Roth University of Illinois at Urbana-Champaign

slide-2
SLIDE 2

A conflict

? Most machine learning algorithms use

feature vectors as inputs.

? Most data is best represented as structured data. ? Feature extraction is the conversion from one to

the other (and may be most of the work).

slide-3
SLIDE 3

before person name(“Mohammed Atta”) gender(male) city person date month(April) year(2001) country

Mohammed Atta met with an Iraqi intelligence agent in Prague in April 2001.

meeting participant participant location time name(Iraq) affiliation nationality after word(an) tag(DT) word(intelligence) tag(NN) word(Iraqi) tag(JJ) before before before

... ...

after after after country name(“Czech Republic”) name(Prague)

  • rganization

location end begin

Structured data – I

Attributes (node labels) Roles (edge labels)

slide-4
SLIDE 4

Structured data – II

slide-5
SLIDE 5

Feature Extraction

male name(john) name(mary) female tall

A

name(jenny) female name(jill) female name(peter) male age(40) name(margot) spouse friend student child child

child

child spouse

Structured example

FE

male female male child

child

female child male spouse friend name(jenny) female

Human-written feature types

1 1

feature vector

slide-6
SLIDE 6

Feature Extraction

? Typically done in ad hoc fashion: ? Prevents general analysis; ? Prevents Feature Extraction/Learning unified

analysis (e.g. kernels).

? Using a language is tricky ? Type of inference. ? May be intractable if not careful.

slide-7
SLIDE 7

A language for declaring which features to generate

participant time nationality meeting country name(Iraq) year(2001) city person date month(April) year(2001) country meeting participant location time name(Iraq) affiliation nationality name(Prague)

  • rganization

Feature type specifications by directed trees Example segment

slide-8
SLIDE 8

Generating feature vectors

male name(john) name(mary) female tall A name(jenny) female name(jill) female name(peter) male age(40) name(margot) spouse friend student child child child child spouse spouse child child male tall female male child child female tall child child

? ? ? ?

Example Feature types

slide-9
SLIDE 9

Generating feature vectors

1

2 3

male name(john) name(mary) female tall A name(jenny) female name(jill) female name(peter) male age(40) name(margot) spouse friend student child child child child

1 3 2

spouse spouse

1

Example Feature types

child

slide-10
SLIDE 10

Generating feature vectors

1 3 2

male name(john) name(mary) female tall A name(jenny) female name(jill) female name(peter) male age(40) name(margot) spouse friend student child child child child spouse

1 1 3 2

spouse

Example Feature types

child

slide-11
SLIDE 11

Generating feature vectors

2 3 1

male name(john) name(mary) female tall A name(jenny) female name(jill) female name(peter) male age(40) name(margot) spouse friend student child child child child

1 3 2

spouse spouse

1

Example Feature types

child

slide-12
SLIDE 12

Generating feature vectors

male name(john) name(mary) female tall A name(jenny) female name(jill) female name(peter) male age(40) name(margot) spouse friend student child child child child spouse child male tall

Example Feature types

Nothing like this in the example!

slide-13
SLIDE 13

Generating feature vectors

1 3 2

male name(john) name(mary) female tall A name(jenny) female name(jill) female name(peter) male age(40) name(margot) spouse friend student child child child child spouse

1 3 2

female male child child

1

Example Feature types

slide-14
SLIDE 14

Generating feature vectors

1 2, 3

male name(john) name(mary) female tall A name(jenny) female name(jill) female name(peter) male age(40) name(margot) spouse friend student child child child child spouse

1 3 2

female tall child child

1

Example Feature types

slide-15
SLIDE 15

Generating feature vectors

2, 3 1

male name(john) name(mary) female tall A name(jenny) female name(jill) female name(peter) male age(40) name(margot) spouse friend student child child child child spouse

1 3 2

female tall child child

1

Example Feature types

slide-16
SLIDE 16

Generating feature vectors

male name(john) name(mary) female tall A name(jenny) female name(jill) female name(peter) male age(40) name(margot) spouse friend student child child child child spouse spouse child male tall female male child child female tall child child

1 1 1

Example Feature types

child

slide-17
SLIDE 17

Feature Description Logics

(AND (SOME spouse ANY) (SOME child (AND male tall))) (SOME spouse (SOME friend female))

spouse child friend spouse male tall female

slide-18
SLIDE 18

Subsumption

?A description C subsumes (⊇) a description D if

every individual in D must be in C, no matter the interpretation.

?Subsumption is tractable.

C D

C = (AND (SOME spouse ANY) (SOME child male)) D = (AND (SOME spouse (SOME student ANY)) (SOME child (AND tall male)) (SOME child female))

slide-19
SLIDE 19

Feature extraction as subsumption

name(kelly) female name(carol) friend child child female

(SOME child female) Feature type Example

name(john) child

(AND SOME friend (AND name(carol) SOME child (AND name(kelly) female)) SOME child name(john)) Description of node

slide-20
SLIDE 20

Feature extraction as subsumption

name(kelly) female name(carol) friend child child female

(SOME child female) Feature type Example

name(john) child

name(john) Description of node

slide-21
SLIDE 21

name(kelly) female name(carol) friend child child female

(SOME child female) Feature type Example

name(john) child

(AND name(kelly) female) Description of node

Feature extraction as subsumption

slide-22
SLIDE 22

name(kelly) female name(carol) friend child child female

(SOME child female) (AND name(carol) SOME child (AND name(kelly) female)) Feature type Example

active feature!

name(john) child

Feature extraction as subsumption

Description of node

slide-23
SLIDE 23

subject

  • bject

A problem in practice

subject

  • bject

buy

Subsumption would be natural in this case but does not occur

dentist car dentist name(patricia) car model(accord) purchase

slide-24
SLIDE 24

A problem in practice

subject

  • bject

subject

  • bject

kill

name(JFK) name(castro) name(kennedy) kill

slide-25
SLIDE 25

A problem in practice

name(schwarzenegger) job governor name(schwarzneger) job actor job

slide-26
SLIDE 26

Make comparison more flexible

?At core of subsumption algorithm is the

comparison of attributes: ... if (attr1 == attr2) ...

?We simply make that a function call:

... if (f (attr1, attr2) == 1) ...

slide-27
SLIDE 27

Is this just a hack?

What about the nice DL semantics?

slide-28
SLIDE 28

Is this just a hack?

What about the nice DL semantics? In fact, equivalent to “shallow OR” (tractable).

slide-29
SLIDE 29

Is this just a hack?

What about the nice DL semantics? In fact, equivalent to “shallow OR” (tractable). Replace any attr by (OR a1 a2 ... an) where f(attr, ai) = 1.

(AND kill (SOME object JFK)) (AND (OR kill murder assassinate) (SOME object (OR JFK kennedy “John F. Kennedy” ...)))

slide-30
SLIDE 30

Why not just use shallow OR then?

?Function is an implicit representation. ?We may incorporate procedural knowledge: ? Typos; ? Similar sounding words; ? Context-sensitive knowledge.

slide-31
SLIDE 31

Take home message

? Feature Description Logics provides an

expressive way to deal with structured examples.

? Syntax choices render it tractable. ? Allows for FE-learning integrated approaches

like kernels (Cumby & Roth 2003).

? Can be made even more expressive with little

extra cost by functional subsumption.

slide-32
SLIDE 32

The End