Generic Ontology Learners on Application Domains Francesca Fallucchi - PowerPoint PPT Presentation

Generic Ontology Learners on Application Domains Francesca Fallucchi 1 Maria Teresa Pazienza 1 Fabio Massimo Zanzotto 1 1 DISP University of Rome Tor Vergata Rome, Italy {fallucchi,pazienza,zanzotto}@info.uniroma2.it LREC 2010, Malta, May 2010

Motivations Probabilistic Ontology Learning Experimental Evaluation Conclusions Motivation Learning methods require large general corpora and knowledge repositories In specific domains ontologies are extremely poor Manually building ontologies is a very time consuming and expensive task Automatically creating or extending ontologies needs large corpora and existing structured knowledge to achieve rea- sonable performance

Motivations Probabilistic Ontology Learning Experimental Evaluation Conclusions Motivation Problems Scarcity of domains covered by existing ontologies Not relevant existing ontologies to expand for target domain

Motivations Probabilistic Ontology Learning Experimental Evaluation Conclusions Motivation Problems Scarcity of domains covered by existing ontologies Not relevant existing ontologies to expand for target domain ⇓ Solution We propose a model that can be used in different specific knowledge domains with a small effort for its adaptation Our model is learned from a generic domain that can be exploited to extract new informations in a specific domain

Motivations Probabilistic Ontology Learning Experimental Evaluation Conclusions Motivations 1 Probabilistic Ontology Learning 2 Corpus Analysis A Probabilistic Model Logistic Regression Experimental Evaluation 3 Experimental Set-Up Agreement Results Conclusions and Future Works 4

Motivations Probabilistic Ontology Learning Experimental Evaluation Conclusions Our Learner Model Model exploits the information learned in a background domain for extracting information in an adaptation domain Model is based on the probabilistic formulation Model takes into consideration corpus-extracted evidences over a list of training pairs Model is used to estimate the probabilities of the new instances computing a new feature space

Motivations Probabilistic Ontology Learning Experimental Evaluation Conclusions Corpus Analysis

Motivations Probabilistic Ontology Learning Experimental Evaluation Conclusions Corpus Analysis corpus instance ✛ ( dog , animal )

Motivations Probabilistic Ontology Learning Experimental Evaluation Conclusions Corpus Analysis context ... “dog” , as “animal” ... corpus instance ✛ ( dog , animal )

Motivations Probabilistic Ontology Learning Experimental Evaluation Conclusions Corpus Analysis context ... “dog” , as “animal” ... corpus instance ✛ ( dog , animal ) , features as , as

Motivations Probabilistic Ontology Learning Experimental Evaluation Conclusions Corpus Analysis context ... “dog” , as “animal” ... corpus instance ✛ ( dog , animal ) , 1 features as 1 ◗ ❦ ◗ , as 1 ◗ ◗ feature space

Motivations Probabilistic Ontology Learning Experimental Evaluation Conclusions Corpus Analysis corpus

Motivations Probabilistic Ontology Learning Experimental Evaluation Conclusions Corpus Analysis context X 1 f 1 f 2 Y 1 corpus

Motivations Probabilistic Ontology Learning Experimental Evaluation Conclusions Corpus Analysis context X 1 f 1 f 2 Y 1 corpus ( X 1 , Y 1 ) • f 1 • f 2

Motivations Probabilistic Ontology Learning Experimental Evaluation Conclusions Corpus Analysis context X 1 f 1 f 2 Y 1 corpus ( X 1 , Y 1 ) ( X 2 , Y 2 ) • f 1 • • f 2 • f 3

Motivations Probabilistic Ontology Learning Experimental Evaluation Conclusions Corpus Analysis context X 1 f 1 f 2 Y 1 corpus ( X 1 , Y 1 ) ( X 2 , Y 2 ) ... ... ( X n , Y n ) • • • • • f 1 • • • • f 2 • • • • f 3 • • • • • • • • • • • • . . . • • • . . • • • • . • • • • • • • • • • • • • f m

Motivations Probabilistic Ontology Learning Experimental Evaluation Conclusions Instances Matrix context X 1 f 1 f 2 Y 1 ( X 1 , Y 1 ) ( X 2 , Y 2 ) ... ... ( X n , Y n ) Corpus • • • • • f 1 • • • • f 2 f 3 • • • • • • • • • • • • • • • . . • • • . . . . • • • • • • • • • • • • f m • • • • •

Motivations Probabilistic Ontology Learning Experimental Evaluation Conclusions Instances Matrix Evidences Matrix context E = ( − → e 1 ... − → X 1 f 1 f 2 Y 1 e n ) ( X 1 , Y 1 ) ( X 2 , Y 2 ) ... ... ( X n , Y n ) Corpus • • • • • f 1 • • • • f 2 f 3 • • • • • • • • • • • • • • • . . • • • . . . . • • • • • • • • • • • • f m • • • • •

Motivations Probabilistic Ontology Learning Experimental Evaluation Conclusions A Probabilistic Model Probabilistic model for learning ontologies form corpora Ontology is seen as a set O of relations R over pairs R i , j If R i , j is in O , i is a concept and j is one of its generalization Goal: Estimate Posterior Probability P ( R i , j ∈ O | E ) where E is a set of evidences extracted from corpus

Motivations Probabilistic Ontology Learning Experimental Evaluation Conclusions Logistic Regression Logit Given two variables Y and X , the probability p of Y to be 1 given that X = x is: p = P ( Y = 1 | X = x ) and Y ∼ Bernoulli ( p ) � � p logit ( p ) = ln 1 − p

Motivations Probabilistic Ontology Learning Experimental Evaluation Conclusions Logistic Regression Logit Given two variables Y and X , the probability p of Y to be 1 given that X = x is: p = P ( Y = 1 | X = x ) and Y ∼ Bernoulli ( p ) � � p logit ( p ) = ln 1 − p logit ( p ) = β 0 + β 1 x 1 + ... + β k x k

Motivations Probabilistic Ontology Learning Experimental Evaluation Conclusions Logistic Regression Logit Given two variables Y and X , the probability p of Y to be 1 given that X = x is: p = P ( Y = 1 | X = x ) and Y ∼ Bernoulli ( p ) � � p logit ( p ) = ln 1 − p logit ( p ) = β 0 + β 1 x 1 + ... + β k x k Given regression coefficients the probability is exp ( β 0 + β 1 x 1 + ... + β k x k ) p ( x ) = 1 + exp ( β 0 + β 1 x 1 + ... + β k x k )

Motivations Probabilistic Ontology Learning Experimental Evaluation Conclusions Estimating Regression Coefficients We estimate the regressors β 0 , β 1 ,..., β k of x 1 ,..., x k with maximal likelihood estimation logit ( p ) = β 0 + β 1 x 1 + ... + β k x k solving a linear problem

Motivations Probabilistic Ontology Learning Experimental Evaluation Conclusions Estimating Regression Coefficients We estimate the regressors β 0 , β 1 ,..., β k of x 1 ,..., x k with maximal likelihood estimation logit ( p ) = β 0 + β 1 x 1 + ... + β k x k solving a linear problem − − − − → logit ( p ) = E β where   1 e 11 e 12 ··· e 1 n  ···  1 e 21 e 22 e 2 n   E =  . . . .  ... . . . .   . . . . ··· 1 e m 1 e m 2 e mn

Motivations Probabilistic Ontology Learning Experimental Evaluation Conclusions Background Ontology Learner Using a logistic regressor based on the Moore-Penrose pseudo-inverse matrix (Fallucchi and Zanzotto, RANLP 2009) β = X + � C B l where: X + C B is the pseudo-inverse matrix of the evidences matrix X C B obtained from a generic corpus C B l is the logit vector ( − − − − → logit ( p ) )

Motivations Probabilistic Ontology Learning Experimental Evaluation Conclusions Estimator for Application Domain The logit of the testing pairs l ′ = α X C A � β where: α is a parameter used to adapt the model by the β vector to the new domain X C A is the inverse evidence matrix obtained from an adaptation domain corpus C A � β is the regressors vector

Motivations Probabilistic Ontology Learning Experimental Evaluation Conclusions Estimator for Application Domain The logit of the testing pairs l ′ = α X C A � β where: α is a parameter used to adapt the model by the β vector to the new domain X C A is the inverse evidence matrix obtained from an adaptation domain corpus C A � β is the regressors vector Then, step by step testing pairs probability exp ( l i ) p i = 1 + exp ( l i )

Motivations Probabilistic Ontology Learning Experimental Evaluation Conclusions Motivations 1 Probabilistic Ontology Learning 2 Corpus Analysis A Probabilistic Model Logistic Regression Experimental Evaluation 3 Experimental Set-Up Agreement Results Conclusions and Future Works 4

Motivations Probabilistic Ontology Learning Experimental Evaluation Conclusions Experimental Set-Up Target Ontologies 1 Training: pairs that are in hyperonym relation in WordNet ==> about 600000 pairs of words Testing: pairs in Earth Observation Domain ==> about 404 pairs of words Corpus 2 Training: English Web as Corpus , ukWaC (Ferraresi,2008) ==> about 2700000 web pages Testing: corpus related to Earth Observation Domain ==> about 8300 web pages Feature Spaces 3 bag-of-words and n-grams windows: length 3 tokens ==> about 280000 features

Generic Ontology Learners on Application Domains Francesca Fallucchi - PowerPoint PPT Presentation

Generic Ontology Learners on Application Domains Francesca Fallucchi 1 Maria Teresa Pazienza 1 Fabio Massimo Zanzotto 1 1 DISP University of Rome Tor Vergata Rome, Italy {fallucchi,pazienza,zanzotto}@info.uniroma2.it LREC 2010, Malta, May 2010

What are Generics? e.g. Generics, Generic Programming, Generic Types, Generic Methods 6

Data driven Ontology Alignment Data driven Ontology Alignment Nigam Shah nigam@stanford.edu

Generic Programming in a Dependently Typed Language Generic proofs for generic programs Peter

Generic Methods 36 What are Generic Methods? Generic methods = methods that introduce type

1 Definition of a simple generic class Why generic programming (cont.) class Pair <T> {

Planning and Optimization C14. Merge-and-Shrink Abstractions: Generic Algorithm Malte Helmert and

Generic classes Declaration Use Annotations 54 Generic classes Declaration add

Some (more) Burning Issues for Ontology Initiatives Background: Current Ontology Work in Bremen

Ontology Development 101: A Guide to Creating Your First Ontology Natalya F. Noy and Deborah L.

Systematic Annotation Mark Voorhies 4/5/2011 The Gene Ontology Three directed acyclic graphs

Combining XML querying Combining XML querying with ontology reasoning: with ontology reasoning:

Ontology Languages for the Semantic Web Ontology Languages Wide variety of languages for

Ontology Jan Pettersen Nytun Knowledge Representation Part I, JPN, UiA 1 Outline S O P

Ontology Engineering Lecture 7: Top-down (and middle-out) Ontology Development II Maria Keet

ODPReco - A Tool to Recommend Ontology Design Patterns Maleeha Arif Yasvi, Raghava Mutharaju

Bi-Continuous Domains and Some Old Problems in Domain Theory Talk at Domains IX Klaus Keimel

Iterative Convex Regularization Lorenzo Rosasco Universita di Genova Universita di Genova

Linear Regression II, SGD, Perceptron Milan Straka October 14, 2019 Charles University in

Graph Theoretic Approaches to Atom ic Vibrations in Fullerenes ERNESTO ESTRADA Department of

t r

Interference Alignment Approaches: Delayed CSIT and Alignment Matrix Jhanak Parajuli Jacobs

Geographic Data Science - Lecture V Space, formally Dani Arribas-Bel Today The need to

Creating smoothed maps with the help of the command Nick Deschacht

On the development of a Cognitive Radio Network Simulator based on OMNeT++/MiXiM Giuseppe

Generic Ontology Learners on Application Domains Francesca Fallucchi - PowerPoint PPT Presentation

Generic Ontology Learners on Application Domains Francesca Fallucchi 1 Maria Teresa Pazienza 1 Fabio Massimo Zanzotto 1 1 DISP University of Rome Tor Vergata Rome, Italy {fallucchi,pazienza,zanzotto}@info.uniroma2.it LREC 2010, Malta, May 2010

What are Generics? e.g. Generics, Generic Programming, Generic Types, Generic Methods 6

Data driven Ontology Alignment Data driven Ontology Alignment Nigam Shah nigam@stanford.edu

Generic Programming in a Dependently Typed Language Generic proofs for generic programs Peter

Generic Methods 36 What are Generic Methods? Generic methods = methods that introduce type

1 Definition of a simple generic class Why generic programming (cont.) class Pair &lt;T&gt; {

Planning and Optimization C14. Merge-and-Shrink Abstractions: Generic Algorithm Malte Helmert and

Generic classes Declaration Use Annotations 54 Generic classes Declaration add

Some (more) Burning Issues for Ontology Initiatives Background: Current Ontology Work in Bremen

Ontology Development 101: A Guide to Creating Your First Ontology Natalya F. Noy and Deborah L.

Systematic Annotation Mark Voorhies 4/5/2011 The Gene Ontology Three directed acyclic graphs

Combining XML querying Combining XML querying with ontology reasoning: with ontology reasoning:

Ontology Languages for the Semantic Web Ontology Languages Wide variety of languages for

Ontology Jan Pettersen Nytun Knowledge Representation Part I, JPN, UiA 1 Outline S O P

Ontology Engineering Lecture 7: Top-down (and middle-out) Ontology Development II Maria Keet

ODPReco - A Tool to Recommend Ontology Design Patterns Maleeha Arif Yasvi, Raghava Mutharaju

Bi-Continuous Domains and Some Old Problems in Domain Theory Talk at Domains IX Klaus Keimel

Iterative Convex Regularization Lorenzo Rosasco Universita di Genova Universita di Genova

Linear Regression II, SGD, Perceptron Milan Straka October 14, 2019 Charles University in

Graph Theoretic Approaches to Atom ic Vibrations in Fullerenes ERNESTO ESTRADA Department of

t r

Interference Alignment Approaches: Delayed CSIT and Alignment Matrix Jhanak Parajuli Jacobs

Geographic Data Science - Lecture V Space, formally Dani Arribas-Bel Today The need to

Creating smoothed maps with the help of the command Nick Deschacht

On the development of a Cognitive Radio Network Simulator based on OMNeT++/MiXiM Giuseppe

1 Definition of a simple generic class Why generic programming (cont.) class Pair <T> {