A Family of Fuzzy Orthogonal Projection Models for Monolingual and - PowerPoint PPT Presentation

A Family of Fuzzy Orthogonal Projection Models for Monolingual and Cross-lingual Hypernymy Prediction Chengyu Wang 1 , Yan Fan 1 , Xiaofeng He 1* , Aoying Zhou 2 1 School of Computer Science and Software Engineering, 2 School of Data Science and Engineering, East China Normal University Shanghai, China

Outline • Introduction • Related Work • Monolingual Model – Multi-Wahba Projection (MWP) • Cross-lingual Models – Transfer MWP (TMWP) – Iterative Transfer MWP (ITMWP) • Experiments – Monolingual Experiments – Cross-lingual Experiments • Conclusion and Future Work 2

Introduction (1) • Hypernymy (“is-a”) relations are important for NLP and Web applications. – Semantic resource construction: semantic hierarchies, taxonomies, knowledge graphs, etc. – Web-based applications: query understanding, post-search navigation, personalized recommendation, etc. Entitiy A simple example of taxonomy Person Country Political Leader Scientist Developed Country 3

Introduction (2) • Research challenges for predicting hypernymy relations between words: – Monolingual hypernymy prediction • Pattern-based approaches: have low recall • Distributional classifiers: suffer from the “lexical memorization” problem – Cross-lingual hypernymy prediction • The small size of training sets for lower-resourced languages • Not sufficient research in this area 4

Related Work (1) • Monolingual hypernymy prediction – Pattern based approaches: • Handcraft patterns: high accuracy, low coverage – Hearst Patterns: NP1 such as NP2 • Automatic generated patterns: higher coverage, lower accuracy • High language dependency – Distributional approaches: • Unsupervised distributional measures: relatively low precision • Supervised distributional classifiers: suffer from the “lexical memorization” problem 6

Related Work (2) • Cross-lingual hypernymy prediction – Learning multi-lingual taxonomies based on existing knowledge sources • YAGO3: Muti-lingual Wikipedia + WordNet • More precise but have limited scope constrained by sources – This task has not been extensively studied for lower- resourced languages. 7

Monolingual Model (1) • Basic Notations – Hypernymy training set 𝐸 (#) = {(𝑦 ( ,𝑧 ( (#) )} – Non-hypernymy training set 𝐸 (,) = {(𝑦 ( ,𝑧 ( (,) )} • Orthogonal Projection Model for Hypernymy Relations – Objective function Normalized embeddings Adding orthogonal constraints to guarantee normalization! – It does not consider the complicated linguistic regularities of hypernymy relations. 8

Monolingual Model (2) • Fuzzy Orthogonal Projection Model for Hypernymy Relations # with cluster – Apply K-means to 𝐸 (#) over the features 𝑦 ⃑ ( − 𝑧 ⃑ ( # , 𝑑 # , ⋯ ,𝑑 # . centroids as 𝑑 ⃑ ⃑ 1 ⃑ 3 0 (#) ) in 𝐸 (#) w.r.t. the 𝑘 th cluster. – Compute the weight of (𝑦 ( ,𝑧 ( – Objective function Multi-Wahba Projection (MWP) 9

Some Observations • Objective Function Multi-Wahba Projection (MWP) – The optimization of different matrices is independent from each other! Extended Wahba’s Problem 10

Monolingual Model (3) • Solving the MWP Problem – Consider the 𝑘 th cluster only: – An SVD-based closed-form solution: Refer to the paper for the proof of correctness. 11

Monolingual Model (4) • Overall Procedure – Learning hypernymy projections – Learning non-hypernymy projections 12

Monolingual Model (5) • Overall Procedure – Training the projection-based neural network 13

Cross-lingual Models (1) • Basic Notations – Hypernymy training sets (#) • Source language: 𝐸 5 (#) ≫ 𝐸 6 (#) 𝐸 5 (#) • Target language: 𝐸 6 – Non-hypernymy training sets (,) • Source language: 𝐸 5 (,) ≫ 𝐸 6 (,) 𝐸 5 (,) • Target language: 𝐸 6 – Unlabeled set of the target language: 𝑉 6 = {(𝑦 ( ,𝑧 ( )} 14

Cross-lingual Models (2) • Transfer MWP Model (TMWP) – Learning hypernymy projections 𝑇 : maps the embeddings of the source language to the target language by Bilingual Lexicon Induction – 𝛾 : controls the importance of training sets of source and target languages. (#) : controls the individual weight of each training instance of the – 𝛿 ( source language 15

Cross-lingual Models (3) • Transfer MWP Model (TMWP) – Hypernymy projections in TMWP can also be converted into a high- dimensional Wahba’s problem. – The SVD-based closed form solution: 16

Cross-lingual Models (4) • Transfer MWP Model (TMWP) – Learning non-hypernymy projections – Training the projection-based neural network 17

Cross-lingual Models (5) • Iterative Transfer MWP Model (ITMWP) – Employ semi-supervised learning for training set augmentation 18

Monolingual Experiments (1) • Task 1: Supervised hypernymy detection – MWP outperforms state-of-the-art over two benchmark datasets (BLESS and ENTAILMENT) 19

Monolingual Experiments (2) • Task 1: Supervised hypernymy detection – MWP outperforms state-of-the-art over three domain-specific datasets derived from existing domain-specific taxonomies. 20

Monolingual Experiments (3) • Task 2: Unsupervised hypernymy classification – Hypernymy measure: 𝑡̃ 𝑦 ( ,𝑧 ( = ℱ , (𝑦 ⃑ ( ) 1 − ℱ # (𝑦 ⃑ ( ,𝑧 ⃑ ( , 𝑧 ⃑ ( ) 1 Hypernymy vs. Hypernymy vs. Reverse- Other relations hypernymy 21

Cross-lingual Experiments (1) • Dataset Construction – English dataset: combining five human-labeled datasets (Training set) • 17,394 hypernymy relations • 67,930 non-hypernymy relations – Other languages: deriving from the Open Multilingual Wordnet project • 20% for training, 20% for development and 60% for testing French Chinese Japanese Italian Thai Finnish Greek 22

Cross-lingual Experiments (2) • Task 1: Cross-lingual hypernymy direction classification – hypernymy vs. reverse-hypernymy 23

Cross-lingual Experiments (3) • Task 1: Cross-lingual hypernymy detection – hypernymy vs. non-hypernymy 24

Conclusion • Models – Monolingual hypernymy prediction: MWP – Cross-lingual hypernymy prediction: TMWP & ITMWP • Results – State-of-the-art performance in monolingual experiments – Highly effective in cross-lingual experiments • Future Works – Predicting multiple types of semantic relations over multiple languages – Improving improve cross-lingual hypernymy prediction via multi-lingual embeddings 25

Thank You! Questions & Answers

A Family of Fuzzy Orthogonal Projection Models for Monolingual and - PowerPoint PPT Presentation

A Family of Fuzzy Orthogonal Projection Models for Monolingual and Cross-lingual Hypernymy Prediction Chengyu Wang 1 , Yan Fan 1 , Xiaofeng He 1* , Aoying Zhou 2 1 School of Computer Science and Software Engineering, 2 School of Data Science and

On Fuzzy Soft Rings Banu Pazar Varol and Halis Ayg un Department of Mathematics, Kocaeli

Applications Three sample applications Fuzzy inferno Nostalgic cow Twilight Eden Fuzzy inferno

11 Fuzzy Rule-Based Models Fuzzy Systems Engineering Toward Human-Centric Computing Contents

Orthogonal Complements and Orthonormal Matrices Orthogonal Complements Defn. For a set W , the

7 Transformations of Fuzzy Sets Fuzzy Systems Engineering Toward Human-Centric Computing

Semi-Heuristic Target-Based Fuzzy Target . . . Fuzzy Target . . . Fuzzy Decision Procedures:

M odels for Inexact Reasoning Fuzzy Logic Lesson 8 Fuzzy Controllers M aster in

On using Different Distance Measures for Fuzzy Numbers in Fuzzy Linear Regression Models Duygu

10 Fuzzy Modeling: Principles and Methodology Fuzzy Systems Engineering Toward Human-Centric

Orthogonal range searching Orthogonal range searching Problem: Given a set of n points Orthogonal

Projection Ping Yu School of Economics and Finance The University of Hong Kong Ping Yu (HKU)

Fuzzy Reasoning Outline Introduction Bivalent & Multivalent Logics Fundamental

A fuzzy clustering method using Genetic Algorithm and Fuzzy Subtractive Clustering Thanh Le, Tom

M odels for Inexact Reasoning Fuzzy Logic Lesson 1 Crisp and Fuzzy Sets M aster in

5 Operations and Aggregations of Fuzzy Sets Fuzzy Systems Engineering Toward Human-Centric

2 Notions and Concepts of Fuzzy Sets Fuzzy Systems Engineering Toward Human-Centric Computing

Cluster Analysis Good-1965 Homogeneous-objects or individuals in the For mental

SU (9) Family Unification S. Nandi Oklahoma State University and Oklahoma Center for High

I mproving Relationships & Results: Building Family Building Family School Partnerships

Making Ends Meet: The Cost to Support a Family in California SARA KIMBERLIN, SENIOR POLICY

Using DMM F amily F unc tional F or mulations in inte r ve ntions for c hildr e n with ne

STATE OF MALWARE: FAMILY TIES Who are we? Ero Carrera Peter Silberman Researcher at

Closely related to Family History (Why, How) . Names Dates Places 1 9/26/2019

Genetic Research in Genetic Research in Mental Illness Mental Illness Myrna M. Weissman, Ph.D.,

A Family of Fuzzy Orthogonal Projection Models for Monolingual and - PowerPoint PPT Presentation

A Family of Fuzzy Orthogonal Projection Models for Monolingual and Cross-lingual Hypernymy Prediction Chengyu Wang 1 , Yan Fan 1 , Xiaofeng He 1* , Aoying Zhou 2 1 School of Computer Science and Software Engineering, 2 School of Data Science and

On Fuzzy Soft Rings Banu Pazar Varol and Halis Ayg un Department of Mathematics, Kocaeli

Applications Three sample applications Fuzzy inferno Nostalgic cow Twilight Eden Fuzzy inferno

11 Fuzzy Rule-Based Models Fuzzy Systems Engineering Toward Human-Centric Computing Contents

Orthogonal Complements and Orthonormal Matrices Orthogonal Complements Defn. For a set W , the

7 Transformations of Fuzzy Sets Fuzzy Systems Engineering Toward Human-Centric Computing

Semi-Heuristic Target-Based Fuzzy Target . . . Fuzzy Target . . . Fuzzy Decision Procedures:

M odels for Inexact Reasoning Fuzzy Logic Lesson 8 Fuzzy Controllers M aster in

On using Different Distance Measures for Fuzzy Numbers in Fuzzy Linear Regression Models Duygu

10 Fuzzy Modeling: Principles and Methodology Fuzzy Systems Engineering Toward Human-Centric

Orthogonal range searching Orthogonal range searching Problem: Given a set of n points Orthogonal

Projection Ping Yu School of Economics and Finance The University of Hong Kong Ping Yu (HKU)

Fuzzy Reasoning Outline Introduction Bivalent &amp; Multivalent Logics Fundamental

A fuzzy clustering method using Genetic Algorithm and Fuzzy Subtractive Clustering Thanh Le, Tom

M odels for Inexact Reasoning Fuzzy Logic Lesson 1 Crisp and Fuzzy Sets M aster in

5 Operations and Aggregations of Fuzzy Sets Fuzzy Systems Engineering Toward Human-Centric

2 Notions and Concepts of Fuzzy Sets Fuzzy Systems Engineering Toward Human-Centric Computing

Cluster Analysis Good-1965 Homogeneous-objects or individuals in the For mental

SU (9) Family Unification S. Nandi Oklahoma State University and Oklahoma Center for High

I mproving Relationships &amp; Results: Building Family Building Family School Partnerships

Making Ends Meet: The Cost to Support a Family in California SARA KIMBERLIN, SENIOR POLICY

Using DMM F amily F unc tional F or mulations in inte r ve ntions for c hildr e n with ne

STATE OF MALWARE: FAMILY TIES Who are we? Ero Carrera Peter Silberman Researcher at

Closely related to Family History (Why, How) . Names Dates Places 1 9/26/2019

Genetic Research in Genetic Research in Mental Illness Mental Illness Myrna M. Weissman, Ph.D.,

Fuzzy Reasoning Outline Introduction Bivalent & Multivalent Logics Fundamental

I mproving Relationships & Results: Building Family Building Family School Partnerships