Using Lexical Knowledge to Evaluate the Novelty of Rules Mined from - PowerPoint PPT Presentation

Apr 06, 2024 •300 likes •415 views

Using Lexical Knowledge to Evaluate the Novelty of Rules Mined from Text Sugato Basu, Raymond J. Mooney, Krupakar V. Pasupuleti, Joydeep Ghosh Presented by Joseph Schlecht Problem Description Modern data-mining techniques discover large

Using Lexical Knowledge to Evaluate the Novelty of Rules Mined from Text Sugato Basu, Raymond J. Mooney, Krupakar V. Pasupuleti, Joydeep Ghosh Presented by Joseph Schlecht
Problem Description • Modern data-mining techniques discover large number of relationships (rules) – Antecedent ‡ Consequent • Few may actually be of interest – CS job hunting: SQL ‡ database • How do we find rules that are interesting and novel ? • Notice this is subjective
Problem Formalization • Authors consider text mining – Rules consist of words in natural language • Use WordNet and define semantic distance between two words • Novelty is defined w.r.t the semantic distance between words in the antecedent and consequent of a rule
Semantic Distance Given words w i and w j , d ( w i , w j ) = Dist ( P ( w i , w j )) + K * Dir ( P ( w i , w j )) • Dist ( p ) is the distance along path p – Weighted by relation type (15 in WordNet) • Dir ( p ) is the number of directional changes on p – Defined 3 directions according to relation type • K is a chosen constant
Weight and Direction Info Relation Weight Direction Synonym, Attribute, Pertainym, 0.5 Horizontal Similar Antonym 2.5 Horizontal Hypernym, (Member|Part|Substance), 1.5 Up Meronym Hyponym, (Member|Part|Substance) 1.5 Down Holonym, Cause, Entailment
Novelty • For each rule, a score of novelty is generated • Let A = {set of antecedent words} and C = {set of consequent words} in a given rule • For each word w i in A and w j in C – Score( w i , w j ) fl d ( w i , w j ) • Score of rule = average of all ( w i , w j ) scores
Experiment • Measure success by comparing the heuristic’s results of novelty scoring to humans’ • Used rules generated by DiscoTEX from 9000 Amazon.com book descriptions • Four random samples of 25 rules were made • Four groups of humans scored each sample – 0.0 (least interesting) to 10.0 (most interesting) • One set was used as training for the heuristic (to find K ), the other three were used for experiments
Results Human-Human Heuristic-Human Correlation Correlation Raw Rank Raw Rank Group1 0.350 0.338 0.187 0.137 Group2 0.412 0.393 0.386 0.363 Group3 0.337 0.339 0.339 0.338 Raw = Pearson’s Raw Score Rank = Spearman’s Ranks Score
Results (cont) Example of rules scored by the heuristic • High Score (9.5) romance love heart ‡ midnight • Medium Score (5.8) author romance ‡ characters love • Low Score (1.9) Astronomy science ‡ space
Discussion • Humans rarely agreed with each other • Correlation between heuristic and human was similar to human-human correlation – Success, but not too meaningful • Provided statistical evidence that correlation is unlikely due to random chance • Future tests would use dataset that had higher human-human correlation

Recommend

#@&*$% The Power of Novelty Novelty is experiencing the familiar in a new light A Recipe for

THE #@&*$% The Power of Novelty Novelty is experiencing the familiar in a new light A Recipe for Action B ehavior = M otivation +A bility +T rigger BJ Fogg Model Soil Your Undies Novelty and the Peak-End Rule Novelty and the

438 views • 8 slides

Heterogeneous Lexical Resources MultiJEDI ERC 259234 Lexical Resource Lexical Resource Lexical

A Robust Approach to Aligning Heterogeneous Lexical Resources MultiJEDI ERC 259234 Lexical Resource Lexical Resource Lexical Resource Why combine resources? e.g., named entities, new senses dozens of new languages

1.18k views • 64 slides

5. Novelty & Diversity Outline 5.1. Why Novelty & Diversity? 5.2. Probability Ranking

5. Novelty & Diversity Outline 5.1. Why Novelty & Diversity? 5.2. Probability Ranking Principled Revisited 5.3. Implicit Diversification 5.4. Explicit Diversification 5.5. Evaluating Novelty & Diversity Advanced Topics in

504 views • 38 slides

Proof of Novelty A distributed consensus mechanism for securing content novelty Daniel Severo

Motivation Trustworthiness Proof of Novelty Take-away points Open Questions and Future Work Proof of Novelty A distributed consensus mechanism for securing content novelty Daniel Severo Independent Scientist Virtual Design Challenge The

720 views • 36 slides

LEXICAL TYPOLOGY Peter Koch (Part I) Koch, Lexical typology, 2010-8-24 A. General introduction

LEXICAL TYPOLOGY Peter Koch (Part I) Koch, Lexical typology, 2010-8-24 A. General introduction B. Lexical hierarchies C. Lexical motivation D. Syntagmatic axis E. Outlook Koch, Lexical typology, 2010-8-24 1. The problem of the tertium

847 views • 69 slides

Compilers Lexical Analysis Alex Aiken Lexical Analysis 1. Lexical Analysis 2. Parsing 3.

Compilers Lexical Analysis Alex Aiken Lexical Analysis 1. Lexical Analysis 2. Parsing 3. Semantic Analysis 4. Optimization 5. Code Generation Alex Aiken Lexical Analysis if (i == j) Z = 0; else Z = 1; \tif (i == j)\n\t\tz =

231 views • 9 slides

2020-07-29_SHPWG_Issue1-Themes Address Calibrate, dynamics of Review the Evaluate Evaluate

2020-07-29_SHPWG_Issue1-Themes Address Calibrate, dynamics of Review the Evaluate Evaluate the Evaluate Evaluate evaluate, and habitat, outcomes of sediments Consider Evaluate flow path and COIs for the temperatures validate the

665 views • 43 slides

Novel Is Not Always Better: On the Relation between Novelty and Dominance Pruning Joschka Gro,

Classical Planning Novelty Dominance Relation Novelty Heuristics Conclusions Novel Is Not Always Better: On the Relation between Novelty and Dominance Pruning Joschka Gro, Alvaro Torralba, Maximilian Fickert Classical Planning

1.13k views • 68 slides

Seek Novelty Personality Environment Predictable Unpredictable Seek Stability Seek Novelty

Seek Novelty Personality Environment Predictable Unpredictable Seek Stability Seek Novelty Predictable Unpredictable Seek Stability ? X ? ? 1. Establish 1RM (performance) 2. Plan the training block/phase 3. Rinse and repeat How to

728 views • 60 slides

Patent Law Prof. Roger Ford September 28, 2016 Class 7 Novelty: (AIA) 102(a)(1) prior

Patent Law Prof. Roger Ford September 28, 2016 Class 7 Novelty: (AIA) 102(a)(1) prior art Recap Recap Novelty: introduction Anticipation: the basics Accidental anticipation Todays agenda Todays agenda Novelty

605 views • 28 slides

LEXICAL TYPOLOGY LEXICAL TYPOLOGY Peter Koch (Part II) Department of Romance Studies, Tbingen

LEXICAL TYPOLOGY LEXICAL TYPOLOGY Peter Koch (Part II) Department of Romance Studies, Tbingen University peter.koch@uni-tuebingen.de http://homepages.unituebingen.de/peter.koch/index.htm Koch, Lexical typology, 2010825 1 6. Lexical

765 views • 54 slides

LEXICAL SEMANTICS LEXICAL SEMANTICS CS 224N 2011 Gerald Penn Slides largely adapted from

LEXICAL SEMANTICS LEXICAL SEMANTICS CS 224N 2011 Gerald Penn Slides largely adapted from ones by Christopher Manning, Massimo Poesio, Ted Pedersen, Dan Jurafsky, and Jim Martin 1 Lexical information and NL applications Lexical

732 views • 72 slides

Lesson 2 Lexical Analysis CS 226/326 Spring 2003 Lexical Analysis Transform source program

Lesson 2 Lexical Analysis CS 226/326 Spring 2003 Lexical Analysis Transform source program (a sequence of characters) into a sequence of tokens . get token lexical source parse parser program tree analyzer token Lexical

601 views • 33 slides

Lexical analysis Lexical analysis Lexical analysis checks the correctness of program words and

Lexical analysis Lexical analysis Lexical analysis checks the correctness of program words and transforms a program to the stream of tokens: removes empty symbols and commentaries; identifies keywords, indentifiers and literal

402 views • 37 slides

Introduction to Lexical Analysis Outline Informal sketch of lexical analysis

Introduction to Lexical Analysis Outline Informal sketch of lexical analysis Identifies tokens in input string Issues in lexical analysis Lookahead Ambiguities Specifying lexical analyzers (lexers) Regular

700 views • 49 slides

Patent Law Prof. Roger Ford February 29, 2016 Class 7 Novelty: public knowledge, use,

Patent Law Prof. Roger Ford February 29, 2016 Class 7 Novelty: public knowledge, use, and publication Recap Recap Novelty: introduction Anticipation: the basics Accidental anticipation Todays agenda Todays agenda

793 views • 32 slides

How to Give an Effective Presentation Jannette Collins, MD, MEd, FCCP University of Wisconsin

How to Give an Effective Presentation Jannette Collins, MD, MEd, FCCP University of Wisconsin Hospital and Clinics Introduction When asked for a definition of CME (continuing medical education), many physicians will describe a short course with

195 views • 4 slides

Pipeline Safety Administrative Rules March 2011 The South Dakota legal landscape SD STATUTES

Pipeline Safety Administrative Rules March 2011 The South Dakota legal landscape SD STATUTES ADMINISTRATIVE RULES Created and changed 49-34A-4: The Commission through legislative action may write rules that pertain to: only

338 views • 17 slides

Year 8 Parental Information Evening Preparing for the Year 8 Exams This evenings presenters

Year 8 Parental Information Evening Preparing for the Year 8 Exams This evenings presenters Peter Morris How to Revise Effectively Victoria Phelps Year 8 Exams in English Mike Penhale Year 8 Exams in Maths

650 views • 52 slides

3 WORDS PACKED WITH MEANING NEW WORSHIPING COMMUNITY SEEKING TO MAKE AND FORM NEW DISCIPLES OF

3 WORDS PACKED WITH MEANING NEW WORSHIPING COMMUNITY SEEKING TO MAKE AND FORM NEW DISCIPLES OF GATHERED BY THE SPIRIT TO MEET JESUS CHRIST IN PRACTICING MUTUAL CARE AND ACCOUNTABILITY JESUS CHRIST WORD AND SACRAMENT DEVELOPING

359 views • 7 slides

Date: 14 April 2013 Maserati Trofeo World Series - Championship Presentation The ever-more

Date: 14 April 2013 Maserati Trofeo World Series - Championship Presentation The ever-more international Maserati Trofeo MC World Series The fourth season in the Maserati Trofeo MC World Series is soon to get underway. This will be the second

243 views • 3 slides

Environmental Health and your business SENIOR ENVIRONMENTAL HEALTH OFFICER How does our role

Environmental Health and your business SENIOR ENVIRONMENTAL HEALTH OFFICER How does our role affect your business ? If you sell food then you need to be registered with us. This is free to do . This allows us to know that you are

461 views • 18 slides

Adaptation in a Sea of Uncertainty Sea-Level Rise Planning at the Local Level Jason M. Evans,

Adaptation in a Sea of Uncertainty Sea-Level Rise Planning at the Local Level Jason M. Evans, Ph.D. Assistant Professor of Environmental Science Stetson University November 16, 2016 Florida Water and Climate Alliance Arcadia, FL Stormwater

610 views • 45 slides

1 2 3 4 5 6 7 8 ELIZABETH VAN CLIEF A Professional Corporation 401 B Street, Suite 2400

1 2 3 4 5 6 7 8 ELIZABETH VAN CLIEF A Professional Corporation 401 B Street, Suite 2400 San Diego, California 92101 (619) 239-1211 evanclief@hplawsd.com AREAS OF PRACTICE State and Federal Taxation of Pension Plans Qualified

453 views • 9 slides

Using Lexical Knowledge to Evaluate the Novelty of Rules Mined from - PowerPoint PPT Presentation

Using Lexical Knowledge to Evaluate the Novelty of Rules Mined from Text Sugato Basu, Raymond J. Mooney, Krupakar V. Pasupuleti, Joydeep Ghosh Presented by Joseph Schlecht Problem Description Modern data-mining techniques discover large

#@&*$% The Power of Novelty Novelty is experiencing the familiar in a new light A Recipe for

Heterogeneous Lexical Resources MultiJEDI ERC 259234 Lexical Resource Lexical Resource Lexical

5. Novelty & Diversity Outline 5.1. Why Novelty & Diversity? 5.2. Probability Ranking

Proof of Novelty A distributed consensus mechanism for securing content novelty Daniel Severo

LEXICAL TYPOLOGY Peter Koch (Part I) Koch, Lexical typology, 2010-8-24 A. General introduction

Compilers Lexical Analysis Alex Aiken Lexical Analysis 1. Lexical Analysis 2. Parsing 3.

2020-07-29_SHPWG_Issue1-Themes Address Calibrate, dynamics of Review the Evaluate Evaluate

Novel Is Not Always Better: On the Relation between Novelty and Dominance Pruning Joschka Gro,

Seek Novelty Personality Environment Predictable Unpredictable Seek Stability Seek Novelty

Patent Law Prof. Roger Ford September 28, 2016 Class 7 Novelty: (AIA) 102(a)(1) prior

LEXICAL TYPOLOGY LEXICAL TYPOLOGY Peter Koch (Part II) Department of Romance Studies, Tbingen

LEXICAL SEMANTICS LEXICAL SEMANTICS CS 224N 2011 Gerald Penn Slides largely adapted from

Lesson 2 Lexical Analysis CS 226/326 Spring 2003 Lexical Analysis Transform source program

Lexical analysis Lexical analysis Lexical analysis checks the correctness of program words and

Introduction to Lexical Analysis Outline Informal sketch of lexical analysis

Patent Law Prof. Roger Ford February 29, 2016 Class 7 Novelty: public knowledge, use,

How to Give an Effective Presentation Jannette Collins, MD, MEd, FCCP University of Wisconsin

Pipeline Safety Administrative Rules March 2011 The South Dakota legal landscape SD STATUTES

Year 8 Parental Information Evening Preparing for the Year 8 Exams This evenings presenters

3 WORDS PACKED WITH MEANING NEW WORSHIPING COMMUNITY SEEKING TO MAKE AND FORM NEW DISCIPLES OF

Date: 14 April 2013 Maserati Trofeo World Series - Championship Presentation The ever-more

Environmental Health and your business SENIOR ENVIRONMENTAL HEALTH OFFICER How does our role

Adaptation in a Sea of Uncertainty Sea-Level Rise Planning at the Local Level Jason M. Evans,

1 2 3 4 5 6 7 8 ELIZABETH VAN CLIEF A Professional Corporation 401 B Street, Suite 2400

Sambuz

Useful Links

Newsletter

Mail Us

Using Lexical Knowledge to Evaluate the Novelty of Rules Mined from - PowerPoint PPT Presentation

Using Lexical Knowledge to Evaluate the Novelty of Rules Mined from Text Sugato Basu, Raymond J. Mooney, Krupakar V. Pasupuleti, Joydeep Ghosh Presented by Joseph Schlecht Problem Description Modern data-mining techniques discover large

#@&amp;*$% The Power of Novelty Novelty is experiencing the familiar in a new light A Recipe for

Heterogeneous Lexical Resources MultiJEDI ERC 259234 Lexical Resource Lexical Resource Lexical

5. Novelty &amp; Diversity Outline 5.1. Why Novelty &amp; Diversity? 5.2. Probability Ranking

Proof of Novelty A distributed consensus mechanism for securing content novelty Daniel Severo

LEXICAL TYPOLOGY Peter Koch (Part I) Koch, Lexical typology, 2010-8-24 A. General introduction

Compilers Lexical Analysis Alex Aiken Lexical Analysis 1. Lexical Analysis 2. Parsing 3.

2020-07-29_SHPWG_Issue1-Themes Address Calibrate, dynamics of Review the Evaluate Evaluate

Novel Is Not Always Better: On the Relation between Novelty and Dominance Pruning Joschka Gro,

Seek Novelty Personality Environment Predictable Unpredictable Seek Stability Seek Novelty

Patent Law Prof. Roger Ford September 28, 2016 Class 7 Novelty: (AIA) 102(a)(1) prior

LEXICAL TYPOLOGY LEXICAL TYPOLOGY Peter Koch (Part II) Department of Romance Studies, Tbingen

LEXICAL SEMANTICS LEXICAL SEMANTICS CS 224N 2011 Gerald Penn Slides largely adapted from

Lesson 2 Lexical Analysis CS 226/326 Spring 2003 Lexical Analysis Transform source program

Lexical analysis Lexical analysis Lexical analysis checks the correctness of program words and

Introduction to Lexical Analysis Outline Informal sketch of lexical analysis

Patent Law Prof. Roger Ford February 29, 2016 Class 7 Novelty: public knowledge, use,

How to Give an Effective Presentation Jannette Collins, MD, MEd, FCCP University of Wisconsin

Pipeline Safety Administrative Rules March 2011 The South Dakota legal landscape SD STATUTES

Year 8 Parental Information Evening Preparing for the Year 8 Exams This evenings presenters

3 WORDS PACKED WITH MEANING NEW WORSHIPING COMMUNITY SEEKING TO MAKE AND FORM NEW DISCIPLES OF

Date: 14 April 2013 Maserati Trofeo World Series - Championship Presentation The ever-more

Environmental Health and your business SENIOR ENVIRONMENTAL HEALTH OFFICER How does our role

Adaptation in a Sea of Uncertainty Sea-Level Rise Planning at the Local Level Jason M. Evans,

1 2 3 4 5 6 7 8 ELIZABETH VAN CLIEF A Professional Corporation 401 B Street, Suite 2400

Sambuz

Useful Links

Newsletter

Mail Us

#@&*$% The Power of Novelty Novelty is experiencing the familiar in a new light A Recipe for

5. Novelty & Diversity Outline 5.1. Why Novelty & Diversity? 5.2. Probability Ranking