Story Generation From Knowledge Graphs Patrick Saad Referee: Prof. - - PowerPoint PPT Presentation

story generation from knowledge graphs
SMART_READER_LITE
LIVE PREVIEW

Story Generation From Knowledge Graphs Patrick Saad Referee: Prof. - - PowerPoint PPT Presentation

Story Generation From Knowledge Graphs Patrick Saad Referee: Prof. Dr. Benno Stein Referee: Prof. Dr. Norbert Siegmund Master Thesis | SoSe19 | Bauhaus-Universitt Weimar The Research Problem Knowledge Graph Document Collection MATCH


slide-1
SLIDE 1

Story Generation From Knowledge Graphs

Patrick Saad Referee: Prof. Dr. Benno Stein Referee: Prof. Dr. Norbert Siegmund

Master Thesis | SoSe19 | Bauhaus-Universität Weimar

slide-2
SLIDE 2

Query Language (Cypher, SPARQL)

MATCH (a:Author)-[r1:AUTHOR_IN]->(p1:Paper)-[r2:CITED_BY]->(p2:Paper) WHERE p1.year = 2019 WITH a, r1, p1, r2, p2 RETURN a.name AS author, count(r2) AS total ORDER BY total DESC

The Research Problem

1

Keyword query to graph query

Maybe Unavailable Coherent Text Subjective Intuitive Objective Available Raw Results Hard

Google Search google.com Story Generation

Knowledge Graph Document Collection

Making search knowledge graphs like searching the web Query Results

slide-3
SLIDE 3

Provide users with a visual method to formulating queries using facets

Query Results User

Related Work | Faceted Search Interfaces

2

Query Languages Knowledge Graph

Faceted Search

Facets

slide-4
SLIDE 4

Semantic Scholar semanticscholar.com

Faceted search interfaces provides query simplification using facets Complex queries are still hard to formulate (Author + Year + ”Top”) Filtered results contain implicit insights

Related Work | Faceted Search Interfaces

3

slide-5
SLIDE 5

Social Network Analysis | Centrality, Louvain Algorithm, etc.. Wolfram Alpha - wolframalpha.com Distant Reading | Influential Authors In Literature Illustration by Joon Mo Kang, Stanford Literary Lab

Find relationship patterns, influential entities, outliers

Related Work | Social Network Analysis, Distant Reading

4

slide-6
SLIDE 6

Automatically generate stories from data

Valtteri, the Finnish Municipal Election Bot vaalibotti.fi

Problems ➔ News reporting without in-depth analysis ➔ Insights are still implicit (influential entities?) ➔ Natural Language Processing ➔ Natural Language Generation ➔ Story Templates

750 000 articles

Related Work | Automated Journalism

Facets such as Location, Candidate,

  • r Party

5

slide-7
SLIDE 7

2 Neo4j https://neo4j.com 3 Cypher https://neo4j.com/developer/cypher-query-language

Semantic Scholar Open Research Corpus 45 million papers (Computer Science, Neuroscience, Biomedical)

Subset from our knowledge graph built using Neo4j 2 and Cypher 3

Story Generation Framework | Use Case

Knowledge Graph Setup

(1) Select all papers with a specific author A

A

(2) Recursively get incoming/outgoing citations

A 549,066 Papers, 8124 Authors and 632 Journals

Our graph model

6

slide-8
SLIDE 8

1 https://neo4j.com/developer/graph-algorithms

Construct graph queries that compute social performance and influence metrics Neo4j’s graph algorithms library 1 Betweenness Centrality, PageRank, etc..

Story Generation Framework | Use Case

Insight Discovery

Discovering insights from social relationships

Total Direct Relationships Paper Citations, Author Collaborations, etc.. Statistics from facets of directly connected nodes Total/Min/Max/Avg Author h-index, Paper Citations, etc.. Total Indirect Relationships Nested Paper Citations, Nested Author Collaborations, etc..

7

slide-9
SLIDE 9

Paper Author Journal Total Numerical facet analysis 8 5 9 22 Time-filtered numerical facet analysis 448 488 Numerical facet correlation analysis 28 10 36 74 Weaver performance analysis 1 1 1 3 Total 485 16 46 547

Total stories by story type for different entity types

Story Generation Framework | Use Case

Story Generation

Story Templates 2 templates Story Content Introduction Data overview using statistics Top performing entities Plot graphs Story Types 4 different story types based on the available facets Story Types Automatically generate stories to communicate the insights

8

slide-10
SLIDE 10

Example Story Template | Search Results and Knowledge Box

Weaver User Interface | Search

Knowledge Box provides additional graph insights

13

slide-11
SLIDE 11

Example Story Template | Search Results - Knowledge Box and all facet ranks

Weaver User Interface | Knowledge Box

Top Connected Entities Separate entity ranking for every social metric

14

slide-12
SLIDE 12

Different insights can reveal different kinds of social influence

Weaver User Interface | Knowledge Box

Community impact from several aspects

15

slide-13
SLIDE 13

Weaver User Interface | Story Templates

9

weaver.webis.de

slide-14
SLIDE 14

Weaver User Interface | Story Templates

Title and Introduction sections

Title Introduction (Dataset info, Metric description)

10

slide-15
SLIDE 15

Data Overview section

Weaver User Interface | Story Templates

Statistical Overview

11

slide-16
SLIDE 16

Top performing entities section

Weaver User Interface | Story Templates

Entities ranked by their facet performance Interconnected Stories, Entities, and Search Results via hyperlinks

12

slide-17
SLIDE 17

Story Generation Framework | Use Case

Evaluation using CSUQ

Question Category Mean Standard Deviation System Use (questions 1-8) 1.28 0.40 Information Quality (questions 9-15) 0.72 0.33 Interface Quality (questions 16-18) 1.07 0.22 Overall (questions 1 and 19) 1.70 0.04

  • 3
  • 2

1 1 2 3 Strongly disagree Strongly agree 5 participants (expert users)

16

slide-18
SLIDE 18

Bigger knowledge graph using the cluster (more resources, framework modifications) Generate additional insights (social network analysis, graph theory, etc..) Improve story titles and content (natural language generation, interactive storytelling, ) Better search results ranking

Story Generation From Knowledge Graphs

Future Work

Improve the search interface (keyword query to graph query, iterative usability testing)

17

slide-19
SLIDE 19

Story Generation from Knowledge Graphs

Patrick Saad Referee: Prof. Dr. Benno Stein Referee: Prof. Dr. Norbert Siegmund

Master Thesis | SoSe19 | Bauhaus-Universität Weimar