Bringing Your Content to the User, not the User to Your Content A - PowerPoint PPT Presentation

Bringing Your Content to the User, not the User to Your Content – A lightweight approach towards integrating external content via the EEXCESS framework Martin Höffernig, Werner Bailer JOANNEUM RESEARCH SWIB 2015, Hamburg, 2015-11-23

Outline (1) • Introduction to EEXCESS • Tools for content injection – Install & try Chrome plugin • Integrating a new data provider – Introduction to the data model – PartnerWizard – Integrate data provider with a web-based tool 2

Outline (2) • Refining data mapping – Introduction to mapping tool – Review and update mappings – Test and check mappings • Metadata quality assessment – Checking input and mapping quality 3

Logistics • Wifi – SSID: SWIB* – Password: berners-lee • Coffee break 15.30-16.00 • Short breaks in each of the blocks before & after (flexible timing) Seite 4

Materials Links, examples etc. http://eexcess-dev.joanneum.at/swib15.html Accounts: see handout Slides: will be made available on EEXCESS website Seite 5

EEXCESS - Enhancing Europe’s eXchange in Cultural Educational and Scientific resourceS • EU FP7 project (Feb. 2013-Jul. 2016) • 10 partners – technical partners – scientific partners – cultural institutions 6

Overview

Motivation • Vast amounts of digital cultural and scientific resources available • Still memory organisations (i.e. library, museums, archives) face challenges in disseminating their content • Two reasons, addressed by EEXCESS: – Todays content dissemination processes are optimised for mainstream content – Long tail content needs contextualisation Seite 2

Motivation • Content provider strategies – Dedicated portals – Search engine optimisation – Social network marketing • User strategies – Use major search engines – Use Wikipedia 3

The Long Tail Content • Few sites get a large share of visits 250.000.000 • Large number of sites get a low share of visits • A big, short “head”, but a (very) long tail Avg. Monthly Visitors (USA, 2014) 200.000.000 150.000.000 Challenges of the Long Tail 100.000.000 • High specialisation • Low contextualisation 50.000.000 • Most items are unrelated • Not easy to consume 0 • Low # of users per item 1 4 7 10 13 16 19 22 25 28 31 34 37 40 43 46 49 52 55 58 61 64 67 70 73 76 79 82 85 88 Rank of the Web site Seite 4

The value of long tail content Alumni of Programming Trinity College The “first” computer Lord Byron Economics Language Cambridge invented Ada Charles named Lovelace Alumni of Babbage after daughter of worked with The “Babbage Principle” Value of Long Tail Content Scholarly content Cultural Heritage content Discover new knowledge • • Discourse • Multimedia Artefacts Verify information • Validated facts • • Original Material • Additional explanations • Explanations Enrich other content • 5

Long Tail content dissemination Challenges of today‘s methods 250.000.000 Challenges • Competition with mainstream content • Avg. Monthly Visitors (USA, 2014) Highly commercialised 200.000.000 • Unawareness of existing portals • Content is not contextualised 150.000.000 • User triggered 100.000.000 50.000.000 0 1 4 7 10 13 16 19 22 25 28 31 34 37 40 43 46 49 52 55 58 61 64 67 70 73 76 79 82 85 88 Rank of the Web site Search Engine Optimization Social Media Marketing etc. Seite 6

EEXCESS Vision Unfold the treasure of cultural heritage and scholarly long-tail content for • discovering new knowledge, • triggering serendipitous effects, • verifying consumed information, • enriching new content b y “bringing the content to the user, not the user to the content” 7

Approach Idea „Bring the content to the user, not the user to the content“ • Inject cultural and scientific content into existing web channels – Websites (Wikipedia, etc.) 250.000.000 – CMS/LMS Avg. Monthly Visitors (USA, 2014) – Social media channels 200.000.000 (Twitter, etc.) – Support “head - channels” 150.000.000 as well as tail-channels • Contextualise Long Tail content 100.000.000 – Context of the web channel 50.000.000 – User Context – User Task 0 1 5 9 13 17 21 25 29 33 37 41 45 49 53 57 61 65 69 73 77 81 85 Rank of the Web site • Gather user and usage feedback such that memory organisations can optimise their resource distribution

Approach Overview Involved in Involved in Content Consumption Content Creation context (e.g. Browsing, SNA) (e.g. Writing Blogs, Editors) content content Recommendation content ZBW Mendeley AMBL CT Open Europeana Content Content Content Content Access

Approach Test Beds 3 User Groups as Test Beds • Educational Support - Cultural/scientific resources injected to LMS - Pupils, teachers • Scholarly Communication - Interconnecting cultural and scientific resources - Students, lecturers, researchers • General Public Education – Disseminate cultural/scientific content to the general public – Regionally interested users, culturally interested users, media consumers Seite 10

Objectives • Adaptive Augmentation User Interfaces • Personalized Recommendation • Integration and Enrichment • User and Usage Mining • Privacy Preservation Seite 11

Architecture • Distributed data storage – Data remains with data providers – No central index • Partner Recommender – Interface between data provider’s API and EEXCESS system • Federated Recommender – Aggregates and ranks results Seite 12

Architecture Seite 13

Recommendation flow 14

Recommendation flow • Implications from architecture – transformation and enrichment must work on the fly – configuration can be checked and revised manually, but transformation results cannot – no issues due to enrichment with resources that are no longer available 15

Querying partner sites • Two step process – Speed up retrieving initial results – Reduce load on partner sites • Initial query – Get basic metadata of entries • Detail query – Additional metadata – Images 16

Metadata Enrichment • Enriching textual information with named entities • Type of metadata field is used to constrain entity type (e.g. persons) – search for entities with appropriate type • Classify if words are entities in DBpedia • Add synonyms using WordNet • Add connected geographic terms using GeoNames 17

Content Injection – Chrome Browser Extension Content Consumption • A sidebar for recommending cultural/scientific content while browsing Seite 18

Content Injection – Content Management Plugin (Wordpress) Content Creation • Inject cultural heritage and scholarly content into social media creation process • Multiplier effect in the Blogging Community by providing a Wordpress Plugin Seite 19

Content Injection – Google Docs App Content Creation • Inject cultural heritage and scholarly content into collaborative word processing • Support writing reports, grant requests, homeworks • Google Apps Market for Google Documents as high-potential dissemination platform Seite 20

Content Injection – Collection Management System 21

Content Injection – Collection Management System 22

Content Injection – Learn Management Systems Content Creation for Educational Support • Inject cultural heritage content into Learn Management Systems • Moodle and BitMedia‘s SITOS LMS Seite 23 �

Privacy vs. Personalisation trade-off? Privacy Personalisation/Quality 24

Privacy vs. Personalisation trade-off? Privacy Personalisation/Quality 25

Privacy vs. Personalisation trade-off? User Awareness (and Transparency) User Empowerment User Privacy Protection (Privacy Proxy) 26

PEAS: Unlinkability Protocol • PEAS: Private, Efficient, and Accurate web Search • Hypothesis – only the user’s device is trusted • Split the Privacy Proxy into two pieces – Receiver: knows the user, but not the content of the query – Issuer: knows the content of the query, but not the user – Both are supposed “honest but curious” and do not collude Page 27

PEAS: Unlinkability Protocol (simplified) Privacy Proxy a a ’ u :User Receiver Issuer FR b=generateKey() q’=encrypt a ( q +b) q ’ q ’ q+b=decrypt a’ (q’) q R R’=encrypt b (R) R’ R’ R =decrypt b (R’) 28

PEAS: Indistinguishability Protocol (simplified) • Protocol divided into two parts – Obfuscation (done at the user’s side): add fake queries • to mislead attackers, fake queries have the same structure as the original one, are built other users’ queries, but are semantically different from the original query – Filtering: remove irrelevant results Page 29

PEAS: Indistinguishability Protocol (simplified) Privacy Proxy User FR q+ = obfuscation( q ) q+ q+ R+ R+ R =filtering(R+) Page 30

PEAS: Combination of Protocols User q+ = obfuscation(q) R+ = unlinkability(q+) R = filtering(R+) Page 31

Privacy Settings • Transparent to user • Choice which information to expose • Choice to switch on/off different privacy features 32

Data Model

Bringing Your Content to the User, not the User to Your Content A - PowerPoint PPT Presentation

Bringing Your Content to the User, not the User to Your Content A lightweight approach towards integrating external content via the EEXCESS framework Martin Hffernig, Werner Bailer JOANNEUM RESEARCH SWIB 2015, Hamburg, 2015-11-23 Outline

PanMedia Bringing it all together FIRST Vancouver 2008 PanMedia Bringing it all Together

RUN groupadd -r user && useradd -r -g user user USER user $ docker run --read-only debian

Bringing Our Children Home Act and Five Nations Template Laws on Children and Families Bringing

Peering and CDNs Arturo Servin Google Imagine youre a Content Provider Content Provider

Summary User-centric Social Social Multimedia Multimedia Computing From Users: user-perceptive

voice Kate Howland End-user programming? End-user programming? End-user programming?

NOT FOR REPRODUCTION NOT FOR REPRODUCTION NOT FOR REPRODUCTION NOT FOR REPRODUCTION NOT FOR

User Pays User Committee User Pays User Committee 8 th August 2011 1 2 Agenda

Gillian Dick - Glasgow City Council Bringing cities to life, Bringing life into cities.

Bringing Shrek to Life: Software Bringing Shrek to Life: Software Testing at DreamWorks Testing

Bringing Riak to the Mobile Platform Kresten Krab Thorup Hacker @drkrab Bringing Riak to the

SecureCDN: Providing End-to-End Security in Content Delivery Networks Stephen Herwig University

Why Im NOT Why Im NOT Why Im NOT Why Im NOT a Hindu Why Im NOT a Hindu

Screen 1 Go to www.myenroll.com < Click Request User ID and Password> Acquire USER ID and

User Datagram Datagram Protocol (UDP) Protocol (UDP) User Srinidhi Varadarajan UDP: The User

UX/UI What is UX and UI? UX Process User Research User Research Creating User

.NET Remoting Module Subtitle Building distributed applications and implementing

Chapter: 16 p 16 LTE Time-Frequency structure Downlink L1/L2 control signaling/processing

Secure and Usable Out-Of-Band Channels for Ad hoc Mobile Device Interactions Ronald Kainda, Ivan

CDW CORPORATION Second Quarter 2014 Webcast Conference Call July 31, 2014 www.cdw.com |

Introduction to Constrained Estimation Graham C. Goodwin September 2004 Centre for Complex

Channel Assignment and Channel Hopping in IEEE 802.11 Operating Channels for 802.11b Europe

Critique #2 M. Sha, G. Hackmann and C. Lu, Real-world Empirical Studies on Multi- Channel

Rob Havelt Black Hat Europe, 2009 Greetings Black Hat Rob Havelt rhavelt@trustwave.com

Sambuz

Useful Links

Newsletter

Mail Us

Bringing Your Content to the User, not the User to Your Content A - PowerPoint PPT Presentation

Bringing Your Content to the User, not the User to Your Content A lightweight approach towards integrating external content via the EEXCESS framework Martin Hffernig, Werner Bailer JOANNEUM RESEARCH SWIB 2015, Hamburg, 2015-11-23 Outline

PanMedia Bringing it all together FIRST Vancouver 2008 PanMedia Bringing it all Together

RUN groupadd -r user &amp;&amp; useradd -r -g user user USER user $ docker run --read-only debian

Bringing Our Children Home Act and Five Nations Template Laws on Children and Families Bringing

Peering and CDNs Arturo Servin Google Imagine youre a Content Provider Content Provider

Summary User-centric Social Social Multimedia Multimedia Computing From Users: user-perceptive

voice Kate Howland End-user programming? End-user programming? End-user programming?

NOT FOR REPRODUCTION NOT FOR REPRODUCTION NOT FOR REPRODUCTION NOT FOR REPRODUCTION NOT FOR

User Pays User Committee User Pays User Committee 8 th August 2011 1 2 Agenda

Gillian Dick - Glasgow City Council Bringing cities to life, Bringing life into cities.

Bringing Shrek to Life: Software Bringing Shrek to Life: Software Testing at DreamWorks Testing

Bringing Riak to the Mobile Platform Kresten Krab Thorup Hacker @drkrab Bringing Riak to the

SecureCDN: Providing End-to-End Security in Content Delivery Networks Stephen Herwig University

Why Im NOT Why Im NOT Why Im NOT Why Im NOT a Hindu Why Im NOT a Hindu

Screen 1 Go to www.myenroll.com &lt; Click Request User ID and Password&gt; Acquire USER ID and

User Datagram Datagram Protocol (UDP) Protocol (UDP) User Srinidhi Varadarajan UDP: The User

UX/UI What is UX and UI? UX Process User Research User Research Creating User

.NET Remoting Module Subtitle Building distributed applications and implementing

Chapter: 16 p 16 LTE Time-Frequency structure Downlink L1/L2 control signaling/processing

Secure and Usable Out-Of-Band Channels for Ad hoc Mobile Device Interactions Ronald Kainda, Ivan

CDW CORPORATION Second Quarter 2014 Webcast Conference Call July 31, 2014 www.cdw.com |

Introduction to Constrained Estimation Graham C. Goodwin September 2004 Centre for Complex

Channel Assignment and Channel Hopping in IEEE 802.11 Operating Channels for 802.11b Europe

Critique #2 M. Sha, G. Hackmann and C. Lu, Real-world Empirical Studies on Multi- Channel

Rob Havelt Black Hat Europe, 2009 Greetings Black Hat Rob Havelt rhavelt@trustwave.com

Sambuz

Useful Links

Newsletter

Mail Us

RUN groupadd -r user && useradd -r -g user user USER user $ docker run --read-only debian

Screen 1 Go to www.myenroll.com < Click Request User ID and Password> Acquire USER ID and