COMP6037 We know Semi-structured Data and the Web when a grammar - PowerPoint PPT Presentation

Clarification: a grammar, its language, and their types COMP6037 • We know Semi-structured Data and the Web • when a grammar is local: i.e., if none of their non-terminal symbols compete… Uniqueness in Trees, • given a grammar G, what the language (set of trees) L(G) of G is: Repercussion on interesting problems, and finite L(G) := { t | t is a tree accepted by G} Graphs 5.2 • what it means for a language (set of trees) L to be local: i.e., if we can find a local grammar G such that L = L(G) Uli Sattler • hence to find out whether L is local (and perhaps L is given through a grammar G, i.e., L = L(G)) University of Manchester you need to determine whether we can find /construct a local grammar F such that L = L(F) • ...the above works analgously if “local” is replaced with “single-type” 1 2 Clarification: a grammar, its language, and their types Things done so far • Remember: we saw • [structures] semi-structured data, XML, datamodels, trees • G is not single-type • [description mechanisms] schema languages G = (N, � ,S, P) with N = {Book, Author, Editor, Affilia, Paper, F, L} – of different styles, strengths, purposes • G’ is single-type: � = {B, P, Name, F, L, A} – validation, validate-as, PSVIs Author and S = {Book, Paper} – a useful abstraction: tree grammars BA still compete, P = { Book � B Editor|Author, Paper � P Author, • [‘difficult’ extensibility mechanism] namespaces, schemas Editor � Name F,L, Author � Name L,Affilia, but don’t occur F � F � , L � L � , Affilia � A � } • [interaction mechanisms] query languages, parsers, together in a rule! – possibly schema aware – namespace aware G’ = (N’, � ’,S’, P’) with • L(G’) = L(G) N’ = {Book, Author, Editor, Affilia, Paper, F, L} • error handling � ’ = {B, P, Name, F, L, A} • [modelling] attributes vs elements, deep vs flat, ... S’ = {Book, Paper} • hence L(G) is P’ = { Book � B BA, Paper � P Author, single-type! BA � Name (F,L)|(L,Affilia), Author � Name • ...today: L,Affilia, – we go back to [structures]: beyond trees, and F � F � , L � L � , Affilia � A � } – other ‘tasks’ around schemas – more modelling, human factors 3 4 – exam preview

So far, there were trees everywhere Trees and families: family trees! • trees in semi-structured data • Assume you want to work with/display/search/combine/... family trees – apart from when object identifiers are used – you are interested in genealogy • parse trees from XML documents – you work with a solicitor who handles inheritance cases • DOM trees – you study genetics <?xml version="1.0" encoding="UTF-8"?> • infosets – .... • XPath datamodel tree • easy: <!ELEMENT family-tree (person | family)*> • trees that tree grammars run on – information is patchy & <!ELEMENT person (name, birth?, death?, varied, thus use XML father?, mother?, note?)> – and that Relax NG and Schematron work on – let’s build a DTD for this <!ELEMENT family (husband?, wife?, child*, marriage*, divorce*, note*)> ... • ...but is everything really a tree? <!ELEMENT father (name, birth?, death?, – e.g., you, your friends and family, and the relationships between them? father?, mother?, note?)> <!ELEMENT mother (name, birth?, death?, father?, mother?, note?)> ... <!ELEMENT name (firstname?, middle?, lastname)> <!ELEMENT middle (#PCDATA)> <!ELEMENT firstname (#PCDATA)> <!ELEMENT given (#PCDATA)> example taken from & modified .... 5 http://penguin.dcs.bbk.ac.uk/academic/xml/family/index.php 6 <?xml version="1.0" encoding="UTF-8"?> <!DOCTYPE family-tree SYSTEM "family.dtd"> <family-tree> <?x ml version="1.0" encoding="UTF-8"?> <person id="p5" sex="m"> Trees and families: family trees! Trees and families: family trees! <name> <!ENTITY % reference "person IDREF #REQUIRED "> <firstname>Alfred Ernest</firstname> <lastname>Farmer</lastname> <!ELEMENT family-tree (person | family)*> </name> <death> • in order to ensure • things work nicely: <place>Finsbury Park, London</place> <!ELEMENT person (name, birth?, death?, • e.g., to retrieve all pairs of persons and their <date>8 January, 1964</date> father?, mother?, note?)> </death> – integrity : a person’s DoB should be the fathers, we can use a simple XQuery: </person> <!ELEMENT family (husband?, wife?, child*, same regardless of where they occur in <person id="p6" sex="m"> marriage*, divorce*, note*)> <name> our tree let $d := doc("family.xml") <firstname>Ronald Alfred</firstname> – maintainability : when we change a for $p in $d//person <!ELEMENT name (firstname?, middle?, lastname)> <lastname>Farmer</lastname> </name> return person’s data (e.g., add DoD), we <!ELEMENT middle (#PCDATA)> <birth> <childAndParents> should only have to do it once <place>London</place> <!ELEMENT firstname (#PCDATA)> <child>{ $p/name }</child> <date>27 April, 1922</date> <!ELEMENT given (#PCDATA)> { if ($p/father/@father != "") </birth> � we can make use of IDs & IDREFs <death> then <father>{ id($p/father/@father)/name } < !ATTLIST person id ID #REQUIRED <place>Hill House Nursing Home, sex (m | f) #IMPLIED> </father> Kenley, Surrey</place> else <fatherUnknown/>} <date>23 November, 2003</date> before: <!ELEMENT father EMPTY> </death> { if ($p/mother/@mother != "") < !ATTLIST father %reference; > <father father="p5"/> <!ELEMENT father (name, birth?, death?, father?, mother?, note?)> then <mother>{ id($p/mother/@mother)/name } </person> <!ELEMENT mother (name, birth?, death?, father?, mother?, note?)> </mother> <!ELEMENT mother EMPTY> <person id="p7" sex="f"> < !ATTLIST mother %reference; > else <motherUnknown/>} <name> </childAndParents> <firstname>Daisy May</firstname> <!ELEMENT wife EMPTY> <lastname>Farmer</lastname> < !ATTLIST wife %reference; > </name> .... <death> 7 8

COMP6037 We know Semi-structured Data and the Web when a grammar - PowerPoint PPT Presentation

Clarification: a grammar, its language, and their types COMP6037 We know Semi-structured Data and the Web when a grammar is local: i.e., if none of their non-terminal symbols compete Uniqueness in Trees, given a grammar G, what

COMP6037 Semi-structured Data and the Web XPath and XQuery, week 2 Uli Sattler University of

COMP6037 Semi-structured Data and the Web Tree Grammars and Relax NG, week 3 Uli Sattler

COMP6037 Read Blackboards Announcements Read Blackboards Discussions

Trusted Components Bertrand Meyer, Manuel Oriol Lecture 7: Testing Object-Oriented Software

15-251 Great Theoretical Ideas in Computer Science Lecture 21: Modular Arithmetic November 8th,

Fracture Finder Yellow A Ineffective Diagnosing Common Concerns from Athletes, Doctors, and

Josh Bloch Charlie Garrod School of Computer Science 15-214 1 Administrivia Homework 3 due

Hope For Your Home 1. Building a Family Gods Way Psalm 127:1 5 Unless the Lord builds the

ENGLISH Presented on 26 January 2018 P4 ENGLISH CURRICULUM Explicit Language STELLAR Teaching

EXPAND EXPAND NYC NYC CARE CARE CITYWIDE CITYWIDE NYC Care will expand to Queens and

Welcome to the course! Mine Cetinkaya-Rundel Associate Professor of the Practice, Duke University

1. ER and Relational Model (12-04-10) Your best friend makes the perfect pizza. Together you come

Who are your parents? Where is the code, belonging to a

Writing - Week 6 Cohesion & Coherence Example Which passage do you prefer? Why? What do you

Users Meeting 2017 Tammy Walton, Thomas Strauss, Sarah Lockwitz UEC Meeting 21 April 2017

The Gap between Product and UX / OcadoTechnology / OcadoTechnology My journey to Product

Tidy data CLEAN IN G DATA IN P YTH ON Daniel Chen Instructor Tidy data Tidy Data paper

LiquidO: an appetizer Anatael Cabrera, Jeff Hartnell and J. Pedro Ochoa-Ricoux* * for the LiquidO

Charm physics and XYZ states at BESIII Evgeny BOGER JINR Dubna On behalf of BESIII

Menno Veldhorst Operations on spin qubits 1 Last time from transistor Now quantum dot qubits

Universally Adaptive Data Analysis Cynthia Dwork, Microsoft Research 2 : muffin tops?

Evaluating compositionality in sentences embeddings Ishita Dasgupta Harvard University,

Enabling large scale LAPW DFT calculations by a scalable iterative eigensolver CSE15, Salt Lake

Better Together Martin Bravenboer LogicBlox Yannis Smaragdakis UMass Amherst ISSTA 2009

Sambuz

Useful Links

Newsletter

Mail Us

COMP6037 We know Semi-structured Data and the Web when a grammar - PowerPoint PPT Presentation

Clarification: a grammar, its language, and their types COMP6037 We know Semi-structured Data and the Web when a grammar is local: i.e., if none of their non-terminal symbols compete Uniqueness in Trees, given a grammar G, what

COMP6037 Semi-structured Data and the Web XPath and XQuery, week 2 Uli Sattler University of

COMP6037 Semi-structured Data and the Web Tree Grammars and Relax NG, week 3 Uli Sattler

COMP6037 Read Blackboards Announcements Read Blackboards Discussions

Trusted Components Bertrand Meyer, Manuel Oriol Lecture 7: Testing Object-Oriented Software

15-251 Great Theoretical Ideas in Computer Science Lecture 21: Modular Arithmetic November 8th,

Fracture Finder Yellow A Ineffective Diagnosing Common Concerns from Athletes, Doctors, and

Josh Bloch Charlie Garrod School of Computer Science 15-214 1 Administrivia Homework 3 due

Hope For Your Home 1. Building a Family Gods Way Psalm 127:1 5 Unless the Lord builds the

ENGLISH Presented on 26 January 2018 P4 ENGLISH CURRICULUM Explicit Language STELLAR Teaching

EXPAND EXPAND NYC NYC CARE CARE CITYWIDE CITYWIDE NYC Care will expand to Queens and

Welcome to the course! Mine Cetinkaya-Rundel Associate Professor of the Practice, Duke University

1. ER and Relational Model (12-04-10) Your best friend makes the perfect pizza. Together you come

Who are your parents? Where is the code, belonging to a

Writing - Week 6 Cohesion &amp; Coherence Example Which passage do you prefer? Why? What do you

Users Meeting 2017 Tammy Walton, Thomas Strauss, Sarah Lockwitz UEC Meeting 21 April 2017

The Gap between Product and UX / OcadoTechnology / OcadoTechnology My journey to Product

Tidy data CLEAN IN G DATA IN P YTH ON Daniel Chen Instructor Tidy data Tidy Data paper

LiquidO: an appetizer Anatael Cabrera, Jeff Hartnell and J. Pedro Ochoa-Ricoux* * for the LiquidO

Charm physics and XYZ states at BESIII Evgeny BOGER JINR Dubna On behalf of BESIII

Menno Veldhorst Operations on spin qubits 1 Last time from transistor Now quantum dot qubits

Universally Adaptive Data Analysis Cynthia Dwork, Microsoft Research 2 : muffin tops?

Evaluating compositionality in sentences embeddings Ishita Dasgupta Harvard University,

Enabling large scale LAPW DFT calculations by a scalable iterative eigensolver CSE15, Salt Lake

Better Together Martin Bravenboer LogicBlox Yannis Smaragdakis UMass Amherst ISSTA 2009

Sambuz

Useful Links

Newsletter

Mail Us

Writing - Week 6 Cohesion & Coherence Example Which passage do you prefer? Why? What do you