Structured Prediction Introduction What is structured prediction? - PowerPoint PPT Presentation

Structured Prediction Introduction What is structured prediction? CS 6355: Structured Prediction

Our goal today To define a Structure and Structured Prediction 1

What are structures? 2

Examples of structured data? 3

Examples of structured data? • Database tables and spreadsheets • HTML documents • JSON objects • Wikipedia info-boxes • Computer programs … we will see more examples 4

Examples of unstructured data? 5

Examples of unstructured data? • Images • Videos • Text documents What makes these unstructured? • PDF files How are they different from the previous list? • Books • Music recordings • Speech 6

Structured representations are useful • We know how to process them – Algorithms for managing symbolic data – Computational complexity well understood • They abstract away unnecessary complexities – Why deal with text, images, etc when you can process a database with the same information? – (Is this argument always valid?) 7

Example: Reading comprehension is hard! Water is split, providing a source of electrons and protons (hydrogen ions, H + ) and giving off O 2 as a by- product. Light absorbed by chlorophyll drives a transfer of the electrons and hydrogen ions from water to an acceptor called NADP + . What can the splitting of water lead to? A: Light absorption B: Transfer of ions 8

Reading comprehension is hard! Water is split, providing a source of electrons and protons (hydrogen ions, H + ) and giving off O 2 as a by- product. Light absorbed by chlorophyll drives a transfer of the electrons and hydrogen ions from water to an acceptor called NADP + . What can the splitting of water lead to? A: Light absorption B: Transfer of ions 9

Reading comprehension is hard! Water is split , providing a source of electrons and protons (hydrogen ions, H + ) and giving off O 2 as a by- product. Light absorbed by chlorophyll drives a transfer of the electrons and hydrogen ions from water to an acceptor called NADP + . What can the splitting of water lead to? A: Light absorption B: Transfer of ions 10

Reading comprehension is hard! Enable Water is split , providing a source of electrons and protons (hydrogen ions, H + ) and giving off O 2 as a by- product. Light absorbed by chlorophyll drives a transfer of the electrons and hydrogen ions from water to an acceptor called NADP + . What can the splitting of water lead to? A: Light absorption B: Transfer of ions 11

Reading comprehension is hard! Enable Water is split , providing a source of electrons and protons (hydrogen ions, H + ) and giving off O 2 as a by- product. Light absorbed by chlorophyll drives a transfer of the electrons and hydrogen ions from water to an acceptor called NADP + . Cause What can the splitting of water lead to? A: Light absorption B: Transfer of ions 12

Reading comprehension is hard! Enable Water is split , providing a source of electrons and protons (hydrogen ions, H + ) and giving off O 2 as a by- product. Light absorbed by chlorophyll drives a transfer of the electrons and hydrogen ions from water to an acceptor called NADP + . Cause What can the splitting of water lead to? A: Light absorption If we had a representation like this, we might be B: Transfer of ions able to answer complex questions 13

Machine learning to the rescue • Techniques from statistical learning can help build these representations • In fact, machine learning is necessary to scale up and generalize this process 15

A detour: Classification 16

Classification We know how to train classifiers – Given an email, spam or not spam? – Is a review positive or negative? – Automatically place emails into a folder – “Predict if a car purchased at an auction is a lemon” 17

Standard classification setting • Notation – X: Inputs, or a feature representation of inputs – Y: One of a set of labels ( spam , not-spam ) • The goal: To learn a function X ! Y that maps an input to a label • The standard recipe 1. Collect labeled examples {(x 1 ,y 1 ), (x 2 , y 2 ), ! } 2. Train a function f: X ! Y that a. Is consistent with the observed examples, and b. Can hopefully be correct on new, previously unseen examples 18

Classification is generally well understood Theory: generalization bounds • We know how many examples one needs to see to guarantee good behavior on unseen – examples Algorithms and software • – Good learning algorithms for linear representations, efficient and can deal with high dimensionality (millions of features) – Loss minimization idea applies to neural networks too Open questions • – What is a good feature representation? – Learning protocols: how to minimize supervision, efficient semi-supervised learning, active learning Is this sufficient for solving problems like the reading comprehension one? 19

Classification is generally well understood Theory: generalization bounds • We know how many examples one needs to see to guarantee good behavior on unseen – examples Algorithms and software • – Good learning algorithms for linear representations, efficient and can deal with high dimensionality (millions of features) – Loss minimization idea applies to neural networks too Open questions • – What is a good feature representation? – Learning protocols: how to minimize supervision, efficient semi-supervised learning, active learning Is this sufficient for solving problems like the reading comprehension one? No! 20

Back to “What are structures?” 21

Semantic Role Labeling Input : John saw the dog chasing the ball. Output: Predicate see Predicate Chase Viewer John Chaser The dog Thing viewed The dog chasing the ball Thing chased the ball Or equivalently, predicate-argument representations See ( Viewer : John, Chase ( Chaser : the dog, Viewed : the dog chasing the ball) Chased : the ball) 22

Semantic Parsing X: “A python function that takes a name and prints the string Hello followed by the name and exits.” Y: X: “Find the largest state in the US.” SELECT name Y: FROM us_states WHERE size = (SELECT MAX(size) FROM us_states) In all these cases, the output Y is a structure 23

What is a structure? One definition By … linguistic structure, we refer to symbolic representations of language posited by some theory of language. From the book Linguistic Structure Prediction, by Noah Smith, 2011. 24

What is in this picture? 25 Photo by Andrew Dressel - Own work. Licensed under Creative Commons Attribution-Share Alike 3.0

Object detection Right facing bicycle 26 Photo by Andrew Dressel - Own work. Licensed under Creative Commons Attribution-Share Alike 3.0

Object detection Right facing bicycle handle bar saddle/seat right wheel left wheel 27 Photo by Andrew Dressel - Own work. Licensed under Creative Commons Attribution-Share Alike 3.0

The output: A schematic showing the parts and their relative layout Right facing bicycle handle bar saddle/seat right wheel left wheel Once again, a structure 28

A working definition of a structure From the book Analysing Sentences: An Introduction to English Syntax by Noel Burton-Roberts, 1986. A structure is a concept that can be applied to any complex thing, whether it be a bicycle, a commercial company, or a carbon molecule. By complex , we mean: 1. It is divisible into parts, 2. There are different kinds of parts, 3. The parts are arranged in a specifiable way, and, 4. Each part has a specifiable function in the structure of the thing as a whole 29

What is structured prediction? 30

Simple classifiers are not designed to predict structures X: “Find the largest state in the US.” Y: SELECT name FROM us_states WHERE size = (SELECT MAX(size) FROM us_states ) Classification is about making one decision Spam or not spam, or label a picture – We need to make multiple decisions – Each part needs a label • Should “ US ” be mapped to us_states or utah_counties ? • Should “ Find” be mapped to SELECT or FROM or WHERE? – The decisions interact with each other • We need valid SQL queries • If the outer FROM clause talks about the table us_states , then the inner FROM clause should not talk about utah_counties – How to compose the fragments together to create the whole structure? • Should the output consist of a WHERE clause? What should go in it? 31

Structured prediction Machine learning of interdependent variables • Unlike simple classification problems, many problems have – Multiple interdependent output variables – Both local and global decisions to be made • Mutual dependencies may necessitate a joint assignment to all the output variables – Joint inference or Global inference or simply Inference – Presents algorithmic issues • These problems are called structured output problems 32

Computational issues Model definition What are the parts of the output? What are the inter-dependencies? Data annotation difficulty Background How to train the knowledge about How to do inference ? model? domain Semi- supervised/indirectly supervised? 33

Structured Prediction Introduction What is structured prediction? - PowerPoint PPT Presentation

Structured Prediction Introduction What is structured prediction? CS 6355: Structured Prediction Our goal today To define a Structure and Structured Prediction 1 What are structures? 2 Examples of structured data? 3 Examples of structured

L101: Introduction to Structured Prediction Ryan Cotterell What is structured prediction?

A STRUCTURED L IFE A STRUCTURED L IFE A STRUCTURED L IFE A STRUCTURED L IFE A STRUCTURED L IFE

Machine Learning Fall 2017 Structured Prediction (structured perceptron, HMM, structured SVM)

Training Strategies CS 6355: Structured Prediction 1 So far we saw What is structured output

CSCE 496/896 Lecture 11: Structured Prediction and Structured Prediction and Probabilistic

Course Information CS 6355: Structured Prediction Building up structured output prediction

L101: Incremental structured prediction Structured prediction reminder Given an input x (e.g. a

Structured Prediction Final words CS 6355: Structured Prediction 1 A look back What is a

Complex Prediction Problems A novel approach to multiple Structured Output Prediction Yasemin

Scaling Log-Structured KV-Stores featuring Monkey and Dostoevsky SIGMOD17 / SIGMOD18 Niv Dayan

CSCE 970 Lecture 8: Prediction Stephen Scott Structured Prediction and Vinod Variyam

Branch Prediction Branch Prediction vs vs Execution Time Execution Time Prediction

Introduction to SparkSQL Structured Data Processing in Spark 1 Structured Data Processing A

Structured Electronic Design Structured Electronic Design ET 8016 5 ECTS credits 1

Semi-structured data Data is not just text, but is not as well- Semi-structured data

Variational Inference for Tutorial Outline Structured NLP Models 1. Structured Models and Factor

NATIONAL EMISSIONS VERIFICATION BY MERGING EARTH SYSTEM MEASUREMENTS, GLOBAL SOCIAL DATA and

WELCOME to the Falls Lake Regulatory Forum Hosted by Presenters Forrest Westall UNRBA

Linear predictive functional model on environmental data: case of chlorophyll-a oceanographic

(Chapra, L24) David Reckhow CEE 577 #20 1 General Model Kinetics Atmosphere (K a ) K 2 K 4

Some EU and US experiences in Some EU and US experiences in eutrophication assessment for

satin: a R package for extracting and visualizing satellite data for oceanographic applications

A stochastic model for phytoplankton dynamics in the Tyrrhenian Sea a Davide Valenti a Davide

Experiment 6 Absorption and Emision Spectra Chlorophyll a is essential for most photosynthetic