Structured Probability Spaces Guy Van den Broeck Southern - PowerPoint PPT Presentation

Tractable Learning in Structured Probability Spaces Guy Van den Broeck Southern California Machine Learning Symposium Nov 18, 2016

Structured probability spaces?

Running Example Courses: Data • Logic (L) • Knowledge Representation (K) • Probability (P) • Artificial Intelligence (A) Constraints • Must take at least one of Probability or Logic. • Probability is a prerequisite for AI. • The prerequisites for KR is either AI or Logic.

Probability Space unstructured L K P A 0 0 0 0 0 0 0 1 0 0 1 0 0 0 1 1 0 1 0 0 0 1 0 1 0 1 1 0 0 1 1 1 1 0 0 0 1 0 0 1 1 0 1 0 1 0 1 1 1 1 0 0 1 1 0 1 1 1 1 0 1 1 1 1

Structured Probability Space unstructured structured L K P A L K P A 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 1 • Must take at least one of 0 0 1 0 0 0 1 0 Probability or Logic. 0 0 1 1 0 0 1 1 • Probability is a prerequisite for AI. 0 1 0 0 0 1 0 0 • 0 1 0 1 The prerequisites for KR is 0 1 0 1 0 1 1 0 either AI or Logic. 0 1 1 0 0 1 1 1 0 1 1 1 1 0 0 0 1 0 0 0 1 0 0 1 1 0 0 1 1 0 1 0 1 0 1 0 7 out of 16 instantiations 1 0 1 1 1 0 1 1 are impossible 1 1 0 0 1 1 0 0 1 1 0 1 1 1 0 1 1 1 1 0 1 1 1 0 1 1 1 1 1 1 1 1

Learning with Constraints Learn a statistical model that assigns zero probability to instantiations that violate the constraints.

Example: Video [Lu, W. L., Ting, J. A., Little, J. J., & Murphy, K. P. (2013). Learning to track and identify players from broadcast sports videos.]

Example: Language • Non-local dependencies: At least one verb in each sentence [Chang, M., Ratinov, L., & Roth, D. (2008). Constraints as prior knowledge],…, [ Chang, M. W., Ratinov, L., & Roth, D. (2012). Structured learning with constrained conditional models.], [https://en.wikipedia.org/wiki/Constrained_conditional_model]

Example: Language • Non-local dependencies: At least one verb in each sentence • Sentence compression If a modifier is kept, its subject is also kept [Chang, M., Ratinov, L., & Roth, D. (2008). Constraints as prior knowledge],…, [ Chang, M. W., Ratinov, L., & Roth, D. (2012). Structured learning with constrained conditional models.], [https://en.wikipedia.org/wiki/Constrained_conditional_model]

Example: Language • Non-local dependencies: At least one verb in each sentence • Sentence compression If a modifier is kept, its subject is also kept • Information extraction [Chang, M., Ratinov, L., & Roth, D. (2008). Constraints as prior knowledge],…, [ Chang, M. W., Ratinov, L., & Roth, D. (2012). Structured learning with constrained conditional models.], [https://en.wikipedia.org/wiki/Constrained_conditional_model]

Example: Language • Non-local dependencies: At least one verb in each sentence • Sentence compression If a modifier is kept, its subject is also kept • Information extraction Semantic role labeling • … and many more! [Chang, M., Ratinov, L., & Roth, D. (2008). Constraints as prior knowledge],…, [ Chang, M. W., Ratinov, L., & Roth, D. (2012). Structured learning with constrained conditional models.], [https://en.wikipedia.org/wiki/Constrained_conditional_model]

Example: Deep Learning [Graves, A., Wayne, G., Reynolds, M., Harley, T., Danihelka, I., Grabska- Barwińska , A., et al.. (2016). Hybrid computing using a neural network with dynamic external memory. Nature , 538 (7626), 471-476.]

What are people doing now? • Ignore • Hack your way around • Handcraft into models • Use specialized distributions • Find non-structured encoding • Try to learn constraints

What are people doing now? • Ignore • Hack your way around • Handcraft into models Accuracy ? • Use specialized distributions Specialized skill ? • Find non-structured encoding Impossible ? • Try to learn constraints Intractable inference ? Intractable learning ? Waste parameters ? Risk predicting out of space ? + you are on your own 

Structured Probability Spaces • Everywhere in ML! – Configuration problems, video, text, deep learning – Planning and diagnosis (physics) – Cooking scenarios (interpreting videos) – Combinatorial objects: parse trees, rankings, directed acyclic graphs, trees, simple paths, game traces, etc.

Structured Probability Spaces • Everywhere in ML! – Configuration problems, video, text, deep learning – Planning and diagnosis (physics) – Cooking scenarios (interpreting videos) – Combinatorial objects: parse trees, rankings, directed acyclic graphs, trees, simple paths, game traces, etc. • Representations: constrained conditional models, mixed networks, probabilistic logics.

Structured Probability Spaces • Everywhere in ML! – Configuration problems, video, text, deep learning – Planning and diagnosis (physics) – Cooking scenarios (interpreting videos) – Combinatorial objects: parse trees, rankings, directed acyclic graphs, trees, simple paths, game traces, etc. • Representations: constrained conditional models, mixed networks, probabilistic logics. No ML boxes out there that take constraints as input! 

The Problem / The ML Box Goal: Constraints as important as data! General purpose! Data Probabilistic Model Learning (Distribution) Constraints

Specification Language: Logic

Structured Probability Space unstructured structured L K P A L K P A 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 1 • Must take at least one of 0 0 1 0 0 0 1 0 Probability or Logic. 0 0 1 1 0 0 1 1 • Probability is a prerequisite for AI. 0 1 0 0 0 1 0 0 • 0 1 0 1 The prerequisites for KR is 0 1 0 1 0 1 1 0 either AI or Logic. 0 1 1 0 0 1 1 1 0 1 1 1 1 0 0 0 1 0 0 0 1 0 0 1 1 0 0 1 1 0 1 0 1 0 1 0 7 out of 16 instantiations 1 0 1 1 1 0 1 1 are impossible 1 1 0 0 1 1 0 0 1 1 0 1 1 1 0 1 1 1 1 0 1 1 1 0 1 1 1 1 1 1 1 1

Boolean Constraints unstructured structured L K P A L K P A 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 1 0 0 1 0 0 0 1 0 0 0 1 1 0 0 1 1 0 1 0 0 0 1 0 0 0 1 0 1 0 1 0 1 0 1 1 0 0 1 1 0 0 1 1 1 0 1 1 1 1 0 0 0 1 0 0 0 1 0 0 1 1 0 0 1 1 0 1 0 1 0 1 0 7 out of 16 instantiations 1 0 1 1 1 0 1 1 are impossible 1 1 0 0 1 1 0 0 1 1 0 1 1 1 0 1 1 1 1 0 1 1 1 0 1 1 1 1 1 1 1 1

Combinatorial Objects: Rankings rank sushi rank sushi 1 fatty tuna 1 shrimp 10 items : 2 sea urchin 2 sea urchin 3,628,800 3 salmon roe 3 salmon roe rankings 4 shrimp 4 fatty tuna 5 tuna 5 tuna 6 squid 6 squid 20 items : 7 tuna roll 7 tuna roll 2,432,902,008,176,640,000 8 see eel 8 see eel rankings 9 egg 9 egg 10 cucumber roll 10 cucumber roll

Combinatorial Objects: Rankings rank sushi rank sushi A ij item i at position j 1 fatty tuna 1 shrimp (n items require n 2 2 sea urchin 2 sea urchin Boolean variables) 3 salmon roe 3 salmon roe 4 shrimp 4 fatty tuna 5 tuna 5 tuna 6 squid 6 squid 7 tuna roll 7 tuna roll 8 see eel 8 see eel 9 egg 9 egg 10 cucumber roll 10 cucumber roll

Combinatorial Objects: Rankings rank sushi rank sushi A ij item i at position j 1 fatty tuna 1 shrimp (n items require n 2 2 sea urchin 2 sea urchin Boolean variables) 3 salmon roe 3 salmon roe 4 shrimp 4 fatty tuna An item may be assigned 5 tuna 5 tuna to more than one position 6 squid 6 squid 7 tuna roll 7 tuna roll A position may contain 8 see eel 8 see eel more than one item 9 egg 9 egg 10 cucumber roll 10 cucumber roll

Encoding Rankings in Logic A ij : item i at position j pos 1 pos 2 pos 3 pos 4 item 1 A 11 A 12 A 13 A 14 item 2 A 21 A 22 A 23 A 24 item 3 A 31 A 32 A 33 A 34 item 4 A 41 A 42 A 43 A 44

Encoding Rankings in Logic A ij : item i at position j constraint: each item i assigned to a unique position ( n constraints) pos 1 pos 2 pos 3 pos 4 item 1 A 11 A 12 A 13 A 14 item 2 A 21 A 22 A 23 A 24 item 3 A 31 A 32 A 33 A 34 item 4 A 41 A 42 A 43 A 44

Encoding Rankings in Logic A ij : item i at position j constraint: each item i assigned to a unique position ( n constraints) pos 1 pos 2 pos 3 pos 4 item 1 A 11 A 12 A 13 A 14 item 2 A 21 A 22 A 23 A 24 constraint: each position j assigned item 3 A 31 A 32 A 33 A 34 a unique item ( n constraints) item 4 A 41 A 42 A 43 A 44

Structured Probability Spaces Guy Van den Broeck Southern - PowerPoint PPT Presentation

Tractable Learning in Structured Probability Spaces Guy Van den Broeck Southern California Machine Learning Symposium Nov 18, 2016 Structured probability spaces? Running Example Courses: Data Logic (L) Knowledge Representation (K)

A STRUCTURED L IFE A STRUCTURED L IFE A STRUCTURED L IFE A STRUCTURED L IFE A STRUCTURED L IFE

Probability Basics Martin Emms October 1, 2020 Probability Basics Outline Probability

Structured Probability Spaces Guy Van den Broeck DTAI Seminar - KU Leuven Dec 20, 2016

Structured Probability Spaces Guy Van den Broeck UCLA Stats Seminar Jan 17, 2017 Outline 1.

Structured Prediction Introduction What is structured prediction? CS 6355: Structured Prediction

Continuing Probability. Wrap up: Total Probability and Conditional Probability. Continuing

Chapter 2 Probability 1. Definition of Probability 2. Probability of disjoint events 3.

Probability Basics Probability Background Martin Emms October 1, 2020 Probability Basics

Chapter 2 Probability 1. Definition of Probability 2. Probability of disjoint events 3.

Tyrol Hill Park Phase 4 Elementary Campbell Elementary Campbell Park Spaces Open Park

MATH 105: Finite Mathematics 7-1: Sample Spaces and Assignment of Probability Prof. Jonathan

Probability & Stochastic Processes Introduction to Probability Theory Sample Spaces Event

Foundations of Computer Science Lecture 15 Probability Computing Probabilities Probability and

Counting and Probability Whats to come? Counting and Probability Whats to come?

18.175: Lecture 1 Probability spaces and -algebras Scott Sheffield MIT 1 18.175 Lecture 1

Scaling Log-Structured KV-Stores featuring Monkey and Dostoevsky SIGMOD17 / SIGMOD18 Niv Dayan

What is Artificial Intelligence? . . . Exactly what the computer provides is the ability not to be

An Overview of Classical Theatre An Overview of Classical Theatre The Major Periods of

Daniel Thorson, 33, entered the Monastic Academy in Lowell, Vermont on March 23 for two months of

Defini3ons Programming Working out algorithms that solve

Lecturer: Dr. Kingsley Nyarko , Department of Psychology Contact Information: knyarko@ug.edu.gh

Extreme value statistics of correlated random variables Lectures by Satya N. Majumdar 1 and Notes

Quantum Approach Explains Let Us Therefore Look . . . the Need for Expert The Main Idea of . . .

Building a 1500-Class Listing Categorizer from Implicit User Feedback Arnau Tibau -

Sambuz

Useful Links

Newsletter

Mail Us

Structured Probability Spaces Guy Van den Broeck Southern - PowerPoint PPT Presentation

Tractable Learning in Structured Probability Spaces Guy Van den Broeck Southern California Machine Learning Symposium Nov 18, 2016 Structured probability spaces? Running Example Courses: Data Logic (L) Knowledge Representation (K)

A STRUCTURED L IFE A STRUCTURED L IFE A STRUCTURED L IFE A STRUCTURED L IFE A STRUCTURED L IFE

Probability Basics Martin Emms October 1, 2020 Probability Basics Outline Probability

Structured Probability Spaces Guy Van den Broeck DTAI Seminar - KU Leuven Dec 20, 2016

Structured Probability Spaces Guy Van den Broeck UCLA Stats Seminar Jan 17, 2017 Outline 1.

Structured Prediction Introduction What is structured prediction? CS 6355: Structured Prediction

Continuing Probability. Wrap up: Total Probability and Conditional Probability. Continuing

Chapter 2 Probability 1. Definition of Probability 2. Probability of disjoint events 3.

Probability Basics Probability Background Martin Emms October 1, 2020 Probability Basics

Chapter 2 Probability 1. Definition of Probability 2. Probability of disjoint events 3.

Tyrol Hill Park Phase 4 Elementary Campbell Elementary Campbell Park Spaces Open Park

MATH 105: Finite Mathematics 7-1: Sample Spaces and Assignment of Probability Prof. Jonathan

Probability &amp; Stochastic Processes Introduction to Probability Theory Sample Spaces Event

Foundations of Computer Science Lecture 15 Probability Computing Probabilities Probability and

Counting and Probability Whats to come? Counting and Probability Whats to come?

18.175: Lecture 1 Probability spaces and -algebras Scott Sheffield MIT 1 18.175 Lecture 1

Scaling Log-Structured KV-Stores featuring Monkey and Dostoevsky SIGMOD17 / SIGMOD18 Niv Dayan

What is Artificial Intelligence? . . . Exactly what the computer provides is the ability not to be

An Overview of Classical Theatre An Overview of Classical Theatre The Major Periods of

Daniel Thorson, 33, entered the Monastic Academy in Lowell, Vermont on March 23 for two months of

Defini3ons Programming Working out algorithms that solve

Lecturer: Dr. Kingsley Nyarko , Department of Psychology Contact Information: knyarko@ug.edu.gh

Extreme value statistics of correlated random variables Lectures by Satya N. Majumdar 1 and Notes

Quantum Approach Explains Let Us Therefore Look . . . the Need for Expert The Main Idea of . . .

Building a 1500-Class Listing Categorizer from Implicit User Feedback Arnau Tibau -

Sambuz

Useful Links

Newsletter

Mail Us

Probability & Stochastic Processes Introduction to Probability Theory Sample Spaces Event