Structured Probability Spaces Guy Van den Broeck UCLA Stats Seminar - PowerPoint PPT Presentation

Tractable Learning in Structured Probability Spaces Guy Van den Broeck UCLA Stats Seminar Jan 17, 2017

Outline 1. Structured probability spaces? 2. Specification language Logic 3. “Deep architecture” Logic + Probability 4. Learning PSDDs Logic + Probability + Machine Learning 5. Conclusions

References Probabilistic Sentential Decision Diagrams Doga Kisa, Guy Van den Broeck, Arthur Choi and Adnan Darwiche KR, 2014 Learning with Massive Logical Constraints Doga Kisa, Guy Van den Broeck, Arthur Choi and Adnan Darwiche ICML 2014 workshop Tractable Learning for Structured Probability Spaces Arthur Choi, Guy Van den Broeck and Adnan Darwiche IJCAI, 2015 Tractable Learning for Complex Probability Queries Jessa Bekker, Jesse Davis, Arthur Choi, Adnan Darwiche, Guy Van den Broeck. NIPS, 2015 Structured Features in Naive Bayes Classifiers Arthur Choi, Nazgol Tavabi and Adnan Darwiche AAAI, 2016 Tractable Operations on Arithmetic Circuits Jason Shen, Arthur Choi and Adnan Darwiche NIPS, 2016

Structured probability spaces?

Running Example Courses: Data • Logic (L) • Knowledge Representation (K) • Probability (P) • Artificial Intelligence (A) Constraints • Must take at least one of Probability or Logic. • Probability is a prerequisite for AI. • The prerequisites for KR is either AI or Logic.

Probability Space unstructured L K P A 0 0 0 0 0 0 0 1 0 0 1 0 0 0 1 1 0 1 0 0 0 1 0 1 0 1 1 0 0 1 1 1 1 0 0 0 1 0 0 1 1 0 1 0 1 0 1 1 1 1 0 0 1 1 0 1 1 1 1 0 1 1 1 1

Structured Probability Space unstructured structured L K P A L K P A 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 1 • Must take at least one of 0 0 1 0 0 0 1 0 Probability or Logic. 0 0 1 1 0 0 1 1 • Probability is a prerequisite for AI. 0 1 0 0 0 1 0 0 • 0 1 0 1 The prerequisites for KR is 0 1 0 1 0 1 1 0 either AI or Logic. 0 1 1 0 0 1 1 1 0 1 1 1 1 0 0 0 1 0 0 0 1 0 0 1 1 0 0 1 1 0 1 0 1 0 1 0 7 out of 16 instantiations 1 0 1 1 1 0 1 1 are impossible 1 1 0 0 1 1 0 0 1 1 0 1 1 1 0 1 1 1 1 0 1 1 1 0 1 1 1 1 1 1 1 1

Learning with Constraints Data Statistical Model Learn (Distribution) Constraints (Background Knowledge) (Physics) Learn a statistical model that assigns zero probability to instantiations that violate the constraints.

Example: Video [Lu, W. L., Ting, J. A., Little, J. J., & Murphy, K. P. (2013). Learning to track and identify players from broadcast sports videos.]

Example: Language • Non-local dependencies: At least one verb in each sentence [Chang, M., Ratinov, L., & Roth, D. (2008). Constraints as prior knowledge],…, [ Chang, M. W., Ratinov, L., & Roth, D. (2012). Structured learning with constrained conditional models.], [https://en.wikipedia.org/wiki/Constrained_conditional_model]

Example: Language • Non-local dependencies: At least one verb in each sentence Sentence compression If a modifier is kept, its subject is also kept [Chang, M., Ratinov, L., & Roth, D. (2008). Constraints as prior knowledge],…, [ Chang, M. W., Ratinov, L., & Roth, D. (2012). Structured learning with constrained conditional models.], [https://en.wikipedia.org/wiki/Constrained_conditional_model]

Example: Language • Non-local dependencies: At least one verb in each sentence Sentence compression If a modifier is kept, its subject is also kept Information extraction [Chang, M., Ratinov, L., & Roth, D. (2008). Constraints as prior knowledge],…, [ Chang, M. W., Ratinov, L., & Roth, D. (2012). Structured learning with constrained conditional models.], [https://en.wikipedia.org/wiki/Constrained_conditional_model]

Example: Language • Non-local dependencies: At least one verb in each sentence Sentence compression If a modifier is kept, its subject is also kept Information extraction • Semantic role labeling • … and many more! [Chang, M., Ratinov, L., & Roth, D. (2008). Constraints as prior knowledge],…, [ Chang, M. W., Ratinov, L., & Roth, D. (2012). Structured learning with constrained conditional models.], [https://en.wikipedia.org/wiki/Constrained_conditional_model]

Bayesian network synthesized from specs of power system (NASA Ames): Has many constraints (0/1 parameters) due to domain ``physics’’

Example: Deep Learning [Graves, A., Wayne, G., Reynolds, M., Harley, T., Danihelka, I., Grabska- Barwińska , A., et al.. (2016). Hybrid computing using a neural network with dynamic external memory. Nature , 538 (7626), 471-476.]

What are people doing now? • Ignore constraints • Handcraft into models • Use specialized distributions • Find non-structured encoding • Try to learn constraints • Hack your way around

What are people doing now? • Ignore constraints • Handcraft into models • Use specialized distributions Accuracy ? • Find non-structured encoding Specialized skill ? • Try to learn constraints Intractable inference ? • Hack your way around Intractable learning ? Waste parameters ? Risk predicting out of space ? + you are on your own 

Structured Probability Spaces • Everywhere in ML! – Configuration problems, inventory, video, text, deep learning – Planning and diagnosis (physics) – Causal models: cooking scenarios (interpreting videos) – Combinatorial objects: parse trees, rankings, directed acyclic graphs, trees, simple paths, game traces, etc.

Structured Probability Spaces • Everywhere in ML! – Configuration problems, inventory, video, text, deep learning – Planning and diagnosis (physics) – Causal models: cooking scenarios (interpreting videos) – Combinatorial objects: parse trees, rankings, directed acyclic graphs, trees, simple paths, game traces, etc. • Some representations: constrained conditional models, mixed networks, probabilistic logics.

Structured Probability Spaces • Everywhere in ML! – Configuration problems, inventory, video, text, deep learning – Planning and diagnosis (physics) – Causal models: cooking scenarios (interpreting videos) – Combinatorial objects: parse trees, rankings, directed acyclic graphs, trees, simple paths, game traces, etc. • Some representations: constrained conditional models, mixed networks, probabilistic logics. No ML boxes out there that take constraints as input! 

Structured Probability Spaces • Everywhere in ML! – Configuration problems, inventory, video, text, deep learning – Planning and diagnosis (physics) – Causal models: cooking scenarios (interpreting videos) – Combinatorial objects: parse trees, rankings, directed acyclic graphs, trees, simple paths, game traces, etc. • Some representations: constrained conditional models, mixed networks, probabilistic logics. No ML boxes out there that take constraints as input!  Goal: Constraints as important as data! General purpose!

Specification Language: Logic

Structured Probability Space unstructured structured L K P A L K P A 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 1 • Must take at least one of 0 0 1 0 0 0 1 0 Probability or Logic. 0 0 1 1 0 0 1 1 • Probability is a prerequisite for AI. 0 1 0 0 0 1 0 0 • 0 1 0 1 The prerequisites for KR is 0 1 0 1 0 1 1 0 either AI or Logic. 0 1 1 0 0 1 1 1 0 1 1 1 1 0 0 0 1 0 0 0 1 0 0 1 1 0 0 1 1 0 1 0 1 0 1 0 7 out of 16 instantiations 1 0 1 1 1 0 1 1 are impossible 1 1 0 0 1 1 0 0 1 1 0 1 1 1 0 1 1 1 1 0 1 1 1 0 1 1 1 1 1 1 1 1

Boolean Constraints unstructured structured L K P A L K P A 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 1 0 0 1 0 0 0 1 0 0 0 1 1 0 0 1 1 0 1 0 0 0 1 0 0 0 1 0 1 0 1 0 1 0 1 1 0 0 1 1 0 0 1 1 1 0 1 1 1 1 0 0 0 1 0 0 0 1 0 0 1 1 0 0 1 1 0 1 0 1 0 1 0 7 out of 16 instantiations 1 0 1 1 1 0 1 1 are impossible 1 1 0 0 1 1 0 0 1 1 0 1 1 1 0 1 1 1 1 0 1 1 1 0 1 1 1 1 1 1 1 1

Combinatorial Objects: Rankings rank sushi rank sushi 1 fatty tuna 1 shrimp 10 items : 2 sea urchin 2 sea urchin 3,628,800 3 salmon roe 3 salmon roe rankings 4 shrimp 4 fatty tuna 5 tuna 5 tuna 6 squid 6 squid 20 items : 7 tuna roll 7 tuna roll 2,432,902,008,176,640,000 8 see eel 8 see eel rankings 9 egg 9 egg 10 cucumber roll 10 cucumber roll

Structured Probability Spaces Guy Van den Broeck UCLA Stats Seminar - PowerPoint PPT Presentation

Tractable Learning in Structured Probability Spaces Guy Van den Broeck UCLA Stats Seminar Jan 17, 2017 Outline 1. Structured probability spaces? 2. Specification language Logic 3. Deep architecture Logic + Probability 4. Learning

A STRUCTURED L IFE A STRUCTURED L IFE A STRUCTURED L IFE A STRUCTURED L IFE A STRUCTURED L IFE

Probability Basics Martin Emms October 1, 2020 Probability Basics Outline Probability

Structured Probability Spaces Guy Van den Broeck DTAI Seminar - KU Leuven Dec 20, 2016

Structured Prediction Introduction What is structured prediction? CS 6355: Structured Prediction

Continuing Probability. Wrap up: Total Probability and Conditional Probability. Continuing

Chapter 2 Probability 1. Definition of Probability 2. Probability of disjoint events 3.

Probability Basics Probability Background Martin Emms October 1, 2020 Probability Basics

Chapter 2 Probability 1. Definition of Probability 2. Probability of disjoint events 3.

Tyrol Hill Park Phase 4 Elementary Campbell Elementary Campbell Park Spaces Open Park

Structured Probability Spaces Guy Van den Broeck Southern California Machine Learning Symposium

MATH 105: Finite Mathematics 7-1: Sample Spaces and Assignment of Probability Prof. Jonathan

Probability & Stochastic Processes Introduction to Probability Theory Sample Spaces Event

Foundations of Computer Science Lecture 15 Probability Computing Probabilities Probability and

Counting and Probability Whats to come? Counting and Probability Whats to come?

18.175: Lecture 1 Probability spaces and -algebras Scott Sheffield MIT 1 18.175 Lecture 1

Scaling Log-Structured KV-Stores featuring Monkey and Dostoevsky SIGMOD17 / SIGMOD18 Niv Dayan

Natural Language Processing Parsing III Dan Klein UC Berkeley 1 Unsupervised Tagging 2

Risk Assessment and Allocation for Effective Project Delivery and Management June 27, 2012

A very good morning to Shareholders, Chairman Tan Sri Mohd Hassan Marican, Directors, ladies and

Transport Focus 2016 Bus Passenger Survey Briefing 22 March 2017 - Liverpool Presentation of BPS

Planning by Rewriting Jos-Luis Ambite University of Southern California Information Sciences

2018 Underwriting Outlook Underwriting in The Absence of State LIHTC - Key Areas of Crit ical

61A Extra Lecture 2 Thursday, February 5 Announcements 2 Announcements If you want 1 unit

Taxing Capital Gains in New Zealand: Assessment and Recommendations Leonard Burman Syracuse

Structured Probability Spaces Guy Van den Broeck UCLA Stats Seminar - PowerPoint PPT Presentation

Tractable Learning in Structured Probability Spaces Guy Van den Broeck UCLA Stats Seminar Jan 17, 2017 Outline 1. Structured probability spaces? 2. Specification language Logic 3. Deep architecture Logic + Probability 4. Learning

A STRUCTURED L IFE A STRUCTURED L IFE A STRUCTURED L IFE A STRUCTURED L IFE A STRUCTURED L IFE

Probability Basics Martin Emms October 1, 2020 Probability Basics Outline Probability

Structured Probability Spaces Guy Van den Broeck DTAI Seminar - KU Leuven Dec 20, 2016

Structured Prediction Introduction What is structured prediction? CS 6355: Structured Prediction

Continuing Probability. Wrap up: Total Probability and Conditional Probability. Continuing

Chapter 2 Probability 1. Definition of Probability 2. Probability of disjoint events 3.

Probability Basics Probability Background Martin Emms October 1, 2020 Probability Basics

Chapter 2 Probability 1. Definition of Probability 2. Probability of disjoint events 3.

Tyrol Hill Park Phase 4 Elementary Campbell Elementary Campbell Park Spaces Open Park

Structured Probability Spaces Guy Van den Broeck Southern California Machine Learning Symposium

MATH 105: Finite Mathematics 7-1: Sample Spaces and Assignment of Probability Prof. Jonathan

Probability &amp; Stochastic Processes Introduction to Probability Theory Sample Spaces Event

Foundations of Computer Science Lecture 15 Probability Computing Probabilities Probability and

Counting and Probability Whats to come? Counting and Probability Whats to come?

18.175: Lecture 1 Probability spaces and -algebras Scott Sheffield MIT 1 18.175 Lecture 1

Scaling Log-Structured KV-Stores featuring Monkey and Dostoevsky SIGMOD17 / SIGMOD18 Niv Dayan

Natural Language Processing Parsing III Dan Klein UC Berkeley 1 Unsupervised Tagging 2

Risk Assessment and Allocation for Effective Project Delivery and Management June 27, 2012

A very good morning to Shareholders, Chairman Tan Sri Mohd Hassan Marican, Directors, ladies and

Transport Focus 2016 Bus Passenger Survey Briefing 22 March 2017 - Liverpool Presentation of BPS

Planning by Rewriting Jos-Luis Ambite University of Southern California Information Sciences

2018 Underwriting Outlook Underwriting in The Absence of State LIHTC - Key Areas of Crit ical

61A Extra Lecture 2 Thursday, February 5 Announcements 2 Announcements If you want 1 unit

Taxing Capital Gains in New Zealand: Assessment and Recommendations Leonard Burman Syracuse

Probability & Stochastic Processes Introduction to Probability Theory Sample Spaces Event