At the Confluence of Logic and Learning Guy Van den Broeck - PowerPoint PPT Presentation

At the Confluence of Logic and Learning Guy Van den Broeck Dagstuhl September 3, 2019

Outline 1. The AI dilemma: logic vs. learning 2. Deep learning with symbolic knowledge 3. Efficient reasoning during learning 4. New machine learning formalisms 5. Statistical relational learning (tutorial)

The AI Dilemma Pure Learning Pure Logic

The AI Dilemma Pure Learning Pure Logic • Slow thinking: deliberative, cognitive, model-based, extrapolation • Amazing achievements until this day • “ Pure logic is brittle ” noise, uncertainty, incomplete knowledge, …

The AI Dilemma Pure Learning Pure Logic • Fast thinking: instinctive, perceptive, model-free, interpolation • Amazing achievements recently • “ Pure learning is brittle ” bias, algorithmic fairness, interpretability, explainability, adversarial attacks, unknown unknowns, calibration, verification, missing features, missing labels, data efficiency, shift in distribution, general robustness and safety fails to incorporate a sensible model of the world

The FALSE AI Dilemma So all hope is lost? Probabilistic World Models • Joint distribution P(X) • Wealth of representations: can be causal, relational, etc. • Knowledge + data • Reasoning + learning

Then why isn’t everything solved? Pure Logic Probabilistic World Models Pure Learning What did we gain? What did we lose along the way?

Probabilistic World Models Pure Learning Pure Logic A New Synthesis of Learning and Reasoning

Outline 1. The AI dilemma: logic vs. learning 2. Deep learning with symbolic knowledge 3. Efficient reasoning during learning 4. New machine learning formalisms 5. Statistical relational learning (tutorial) 6. Lifted probabilistic inference

Motivation: Vision [Lu, W. L., Ting, J. A., Little, J. J., & Murphy, K. P. (2013). Learning to track and identify players from broadcast sports videos.]

Motivation: Robotics    [Wong, L. L., Kaelbling, L. P., & Lozano-Perez, T., Collision-free state estimation. ICRA 2012]

Motivation: Language • Non-local dependencies: “At least one verb in each sentence” • Sentence compression “If a modifier is kept, its subject is also kept” • NELL ontology and rules … and much more! [Chang, M., Ratinov, L., & Roth, D. (2008). Constraints as prior knowledge], [Ganchev, K., Gillenwater, J., & Taskar, B. (2010). Posterior regularization for structured latent variable models] … and many many more!

Motivation: Deep Learning [Graves, A., Wayne, G., Reynolds, M., Harley, T., Danihelka, I., Grabska- Barwińska , A., et al.. (2016). Hybrid computing using a neural network with dynamic external memory. Nature , 538 (7626), 471-476.]

Motivation: Deep Learning … but …  [Graves, A., Wayne, G., Reynolds, M., Harley, T., Danihelka, I., Grabska- Barwińska , A., et al.. (2016). Hybrid computing using a neural network with dynamic external memory. Nature , 538 (7626), 471-476.]

Knowledge vs. Data • Where did the world knowledge go? – Python scripts • Decode/encode cleverly • Fix inconsistent beliefs – Rule-based decision systems – Dataset design – “a big hack” (with author’s permission) • In some sense we went backwards Less principled, scientific, and intellectually satisfying ways of incorporating knowledge

Learning with Symbolic Knowledge Data + Constraints (Background Knowledge) (Physics) 1. Must take at least one of Probability ( P ) or Logic ( L ). 2. Probability ( P ) is a prerequisite for AI ( A ). 3. The prerequisites for KR ( K ) is either AI ( A ) or Logic ( L ).

Learning with Symbolic Knowledge Data + Constraints (Background Knowledge) (Physics) Learn ML Model Today’s machine learning tools don’t take knowledge as input! 

Deep Learning + Data Constraints with Deep Neural Learn Symbolic Knowledge Network Neural Network Logical Constraint Output Input Output is probability vector p , not Boolean logic!

Semantic Loss Q: How close is output p to satisfying constraint α ? Answer: Semantic loss function L( α , p ) • Axioms, for example: – If α constrains to one label, L( α , p ) is cross-entropy – If α implies β then L( α , p ) ≥ L(β , p ) ( α more strict ) • Implied Properties: SEMANTIC – If α is equivalent to β then L( α , p ) = L( β , p ) Loss! – If p is Boolean and satisfies α then L( α , p ) = 0

Semantic Loss: Definition Theorem: Axioms imply unique semantic loss: Probability of getting state x after flipping coins with probabilities p Probability of satisfying α after flipping coins with probabilities p

Simple Example: Exactly-One • Data must have some label We agree this must be one of the 10 digits: • Exactly-one constraint 𝒚 𝟐 ∨ 𝒚 𝟑 ∨ 𝒚 𝟒 ¬𝒚 𝟐 ∨ ¬𝒚 𝟑 → For 3 classes: ¬𝒚 𝟑 ∨ ¬𝒚 𝟒 • Semantic loss: ¬𝒚 𝟐 ∨ ¬𝒚 𝟒 Only 𝒚 𝒋 = 𝟐 after flipping coins Exactly one true 𝒚 after flipping coins

Semi-Supervised Learning • Intuition: Unlabeled data must have some label Cf. entropy minimization, manifold learning • Minimize exactly-one semantic loss on unlabeled data Train with 𝑓𝑦𝑗𝑡𝑢𝑗𝑜𝑕 𝑚𝑝𝑡𝑡 + 𝑥 ∙ 𝑡𝑓𝑛𝑏𝑜𝑢𝑗𝑑 𝑚𝑝𝑡𝑡

Experimental Evaluation Competitive with state of the art in semi-supervised deep learning Outperforms SoA! Same conclusion on CIFAR10

But what about real constraints? • Path constraint cf. Nature paper vs . • Example: 4x4 grids 2 24 = 184 paths + 16,777,032 non-paths • Easily encoded as logical constraints  [Nishino et al., Choi et al.]

How to Compute Semantic Loss? • In general: #P-hard 

Reasoning Tool: Logical Circuits Representation of logical sentences: 𝐷 ∧ ¬𝐸 ∨ ¬𝐷 ∧ 𝐸 C XOR D

Reasoning Tool: Logical Circuits 1 Representation of 0 1 logical sentences: 1 1 0 1 Input: 0 1 0 1 0 1 1 0 0 0 1 1 0 1 0 1 0

Tractable for Logical Inference • Is there a solution? (SAT) – SAT( 𝛽 ∨ 𝛾 ) iff SAT( 𝛽 ) or SAT( 𝛾 ) ( always ) – SAT( 𝛽 ∧ 𝛾 ) iff ???

Decomposable Circuits Decomposable A B,C,D

Tractable for Logical Inference • Is there a solution? (SAT) ✓ – SAT( 𝛽 ∨ 𝛾 ) iff SAT( 𝛽 ) or SAT( 𝛾 ) ( always ) – SAT( 𝛽 ∧ 𝛾 ) iff SAT( 𝛽 ) and SAT( 𝛾 ) ( decomposable ) • How many solutions are there? (#SAT) • Complexity linear in circuit size 

Deterministic Circuits Deterministic C XOR D

Deterministic Circuits Deterministic C XOR D C ⇔ D

How many solutions are there? (#SAT) x 16 8 8 8 8 1 1 4 4 4 + 2 2 2 2 1 1 1 1 1 1 1 1 1 1

Tractable for Logical Inference • Is there a solution? (SAT) ✓ ✓ • How many solutions are there? (#SAT) • Conjoin, disjoin, equivalence checking, etc. ✓ • Complexity linear in circuit size  • Compilation into circuit by – ↓ exhaustive SAT solver – ↑ conjoin/disjoin/negate [Darwiche and Marquis, JAIR 2002]

How to Compute Semantic Loss? • In general: #P-hard  • With a logical circuit for α : Linear  • Example: exactly-one constraint: L( α , p ) = L( , p ) = - log( ) • Why? Decomposability and determinism!

Predict Shortest Paths Add semantic loss for path constraint Is output Is prediction Are individual a path? the shortest path? edge predictions This is the real task! correct? (same conclusion for predicting sushi preferences, see paper)

Conclusions 1 • Knowledge is (hidden) everywhere in ML • Semantic loss makes logic differentiable • Performs well semi-supervised • Requires hard reasoning in general – Reasoning can be encapsulated in a circuit – No overhead during learning • Performs well on structured prediction • A little bit of reasoning goes a long way!

Another False Dilemma? Classical AI Methods Neural Networks Hungry? $25? Restau Sleep? rant? … “Black Box” Clear Modeling Assumption Empirical performance Well-understood

Probabilistic Circuits 𝐐𝐬(𝑩, 𝑪, 𝑫, 𝑬) = 𝟏. 𝟏𝟘𝟕 0 . 096 .8 x .3 SPNs, ACs .194 .096 1 0 PSDDs, CNs .01 .24 0 (.1x1) + (.9x0) .3 0 .1 .8 Input: 0 0 1 0 1 0 1 0 1 0

At the Confluence of Logic and Learning Guy Van den Broeck - PowerPoint PPT Presentation

At the Confluence of Logic and Learning Guy Van den Broeck Dagstuhl September 3, 2019 Outline 1. The AI dilemma: logic vs. learning 2. Deep learning with symbolic knowledge 3. Efficient reasoning during learning 4. New machine learning

Rewriting Part 5. Confluence of Term Rewriting Systems Temur Kutsia RISC, JKU Linz Confluence

Disproving Confluence of Term Rewriting Systems by Interpretation and Ordering FroCoS 2013

CHINO BASIN WATER CONSERVATION DISTRICT The Confluence Regional Water Resource Project Robert

Deciding Confluence of Certain Term Rewriting Systems in Polynomial Time Ashish Tiwari { tiwari

Confluence Another important property for dont care non-deterministic rule based definitions

Confluence up to Garbage 13th International Conference on Graph Transformation Graham Campbell

7 th International Confluence Competition Takahito Aoto Makoto Hamana Nao Hirokawa Aart

Compositional Z Confluence for Permutative Conversion Koji Nakazawa (Nagoya U) joint work with

EVOLUTION OF THE CISO And the Confluence of IT Security & Audit Thomas Borton, MBA, CISA,

Overview of the Lower Yuba River Overview of the Lower Yuba River Existing Information &

Community Wiki Training http://community.icann.org 22 October 2011 Carole Cornell Agenda

The River Future San Gabriel River Confluence with Cattle Canyon Creek Improvements Project (the

Founded 2009 Top selling Jira app 18 Employees Top selling Confluence app 6 Nationalities Gold Top

CONFLUENCE RIVER PARK Montpelier, Vermont Conceptual Design & Outreach Vermont River

Triveni Sangam- Confluence and Divergence , GST, Accounting and Income-tax Case Studies CA

Understanding Womens Work Experiences in Humboldt Park Through the Confluence of Gender, Race

Consistency Analysis for User Requirements Notation Models Okhaide Akhigbe, Daniel Amyot, Amal

Plya Urns An analytic combinatorics approach Basile Morcrette Algorithms project, INRIA

Area under lattice paths associated with certain urn models. Alois Panholzer, Markus Kuba

Urn models with multiple drawings Markus Kuba joint work with May-Ru Chen; Hosam Mahmoud and

What is an Explicit Construction? Bill Gasarch- U. of MD-College Park gasarch@cs.umd.edu Bill

Computing the Multiplicity Structure from Geometric Involutive Form Xiaoli Wu and Lihong Zhi Key

MA 526 Commutative Algebra / JanApril 2018 (MSc and Ph. D. Programmes) Mobile : 07975952079

Electroweak corrections to W/Z+jet production at the LHC Alexander Mck in collaboration with

At the Confluence of Logic and Learning Guy Van den Broeck - PowerPoint PPT Presentation

At the Confluence of Logic and Learning Guy Van den Broeck Dagstuhl September 3, 2019 Outline 1. The AI dilemma: logic vs. learning 2. Deep learning with symbolic knowledge 3. Efficient reasoning during learning 4. New machine learning

Rewriting Part 5. Confluence of Term Rewriting Systems Temur Kutsia RISC, JKU Linz Confluence

Disproving Confluence of Term Rewriting Systems by Interpretation and Ordering FroCoS 2013

CHINO BASIN WATER CONSERVATION DISTRICT The Confluence Regional Water Resource Project Robert

Deciding Confluence of Certain Term Rewriting Systems in Polynomial Time Ashish Tiwari { tiwari

Confluence Another important property for dont care non-deterministic rule based definitions

Confluence up to Garbage 13th International Conference on Graph Transformation Graham Campbell

7 th International Confluence Competition Takahito Aoto Makoto Hamana Nao Hirokawa Aart

Compositional Z Confluence for Permutative Conversion Koji Nakazawa (Nagoya U) joint work with

EVOLUTION OF THE CISO And the Confluence of IT Security &amp; Audit Thomas Borton, MBA, CISA,

Overview of the Lower Yuba River Overview of the Lower Yuba River Existing Information &amp;

Community Wiki Training http://community.icann.org 22 October 2011 Carole Cornell Agenda

The River Future San Gabriel River Confluence with Cattle Canyon Creek Improvements Project (the

Founded 2009 Top selling Jira app 18 Employees Top selling Confluence app 6 Nationalities Gold Top

CONFLUENCE RIVER PARK Montpelier, Vermont Conceptual Design &amp; Outreach Vermont River

Triveni Sangam- Confluence and Divergence , GST, Accounting and Income-tax Case Studies CA

Understanding Womens Work Experiences in Humboldt Park Through the Confluence of Gender, Race

Consistency Analysis for User Requirements Notation Models Okhaide Akhigbe, Daniel Amyot, Amal

Plya Urns An analytic combinatorics approach Basile Morcrette Algorithms project, INRIA

Area under lattice paths associated with certain urn models. Alois Panholzer, Markus Kuba

Urn models with multiple drawings Markus Kuba joint work with May-Ru Chen; Hosam Mahmoud and

What is an Explicit Construction? Bill Gasarch- U. of MD-College Park gasarch@cs.umd.edu Bill

Computing the Multiplicity Structure from Geometric Involutive Form Xiaoli Wu and Lihong Zhi Key

MA 526 Commutative Algebra / JanApril 2018 (MSc and Ph. D. Programmes) Mobile : 07975952079

Electroweak corrections to W/Z+jet production at the LHC Alexander Mck in collaboration with

EVOLUTION OF THE CISO And the Confluence of IT Security & Audit Thomas Borton, MBA, CISA,

Overview of the Lower Yuba River Overview of the Lower Yuba River Existing Information &

CONFLUENCE RIVER PARK Montpelier, Vermont Conceptual Design & Outreach Vermont River