Causal Commutative Arrows Revisited Jeremy Yallop Hai (Paul) Liu - PowerPoint PPT Presentation

Causal Commutative Arrows Revisited Jeremy Yallop Hai (Paul) Liu University of Cambridge Intel Labs September 21, 2016 1/22

Normalization as an optimization technique? 2/22

Normalization as an optimization technique? ◮ Plausible, because it preserves semantics. 2/22

Normalization as an optimization technique? ◮ Plausible, because it preserves semantics. ◮ Effective, when conditions are met: 2/22

Normalization as an optimization technique? ◮ Plausible, because it preserves semantics. ◮ Effective, when conditions are met: ◮ It has to terminate ; ◮ It gives simpler program as a result; ◮ It enables other optimizations. 2/22

Normalization as an optimization technique? ◮ Plausible, because it preserves semantics. ◮ Effective, when conditions are met: ◮ It has to terminate ; ◮ It gives simpler program as a result; ◮ It enables other optimizations. ◮ with a few catches: ◮ Strongly normalizing can be too restrictive; ◮ Sharing is hard to preserve; ◮ Static or dynamic implementation? 2/22

Arrows Arrows are a generalization of monads (Hughes 2000). class Arrow ( arr :: ∗ → ∗ → ∗ ) where arr :: ( a → b ) → arr a b ( ≫ ) :: arr a b → arr b c → arr a c first :: arr a b → arr ( a , c ) ( b , c ) class Arrow arr ⇒ ArrowLoop arr where loop :: arr ( a , c ) ( b , c ) → arr a b (a) arr f (b) f ≫ g (c) first f (d) second f (e) f ⋆⋆⋆ g (f) loop f 3/22

Arrow and ArrowLoop laws arr id ≫ f f ≡ f ≫ arr id f ≡ ( f ≫ g ) ≫ h f ≫ ( g ≫ h ) ≡ arr ( g . f ) arr f ≫ arr g ≡ first ( arr f ) arr ( f × id ) ≡ first ( f ≫ g ) first f ≫ first g ≡ first f ≫ arr ( id × g ) arr ( id × g ) ≫ first f ≡ first f ≫ arr fst arr fst ≫ f ≡ first ( first f ) ≫ arr assoc arr assoc ≫ first f ≡ loop ( first h ≫ f ) h ≫ loop f ≡ loop ( f ≫ first h ) loop f ≫ h ≡ loop ( f ≫ arr ( id × k )) loop ( arr ( id × k ) ≫ f ) ≡ loop ( arr assoc - 1 . f . arr assoc ) loop ( loop f ) ≡ loop ( arr assoc . second f . arr assoc - 1 ) second ( loop f ) ≡ loop ( arr f ) arr ( trace f ) ≡ 4/22

Normalizing arrows (a dataflow example) (a) original 5/22

Normalizing arrows (a dataflow example) (a) original (b) normalized 5/22

Causal Commutative Arrows CCA is a more restricted arrow with an additional init combinator: class ArrowLoop arr ⇒ ArrowInit arr where init :: a → arr a a and two additional arrow laws: ≡ first f ≫ second g second g ≫ first f init i ⋆⋆⋆ init j ≡ init ( i , j ) 6/22

Causal Commutative Arrows CCA is a more restricted arrow with an additional init combinator: class ArrowLoop arr ⇒ ArrowInit arr where init :: a → arr a a and two additional arrow laws: ≡ first f ≫ second g second g ≫ first f init i ⋆⋆⋆ init j ≡ init ( i , j ) Causal Commutative Normal Form (CCNF) is either a pure arrow, or a single loop containing a pure arrow and an initial state: loopD :: ArrowInit arr ⇒ c → (( a , c ) → ( b , c )) → arr a b loopD i f = loop ( arr f ≫ second ( init i )) Proved by algebraic arrow laws. (Liu et al. ICFP2009, JFP2010) 6/22

Application: stream transformers as arrows newtype SF a b = SF { unSF :: a → ( b , SF a b ) } instance Arrow SF where arr f = g where g = SF ( λ x → ( f x , g )) f ≫ g = ... first f = ... instance ArrowLoop SF where ... instance ArrowInit SF where ... 7/22

Application: stream transformers as arrows newtype SF a b = SF { unSF :: a → ( b , SF a b ) } instance Arrow SF where arr f = g where g = SF ( λ x → ( f x , g )) f ≫ g = ... first f = ... instance ArrowLoop SF where ... instance ArrowInit SF where ... We can run a stream transformer over an input stream: run SF :: SF a b → [ a ] → [ b ] run SF ( SF f ) ( x : xs ) = let ( y , f ′ ) = f x in y : run SF f ′ xs nth SF :: Int → SF () a → a nth SF n sf = run SF sf ( repeat ()) !! n 7/22

Performance Comparison Orders of magnitude speedup (JFP2010): Name SF CCNF sf CCNF tuple exp 1.0 30.84 672.79 sine 1.0 18.89 442.48 oscSine 1.0 14.28 29.53 50’s sci-fi 1.0 18.72 21.37 robotSim 1.0 24.67 34.93 Table : Performance Ratio (greater is better) Normalization of CCA programs seems very effective! 8/22

Performance Comparison Orders of magnitude speedup (JFP2010): Name SF CCNF sf CCNF tuple exp 1.0 30.84 672.79 sine 1.0 18.89 442.48 oscSine 1.0 14.28 29.53 50’s sci-fi 1.0 18.72 21.37 robotSim 1.0 24.67 34.93 Table : Performance Ratio (greater is better) Normalization of CCA programs seems very effective! But why is everyone not using it?? 8/22

Performance Comparison Orders of magnitude speedup (JFP2010): Name SF CCNF sf CCNF tuple exp 1.0 30.84 672.79 sine 1.0 18.89 442.48 oscSine 1.0 14.28 29.53 50’s sci-fi 1.0 18.72 21.37 robotSim 1.0 24.67 34.93 Table : Performance Ratio (greater is better) Normalization of CCA programs seems very effective! But why is everyone not using it?? Not even used by Euterpea , the music and sound synthesis framework from the same research group! 8/22

Pitfalls of the CCA implementation The initial CCA library was implemented using Template Haskell, because: ◮ Normalization is a syntactic transformation; ◮ Meta-level implementation guarantees normal form at compile time; ◮ TH is less work than a full-blown pre-processor. 9/22

Pitfalls of the CCA implementation The initial CCA library was implemented using Template Haskell, because: ◮ Normalization is a syntactic transformation; ◮ Meta-level implementation guarantees normal form at compile time; ◮ TH is less work than a full-blown pre-processor. However, TH based static normalization is: ◮ restricted to first-order, no reactivity, etc. 9/22

Pitfalls of the CCA implementation The initial CCA library was implemented using Template Haskell, because: ◮ Normalization is a syntactic transformation; ◮ Meta-level implementation guarantees normal form at compile time; ◮ TH is less work than a full-blown pre-processor. However, TH based static normalization is: ◮ restricted to first-order, no reactivity, etc. ◮ hard to program with: f x = ... [ | ... x ... | ] ... ... $( norm g ) ... 9/22

Pitfalls of the CCA implementation The initial CCA library was implemented using Template Haskell, because: ◮ Normalization is a syntactic transformation; ◮ Meta-level implementation guarantees normal form at compile time; ◮ TH is less work than a full-blown pre-processor. However, TH based static normalization is: ◮ restricted to first-order, no reactivity, etc. ◮ hard to program with: f x = ... [ | ... x ... | ] ... ... $( norm g ) ... ◮ perhaps not as effective as we had thought for “real” applications? 9/22

How about run-time normalization? 10/22

How about run-time normalization? from: Paul Liu to: Jeremy Yallop cc: Paul Hudak, Eric Cheng date: 18 June 2009 I wonder if there is any way to optimize GHC’s output based on your code since the CCNF is actually running slower 10/22

How about run-time normalization? from: Paul Liu to: Jeremy Yallop cc: Paul Hudak, Eric Cheng date: 18 June 2009 I wonder if there is any way to optimize GHC’s output based on your code since the CCNF is actually running slower “. . . that the actual construction of CCNF is now at run-time rather than compile-time. Therefore, we cannot rely on GHC to take the pure function and state captured in a CCNF and produce optimized code. . . ” (Liu 2011) 10/22

Normalization by construction 1. Define normal form as a data type: data CCNF a b where Arr :: ( a → b ) → CCNF a b LoopD :: c → (( a , c ) → ( b , c )) → CCNF a b 11/22

Normalization by construction 1. Define normal form as a data type: data CCNF a b where Arr :: ( a → b ) → CCNF a b LoopD :: c → (( a , c ) → ( b , c )) → CCNF a b 2. Observation function: observe :: ArrowInit arr ⇒ CCNF a b → arr a b observe ( Arr f ) = arr f observe ( LoopD i f ) = loop ( arr f ≫ second ( init i )) 11/22

Normalization by construction 1. Define normal form as a data type: data CCNF a b where Arr :: ( a → b ) → CCNF a b LoopD :: c → (( a , c ) → ( b , c )) → CCNF a b 2. Observation function: observe :: ArrowInit arr ⇒ CCNF a b → arr a b observe ( Arr f ) = arr f observe ( LoopD i f ) = loop ( arr f ≫ second ( init i )) 3. Instances for the data type: instance Arrow CCNF where ... instance ArrowLoop CCNF where ... instance ArrowInit CCNF where ... 11/22

Optimize the observe function 1. Specialize observe to a concrete instance. observe :: ArrowInit arr ⇒ CCNF a b → arr a b observe SF :: CCNF a b → SF a b observe SF ( Arr f ) = arr SF f observe SF ( LoopD i f ) = loop SF ( arr SF f ≫ SF second SF ( init SF i )) 12/22

Optimize the observe function 1. Specialize observe to a concrete instance. observe :: ArrowInit arr ⇒ CCNF a b → arr a b observe SF :: CCNF a b → SF a b observe SF ( Arr f ) = arr SF f observe SF ( LoopD i f ) = loop SF ( arr SF f ≫ SF second SF ( init SF i )) 2. Derive an optimized definition. observe SF ( LoopD i f ) = loopD i f where loopD :: c → (( a , c ) → ( b , c )) → SF a b loopD i f = SF ( λ x → let ( y , i ′ ) = f ( x , i ) in ( y , loopD i ′ f )) 12/22

Causal Commutative Arrows Revisited Jeremy Yallop Hai (Paul) Liu - PowerPoint PPT Presentation

Causal Commutative Arrows Revisited Jeremy Yallop Hai (Paul) Liu University of Cambridge Intel Labs September 21, 2016 1/22 Normalization as an optimization technique? 2/22 Normalization as an optimization technique? Plausible, because

Arrows Impossibility Theorem Lecture 12 Arrows Impossibility Theorem Lecture 12, Slide 1

The Theory and Practice of Causal Commutative Arrows Hai (Paul) Liu Advisor: Paul Hudak

Causal commutative arrows revisited Jeremy Yallop Hai Liu WadlerFest Edinburgh, April 2016

Causal Effect Evaluation and Causal Network Learning Zhi Geng Peking University, China June

Arrows Impossibility Theorem Lecture 12 Arrows Impossibility Theorem Lecture 12, Slide 1

Causal Commutative Arrows Hai (Paul) Liu, Eric Cheng, and Paul Hudak Computer Science Department

Political Science 209 - Fall 2018 Causal Inference Florian Hollenbach 7th September 2018 Causal

Foundations of Causal Discovery Frederick Eberhardt KDD Causality Workshop 2016 Causal Discovery

1 / 24 Theorem Let A be a commutative Q -algebra 1 / 24 Theorem Let A be a commutative Q

Test sets of commutative languages t epn Holub Department of Algebra Charles University

Causal Inference By: Miguel A. Hern an and James M. Robins Part I: Causal inference without

Causal Programming Causal Programming Joshua Brul Joshua Brul

Few-shot Domain Adaptation 1/12 by Causal Mechanism Transfer Domain adaptation Causal mechanism

Causal Discovery from Observational Data Brady Neal causalcourse.com What if we dont have

Hom and Ext, Revisited Justin Lyle Lawrence, KS justin.lyle@ku.edu April 28, 2018 JL Hom and

Reversible effects as inverse arrows Chris Heunen Robin Kaarsgaard Martti Karvonen 1 / 17

MANAGING AND MANAGING AND PROCESSING LARGE PROCESSING LARGE DATASETS DATASETS Christian

Evolution of an Apache Spark Nick Afshartous Architecture for Processing WB Analytics Game Data

Streaming Data And Concurrency In R Rory Winston rory@theresearchkitchen.com About Me

RFNoC: Evolving SDR Toolkits to the FPGA platgorm Martjn Braun 31.1.2016 USRP: A White Box?

Big Data with ADAMS Big Data with ADAMS What the heck is ADAMS? Peter Reutemann What is ADAMS?

1.2 Basic Graphics Programming Hao Li http://cs420.hao-li.com 1 Last time Last Time Computer

JAXP ( Java API for XML Processing ) Dr. Kanda Runapongsa Dr. Kanda Runapongsa Department of

Program extraction in computable analysis Ulrich Berger - Swansea Computability in Europe 2011