physics and phase transitions in parallel computational
play

Physics and phase transitions in parallel computational complexity - PowerPoint PPT Presentation

Physics and phase transitions in parallel computational complexity Jon Machta University of Massachusetts Amherst and Santa Fe Institute Physics of Algorithms August 31, 2009 Tuesday, September 1, 2009 Collaborators Ray Greenlaw,


  1. Physics and phase transitions in parallel computational complexity Jon Machta University of Massachusetts Amherst and Santa Fe Institute Physics of Algorithms August 31, 2009 Tuesday, September 1, 2009

  2. Collaborators • Ray Greenlaw, Armstrong Atlantic University • Cris Moore, University of New Mexico, SFI • Stephan Mertens, Otto-von-Guericke University Magdeburg, SFI • Students: – Ken Moriarty – Xuenan Li – Ben Machta – Dan Tillberg Tuesday, September 1, 2009

  3. Outline • Parallel computing and computational complexity • Parallel complexity of models in statistical physical • Random circuit value problem: complexity of solving and sampling Tuesday, September 1, 2009

  4. Parallel Random Access Machine PRAM •Each processor runs the same program but has a Controller distinct label Processors •Each processor communicates with any 1 2 3 m memory cell in a single time step. •Primary resources:  Parallel time Global Memory  Number of processors Tuesday, September 1, 2009

  5. Parallel Computing Adding n numbers can be carried out in O (log n ) steps using O ( n ) processors. Σ X i + log n + + + + + + X 1 X 2 X 3 X 4 X 5 X 6 X 7 X 8 Connected components of a graph can be found in O( log 2 n) steps using n 2 processors. Tuesday, September 1, 2009

  6. Complexity Classes and P-completeness • P is the class of feasible problems: solvable with W polynomial work. T T F • NC is the class of problems efficiently solved in parallel (polylog time and polynomial work, NC ⊆ P ). V V V •Are there feasible problems that cannot be solved efficiently in parallel ( P ≠ NC )? D V V V • P -complete problems are the hardest problems in P to solve in parallel. It is believed they are inherently V V V sequential: not solvable in polylog time. •The Circuit Value Problem is P -complete. Tuesday, September 1, 2009

  7. Complexity Classes and P-completeness • P is the class of feasible problems: solvable with T T F polynomial work. • NC is the class of problems efficiently solved in V V T F T parallel (polylog time and polynomial work, NC ⊆ P ). V •Are there feasible problems that cannot be solved V T T F V V efficiently in parallel ( P ≠ NC )? • P -complete problems are the hardest problems in P to V V T T F V solve in parallel. It is believed they are inherently sequential: not solvable in polylog time. T T F •The Circuit Value Problem is P -complete. Tuesday, September 1, 2009

  8. Sampling Complexity Random Bits •Models and algorithms in statistical physics convert PRAM Algorithm random bits into typical system states. Typical System State Tuesday, September 1, 2009

  9. Diffusion Limited Aggregation Witten and Sander, PRL 47, 1400 (1981) •Particles added one at a time with sticking probabilities given by the solution of Laplace’s equation. •Self-organized fractal object d f =1.715… (2D) •Physical systems: Fluid flow in porous media Electrodeposition Bacterial colonies Tuesday, September 1, 2009

  10. Random Walk Dynamics for DLA Tuesday, September 1, 2009

  11. Tuesday, September 1, 2009

  12. The Problem with Parallelizing DLA #3 #1 #1 #2 #2 #3 Parallel dynamics ignores Sequential dynamics interference between 1 and 3 Tuesday, September 1, 2009

  13. Complexity of DLA Theorem : Determining the shape of an aggregate from the random walks of the constituent particles is a P -hard problem. Proof idea: Reduce the Circuit Value Problem to DLA dynamics. output Gadet for NOR gate d c a b power Caveats: input 1 input 2 1. P ≠ NC not proven 2. Average case may be easier than worst case 3. Alternative dynamics may be faster than random walk dynamics for sampling DLA Tuesday, September 1, 2009

  14. Sequential models with polylog parallel complexity • Eden growth • Invasion percolation • Scale free networks • Ballistic deposition • Bak-Sneppen model Eden growth • Internal DLA Scale free network Invasion percolation Tuesday, September 1, 2009

  15. Internal DLA Particles start at the origin, random walk and stick where they first leaves the cluster. •Shape approaches a circle with logarithmic fluctuations . •P-completeness proof fails. (However, IDLA is CC-complete) Tuesday, September 1, 2009

  16. Parallel Algorithm for IDLA C. Moore and JM, J. Stat. Phys. 99 , 661 (2000) 1. Start with seed particle at the origin and N walk trajectories 2. Place particles at expected positions along their trajectories. 3. Iteratively move particles until holes and multiple occupancies are eliminated Average parallel time 1 3 5 polylogarthmic or possibly a small power in N . 2 4 6 Cluster of 2500 particles made in 6 parallel steps. Tuesday, September 1, 2009

  17. Random Monotone CVP W • Circuit arranged in levels with W F T T gates on a level and D levels. • =fraction of TRUE inputs. V τ 0 V V • p =fraction of OR gates. • Gates at level n+1 randomly take k D V V V inputs from gates at level n (with replacement). V V V Monotone CVP is P-complete but how hard is it on average to evaluate the circuit in parallel? Tuesday, September 1, 2009

  18. � Recursion relations, k =2 • Let be the expected fraction of gates τ n evaluating to TRUE at level n . τ n +1 = p (1 − (1 − τ n ) 2 ) + (1 − p ) τ 2 n Absorbing fixed points at and . τ = 1 τ = 0 Tuesday, September 1, 2009

  19. � Recursion relations, k =2 • Let be the expected fraction of gates τ n evaluating to TRUE at level n . τ n +1 = p (1 − (1 − τ n ) 2 ) + (1 − p ) τ 2 n τ n p < 1 / 2 0 1 mainly AND τ n p > 1 / 2 mainly OR 0 1 Tuesday, September 1, 2009

  20. Linearize around fixed points Near the fixed point for p <1/2 the τ = 0 linearized recursion relations are: τ n +1 = 2 p τ n + O ( τ 2 n ) Let T be the time to saturation to all FALSE, τ T ≈ 1 /W ln W T ∼ − ln(2 p ) Tuesday, September 1, 2009

  21. 200 p = 0.52 p = 0.53 p = 0.55 150 p = 0.60 T n 100 50 0 7 4 5 6 8 9 10 ln W log L Time to saturation T as a function of circuit width W for various fractions p of OR gates. Tuesday, September 1, 2009

  22. 30 20 slope m 10 0 0.5 0.55 0.6 0.65 p Slope of the logarithmic scaling of the saturation time vs. p . The solid line is the prediction, -1/ ln (2(1-p)) . Tuesday, September 1, 2009

  23. Critical point at p=1/2 The number of gates, X n evaluating to TRUE at level n obeys a stochastic recursion relation, X n +1 = B ( W, X n /W ) Here B (n,p) is a binomial random variable. After taking the continuum limit, one obtains a diffusion process with absorbing endpoints and a diffusion coefficient that vanishes at the endpoints. Tuesday, September 1, 2009

  24. Critical Saturation Time Using known results for mean first passage times with the spatially non-uniform diffusion coefficient D ( x ) = x 2 (1 − x W ) we obtain a linear saturation time: T = − 2 W [ τ 0 ln τ 0 + (1 − τ 0 ) ln(1 − τ 0 )] Tuesday, September 1, 2009

  25. 2.0 2 T = − 2 W [ τ 0 ln τ 0 + (1 − τ 0 ) ln(1 − τ 0 )] 1.5 slope m 1.0 0.5 0.0 0.4 0.5 0.6 0.7 0.8 0.9 1 τ τ 0 Slope of the linear scaling of the saturation time vs. W . Tuesday, September 1, 2009

  26. Summary for two input gates p � = 1 / 2 • For Circuit evaluation easy T ∼ ln W • For p = 1 / 2 Circuit evaluation hard T ∼ W Tuesday, September 1, 2009

  27. k >2 • For p<1/k or p>1-1/k have T ~ ln W ➜ Fast circuit evaluation . • For 1/k < p < 1-1/k have non-trivial fixed point: 0 < τ ∗ < 1 Circuit does not saturate to a single value except via a large deviation ➜ Slow circuit evaluation . Tuesday, September 1, 2009

  28. Generating Circuit+Solution Pairs F T T • Q: How difficult is it to simultaneously V V T F T V generate an instance of random monotone CVP together with its evaluation? V T T F V V V V T T F V T T F • A: For any values of the parameters, a random instance chosen from the correct distribution and its evaluation can be generated in polylog parallel time on a PRAM. Tuesday, September 1, 2009

  29. Fast Parallel Sampling of Circuit +Evaluation Pairs • Idea: In parallel generate an instance of each level-- gates and their inputs and outputs--then put the levels together into a complete circuit+evaluation. • Difficulty: Inputs to layer n +1 are not known until layer n is evaluated. • Solution: The number of TRUE inputs is all that is required to generate a random level. In parallel construct W +1 instances of each level, one for each number of TRUE inputs. Tuesday, September 1, 2009

  30. Construct one level Given: 2 TRUE, 1 FALSE V V V F T F Tuesday, September 1, 2009

  31. Attaching levels into a circuit+evaluation F T T 0 TRUE, 3 FALSE 1 TRUE, 2 FALSE 2 TRUE, 1 FALSE 3 TRUE, 0 FALSE V V V V V V V V V V V V F F F T F F F T T F T T 0 TRUE, 3 FALSE 1 TRUE, 2 FALSE 2 TRUE, 1 FALSE 3 TRUE, 0 FALSE V V V V V V V V V V V V F F T T F F F T F T T T Tuesday, September 1, 2009

  32. Wiring the circuit F T T 2 TRUE, 1 FALSE V V V F T F 1 TRUE, 2 FALSE V V V F F T Tuesday, September 1, 2009

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend